DEV Community

Cover image for ML for Google CDAP
Predictive Works
Predictive Works

Posted on

ML for Google CDAP

Google CDAP (former Cask Data Application Platform) is a next-generation data fusion platform. It is at the heart of Google Cloud Data Fusion, but with a significant advantage: It is not lock-in to the Google Cloud Platform and thus, can be operated quite cost-effective.

Even if CDAP is a great tool to realize plenty of data fusion use cases, e.g., move siloed data to a data warehouse, without writing a single line of code, we often face the same dilemma:

Having configured a data integration pipeline e.g., for data streaming, our customers do not understand why they have to operate another platform to also implement complex event processing or use machine learning to classify real-time events.

In order to avoid operational complexity, Google CDAP finally is discarded to be used in production.

This recurring situation was so annoying for us, that's why we decided to equip Google CDAP with extra plugins to seamlessly implement a wide variety of smart data processing use cases (including all kinds of machine intelligence) with always the same platform.

Today, we use Google CDAP with 150+ plugins, from deep and machine learning, business rule processing to natural language processing and more.

Doing ML/AI projects this way, is a customer success story. At the end, they often have to operate just a single platform to move their data from sources to sinks and vice versa in a smart and extensible way.

That is the real potential of Google CDAP, and data fusion just marked the beginning.

We are excited to share this experience with you. And in one of the next posts, we talk about ML/AI model management inside Google CDAP.

Top comments (0)