Description: Cloudera CDH (Cloudera Distribution Including Apache Hadoop) is an open source data platform that combines Hadoop ecosystem components like HDFS, YARN, Spark, Hive, HBase, Impala, Kudu, and more into a single managed platform.
Type: software
Pricing: Open Source
Description: StreamSets is an open source data integration platform for building and managing big data pipelines. It offers a simple and intuitive drag-and-drop interface to help users quickly build pipelines to transfer data between a variety of sources and destinations including databases, data lakes, and cloud platforms.
Type: software
Pricing: Open Source