G2 takes pride in showing unbiased reviews on user satisfaction in our ratings and reports. We do not allow paid placements in any of our ratings, rankings, or reports. Learn about our scoring methodologies.
A weekly snapshot of rising stars, new launches, and what everyone's buzzing about.
Cloud Dataflow is a fully-managed service for transforming and enriching data in stream (real time) and batch (historical) modes with equal reliability and expressiveness -- no more complex workaround
At Cloudera, we believe data can make what is impossible today, possible tomorrow. We deliver an enterprise data cloud for any data, anywhere, from the Edge to AI. We enable people to transform vast a
Cloud-native service for data in motion built by the original creators of Apache Kafka® Today’s consumers have the world at their fingertips and hold an unforgiving expectation for end-to-end real-ti
Qubole is the open data lake company that provides a simple and secure data lake platform for machine learning, streaming, and ad-hoc analytics. No other platform provides the openness and data worklo
Hadoop HDFS is a distributed, scalable, and portable filesystem written in Java.
Apache Ambari is a software project designed to enable system administrators to provision, manage and monitor a Hadoop cluster, and also to integrate Hadoop with the existing enterprise infrastructure
Posit was founded with the mission to create open-source software for data science, scientific research, and technical communication. We don’t just say this: it’s fundamentally baked into our corporat
TIMi is the most efficient Data Science and Data Processing Platform. Since 2007, we have been creating and improving the most powerful framework to push the barriers of analytics, predictive analyt
Apache Druid is an open source real-time analytics database. Druid combines ideas from OLAP/analytic databases, timeseries databases, and search systems to create a complete real-time analytics soluti