Introducing G2.ai, the future of software buying.Try now
Starburst
Sponsored
Starburst
Visit Website
Product Avatar Image
Cloudera Data Flow

By Cloudera

Unclaimed Profile

Claim your company’s G2 profile

Claiming this profile confirms that you work at Cloudera Data Flow and allows you to manage how it appears on G2.

    Once approved, you can:

  • Update your company and product details

  • Boost your brand's visibility on G2, search and LLMs

  • Access insights on visitors and competitors

  • Respond to customer reviews

  • We’ll verify your work email before granting access.

Claim Now
3.5 out of 5 stars
5 star
0%
2 star
0%
1 star
0%

How would you rate your experience with Cloudera Data Flow?

Starburst
Sponsored
Starburst
Visit Website
It's been two months since this profile received a new review
Leave a Review

Cloudera Data Flow Reviews & Product Details

Product Avatar Image

Have you used Cloudera Data Flow before?

Answer a few questions to help the Cloudera Data Flow community

Cloudera Data Flow Reviews (3)

Reviews

Cloudera Data Flow Reviews (3)

3.5
3 reviews

Search reviews
Filter Reviews
Clear Results
G2 reviews are authentic and verified.
Aditya K.
AK
Lead Software Engineer
Enterprise (> 1000 emp.)
"Cloudera Data Flow(CDF) honest reviews"
What do you like best about Cloudera Data Flow?

We are leveraging Kafka of Cloudera Data flow for streaming analytics. CDF provides us real time data which is critical for producing live dashboards and also the amount of data streaming (in petabytes) helps us to have CDF as one stop shop for live data analysis Review collected by and hosted on G2.com.

What do you dislike about Cloudera Data Flow?

Kafka of CDF although is scalable however it has a lot of lag problems and needs complex tuning. When the lag occurrs that is the current offset is more than consumer end offset, a lag in 6-7 figures can be seen that means the stale records reaches to around 1 million at times due to which the dashboard waits for latest data and it sometimes takes hours to fetch that and sometimes restart of service is also required to fix that Review collected by and hosted on G2.com.

Bidisha P.
BP
Senior Speclialist (Vendor Master Data)
Enterprise (> 1000 emp.)
"CDF review"
What do you like best about Cloudera Data Flow?

Cloudera Data Flow(CDF) provides us a single platform for analysis of real time streaming data. We mostly use CFM, CEM to push agents data and Kafka to push live data which is then consumed by spark and after cleaning the financial reports are created. Review collected by and hosted on G2.com.

What do you dislike about Cloudera Data Flow?

Kafka which was earlier a part of CDP(cloudera data platform) has been moved to CDF which makes us buy a separate subscription and hence incur more costs to the project. This was a smart move by Cloudera to make more money but surely hurts us as the service that we used along with CDP now has to be purchased as it comes under CDF umbrella Review collected by and hosted on G2.com.

Verified User in Real Estate
UR
Mid-Market (51-1000 emp.)
"Close to success"
What do you like best about Cloudera Data Flow?

Hortonworks two main pillars are HDP (Hortonworks Data Platform) and HDP (Hortonworks Data Flow). The former applies to the infrastructure required for building and deploying a data lake, and the latter is about ingestion, in batch or realtime.

Both HDP and HDF rely entirely on opensource projects, this is a distinctive point about Hortonworks. Review collected by and hosted on G2.com.

What do you dislike about Cloudera Data Flow?

As an open source project collection, it relies strongly on community activity. You still have the option to contract premium consulting or training services.

Altough it is quickly evolving into Data Science tools availability (eg. Tensorflow incorporate in HDP 3), it can be cumbersome from a developer transitioning from a traditional IDE, into the notebook vs. datalake metaphore. Review collected by and hosted on G2.com.

There are not enough reviews of Cloudera Data Flow for G2 to provide buying insight. Below are some alternatives with more reviews:

1
MATLAB Logo
MATLAB
4.5
(760)
MATLAB is a programming, modeling and simulation tool developed by MathWorks.
2
Google Cloud BigQuery Logo
Google Cloud BigQuery
4.5
(1,201)
Analyze Big Data in the cloud with BigQuery. Run fast, SQL-like queries against multi-terabyte datasets in seconds. Scalable and easy to use, BigQuery gives you real-time insights about your data.
3
Snowflake Logo
Snowflake
4.6
(666)
Snowflake’s platform eliminates data silos and simplifies architectures, so organizations can get more value from their data. The platform is designed as a single, unified product with automations that reduce complexity and help ensure everything “just works”. To support a wide range of workloads, it’s optimized for performance at scale no matter whether someone’s working with SQL, Python, or other languages. And it’s globally connected so organizations can securely access the most relevant content across clouds and regions, with one consistent experience.
4
Alteryx Logo
Alteryx
4.6
(665)
Alteryx drives transformational business outcomes through unified analytics, data science, and process automation.
5
Databricks Data Intelligence Platform Logo
Databricks Data Intelligence Platform
4.6
(634)
Making big data simple
6
HubSpot Data Hub Logo
HubSpot Data Hub
4.5
(567)
HubSpot Operations Hub allows you to keep all your contacts in 2-Way, Real Time Sync no matter if you use (Gmail/Outlook, Salesforce, Pipedrive, Constant Contact, Prosperworks, HubSpot, MailChimp or ActiveCampaign to name a few).
7
Tealium Customer Data Hub Logo
Tealium Customer Data Hub
4.3
(405)
Tealium AudienceStream™ is the market-leading Customer Data Platform, combining robust audience management and data enrichment capabilities resulting in unified customer profiles and the ability to take immediate, relevant action.
8
Spotfire Analytics Logo
Spotfire Analytics
4.2
(363)
Self-service data discovery. Fastest to actionable insight. Collaborative, predictive, event-driven data analysis - free from IT.
9
Teradata Vantage Logo
Teradata Vantage
4.3
(360)
The Teradata Database easily and efficiently handles complex data requirements and simplifies management of the data warehouse environment.
10
Qubole Logo
Qubole
4.0
(259)
Qubole delivers a Self-Service Platform for Big Data Analytics built on Amazon, Microsoft and Google Clouds
Show More
Pricing

Pricing details for this product isn’t currently available. Visit the vendor’s website to learn more.

Product Avatar Image
Cloudera Data Flow
View Alternatives