
We are leveraging Kafka of Cloudera Data flow for streaming analytics. CDF provides us real time data which is critical for producing live dashboards and also the amount of data streaming (in petabytes) helps us to have CDF as one stop shop for live data analysis Review collected by and hosted on G2.com.
Kafka of CDF although is scalable however it has a lot of lag problems and needs complex tuning. When the lag occurrs that is the current offset is more than consumer end offset, a lag in 6-7 figures can be seen that means the stale records reaches to around 1 million at times due to which the dashboard waits for latest data and it sometimes takes hours to fetch that and sometimes restart of service is also required to fix that Review collected by and hosted on G2.com.
At G2, we prefer fresh reviews and we like to follow up with reviewers. They may not have updated their review text, but have updated their review.
The reviewer uploaded a screenshot or submitted the review in-app verifying them as current user.
Validated through LinkedIn
This reviewer was offered a nominal gift card as thank you for completing this review.
Invitation from G2. This reviewer was offered a nominal gift card as thank you for completing this review.

