Kudu is a big improvement for Impala data that needs to be quickly queried ad hoc while still being able to incrementally add to that data.
What do you dislike?
Kudu seems to have limited use in Impala. I have tried using is in Spark but didn't have great results (some of that was my fault). Impala isn't really for streaming data where the speed comes in so I'm not sure Kudu fits my use case.
What problems are you solving with the product? What benefits have you realized?
Streaming data into Kudu to quickly query that data from Impala or Spark. Impala for ad hoc queries and Spark for batch processing of data. We saw significant improvements in Kudu but not that same improvements for Spark so we are not using Kudu for production work loads as ad hoc querying was not as important for us.
* We monitor all Apache Kudu reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. Validated reviews require the user to submit a screenshot of the product containing their user ID, in order to verify a user is an actual user of the product.