It's been two months since this profile received a new review

Druid Reviews & Product Details

Apache Druid is an open source real-time analytics database. Druid combines ideas from OLAP/analytic databases, timeseries databases, and search systems to create a complete real-time analytics solution for real-time data. It includes stream and batch ingestion, column-oriented storage, time-optimized partitioning, native OLAP and search indexing, SQL and REST support, flexible schemas; all with true horizontal scalability on a shared nothing, cloud native architecture that makes it easy to deploy, monitor and manage at scale. It is downloadable for free for unlimited use from druid.apache.org and also hosted in the cloud by Imply Data.

Seller

Druid

Discussions

Druid Community

Overview by

Robert Meyer

Value at a Glance

Averages based on real user reviews.

Perceived Cost

$$$$$

View More Pricing Information

Top-Rated Alternatives

Snowflake

4.6/5

(637)

Google Cloud BigQuery

View All Alternatives

G2 reviews are authentic and verified.

Here's how.

Verified User in Computer Software

Mid-Market (51-1000 emp.)

6/15/2021

"Open-Source distributed OLAP datastore"

4/5

What do you like best about Druid?

1. Pre-aggregate capability which allows to pre-calculate aggregations and save aggregates in segments. Thus, reduces compute and storage costs.

2. Druid UI (0.14+) which has many improvements and allows creating ingestion_specs via UI

3. REST interface for druid_broker for communication, makes it easy to integrate with microservices

4. Druid is a NoSQL DB still it has SQL query support and BAs/Analysts are comfortable using SQL to query Druid

5. Data Security options - Basic HTTP Auth and LDAP supported Review collected by and hosted on G2.com.

What do you dislike about Druid?

1. Complex Architecture - Steep learning curve and has 6 core services which makes deployment & management of Druid cluster complex

2. Memory intensive Historical services - Druid services are quite memory intensive and requires high compute+memory cloud instances.

3. Indexing support - Druid supports only 1 indexing (Inverted Index) which limits the idea of optimizing datasources as per usecase Review collected by and hosted on G2.com.

Shashank N.

Software Engineer III

Enterprise (> 1000 emp.)

1/11/2021

"Druid, Kafka and your favourite Dashboard"

4/5

What do you like best about Druid?

Druid is amazingly fast and has built-in connectors for most of the popular datasources .

It supports variety of dashboards which makes druid a perfect choice for any Real Time Streaming Application . Review collected by and hosted on G2.com.

What do you dislike about Druid?

Druid natively queries in Json format which is hard to pick up for a SQL user.

Rollover queries are not dynamic . Example - If you want to roll up for a specific time of one day to a specific time of another day , that might not be possible .

Web GUI is also not so user friendly for a business user .

Missing operations friendly cluster manager console.

Druid needs a dedicated server and cannot utilise existing Hadoop resources. Review collected by and hosted on G2.com.

Recommendations to others considering Druid:

Druid is a perfect database to power real-time analytic workloads for event-driven data.It is fast, has column-oriented storage and is a time series database . It is just fits fine in any big data stack .

Note - Druid might not be a good choice if you are a heavy dependent on joins .It might slow down the performance Review collected by and hosted on G2.com.

What problems is Druid solving and how is that benefiting you?

We needed a database where we could persist our data from Kafka and could also do some rollup .

It was also required that the database should be fast enough to do aggregations when displaying on the dashboard . Everything had to be in realtime .

Druid was fast and capable enough to acknowledge all the requirements.

Built in connectors save much of the time and effort while integrating with other applications. Review collected by and hosted on G2.com.

Mohan S.

Senior Software Engineer

Mid-Market (51-1000 emp.)

12/14/2020

"Senior Software Engineer"

4/5

What do you like best about Druid?

Druid is best for low latency analytics, as it combines the best qualities of a column store and inverted indexing. With column stores, the druid can minimize I/O costs for analytical queries.

It supports OLTP and OLAP.

Real-Time Aggregation.

Batch & Real-Time Ingestion Review collected by and hosted on G2.com.

What do you dislike about Druid?

1. No fault-tolerance on the query execution path. ex: A single query be processed on hundreds of historical nodes — it completely lacks any fault-tolerance on the query execution path.

2. Straggling sub-queries on the historical nodes takes a lot of time.

3. Back filling takes lot of time. But its understandable as to update old segment and update it takes lot of time. I wouldn't consider it as a drawback.

4. As Druid Brokers need to keep the view of the whole cluster in memory , it require significantly more memory and also cause lot lot JVM GC pause.

5. In case of large queries, it saturate the processing capacity of the entire historical layer for up to tens of seconds. Review collected by and hosted on G2.com.

Recommendations to others considering Druid:

I would recommend anyone who wants to use the Time Series database for the realtime use case. Druid is best among its peer in TSDB. If your company is into big data analysis need to do drill down. Druid is a great match for the historical data with the medium-size cluster. Review collected by and hosted on G2.com.

What problems is Druid solving and how is that benefiting you?

In our company, we are using the Druid as a Time-series Database to query User Related behaviour and perform the analytical queries on it. We have both realtime and batch ingestion usecase. In case of realtime ingestion , it benefits us with Analytics capability and windowing function on realtime data. Our use case if generally require 1 hour rolling window computation , computation on it takes hardly 1 sec. Review collected by and hosted on G2.com.

Duminda Kaviranga G.

Computer Engineer

Small-Business (50 or fewer emp.)

6/17/2021

"It is a good open source scalable big data analytics database suitable for your business ."

5/5

What do you like best about Druid?

It is easy to integrate with other database engines like MySQL. That is called integration features are good! Review collected by and hosted on G2.com.

What do you dislike about Druid?

It will be unable to configure with some of the data analytics platforms like "Metatron." Metatron uses a modified version of druid! Review collected by and hosted on G2.com.

Adam W.

Software Engineer

Mid-Market (51-1000 emp.)

12/1/2020

Business partner of the seller or seller's competitor, not included in G2 scores.

"Apache Druid has provided us with fast access to vast quantities of data."

5/5

What do you like best about Druid?

The community behind Druid and its docs are great. The scale at which Druid can ingest and query data is impressive. Review collected by and hosted on G2.com.

What do you dislike about Druid?

Only recent versions have support for joins between data sources. Some log messages could be more verbose. Review collected by and hosted on G2.com.

Recommendations to others considering Druid:

Consider the cardinality of dimensions in your data and how wide your aggregates will be. Druid allows for reindexing of data and schema evolution so it is possible to keep high cardinality dimensions for a short time before removing them. Also consider hardware requirements and who will manage its operation and maintenance. Review collected by and hosted on G2.com.

What problems is Druid solving and how is that benefiting you?

Powering front ends and reporting for users. We can update customer dashboards in realtime and provide self service access for users to drill down to answer questions. Review collected by and hosted on G2.com.

Verified User in Market Research

Enterprise (> 1000 emp.)

12/8/2020

"It is a good time series database, Best suited for append only data"

4/5

What do you like best about Druid?

It excellently supports horizontal scalability, The deep storage functionality improves data resilience and makes it easy to add a new node. Since the data is partitioned by time out of the box, time-based queries perform exceedingly well. It can ingest a large amount of data very quickly. It has multiple plugins to suffice your need and it can integrate with many cloud infrastructure out of the box. Review collected by and hosted on G2.com.

What do you dislike about Druid?

Need to provide better features to accommodate multi-tenants. Updates to existing data are currently supported by rebuilding the corresponding time segment entirely from the true source, Instead, it should support tenant id based updates. Same-day updates are a little bit tricky and need to iron it out.

One of the places we use it to calculate demographic-based suppression of data and it is slow in that particular scenario. Review collected by and hosted on G2.com.

What problems is Druid solving and how is that benefiting you?

We are using it to analyze survey responses and it massively helps us to analyze trends over time. It also made our reports highly interactive and we are able to support more users parallelly. We have ported our reports from the SQL server to the druid and it has considerably reduced the number of lines of code. It is also easier to maintain and make changes to the reports quickly. Review collected by and hosted on G2.com.

Verified User in Computer & Network Security

Mid-Market (51-1000 emp.)

12/3/2020

"Great analytics database but only for immutable time series data"

3.5/5

What do you like best about Druid?

Apache Druid works very well if you need basic aggregations across immutable time series data. It has some really useful approximations such as HyperLogLog for fast cardinality estimations that converge to exact counts for small datasets. It also now supports Druid Sql as a query language which doesn't have the steep learning curve native Druid query language requires. Review collected by and hosted on G2.com.

What do you dislike about Druid?

Apache Druid becomes hard to use and very inefficient when your data is 1) updated 2) ingested out of order (based on timestamp) or 3) requires joins. Unfortunately this greatly limits the number of use-cases that Druid readily supports. Tooling can be built around it to support things like out of order ingestion but it makes Druid very inefficient.

Druid also has inherent bottlenecks in its design: each cluster can have only one coordinator and one overlord. We found that this made it impossible to scale a single cluster out to meet our needs. Review collected by and hosted on G2.com.

Mike S.

Lead Engineer

Enterprise (> 1000 emp.)

12/2/2020

"Druid Review"

5/5

What do you like best about Druid?

Easy to use, good documentation, flexible, scaleable. Review collected by and hosted on G2.com.

What do you dislike about Druid?

Performance is not always predictable. Ingestion specs can be difficult to create and debug. Review collected by and hosted on G2.com.

Recommendations to others considering Druid:

Build a pipeline from data origin through caching in Druid and build some reporting with time and data filtering. Review collected by and hosted on G2.com.

What problems is Druid solving and how is that benefiting you?

Providing reporting for a very large retail business. We've been able to retire several existing 3rd party systems. Review collected by and hosted on G2.com.

NEERAJ S.

Enterprise (> 1000 emp.)

12/22/2020

"Hot Datastore is the new normal and Druid aptly fulfils the criteria"

4.5/5

What do you like best about Druid?

Real-time ingestion and querying capability

Sub-second query performance

Time Series based datastore

Slice N Dice support

Data Compression Review collected by and hosted on G2.com.

What do you dislike about Druid?

Inability to support nested data

Partial Join Support

Setup to bring it up for the first time Review collected by and hosted on G2.com.

Ashish M.

Technical Lead - Big Data

Enterprise (> 1000 emp.)

12/1/2020

"Work on realtime rollup & analysis overto clickstream data"

4/5

What do you like best about Druid?

1) Pre-rolled up data into dimension and metrics

2) Lighting fast data/ query result retrieval Review collected by and hosted on G2.com.

What do you dislike about Druid?

Managing the broker/cluster if load is high

Limitation in dynamic dimensions Review collected by and hosted on G2.com.

Recommendations to others considering Druid:

I definitely recommend to those who want realtime aggregation and derive analytics in realtime Review collected by and hosted on G2.com.

What problems is Druid solving and how is that benefiting you?

We want to replace google analytics due to cost implications. So we are serving real-time analytics dashboard data source as a druid.

The best benefit is that druid keep pre-aggregated rolled up data in dimension and metric form and that can be further queried very fast Review collected by and hosted on G2.com.