Druid Reviews & Product Details

Druid Overview

What is Druid?

Apache Druid is an open source real-time analytics database. Druid combines ideas from OLAP/analytic databases, timeseries databases, and search systems to create a complete real-time analytics solution for real-time data. It includes stream and batch ingestion, column-oriented storage, time-optimized partitioning, native OLAP and search indexing, SQL and REST support, flexible schemas; all with true horizontal scalability on a shared nothing, cloud native architecture that makes it easy to deploy, monitor and manage at scale. It is downloadable for free for unlimited use from druid.apache.org and also hosted in the cloud by Imply Data.

Druid Details
Website
Discussions
Druid Community
Product Description

Open source streaming data store for interactive analytics at scale.


Seller Details
Seller
Druid
Company Website
Year Founded
1998
HQ Location
Rio de Janeiro, Rio de Janeiro
LinkedIn® Page
www.linkedin.com
45 employees on LinkedIn®

Overview Provided by:
Show More
Answer a few questions to help the Druid community
Have you used Druid before?
Yes

Druid Reviews

Write a Review
Filter reviews
LinkedIn®
Connections
Popular Mentions
Showing 29 Druid reviews
Popular Mentions
Showing 29 reviews
Filter Reviews
Filter Reviews
Sort by
Ratings
Company Size
User Role
For Category
All Industries
Region
Already have Druid?
Write a Review
Programmer Analyst
Enterprise(> 1000 emp.)
Validated Reviewer
Verified Current User
Review source: Invitation from G2
What do you like best?

Druid is amazingly fast and has built-in connectors for most of the popular datasources .

It supports variety of dashboards which makes druid a perfect choice for any Real Time Streaming Application . Review collected by and hosted on G2.com.

What do you dislike?

Druid natively queries in Json format which is hard to pick up for a SQL user.

Rollover queries are not dynamic . Example - If you want to roll up for a specific time of one day to a specific time of another day , that might not be possible .

Web GUI is also not so user friendly for a business user .

Missing operations friendly cluster manager console.

Druid needs a dedicated server and cannot utilise existing Hadoop resources. Review collected by and hosted on G2.com.

Recommendations to others considering the product:

Druid is a perfect database to power real-time analytic workloads for event-driven data.It is fast, has column-oriented storage and is a time series database . It is just fits fine in any big data stack .

Note - Druid might not be a good choice if you are a heavy dependent on joins .It might slow down the performance Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

We needed a database where we could persist our data from Kafka and could also do some rollup .

It was also required that the database should be fast enough to do aggregations when displaying on the dashboard . Everything had to be in realtime .

Druid was fast and capable enough to acknowledge all the requirements.

Built in connectors save much of the time and effort while integrating with other applications. Review collected by and hosted on G2.com.

Show More
Show Less
Senior Software Engineer
Mid-Market(51-1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Druid is best for low latency analytics, as it combines the best qualities of a column store and inverted indexing. With column stores, the druid can minimize I/O costs for analytical queries.

It supports OLTP and OLAP.

Real-Time Aggregation.

Batch & Real-Time Ingestion Review collected by and hosted on G2.com.

What do you dislike?

1. No fault-tolerance on the query execution path. ex: A single query be processed on hundreds of historical nodes — it completely lacks any fault-tolerance on the query execution path.

2. Straggling sub-queries on the historical nodes takes a lot of time.

3. Back filling takes lot of time. But its understandable as to update old segment and update it takes lot of time. I wouldn't consider it as a drawback.

4. As Druid Brokers need to keep the view of the whole cluster in memory , it require significantly more memory and also cause lot lot JVM GC pause.

5. In case of large queries, it saturate the processing capacity of the entire historical layer for up to tens of seconds. Review collected by and hosted on G2.com.

Recommendations to others considering the product:

I would recommend anyone who wants to use the Time Series database for the realtime use case. Druid is best among its peer in TSDB. If your company is into big data analysis need to do drill down. Druid is a great match for the historical data with the medium-size cluster. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

In our company, we are using the Druid as a Time-series Database to query User Related behaviour and perform the analytical queries on it. We have both realtime and batch ingestion usecase. In case of realtime ingestion , it benefits us with Analytics capability and windowing function on realtime data. Our use case if generally require 1 hour rolling window computation , computation on it takes hardly 1 sec. Review collected by and hosted on G2.com.

Show More
Show Less
Software Engineer
Mid-Market(51-1000 emp.)
Validated Reviewer
Verified Current User
Review source: Invitation from G2
Business partner of the seller or seller's competitor, not included in G2 scores.
What do you like best?

The community behind Druid and its docs are great. The scale at which Druid can ingest and query data is impressive. Review collected by and hosted on G2.com.

What do you dislike?

Only recent versions have support for joins between data sources. Some log messages could be more verbose. Review collected by and hosted on G2.com.

Recommendations to others considering the product:

Consider the cardinality of dimensions in your data and how wide your aggregates will be. Druid allows for reindexing of data and schema evolution so it is possible to keep high cardinality dimensions for a short time before removing them. Also consider hardware requirements and who will manage its operation and maintenance. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Powering front ends and reporting for users. We can update customer dashboards in realtime and provide self service access for users to drill down to answer questions. Review collected by and hosted on G2.com.

Show More
Show Less
AM
Enterprise(> 1000 emp.)
Validated Reviewer
Verified Current User
Review source: Invitation from G2
What do you like best?

It excellently supports horizontal scalability, The deep storage functionality improves data resilience and makes it easy to add a new node. Since the data is partitioned by time out of the box, time-based queries perform exceedingly well. It can ingest a large amount of data very quickly. It has multiple plugins to suffice your need and it can integrate with many cloud infrastructure out of the box. Review collected by and hosted on G2.com.

What do you dislike?

Need to provide better features to accommodate multi-tenants. Updates to existing data are currently supported by rebuilding the corresponding time segment entirely from the true source, Instead, it should support tenant id based updates. Same-day updates are a little bit tricky and need to iron it out.

One of the places we use it to calculate demographic-based suppression of data and it is slow in that particular scenario. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

We are using it to analyze survey responses and it massively helps us to analyze trends over time. It also made our reports highly interactive and we are able to support more users parallelly. We have ported our reports from the SQL server to the druid and it has considerably reduced the number of lines of code. It is also easier to maintain and make changes to the reports quickly. Review collected by and hosted on G2.com.

Show More
Show Less
AC
Mid-Market(51-1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Apache Druid works very well if you need basic aggregations across immutable time series data. It has some really useful approximations such as HyperLogLog for fast cardinality estimations that converge to exact counts for small datasets. It also now supports Druid Sql as a query language which doesn't have the steep learning curve native Druid query language requires. Review collected by and hosted on G2.com.

What do you dislike?

Apache Druid becomes hard to use and very inefficient when your data is 1) updated 2) ingested out of order (based on timestamp) or 3) requires joins. Unfortunately this greatly limits the number of use-cases that Druid readily supports. Tooling can be built around it to support things like out of order ingestion but it makes Druid very inefficient.

Druid also has inherent bottlenecks in its design: each cluster can have only one coordinator and one overlord. We found that this made it impossible to scale a single cluster out to meet our needs. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

We get very low latency query results from Druid in our UI. Prior to implementing Druid we were using MongoDB, which does not perform well for analytic queries. Druid sped up our UI a great deal. Review collected by and hosted on G2.com.

Show More
Show Less
Lead Engineer
Enterprise(> 1000 emp.)
Validated Reviewer
Verified Current User
Review source: Invitation from G2
What do you like best?

Easy to use, good documentation, flexible, scaleable. Review collected by and hosted on G2.com.

What do you dislike?

Performance is not always predictable. Ingestion specs can be difficult to create and debug. Review collected by and hosted on G2.com.

Recommendations to others considering the product:

Build a pipeline from data origin through caching in Druid and build some reporting with time and data filtering. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Providing reporting for a very large retail business. We've been able to retire several existing 3rd party systems. Review collected by and hosted on G2.com.

Show More
Show Less
Enterprise(> 1000 emp.)
Validated Reviewer
Verified Current User
Review source: Invitation from G2
What do you like best?

Real-time ingestion and querying capability​

Sub-second query performance​

Time Series based datastore​

Slice N Dice support​

Data Compression Review collected by and hosted on G2.com.

What do you dislike?

Inability to support nested data

Partial Join Support

Setup to bring it up for the first time Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Faster Querying Capabilities

Slice and Dice

Stable Datastore setup with minimal maintenance Review collected by and hosted on G2.com.

Show More
Show Less
Technical Lead - Big Data
Enterprise(> 1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

1) Pre-rolled up data into dimension and metrics

2) Lighting fast data/ query result retrieval Review collected by and hosted on G2.com.

What do you dislike?

Managing the broker/cluster if load is high

Limitation in dynamic dimensions Review collected by and hosted on G2.com.

Recommendations to others considering the product:

I definitely recommend to those who want realtime aggregation and derive analytics in realtime Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

We want to replace google analytics due to cost implications. So we are serving real-time analytics dashboard data source as a druid.

The best benefit is that druid keep pre-aggregated rolled up data in dimension and metric form and that can be further queried very fast Review collected by and hosted on G2.com.

Show More
Show Less
Senior Database Administrator
Enterprise(> 1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

it is Column oriented and open source distributed data store .it is awesome in ingesting massive amount of even driven data and provide low latency queries on the data Review collected by and hosted on G2.com.

What do you dislike?

limitations with auto scaling(scale up & scale down of the druid servers on the basis of demand ). Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

it has helped in building cubes and visualizations for the real time data. we have been thinking it as an alternative for BI tools to some extent & exploring further on the same. Review collected by and hosted on G2.com.

Show More
Show Less
UB
Enterprise(> 1000 emp.)
Validated Reviewer
Verified Current User
Review source: Invitation from G2
What do you like best?

Druid is very fast to query results and libraries like pydruid help increase the usability Review collected by and hosted on G2.com.

What do you dislike?

The errors are not very intuitive for instance if more than one dimensions have high cardinality and the query times out, error do not hint the same! Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Daily metrics and log data is pipelined to Druid clusters and UI built on the same helps even the non technical users to easily find insights Review collected by and hosted on G2.com.

Show More
Show Less
Specialist Engineer
Enterprise(> 1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Horizontal scalable

Support of Druid Kafka indexer task to ingest data directly from Kafka

Support for schema less datasource Review collected by and hosted on G2.com.

What do you dislike?

Once metadata is corrupted then it's very difficult to recover. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Using it in our IIOT product to store huge volume of sensor data.

By using Druid Kafka indexer task, we are ingesting data directly from Kafka so it's avoiding need of some Kafka source/sink connector. Review collected by and hosted on G2.com.

Show More
Show Less
UC
Enterprise(> 1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

I have hoped on using Druid very early in the day, using it from early 2018 , the potential it unlocks with all the easy to use and inbuilt capabilities of looking at different analytics perspectives is amazing. All the options of flexible filters, approximate algorithms, exact calculations etc makes our life lot simpler. Review collected by and hosted on G2.com.

What do you dislike?

Due to the initial days, we had our challenges in working with Druid ,but is fast evolving and enabling so much more new functionally Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Fast analytics by per-calculated values is the biggest inbuilt benefit we took from Druid Review collected by and hosted on G2.com.

Show More
Show Less
Software developer engineer
Mid-Market(51-1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Easy to learn and can ingest & query huge amount of data with fast speed Review collected by and hosted on G2.com.

What do you dislike?

limitation while using multiple joins in complex queries Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Good alternative for visualisation and BI tools. We are looking forward to retire existing BI tools which are currently used as reporting and visualisation. Review collected by and hosted on G2.com.

Show More
Show Less
CC
Small-Business(50 or fewer emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

easy integration with existing framework , good fit for realtime analytics which need to be performant Review collected by and hosted on G2.com.

What do you dislike?

The major drawback of this solution is that with commodity deep storage (Amazon S3) and network, it would make the majority of queries in our use case run for 10 of seconds, instead of current 0 — 3 seconds. I think decoupling of storage and compute is the future including time series databases. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

we were solving realtime aggregation and grouping problem for high volume of data processing Review collected by and hosted on G2.com.

Show More
Show Less
Senior Principal Member of Technical Staff
Enterprise(> 1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Blazing fast query response times, rich integration options Review collected by and hosted on G2.com.

What do you dislike?

Lack of clear documentation on certain functionality Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

I have used druid to power large scale analytics solutions. Because of druid we were able to provide some quick real time insights into business data Review collected by and hosted on G2.com.

Show More
Show Less
Academic Trainer
Mid-Market(51-1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Flexible, diverse, fast and offers solutions to a range of problems while developing real-time applications. Review collected by and hosted on G2.com.

What do you dislike?

Requires a quite bulky cluster to even do a basic setup. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Real time analytics at scale. Review collected by and hosted on G2.com.

Show More
Show Less
UC
Mid-Market(51-1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

- easy integration with other 3rd party opensource and proprietary software

- easy to setup and maintain

- good community Review collected by and hosted on G2.com.

What do you dislike?

- sometimes data needs to be reindexed if its too large.

- provides approx numbers, and sometimes exact counts are required. However, this is by design. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

- realtime OLAP data analytics

- reporting of business metrics

- powering many different UIs Review collected by and hosted on G2.com.

Show More
Show Less
UO
Small-Business(50 or fewer emp.)
Validated Reviewer
Verified Current User
Review source: Invitation from G2
What do you like best?

Easy to maintain and access through restapi and export the data to sql server. Review collected by and hosted on G2.com.

What do you dislike?

Open source and easy to use., so no comments Review collected by and hosted on G2.com.

Recommendations to others considering the product:

Good Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Long running issues in OLTP Review collected by and hosted on G2.com.

Show More
Show Less
Small-Business(50 or fewer emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

It's ability to deploy task using rest api Review collected by and hosted on G2.com.

What do you dislike?

Custom extentions not really configurable Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Data collection from event hub to blob storage and Data processing and decoding using custom extentions Review collected by and hosted on G2.com.

Show More
Show Less
AI
Mid-Market(51-1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

The ability to power realtime dashboards well, and ofcourse that its open source (so I can skip messy accounting approvals) Review collected by and hosted on G2.com.

What do you dislike?

Sometimes filtering using HiveQL can cause bugs and unexpected errors to pop up. I have also heard of indexing issues which sometimes occur. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Mostly powering analytical dashboards and ingesting real time streaming data Review collected by and hosted on G2.com.

Show More
Show Less
UM
Mid-Market(51-1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Out-of-the-box integration with Kafka, AWS S3, HDFS. Data visibility is quite instantaneous. Review collected by and hosted on G2.com.

What do you dislike?

The ability to modify its configuration could cause a serious threat to the security.

Creation of personalized protocol also would mean that new bugs will be created. So we will need more debuggers. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Assessing and managing the large volumes of data coming in the Advertising domain. Review collected by and hosted on G2.com.

Show More
Show Less
CA
Small-Business(50 or fewer emp.)
Validated Reviewer
Review source: Invitation from G2
Business partner of the seller or seller's competitor, not included in G2 scores.
What do you like best?

Fast and provide support for complex queries Review collected by and hosted on G2.com.

What do you dislike?

I really liked Druid, just faced 1 problem during setup regarding the documentation. Its documentation should be more. Review collected by and hosted on G2.com.

Recommendations to others considering the product:

Sure, definitely recommend to others Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

I have stored time-series telemetry data on Hadoop Cluster. Out of the box integration with cluster and Complex queries. Review collected by and hosted on G2.com.

Show More
Show Less
UR
Enterprise(> 1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Low latency querying, ease of loading data and retrieving data Review collected by and hosted on G2.com.

What do you dislike?

Inefficiency in bulk data extraction, would love to use spark or other big data tools for bulk data extraction and processing from spark Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

We were using druid for time based extraction of transactions through api and also as a data store for dashboards Review collected by and hosted on G2.com.

Show More
Show Less
UC
Enterprise(> 1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Ease of use. Both inserting the data through Kafka or querying, both were pretty intuitive. Review collected by and hosted on G2.com.

What do you dislike?

There were a few production issues in which the data was lost. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

We're were doing analytics on our payments data. Review collected by and hosted on G2.com.

Show More
Show Less
UF
Enterprise(> 1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Fetching and indexing, easy to adopt and able to fix the error quickly Review collected by and hosted on G2.com.

What do you dislike?

No GUI env for druid directly, scaling to increase the performance Review collected by and hosted on G2.com.

Recommendations to others considering the product:

need more training Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Able to access the huge file system records in kafka Review collected by and hosted on G2.com.

Show More
Show Less