Databricks Reviews & Product Details

Databricks Overview

What is Databricks?

Making big data simple

Databricks Details
Website
Product Description

Making big data simple


Seller Details
Seller
Databricks Inc.
Company Website
Year Founded
2013
HQ Location
San Francisco, CA
Twitter
@databricks
38,916 Twitter followers
LinkedIn® Page
www.linkedin.com
1,704 employees on LinkedIn®
Show More

Databricks Screenshots

Databricks Reviews

Write a Review
Filter reviews
LinkedIn®
Connections
Popular Mentions
Showing 20 Databricks reviews
Popular Mentions
Showing 20 reviews
Filter Reviews
Filter Reviews
Sort by
Ratings
Company Size
User Role
For Category
All Industries
Region
Already have Databricks?
Write a Review
VP & Head of Data
Mid-Market(51-1000 emp.)
Validated Reviewer
Review source: Invitation from G2
Business partner of the seller or seller's competitor, not included in G2 scores.
What do you like best?

Incidentally, the thing I like most about Databricks isn't a product feature at all; I love Databricks's proactive and customer-centric service, always willing to make an exception or create a unique feature, all the while minimizing costs for the customer - as @Heather Akuiyibo & Shelby Ferson et al. have done for me and my former teams! Review collected by and hosted on G2.com.

What do you dislike?

Broadening programming logic and syntax. Review collected by and hosted on G2.com.

Recommendations to others considering the product:

Be open to the pitch. You may think things are "going fine" or proffer the idea of "if it ain't broke, don't fix it," but these represent short-term thinking traps such that scaling becomes inherently and implicitly constrained and limited. Databricks amounts to the forward-thinking businessperson. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

To name seven (7):

(1) User segmentation using a proprietary variation of a hierarchical DBSCAN clustering algorithm of high-dimensional data with novel distance [quasi] metric, based on hubness analysis;

(2) Leveraging the above in email targeting and invoking multi-armed bandit testing methodologies for email timing, frequency, and content, using decreasing-epsilon strategy;

(3) Modeling predicted underwriting criteria with a binary approval odds classification algorithm;

(4) Using a dynamic panel data, fixed effects model to predict the effect of changes in credit reports on user credit score;

(5) Employing an Autoregressive Integrated Moving Average (ARIMA) with optimized Akaike Information Criterion exploits to predict future revenue and growth (lagged results led to average error bounds of only 5 percent; cross-validation results were even stronger, though I was conservative in guaranteeing 7 percent error, on average);

(6) Refining a multiverse (context-aware) recommendation engine as an n-dimensional tensor (rather than the typical two-dimensional user-item matrix) for partner product recommendations, using High-Order Singular Value Decomposition to solve;

(7) Invoking a Convolutional Neural Network framework with a novel architecture and results of a Fourier Transform as input to classify dental x-rays and highlight to the dentist which teeth require fillings (after approximately two months, the model reached ~95 percent accuracy - in terms of actual agreement by dentists using the app - with F1 score in cross-validation performing on par). Review collected by and hosted on G2.com.

Show More
Show Less
Enterprise(> 1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Very easy to use. No need to install and setup spark manually.

provides a notebook environment to write code.

support various languages like Python, Spark-SQL, R, Scala, etc.

easy to set up and use.

you can choose the cluster according to your need.

Support Machine Learning flows and Streaming Data.

Automatic suspend cluster if inactive for more than a given time( Cost-cutting)

Auto scalable Cluster.

Optimize uses of clusters (resources) Review collected by and hosted on G2.com.

What do you dislike?

No CI/ CD features given by default.

Costly for small level Enterprise.

Certification cost is high. Review collected by and hosted on G2.com.

Recommendations to others considering the product:

Splunk is a best tool when it comes to Big data processing. it is easy to use and setup Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

We have to develop pipelines. We are getting data from different sources like AWS S3, redshift and we had to process that large amount of data on Databricks and put it back to our Dataware house. Review collected by and hosted on G2.com.

Show More
Show Less
Digital Marketing Specialist
Small-Business(50 or fewer emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

It is great when you have large amount of data, excellent for collaboration, perfect for using with visualisation tools and functions with many programming languages. Review collected by and hosted on G2.com.

What do you dislike?

Difficult to get a grasp on how many applications and funcrions it has. Review collected by and hosted on G2.com.

Recommendations to others considering the product:

Use it it s the best available and it s great! Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

It s great for ELT of date to use with power BI Review collected by and hosted on G2.com.

Show More
Show Less
Data Engineer
Small-Business(50 or fewer emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Interactive clusters, user friendly, excellent cluster management Review collected by and hosted on G2.com.

What do you dislike?

Cluster takes some time to heat up on start, should support upsert without delta as business need pure upserts too Review collected by and hosted on G2.com.

Recommendations to others considering the product:

It's the best infrastructure to build pipelines if you are planning to use spark in production Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Can seemlessly use pyspark, Python to build a robust pipeline Review collected by and hosted on G2.com.

Show More
Show Less
Data Engineer
Mid-Market(51-1000 emp.)
Validated Reviewer
Verified Current User
Review source: Invitation from G2
What do you like best?

The different languages used for implementation.

Great user experience.

Easy to understand and use.

Creation of different tools inside such as clusters or database.

Ease of integration with other software such as azure services.

Great addition to your expertise if you manage to master it completely.

Integration of spark with the different languages.(Python, R, Scala) Review collected by and hosted on G2.com.

What do you dislike?

The documentation inside the portal isn't the best, find better support outside with search engines. Review collected by and hosted on G2.com.

Recommendations to others considering the product:

Great tool for developing when looking for a fast result as it uses distributed programming by the usage of different clusters. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Currently data transformation as it provides easy access to databases or blobs and the ability to use a language such as python to build up the solution you need is great. Review collected by and hosted on G2.com.

Show More
Show Less
Senior Consultant
Enterprise(> 1000 emp.)
Validated Reviewer
Review source: Invitation from the seller
Business partner of the seller or seller's competitor, not included in G2 scores.
What do you like best?

DataBricks is a great analytics tool which provides lightening speed analytics and has given new abilities to Data Scientists. Additionally, our advanced analytics at scale has gone up 100 times. Review collected by and hosted on G2.com.

What do you dislike?

The learning curve is steep and people would need coding knowledge to work with Databricks. It can also be costly at times. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Problems - Analytics problems

Benefits - Scale and Speed Review collected by and hosted on G2.com.

Show More
Show Less
Head of Data Science
Small-Business(50 or fewer emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

It's like a Jupyter notebook but a lot more powerful and flexible. You can easily switch from Python to SQL to Scala from one cell to the next. With the Spark framework, you can preview your data processing tasks without having to build large intermediate tables. Review collected by and hosted on G2.com.

What do you dislike?

Need better support when it comes to troubleshooting spark applications. It shows a lot of information, but gives you little sense of how to apply it Review collected by and hosted on G2.com.

Recommendations to others considering the product:

It's great if you already understand Spark. Otherwise, Spark has quite a learning curve. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

We do a lot of large scale data processing applications. Previously we used databases, but this is more flexible and powerful (and cheap). Review collected by and hosted on G2.com.

Show More
Show Less
UI
Mid-Market(51-1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

1. Good UI

2. Good integrations with other applications/services.

3. Faster and efficient.

4. Updates are good. Review collected by and hosted on G2.com.

What do you dislike?

1. Sometimes it take much time to load the Spark notebook.

2. Sometimes having issues with interpreter settings while running the notebook. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

1. Big data - Analyzing large datasets. Review collected by and hosted on G2.com.

Show More
Show Less
UC
Mid-Market(51-1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

You can sync data from different systems all onto this one platform and everything can be analyzed without switching programs since you can also use many different programming languages and reap the benefits of each such as SQL and Python. This makes it so much easier to work with large datasets. Very nice user interface too! Review collected by and hosted on G2.com.

What do you dislike?

Very difficult to collaborate on projects using Databricks, it is its biggest downfall and in fact just almost outweighs the benefits. I also don't think their customer support is the best, have had some challenges with that. Otherwise a very good product. Review collected by and hosted on G2.com.

Recommendations to others considering the product:

Keep in mind that you cannot collaborate on products. Technical support is also not the best! Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Great way to uncover data insights easily from large datasets. Review collected by and hosted on G2.com.

Show More
Show Less
UC
Mid-Market(51-1000 emp.)
Validated Reviewer
Verified Current User
Review source: Invitation from G2
What do you like best?

The system architects data delivery in a very easy to use and intuitive way. Non-data-savvy individuals are able to access the insights that data can provide and the support around the product is 2nd to none! Review collected by and hosted on G2.com.

What do you dislike?

Cost is always a concern when working with a system like this, but if the organization can afford it the insights and ease of use are worth it Review collected by and hosted on G2.com.

Recommendations to others considering the product:

Try to ask an individual their experience with the product and how they used it. The work to implement and create connections is often the most complicated part, not the analysis of the data itself. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Similar to salesforce, we are able to sync are data from disperate systems and visualize in a single space Review collected by and hosted on G2.com.

Show More
Show Less
Lead Data Scientist/Analytics Manager
Investment Banking
Enterprise(> 1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

The ability to automatically load data from aws into databricks for collection, and analysis. It does a great job of customizing the notebooks and nodes that can be created. Review collected by and hosted on G2.com.

What do you dislike?

This tool is not optimized for R users. They were supposed to have an update Q1 for R studio but their help team informed me that this was no longer a priority. Even so, heavy AI and machine learning algorithms are not optimized for use here besides the usual theano and keras on python. Its also difficult to run analyses on a large volume of data without sampling which defeats the purpose. Review collected by and hosted on G2.com.

Recommendations to others considering the product:

If you have simple needs and not looking to run unsupervised AI on this then it will work for you. I, however, need to create sophisticated models and cannot do so without constantly running into issues left and right. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

data warehousing, sampling, simple models Review collected by and hosted on G2.com.

Show More
Show Less
UR
Enterprise(> 1000 emp.)
Validated Reviewer
Verified Current User
Review source: Invitation from G2
What do you like best?

It is no surprise that Spark is one of the fastest growing technologies today and databricks provides a platform that makes transitioning to Spark easier. I like how there are also tutorials for people who are just beginning to learn make the onboarding to Spark easier. Love the connection with Github so there is the ease of sharing the projects with the world. Love the ease of pipeline creation in whatever language that one is comfortable in. Review collected by and hosted on G2.com.

What do you dislike?

Could give a bigger size of the cluster for individuals and students so that they can explore it to a bigger extent. Also, technical support is not good enough. I also do not like there is no way you can collaborate on a project. Visualization could be better. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

We are scaling our marketing solutions using this platform. Would love even more tutorials. Should have video tutorials as well. Review collected by and hosted on G2.com.

Show More
Show Less
UE
Enterprise(> 1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Overall, since we brought in DataBricks, our ability to use DataScience and advacned analytics at scale has gone up 100 times. Our experience has been awesome, and I know we're not even pushing the bounds of what it can do Review collected by and hosted on G2.com.

What do you dislike?

Overall Databricks has worked well, though it has taken longer than we anticipated to get it up and running. Review collected by and hosted on G2.com.

Recommendations to others considering the product:

Provides support and solution that is not available in open source version. Good communication Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Frees up data scientists to do data science instead of fighting with cluster management.

Review collected by and hosted on G2.com.

Show More
Show Less
Partner Account Lead
Internet
Mid-Market(51-1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Get the data you need right at your finger tips Review collected by and hosted on G2.com.

What do you dislike?

Data can be hard to pull (weite code in SQL) versus other platforms Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Insights to be pulled from our app

Create pulls for selling stories

Benefits: lots of cuts of data Review collected by and hosted on G2.com.

Show More
Show Less
UM
Mid-Market(51-1000 emp.)
Validated Reviewer
Verified Current User
Review source: Invitation from G2
What do you like best?

Databricks is my company's one stop shop for interacting with our expansive datasets. Databricks has been great so far for navigating our complex storage systems, accessing data, and being able to analyze it without having to switch programs. One of the best features of Databricks is that you can use a variety of languages within the program to complete all steps needed to fully use the data. I like being able to switch seamlessly between python, spark, and sql to work on big data sets.

Additionally, the formatting of the workbook is awesome. You can create new spaces below your original data view in order to perform the analysis. Review collected by and hosted on G2.com.

What do you dislike?

When using Databricks on a cloud-based server it is sometimes difficult to search through the folders and tables to find exactly what you need. I think it would be beneficial if they created an S3 browser to speed up this process. Review collected by and hosted on G2.com.

Recommendations to others considering the product:

When looking for a new program to access and query your data, look no further. Databricks is better than any SQL server I've used and allows you to utilize python, scala, and spark without having to waste any time changing workbooks. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Databricks is helping our analytics group to sift through our mountains of data in order to create new and innovative products that paint a picture of client's customers (and beyond). Personally, I think this program saves a lot of time than having to work on data in silos based on the language you are working with. Review collected by and hosted on G2.com.

Show More
Show Less
UH
Enterprise(> 1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

I love how accurate and quick databricks is. Once I started working with databricks, I couldn’t fathom doing data analyzation and comparisons without it. Review collected by and hosted on G2.com.

What do you dislike?

The software is not the cheapest on the market and that detracts funds that could go elsewhere throughout the hospital. However, databricks continues to be a great product. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Multiple forms of data analysis along with demographics and data comparison implementations. Review collected by and hosted on G2.com.

Show More
Show Less
Small-Business(50 or fewer emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Software was great and easy.It was fun to use. Review collected by and hosted on G2.com.

What do you dislike?

Nothing at all.Ircwas understandable and fun. Review collected by and hosted on G2.com.

Recommendations to others considering the product:

Yes Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

To simplify work Review collected by and hosted on G2.com.

Show More
Show Less
UI
Mid-Market(51-1000 emp.)
Validated Reviewer
Verified Current User
Review source: Invitation from G2
What do you like best?

Databricks is a great tool to integrate queries from MySQL, Redshift, Python, and Adobe Clickstream data and the queries run pretty fast too. Review collected by and hosted on G2.com.

What do you dislike?

It takes some coding knowledge to set up and a good Data Engineering team. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

It allows me to pull reports in one place. Review collected by and hosted on G2.com.

Show More
Show Less
UI
Enterprise(> 1000 emp.)
Validated Reviewer
Review source: Invitation from G2
What do you like best?

Helpful online resources. Easy to get started Review collected by and hosted on G2.com.

What do you dislike?

Not enough documentation. Not enough examples Review collected by and hosted on G2.com.

Recommendations to others considering the product:

Good idea Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Predictive modeling. Easy to spin up models Review collected by and hosted on G2.com.

Show More
Show Less
IF
Enterprise(> 1000 emp.)
Validated Reviewer
Verified Current User
Review source: Invitation from G2
What do you like best?

One of the best features on the platform is the ability to use a notebook environment and attach them to different Spark interpreters. I do like the user interface and the easy access to browsing files stored on the cluster. Review collected by and hosted on G2.com.

What do you dislike?

Controls are not really developed, it is hard to optimize runtimes. Databricks is expensive and we can not say it is the best for price/value. Review collected by and hosted on G2.com.

Recommendations to others considering the product:

Try to avoid Databricks proprietary special operators as they will not work outside of this environment. Review collected by and hosted on G2.com.

What problems are you solving with the product? What benefits have you realized?

Tracking performance of millions of loans. We get responses much faster than without using big data tools. Some of the queries we run took days previously while now they take only minutes. Review collected by and hosted on G2.com.

Show More
Show Less
Do you work for Databricks?