
Incidentally, the thing I like most about Databricks isn't a product feature at all; I love Databricks's proactive and customer-centric service, always willing to make an exception or create a unique feature, all the while minimizing costs for the customer - as @Heather Akuiyibo & Shelby Ferson et al. have done for me and my former teams! Review collected by and hosted on G2.com.
Broadening programming logic and syntax. Review collected by and hosted on G2.com.
Very easy to use. No need to install and setup spark manually.
provides a notebook environment to write code.
support various languages like Python, Spark-SQL, R, Scala, etc.
easy to set up and use.
you can choose the cluster according to your need.
Support Machine Learning flows and Streaming Data.
Automatic suspend cluster if inactive for more than a given time( Cost-cutting)
Auto scalable Cluster.
Optimize uses of clusters (resources) Review collected by and hosted on G2.com.
No CI/ CD features given by default.
Costly for small level Enterprise.
Certification cost is high. Review collected by and hosted on G2.com.
It is great when you have large amount of data, excellent for collaboration, perfect for using with visualisation tools and functions with many programming languages. Review collected by and hosted on G2.com.
Difficult to get a grasp on how many applications and funcrions it has. Review collected by and hosted on G2.com.
Interactive clusters, user friendly, excellent cluster management Review collected by and hosted on G2.com.
Cluster takes some time to heat up on start, should support upsert without delta as business need pure upserts too Review collected by and hosted on G2.com.
The different languages used for implementation.
Great user experience.
Easy to understand and use.
Creation of different tools inside such as clusters or database.
Ease of integration with other software such as azure services.
Great addition to your expertise if you manage to master it completely.
Integration of spark with the different languages.(Python, R, Scala) Review collected by and hosted on G2.com.
The documentation inside the portal isn't the best, find better support outside with search engines. Review collected by and hosted on G2.com.
DataBricks is a great analytics tool which provides lightening speed analytics and has given new abilities to Data Scientists. Additionally, our advanced analytics at scale has gone up 100 times. Review collected by and hosted on G2.com.
The learning curve is steep and people would need coding knowledge to work with Databricks. It can also be costly at times. Review collected by and hosted on G2.com.
It's like a Jupyter notebook but a lot more powerful and flexible. You can easily switch from Python to SQL to Scala from one cell to the next. With the Spark framework, you can preview your data processing tasks without having to build large intermediate tables. Review collected by and hosted on G2.com.
Need better support when it comes to troubleshooting spark applications. It shows a lot of information, but gives you little sense of how to apply it Review collected by and hosted on G2.com.
1. Good UI
2. Good integrations with other applications/services.
3. Faster and efficient.
4. Updates are good. Review collected by and hosted on G2.com.
1. Sometimes it take much time to load the Spark notebook.
2. Sometimes having issues with interpreter settings while running the notebook. Review collected by and hosted on G2.com.
You can sync data from different systems all onto this one platform and everything can be analyzed without switching programs since you can also use many different programming languages and reap the benefits of each such as SQL and Python. This makes it so much easier to work with large datasets. Very nice user interface too! Review collected by and hosted on G2.com.
Very difficult to collaborate on projects using Databricks, it is its biggest downfall and in fact just almost outweighs the benefits. I also don't think their customer support is the best, have had some challenges with that. Otherwise a very good product. Review collected by and hosted on G2.com.
The system architects data delivery in a very easy to use and intuitive way. Non-data-savvy individuals are able to access the insights that data can provide and the support around the product is 2nd to none! Review collected by and hosted on G2.com.
Cost is always a concern when working with a system like this, but if the organization can afford it the insights and ease of use are worth it Review collected by and hosted on G2.com.
The ability to automatically load data from aws into databricks for collection, and analysis. It does a great job of customizing the notebooks and nodes that can be created. Review collected by and hosted on G2.com.
This tool is not optimized for R users. They were supposed to have an update Q1 for R studio but their help team informed me that this was no longer a priority. Even so, heavy AI and machine learning algorithms are not optimized for use here besides the usual theano and keras on python. Its also difficult to run analyses on a large volume of data without sampling which defeats the purpose. Review collected by and hosted on G2.com.
It is no surprise that Spark is one of the fastest growing technologies today and databricks provides a platform that makes transitioning to Spark easier. I like how there are also tutorials for people who are just beginning to learn make the onboarding to Spark easier. Love the connection with Github so there is the ease of sharing the projects with the world. Love the ease of pipeline creation in whatever language that one is comfortable in. Review collected by and hosted on G2.com.
Could give a bigger size of the cluster for individuals and students so that they can explore it to a bigger extent. Also, technical support is not good enough. I also do not like there is no way you can collaborate on a project. Visualization could be better. Review collected by and hosted on G2.com.
Overall, since we brought in DataBricks, our ability to use DataScience and advacned analytics at scale has gone up 100 times. Our experience has been awesome, and I know we're not even pushing the bounds of what it can do Review collected by and hosted on G2.com.
Overall Databricks has worked well, though it has taken longer than we anticipated to get it up and running. Review collected by and hosted on G2.com.
Databricks is my company's one stop shop for interacting with our expansive datasets. Databricks has been great so far for navigating our complex storage systems, accessing data, and being able to analyze it without having to switch programs. One of the best features of Databricks is that you can use a variety of languages within the program to complete all steps needed to fully use the data. I like being able to switch seamlessly between python, spark, and sql to work on big data sets.
Additionally, the formatting of the workbook is awesome. You can create new spaces below your original data view in order to perform the analysis. Review collected by and hosted on G2.com.
When using Databricks on a cloud-based server it is sometimes difficult to search through the folders and tables to find exactly what you need. I think it would be beneficial if they created an S3 browser to speed up this process. Review collected by and hosted on G2.com.
I love how accurate and quick databricks is. Once I started working with databricks, I couldn’t fathom doing data analyzation and comparisons without it. Review collected by and hosted on G2.com.
The software is not the cheapest on the market and that detracts funds that could go elsewhere throughout the hospital. However, databricks continues to be a great product. Review collected by and hosted on G2.com.
Databricks is a great tool to integrate queries from MySQL, Redshift, Python, and Adobe Clickstream data and the queries run pretty fast too. Review collected by and hosted on G2.com.
It takes some coding knowledge to set up and a good Data Engineering team. Review collected by and hosted on G2.com.
One of the best features on the platform is the ability to use a notebook environment and attach them to different Spark interpreters. I do like the user interface and the easy access to browsing files stored on the cluster. Review collected by and hosted on G2.com.
Controls are not really developed, it is hard to optimize runtimes. Databricks is expensive and we can not say it is the best for price/value. Review collected by and hosted on G2.com.