We use Databricks for experimentation. For example, we do ML model building and training that is connecting to our data which resides in Azure. It offers very good integration with Azure. We've deployed some of our model inference tools in Databricks.
Chief Data Scientist at Ngenux
Effective integration, helpful support, and simple cloud implementation
Pros and Cons
- "Databricks integrates well with other solutions."
- "Databricks doesn't offer the use of Python scripts by itself and is not connected to GitHub repositories or anything similar. This is something that is missing. if they could integrate with Git tools it would be an advantage."
What is our primary use case?
What is most valuable?
Databricks integrates well with other solutions.
What needs improvement?
Databricks doesn't offer the use of Python scripts by itself and is not connected to GitHub repositories or anything similar. This is something that is missing. if they could integrate with Git tools it would be an advantage.
Along with having connections to different databases for Git tools, adding libraries for easy access would be a benefit. As data scientists, we connect to different databases and different sources of data, having a library would be useful.
For how long have I used the solution?
I have been using Databricks for approximately one year.
Buyer's Guide
Databricks
February 2025

Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: February 2025.
838,713 professionals have used our research since 2012.
What do I think about the stability of the solution?
The solution is stable. We did not face any downtime.
What do I think about the scalability of the solution?
Databricks is scalable. It operates three times faster than any of the other ecosystems which we have experimented on.
We have approximately five data scientists using this solution in my organization. We are a small company and as we grow, all our data scientists would be using this platform. We plan to increase usage.
How are customer service and support?
The technical support is good. We didn't need a lot of support. There were a few times we needed some help on how to do certain operations.
How was the initial setup?
The installation was straightforward because it is on the cloud. The full deployment took approximately one week.
What about the implementation team?
We did the implementation of Databricks in-house. It only requires one person for the maintenance of the solution.
What other advice do I have?
My advice to others wanting to implement this solution is to use a cloud environment. For example, we are using Azure with Databricks. It is much better than doing an on-premise implementation.
I rate Databricks an eight out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: I am a real user, and this review is based on my own experience and opinions.

Data Science Consultant at Syniti
Good performance, easy to set up, and easy to use if you have a Python background
Pros and Cons
- "I work in the data science field and I found Databricks to be very useful."
- "It would be very helpful if Databricks could integrate with platforms in addition to Azure."
What is our primary use case?
We are building internal tools and custom models for predictive analysis. We are currently building a platform where we can integrate multiple data sources, such as data that is coming from Azure, AWS, or any SQL database. We integrate the data and run our models on top of that.
We primarily use Databricks for data processing and for SQL databases.
What is most valuable?
I found that PySpark is the most useful tool. It uses in-memory calculation and when you want to run a model it does it very quickly. We used to use Python and when we migrated to PySpark the performance was much better.
What needs improvement?
It would be very helpful if Databricks could integrate with platforms in addition to Azure.
Having an open-source version or having the option to get a trial version of Databricks would be very helpful.
It would be very useful for beginners if there were tutorials and examples on how to write code for PySpark, R, or Scala. Having examples would give people something to refer to and play with.
For how long have I used the solution?
We have been using Databricks for the past two or three years.
What do I think about the stability of the solution?
A couple of times I faced an issue where a long-running process was consuming a lot of time and then stopped abruptly. It necessitated starting the process again.
What do I think about the scalability of the solution?
We are in the prototyping stage so we do not plan on increasing our usage yet.
How are customer service and technical support?
We have not been in contact with technical support.
Which solution did I use previously and why did I switch?
Before using Databricks, we were running our own cluster with a web server that executed our Python queries.
How was the initial setup?
The initial setup is straightforward. With respect to deployment, the development can be done within half an hour and we can use code and deploy from there.
What about the implementation team?
We implemented Databricks on our own. We haven't deployed as such, as we are just running our queries and it is not in production yet.
What other advice do I have?
I work in the data science field and I found Databricks to be very useful. If I want to run any models then I can code them in PySpark. If you are coming from a Python background then you can write code in PySpark and it runs quickly. This is a good solution in terms of performance.
I would rate this solution a nine out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
Buyer's Guide
Databricks
February 2025

Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: February 2025.
838,713 professionals have used our research since 2012.
Data Platform Architect at a tech services company with 51-200 employees
Provides seamless integration capabilities, but the cluster management features need improvement
Pros and Cons
- "Databricks is a robust solution for big data processing, offering flexibility and powerful features."
- "The product could be improved regarding the delay when switching to higher-performing virtual machines compared to other platforms."
What is our primary use case?
We use the product as a data science platform that enables me to handle and analyze large datasets efficiently.
What is most valuable?
Databricks can switch easily between cloud providers, such as Azure and GCP. It allows seamless integration with various data platforms and cloud providers, facilitating better data handling and analysis.
What needs improvement?
The product could be improved regarding the delay when switching to higher-performing virtual machines compared to other platforms like Snowflake. The ease and speed of managing clusters can also be enhanced, especially when scaling up resources. They could add more advanced data storage solutions like Iceberg and Delta files.
For how long have I used the solution?
I have been using Databricks for approximately two years.
What do I think about the stability of the solution?
I rate the product stability a seven out of ten.
What do I think about the scalability of the solution?
I rate the product scalability an eight.
How are customer service and support?
The technical support services are good.
How would you rate customer service and support?
Positive
How was the initial setup?
The initial setup was straightforward. However, configuring policies could have been simpler.
What's my experience with pricing, setup cost, and licensing?
The product pricing is moderate.
Which other solutions did I evaluate?
I evaluated other options, including Snowflake, before choosing Databricks.
What other advice do I have?
Databricks is a robust solution for big data processing, offering flexibility and powerful features. While there are areas for improvement, especially in performance and cluster management, it remains a highly valuable tool in my data science toolkit.
I rate it a seven.
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
Last updated: Jul 16, 2024
Flag as inappropriateData engineer
A stable solution that can be scaled depending on the project, but the price could be cheaper
Pros and Cons
- "The setup was straightforward."
- "The pricing of Databricks could be cheaper."
What is our primary use case?
I primarily use the solution in two conditions: machine learning and big data computing.
What needs improvement?
The pricing of Databricks could be cheaper. The solution can also improve by providing more intelligence to the coder.
For how long have I used the solution?
I have been using Databricks for the past two years.
What do I think about the stability of the solution?
The solution is stable. I would rate the stability a seven out of ten.
What do I think about the scalability of the solution?
The scalability depends on the project. At present, around 20 people use the solution in my company.
How are customer service and support?
How was the initial setup?
The setup was straightforward. It also depends on the projects.
What about the implementation team?
The deployment process was automated.
Which other solutions did I evaluate?
Evaluating solutions is not my work. I depend on Databricks.
What other advice do I have?
I rate Databricks a seven out of ten.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Project Manager at MAQ Software
Integrates well, is scalable, and high availability
Pros and Cons
- "The most valuable feature of Databricks is the integration with Microsoft Azure."
- "Databricks can improve by making the documentation better."
What is our primary use case?
I am using Databricks for creating business intelligence solutions.
What is most valuable?
The most valuable feature of Databricks is the integration with Microsoft Azure.
What needs improvement?
Databricks can improve by making the documentation better.
For how long have I used the solution?
I have been using Databricks for approximately one year.
What do I think about the stability of the solution?
Databricks is stable.
What do I think about the scalability of the solution?
The scalability of Databricks is good.
We have approximately 500 users using this solution in my organization.
How are customer service and support?
I have not used the support from Databricks.
Which solution did I use previously and why did I switch?
We previously used Microsoft stacks. We chose Databricks because the processing power was better and it was a better fit for our use case.
How was the initial setup?
The initial setup of Databricks was not straightforward. We had to do trial and error and we learned as we went along.
I rate the initial setup of Databricks a four out of five.
What about the implementation team?
We did the implementation of Databricks in-house. The solution requires ongoing maintenance.
What other advice do I have?
I would recommend this solution to others.
My advice to others is for them to first do a small proof of concept and then see how it works out and then take it from there.
I rate Databricks an eight out of ten.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Business Intelligence Coordinator Latam at a construction company with 5,001-10,000 employees
The capacity of use of the different types of coding is valuable
Pros and Cons
- "The capacity of use of the different types of coding is valuable. Databricks also has good performance because it is running in spark extra storage, meaning the performance and the capacity use different kinds of codes."
- "There would also be benefits if more options were available for workers, or the clusters of the two points."
What is our primary use case?
My company is a customer of Databricks. We use Data Science products for machine learning, engineering, and data preparation.
We have between five and eight people working on coding in Databricks. Indirectly, we have 1500 people consuming the data. We have plans to increase the usage of data bricks by 30% next year.
What is most valuable?
The capacity of use of the different types of coding is valuable. Databricks also has good performance because it is running in spark extra storage, meaning the performance and the capacity use different kinds of codes.
What needs improvement?
Databricks does not always have clear updates. Often we find an update in the tool but we are not really sure what has changed. We would appreciate better communication from Databricks. It could be in the form of a friendly warning that talks about the updates.
There would also be benefits if more options were available for workers, or the clusters of the two points.
For how long have I used the solution?
I have been using Databricks for two years.
What do I think about the stability of the solution?
Databricks is stable, however, we do find some errors and don't understand what has happened. Usually, they are resolved within a few minutes. I would say it is 95% stable.
What do I think about the scalability of the solution?
Scalability is really good.
How are customer service and support?
I have not had to contact Databrick's support other than through the deployment, which they helped a lot.
How was the initial setup?
The initial setup of Databricks is straightforward and simple. It is not complex because they provide a lot of documentation. The deployment was fast, it took less than three days with five people assigned to the task.
What about the implementation team?
We implemented in-house. It is difficult to find a good consultant or reseller for Databricks in Brazil.
What's my experience with pricing, setup cost, and licensing?
We pay monthly on a pay as you go plan.
What other advice do I have?
With Databricks, you may have a lot of devices. It is important to use each cluster for each kind of process and then not use the small clusters. Using the bigger cluster you will receive better performance and the use is closer and will save you money.
It is important to code it in parts because if you code it all in full you could find some problems with performance.
I would rate Databricks a 9 out of 10.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Advanced Analytics Lead at a pharma/biotech company with 1,001-5,000 employees
Better tailored code and automation capabilities needed, but easy to use
Pros and Cons
- "The solution is easy to use and has a quick start-up time due to being on the cloud."
- "The solution could improve by providing better automation capabilities. For example, working together with more of a DevOps approach, such as continuous integration."
What is our primary use case?
Databricks can be used for large-scale data pre-processing and data transformations.
What is most valuable?
The solution is easy to use and has a quick start-up time due to being on the cloud.
What needs improvement?
The solution could improve by providing better automation capabilities. For example, working together with more of a DevOps approach, such as continuous integration. There is a lot of code from places, such as GitHub, but it is not tailored for Databricks. It requires a lot of effort to bring the code to a level where it can be used with Databricks capabilities.
For how long have I used the solution?
I have been using Databricks for two months.
What do I think about the stability of the solution?
The solution is stable.
What do I think about the scalability of the solution?
Databricks is scalable.
How are customer service and technical support?
We did not have a need to use technical support.
How was the initial setup?
The installation is straightforward, and it took approximately one hour.
What about the implementation team?
We did the implementation and maintenance of the solution ourselves using approximately three engineers.
What's my experience with pricing, setup cost, and licensing?
The solution requires a subscription.
Which other solutions did I evaluate?
We are evaluating other solutions.
What other advice do I have?
I would recommend this solution for those wanting to process large data sets, but if it is to be used for smaller data sets, I would not recommend it.
I rate Databricks a five out of ten.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Head of Data & Analytics at a tech services company with 11-50 employees
Helpful integration with Python and notebooks, but it should be more user-friendly and less complicated to use
Pros and Cons
- "The integration with Python and the notebooks really helps."
- "Databricks is not geared towards the end-user, but rather it is for data engineers or data scientists."
What is our primary use case?
We are a consulting house and we employ solutions based on our customers' needs. We don't generally use products internally.
I am a certified data engineer from Microsoft and have worked on the Azure platform, which is why I have experience with Databricks. Now that Microsoft has launched Synapse, I think that there will be more use cases.
What is most valuable?
You can spin up an Azure Databricks clustered, and integrating with it is seamless.
The integration with Python and the notebooks really helps.
What needs improvement?
There is definitely room for improvement.
This is the type of solution where you need to have people with technical expertise to use it. Other products are self-service and can be employed by end-users. Databricks is not geared towards the end-user, but rather it is for data engineers or data scientists. I'm not sure whether Databricks is working towards it, or not.
It would be nice if it were more user-friendly, where you don't have to rely on Power BI or a visualization tool. I know that there is integration in the notebook where you can do it, but still, the relationships and semantics make it more difficult. It would be better to do it right in Databricks. You could put them within the portal and I don't have to log out and bring that into Power BI and then visualize.
What do I think about the stability of the solution?
We have not done any major implementation yet, although I think it's stable to an extent. I can't comment on it in terms of benchmark and experiencing any issues. It works seamlessly in the places where I've used it.
What do I think about the scalability of the solution?
Our implementations have been small and we haven't needed to scale as of yet.
Databricks can help you to build a data lake, and it's something that they need to help make more popular. People are slowly understanding it because if you look, there are lots of data lakes that people are trying to create. I'm not intimate with it, but the concept seems complicated. I think they need to write up something where videos can explain it better. What I have seen on YouTube is quite complicated for an end-user to understand.
How was the initial setup?
The initial setup is easy. It's not difficult when you are used to Azure.
What's my experience with pricing, setup cost, and licensing?
I am based in South Africa, where it is expensive adapting to the cloud, and then there is the price for the tool itself.
The cost is difficult to estimate. I've got customers who went to the cloud and then they realized that the costs were more, compared to what they used to be on-premises. Also, because our exchange rate is so weak, I would always advocate that prices being lower is better, although I don't know how feasible it is.
What other advice do I have?
From a purely technical perspective, I would rate Databricks and eight out of ten. However, there is a failure in terms of user adoption. After I look at other products, including Synapse, those are better. I still feel that Databricks is quite complicated for the average person.
I would rate this solution a five out of ten.
Disclosure: I am a real user, and this review is based on my own experience and opinions.

Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Updated: February 2025
Popular Comparisons
Teradata
Dremio
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which do you prefer - Databricks or Azure Machine Learning Studio?
- How would you compare Databricks vs Amazon SageMaker?
- Which would you choose - Databricks or Azure Stream Analytics?
- Which product would you choose for a data science team: Databricks vs Dataiku?
- Which ETL or Data Integration tool goes the best with Amazon Redshift?
- What are the main differences between Data Lake and Data Warehouse?
- What are the benefits of having separate layers or a dedicated schema for each layer in ETL?
- What are the key reasons for choosing Snowflake as a data lake over other data lake solutions?
- Are there any general guidelines to allocate table space quota to different layers in ETL?
- What cloud data warehouse solution do you recommend?