I believe we are using the new version.
Our company makes comprehensive use of the solution to consolidate data and do a certain amount of reporting and analytics. All the data consumers use Databricks to develop the information.
Coordenador Financeiro at Icatu
Good technical support, but is difficult to set up and integrate
Pros and Cons
- "The technical support is good."
- "The initial setup is difficult."
What is our primary use case?
What needs improvement?
Data governance should be addressed. We have some trouble connecting all the governance solutions with Databricks. This means the integrative capabilities are problematic.
The initial setup is difficult.
For how long have I used the solution?
We have been using Databricks for a year-and-a-half.
What do I think about the stability of the solution?
The solution is stable.
Buyer's Guide
Databricks
February 2025

Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: February 2025.
838,713 professionals have used our research since 2012.
What do I think about the scalability of the solution?
The solution is scalable.
How are customer service and support?
The technical support is good.
Which solution did I use previously and why did I switch?
As we are talking about a corporate solution, the deployment of Databricks lasted longer than the one day it took for Alteryx.
We used Alteryx prior to Databricks and continue to do so, it being the only other solution we have employed. We use the two with different software.
How was the initial setup?
The initial setup is difficult.
While I don't know exactly how long the deployment took, I do know that it lasted longer than the one day needed for Alteryx.
What about the implementation team?
I believe we used a partner for the deployment, although I cannot say for certain, as this is not within my purview.
I don't know how many people are needed for maintenance and deployment.
What's my experience with pricing, setup cost, and licensing?
As the licensing is not within my purview, I am not in a position to comment on this.
What other advice do I have?
My company makes use of the solution. It is employed by my data team and the technology one. I do not have personal experience using the solution.
The solution is deployed on base, on data.
I am not aware of how many people make use of it.
I rate Databricks as a seven out of ten.
Which deployment model are you using for this solution?
Private Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.

Practice Head, Data & Analytics at a tech vendor with 10,001+ employees
Key feature is ability to make changes in structure or data size and align for subsequent consumption
Pros and Cons
- "Can cut across the entire ecosystem of open source technology to give an extra level of getting the transformatory process of the data."
- "Implementation of Databricks is still very code heavy."
What is our primary use case?
We have a team that works on Databricks for our clients. We are customers of Databricks.
What is most valuable?
Databricks can cut across the entire ecosystem of open source technology which gives an extra level in terms of getting the transformatory process of the data. The solution is primarily open source and they have bolstered its components to make it more fit for purpose for a complete Azure Data platform. The solution is responsible for the core transformatory activities. While Azure Data Factory is very good for pulling in the data, doing the basic standardization and profiling, Databricks is more about making fundamental changes in structure or in size of the data and aligning it for subsequent consumption, or for the final layer on Synapse. It also has the power to complement and work with Spark and elements related to Python.
What needs improvement?
In my view, the fundamental approach of implementing Databricks is still very code heavy, more than you find in Azure Data Factory and other technologies like Informatica or SQL Server Integration Service. From my perspective, that could be improved. I'd also like to have the ability to facilitate predictive analytics within the solution.
For how long have I used the solution?
I've been using the solution for a year and a half.
What do I think about the stability of the solution?
Stability of the product is good, whether it's handling large volumes, diverse elements of data or processing data at speed. It has stood the test of time. It's a solution that really lends itself to that higher level of stability, versatility and diversity in terms of its capability to process different forms of data.
What's my experience with pricing, setup cost, and licensing?
The cost of the solution is slightly on the high side so it's important to use it efficiently.
What other advice do I have?
Use the solution wisely and in tandem with Azure Data Factory. Apply the prism in your overall design of the pipelines of the flow, to utilize to its potential. Databricks offers significant capability to the transformatory and data tranching capabilities in terms of diverse variety to Azure Data Stack per se. In terms of the license, ensure that the customer is getting what they paid for so that the value for money is realized.
I rate the solution eight out of 10.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Buyer's Guide
Databricks
February 2025

Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: February 2025.
838,713 professionals have used our research since 2012.
Business Intelligence and Analytics Consultant at a tech services company with 201-500 employees
Easy to switch loads between clusters and automation is easy using the API
Pros and Cons
- "Automation with Databricks is very easy when using the API."
- "Some of the error messages that we receive are too vague, saying things like "unknown exception", and these should be improved to make it easier for developers to debug problems."
What is our primary use case?
I am a developer and I do a lot of consulting using Databricks.
We have been primarily using this solution for ETL purposes. We also do some migration of on-premises data to the cloud.
What is most valuable?
The most valuable feature is the ability to switch loads between multiple clusters.
Automation with Databricks is very easy when using the API.
The ability to write code and SQL in the same interface is useful.
It is easy to connect notebooks to a cluster.
There are a large number of inbuilt functions that help to make things easier.
What needs improvement?
Some of the error messages that we receive are too vague, saying things like "unknown exception", and these should be improved to make it easier for developers to debug problems. As it is now, we have to go into the driver logs to identify the error messages properly.
There is not much information about Databricks available online, such as cost. Whenever we want to find the actual costing, we have to send an email to Databricks, so having the information available on the internet would be helpful.
I would like to see integration with Power BI or Tableau for the business users. They may use Databricks to check on things, but it will be a little bit complicated for them. The GUI interfaces for Tableau and Power BI are ones that they are used to, so the integration would help.
For how long have I used the solution?
I have been using Databricks for about five and a half years.
What do I think about the stability of the solution?
We have found that in the development environment, Databricks is pretty stable. We have had problems where something works in development but does not work in production, and this can happen when the version is updated and certain features have been deprecated. This means that more testing is required before moving to production, but this is the only drawback that we have seen.
Basically, when we move across version we have found issues, but otherwise, it's pretty stable.
What do I think about the scalability of the solution?
Scalability is one of the main features of Databricks. We have used datasets that are one hundred megabytes in size up to one terabyte, and we can manage, so it's easily scalable.
We have a large company with between 400 and 500 people using this solution.
How are customer service and technical support?
We have not reached out for technical support on Databricks.
How was the initial setup?
I found the initial setup easy because I had previously worked on Spark.
If somebody goes through the training, which is available on the website, then it should be straightforward. I don't think that it is very hard.
When it comes to developing things based on use cases, it can take between three days and two weeks, plus two to three days for testing and deploying it. I would say that for an entire use case, it will take a maximum of three weeks.
What other advice do I have?
My advice for developers who are interested in working with this solution is to first go through the Spark architecture.
I would rate this solution a nine out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Chief Research Officer at a consumer goods company with 1,001-5,000 employees
Ability to work collaboratively without concerns regarding the infrastructure is very beneficial to us
Pros and Cons
- "Ability to work collaboratively without having to worry about the infrastructure."
- "Would be helpful to have additional licensing options."
What is our primary use case?
Our primary use case of Databricks is for advanced analytics. I'm the chief research officer of the company and we're customers of Databricks.
What is most valuable?
I think the features I like the most are the scalability of the solution as well as its ability to share. We work with multiple people on notebooks and it enables us to work collaboratively in an easy way without having to worry about the infrastructure. I think the solution is very intuitive, very easy to use. And that's what you pay for.
What needs improvement?
I'd like to see more licensing options for the solution, the availability of additional pricing tiers. I understand it's not easy to achieve because it's a kind of platform-as-a-service type of solution. If you wanted to be more specific about the parts, and what you might or might not need, then you could save some money, and go for a lower level. Of course, that would then mean you'd have to manage more configurations which, as a user, would make things more complex but it would be good to have that option. The pricing is not the cheapest but it's understandable because it's a very high-end solution and easy to use, there's a lot of complexity masked away.
I would like to see additional monitoring tools and, in general, anything that can improve visualization of data. I know it's not the main point of Databricks and there are other tools that can be used, but anything that facilitates the integration of Databricks with visualization tools could be really useful. Increasing data scalability would also be great.
For how long have I used the solution?
I've been using this solution for a year.
What do I think about the stability of the solution?
The solution has been very stable.
What do I think about the scalability of the solution?
Scalability of the solution seems very easy to achieve.
How are customer service and technical support?
We haven't had contact with technical support.
How was the initial setup?
The initial set was very straightforward because it's also in our Azure cloud so it was quite easy to set up and configure. Very intuitive.
What other advice do I have?
I would rate this solution an eight out of 10.
Which deployment model are you using for this solution?
Private Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Cloud Administrator at a retailer with 5,001-10,000 employees
A simple and stable solution that can help with business engineering
Pros and Cons
- "The solution is very simple and stable."
- "The tool should improve its integration with other products."
What is our primary use case?
We use the solution for business engineering.
What is most valuable?
The solution is very simple and stable.
What needs improvement?
The tool should improve its integration with other products.
For how long have I used the solution?
I have been using the solution for around two years.
What do I think about the stability of the solution?
I would rate the product’s stability a seven out of ten.
What do I think about the scalability of the solution?
I would rate the tool’s scalability a seven out of ten.
How was the initial setup?
The solution is very easy to setup. I would rate its setup a ten out of ten.
What's my experience with pricing, setup cost, and licensing?
I would rate the tool’s pricing an eight out of ten.
What other advice do I have?
The tool’s performance is great. I would rate it an eight out of ten.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Co - Founder & Chief Data Officer -CDO at Data360
Allows us to automate the creation of a cluster, optimized for machine learning, and construct AI machine learning models for the client
Pros and Cons
- "Databricks allows me to automate the creation of a cluster, optimized for machine learning and construct AI machine learning models for the client."
- "There could be more support for automated machine learning in the database. I would like to see more ways to do analysis so that the reporting is more understandable."
What is our primary use case?
I use this for database machine learning, to construct different models for supermarkets, drug store management, and market involvement to identify business opportunities for clients.
We provide different statistical models and use different algorithms depending on the client.
I was a Lead Data Scientist in different companies. I implement data and build and optimize processes using machine learning techniques, aided by science and advanced analytics.
What is most valuable?
Databricks allows me to automate the creation of a cluster, optimized for machine learning and construct AI machine learning models for the client.
What needs improvement?
There could be more support for automated machine learning in the database. I would like to see more ways to do analysis so that the reporting is more understandable.
What do I think about the stability of the solution?
It's stable.
What do I think about the scalability of the solution?
It's scalable.
How are customer service and support?
I would rate technical support 4 out of 5.
How was the initial setup?
Setup isn't difficult. We used about 15 people for deployment and maintenance. We have data scientists and statisticians using this solution and doing different analyses.
What other advice do I have?
I would rate this solution 9 out of 10.
My advice is to use the different high analytics methodology, plan for the project, and recognize the different activities for the design.
Disclosure: PeerSpot contacted the reviewer to collect the review and to validate authenticity. The reviewer was referred by the vendor, but the review is not subject to editing or approval by the vendor. The reviewer's company has a business relationship with this vendor other than being a customer: Partner
Data Science Developer at a tech services company with 501-1,000 employees
Good performance and support for big data, built-in machine learning libraries are powerful
Pros and Cons
- "Databricks is based on a Spark cluster and it is fast. Performance-wise, it is great."
- "It should have more compatible and more advanced visualization and machine learning libraries."
What is our primary use case?
We use this solution for streaming analytics. We use machine learning functions that output to the API and work directly with the database.
How has it helped my organization?
Prior to using Azure Databricks in the cloud, we had Databricks installed in clusters. Since our implementation, the performance has increased and our cost has been reduced.
What is most valuable?
Databricks is based on a Spark cluster and it is fast. Performance-wise, it is great.
This solution has very good machine learning libraries built-in.
The support for big data is good.
What needs improvement?
Databricks should have more libraries for predictive analysis and machine learning.
It should have more compatible and more advanced visualization and machine learning libraries. As it is now, I have to try a customer algorithm in order for things to be compatible.
I would like to see more deep learning analytics.
For how long have I used the solution?
I have been using Databricks for about one year.
What do I think about the stability of the solution?
This is a cluster-based solution, so it is stable.
What do I think about the scalability of the solution?
We started using Databricks with a small PoC application, and then we developed it into a larger one. It's scalable, and it's a simple process to scale.
We have eight people in our team who are using this solution. We do not plan to increase usage at this time.
How are customer service and technical support?
I did not contact technical support myself, but when one of our team members contacted them they were given good answers. I would say that the support is good.
How was the initial setup?
It is not difficult to deploy this solution because it is well documented. We followed the normal steps that included all of the APIs.
What's my experience with pricing, setup cost, and licensing?
I do not exactly know the costs, but one of our clients pays between $100 USD and $200 USD monthly.
What other advice do I have?
Databricks has been good and I like it. However, it would be improved with the enhancement of the machine learning libraries, and with the inclusion of visualization libraries.
I would rate this solution an eight out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Data Architect at a tech services company with 201-500 employees
A reliable solution for processing and transforming data
Pros and Cons
- "The fast data loading process and data storage capabilities are great."
- "There are no direct connectors — they are very limited."
What is our primary use case?
We specialize in project consulting for our clients. Whenever we get the opportunity, we recommend Databricks to them.
What is most valuable?
The fast data loading process and data storage capabilities are great.
Based on the data loads and the performance, you can easily scale up the clusters.
What needs improvement?
Sometimes we experience issues connecting our database to Databricks. There are no direct connectors — they are very limited. This should be addressed and corrected in the next release.
Reading past data can also be tricky as there is no data spectrum like you would find with Snowflake and other solutions.
For how long have I used the solution?
We have been using Databricks for one and a half years.
What do I think about the scalability of the solution?
Both the scalability and the stability of Databricks is good.
How are customer service and technical support?
Technical support is good but I have not interacted with them directly. We have a point of contact. We used to interact with tech support on a regular basis and they would respond quickly. We would get a response on the same day based on the priority level. Keep in mind, my company is in a partnership with them which could be a factor in their quick response time.
How was the initial setup?
The initial setup was not very complex. We had it up and running in no time; it's a quick process.
What about the implementation team?
We have just one solution architect and one data architect who handle all maintenance-related issues.
What other advice do I have?
I would recommend purchasing a package that includes technical support. Compared to other companies, they offer great support to their clients.
On a scale from one to ten, I would give Databricks a rating of eight.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner

Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Updated: February 2025
Popular Comparisons
Teradata
Dremio
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which do you prefer - Databricks or Azure Machine Learning Studio?
- How would you compare Databricks vs Amazon SageMaker?
- Which would you choose - Databricks or Azure Stream Analytics?
- Which product would you choose for a data science team: Databricks vs Dataiku?
- Which ETL or Data Integration tool goes the best with Amazon Redshift?
- What are the main differences between Data Lake and Data Warehouse?
- What are the benefits of having separate layers or a dedicated schema for each layer in ETL?
- What are the key reasons for choosing Snowflake as a data lake over other data lake solutions?
- Are there any general guidelines to allocate table space quota to different layers in ETL?
- What cloud data warehouse solution do you recommend?