We use it for data analysis and testing of high volume web user behavioral data.
Sr. Data Quality Analyst at a computer software company with 11-50 employees
Can use different technologies to do data analysis and can quickly get data
Pros and Cons
- "Databricks makes it really easy to use a number of technologies to do data analysis. In terms of languages, we can use Scala, Python, and SQL. Databricks enables you to run very large queries, at a massive scale, within really good timeframes."
- "Databricks has added some alerts and query functionality into their SQL persona, but the whole SQL persona, which is like a role, needs a lot of development. The alerts are not very flexible, and the query interface itself is not as polished as the notebook interface that is used through the data science and machine learning persona. It is clunky at present."
What is our primary use case?
What is most valuable?
Databricks makes it really easy to use a number of technologies to do data analysis. In terms of languages, we can use Scala, Python, and SQL. Databricks enables you to run very large queries, at a massive scale, within really good timeframes.
I'm starting to build a solution using Delta Live Tables and Delta Live pipelines, and it is proving to be exceptionally easy to use. I have also been able to quickly implement a pipeline.
What needs improvement?
Databricks has added some alerts and query functionality into their SQL persona, but the whole SQL persona, which is like a role, needs a lot of development. The alerts are not very flexible, and the query interface itself is not as polished as the notebook interface that is used through the data science and machine learning persona. It is clunky at present.
For how long have I used the solution?
I've been using Databricks for a year.
Buyer's Guide
Databricks
January 2026
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: January 2026.
881,565 professionals have used our research since 2012.
What do I think about the stability of the solution?
It is a stable and reliable solution. I'd rate stability at eight out of ten.
What do I think about the scalability of the solution?
Databricks is absolutely scalable, and I'd rate scalability at eight out of ten. We probably have between 60 and 100 users in our organization, and we hope to increase usage in the future.
How are customer service and support?
The technical support staff we have worked with have been amazing. They helped us initially with our Delta Live pipelines. I would give them a rating of ten out of ten.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
I have previously worked with Apache Hadoop, and Databricks is definitely a better product. It's much easier to get data quickly in Databricks. As a result, a lot of the drudgery is taken away. Whereas with Hadoop, it's a bit more tricky to get data together.
What's my experience with pricing, setup cost, and licensing?
We're charged on what the data throughput is and also what the compute time is.
What other advice do I have?
I'd strongly recommend giving Databricks a try. We have found it to be a fantastic tool that has accelerated some of our solutions. We're an AI-heavy shop, and there are a lot of data scientists using the MLflow capabilities. I hear a lot of good things from that side as well. From a data analysis point of view, Databricks has been fantastic, and I would rate it at eight on a scale from one to ten.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Executive Manager at a computer software company with 10,001+ employees
Excellent data transformation but data-serving performance could be better
Pros and Cons
- "Databricks' most valuable feature is the data transformation through PySpark."
- "Databricks' performance when serving the data to an analytics tool isn't as good as Snowflake's."
What is our primary use case?
We mainly use Databricks to process ingest and do the ELT processes of data to get it ready for analytics and to serve the data to ThoughtSpot, which calls queries and Databricks to get the data.
How has it helped my organization?
We didn't have any good tooling for ELT processing prior to Databricks. We were using Microsoft HD Insight, but it was taking too long to process the data. When we changed our data-processing ELT processes over to Databricks, the amount of time to process the data was reduced to a fraction of what HD Insight used, so we were able to run jobs much faster.
What is most valuable?
Databricks' most valuable feature is the data transformation through PySpark.
What needs improvement?
Databricks' performance when serving the data to an analytics tool isn't as good as Snowflake's. In the next release, Databricks should include a better data-sharing platform to facilitate data sharing between companies.
For how long have I used the solution?
I've been using Databricks for three years.
What do I think about the stability of the solution?
Databricks' stability has been great, and I would rate it eight out of ten.
What do I think about the scalability of the solution?
Databricks is very scalable because it's very easy to spin up multiple clusters, but the cost of doing that is tremendous. I'd rate its scalability nine out of ten, but you'll pay for it.
How are customer service and support?
The technical support has been really bad, but that's because we don't have a direct agreement with Databricks.
How would you rate customer service and support?
Neutral
Which solution did I use previously and why did I switch?
I previously used HD Insight from Microsoft, but it took many, many hours to process data, so we switched to Databricks.
How was the initial setup?
The initial setup was pretty complex and required three people.
What about the implementation team?
We used an in-house team with some consulting help.
What was our ROI?
We've had a low ROI from Databricks.
What's my experience with pricing, setup cost, and licensing?
I would rate Databricks' pricing seven out of ten.
What other advice do I have?
I would advise anyone thinking of implementing Databricks to know their use case. For example, if you're looking for a big data repository to query data and do ELT processing, I recommend looking at other platforms, like Snowflake. However, if you're going to do AI and machine learning, then Databricks is probably stronger in that area. Overall, I would rate Databricks seven out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Databricks
January 2026
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: January 2026.
881,565 professionals have used our research since 2012.
Senior Software Engineer at a computer software company with 201-500 employees
Valuable data analysis and engineering features with an easy setup
Pros and Cons
- "The setup is quite easy."
- "Can be improved by including drag-and-drop features."
What is our primary use case?
Our primary use case for the solution is data analysis by providing a Spark cluster environment with a driver to analyze a huge amount of data and gigabytes of data and can create Notebooks in Databricks. We can write SQL commands, Python code, Scala, or Spark with Python. With Databricks, we get a cluster hosted in the public cloud and we adjust it based on how much we use it.
What is most valuable?
The most valuable features are data engineering and data science because we can create Notebooks on them. We can use any Python library to build data science models, or we can use libraries like Seaborn or Matplotlib to create charts based on data for data analysis. It is a really valuable capability.
What needs improvement?
Microsoft Azure has its learning environment on the Microsoft website. We can complete certifications, but the Databricks certification is more expensive than Microsoft. It costs between $2,000 and $2,500, and the knowledge is linked. They're also charged based on whether a person doesn't want to analyze large amounts of data. Hence, we want to have the capacity for free student users so that people can learn and build their professional skills.
For how long have I used the solution?
We have been using the solution for approximately one year.
What do I think about the stability of the solution?
The solution is stable. Microsoft offers a public service, and we can get it from the Databricks website. Additionally, many companies use it to analyze their data or create a Spark cluster to run Python or SQL scripts based on their data. I rate the stability a nine out of ten.
How was the initial setup?
The setup is quite easy, and Databricks has also partnered with Microsoft, so we get this service on Microsoft Azure.
What was our ROI?
We have seen a return on investment.
What's my experience with pricing, setup cost, and licensing?
We have a pay-as-you-go subscription and pay for it based on our usage.
Which other solutions did I evaluate?
We chose this solution because my company uses Microsoft Azure for a project, and my role as a data engineer primarily focuses on data-related services. For storing data, we use Data Lake; similarly, for the data processing engine, we use Spark, which Databricks provides.
What other advice do I have?
I rate the solution an eight out of ten. The solution is good but can be improved by including drag-and-drop features because it can be helpful for users who are unfamiliar with coding. I advise new users to have prior experience with Python or SQL before utilizing this solution if they use it for data science or model building.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Data Analyst at a manufacturing company with 1-10 employees
Fast and does what it needs to but customer service should be improved upon
Pros and Cons
- "It is fast, it's scalable, and it does the job it needs to do."
- "I would like to see the integration between Databricks and MLflow improved. It is quite hard to train multiple models in parallel in the distributed fashions. You hit rate limits on the clients very fast."
What needs improvement?
I would like to see the integration between Databricks and MLflow improved. It is quite hard to train multiple models in parallel in the distributed fashions. You hit rate limits on the clients very fast.
For how long have I used the solution?
I have been using Databricks for three years.
What do I think about the stability of the solution?
I would rate the stability of this solution a nine out of 10, with one being not stable and 10 being very stable.
What do I think about the scalability of the solution?
I would rate the scalability of this solution an eight out of 10, with one being not scalable and 10 being very scalable.
There are three people using this solution in our organization.
How are customer service and support?
I would rate the available customer service a three. It's worth mentioning that this is Microsoft and not Databricks itself. I haven't spoken to Databricks people directly, but I know the people who have and they have been a lot more pleased.
How would you rate customer service and support?
Negative
What's my experience with pricing, setup cost, and licensing?
I would rate their pricing plan a six (on a scale of one to 10, with one being cheap and 10 being expensive). I think the prices could be lowered a little bit.
What other advice do I have?
Overall, I would rate this solution an eight out of 10, with one being quite poor and 10 being excellent. It is fast, it's scalable, and it does the job it needs to do.
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
Vice President - Data Engineering and Analytics at a financial services firm with 10,001+ employees
A good, but expensive, web-based platform for automated cluster management with some coding limitations
Pros and Cons
- "We like that this solution can handle a wide variety and velocity of data engineering, either in batch mode or real-time."
- "This solution only supports queries in SQL and Python, which is a bit limiting."
What is our primary use case?
We use this solution for advanced civilization power.
What is most valuable?
We like that this solution can handle a wide variety and velocity of data engineering, either in batch mode or real-time.
This product allows us to write the email models in a way that allows us to take the advantage of the parallel scaling computer window backend on any of the satellite services.
What needs improvement?
This solution only supports queries in SQL and Python, which is a bit limiting.
This is a fairly expensive solution for any service outside of the basic package, and costs can add up quite quickly if there are large scaling requirements.
What do I think about the stability of the solution?
This is a stable solution in our experience.
What do I think about the scalability of the solution?
We have found that part of the beauty of this platform is that it is easy to scale and expand.
How are customer service and support?
The support for this product uses Microsoft as a middle man, and due to this there have been times when we experienced communication delays, as well as misunderstandings of what our issues are.
How would you rate customer service and support?
Neutral
How was the initial setup?
The initial setup for this solution is very simple.
What's my experience with pricing, setup cost, and licensing?
The basic version of this solution is now open-source, so there are no license costs involved. However, there is a charge for any advanced functionality and this can be quite expensive.
Which other solutions did I evaluate?
We looked at both Snowflake and BigQuery as a comparison with this solution. We choose this product as it offered more scalability and a higher level of security, which is extremely important in our banking environment.
What other advice do I have?
We would rate this solution an eight out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Head of Business Integration and Architecture at a logistics company with 51-200 employees
Highly scalable data platform that offers exceptional performance and value data types unique to this solution
Pros and Cons
- "The Delta Lake data type has been the most useful part of this solution. Delta Lake is an opensource data type and it was implemented and invented by Databricks."
- "The data visualization for this solution could be improved. They have started to roll out a data visualization tool inside Databricks but it is in the early stages. It's not comparable to a solution like Power BI, Luca, or Tableau."
What is our primary use case?
We use this solution for the Customer Data Platform(CDP). My company works in the MarTech space and usually we implement custom CDP.
What is most valuable?
The Delta Lake data type has been the most useful part of this solution. Delta Lake is an opensource data type and it was implemented and invented by Databricks. It is the most important element of the solution. Databricks also offers exceptional performance and scalability.
What needs improvement?
The data visualization for this solution could be improved. They have started to roll out a data visualization tool inside Databricks but it is in the early stages. It's not comparable to a solution like Power BI, Luca, or Tableau.
In a future release, we would like to have a better ETL designer tool to assist in the way we move data from one place to another.
For how long have I used the solution?
We have been using this solution for four years.
What do I think about the stability of the solution?
This is a stable solution.
What do I think about the scalability of the solution?
This is a scalable solution.
How was the initial setup?
The initial setup is very easy. It is a managed solution inside Azure so you just need to search for Databricks. There are a couple of pages to follow in the setup wizard and Databricks is up and running.
What's my experience with pricing, setup cost, and licensing?
We implement this solution on behalf of our customers who have their own Azure subscription and they pay for Databricks themselves. The pricing is more expensive if you have large volumes of data.
Which other solutions did I evaluate?
When we first started using Databricks in 2018, there were not many comarable solutions to consider. Right now there are many solutions to consider including Snowflake, Azure Synapse, Redshift and BigQuery.
Databricks continues to be our solution of choice but Snowflake does have a better user interface and is easier to work with the data pipelines and with the overall UI.
What other advice do I have?
I would advise others to first define a strong data strategy and then choose which data platform suits your needs.
I would rate this solution a nine out of ten.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Lead Data Scientist at a manufacturing company with 10,001+ employees
A great solution that has allowed for collaboration within our organization
Pros and Cons
- "We have the ability to scale, collaborate and do machine learning."
- "The product cannot be integrated with a popular coding IDE."
What is our primary use case?
Our primary use case for this solution is research for data scientists. The solution is deployed on cloud.
How has it helped my organization?
It has allowed our data engineers, data scientists, and analysts to collaborate and work on the same platform.
What is most valuable?
We have the ability to scale, collaborate and do machine learning.
What needs improvement?
The product cannot be integrated with a popular coding IDE.
For how long have I used the solution?
We have been using this solution for approximately three years.
What do I think about the stability of the solution?
The solution is stable.
What do I think about the scalability of the solution?
The solution is scalable. There are five people using it in our organization.
How are customer service and support?
I rate my experience with customer service and support an eight out of ten.
Which solution did I use previously and why did I switch?
We previously used H2O.
How was the initial setup?
The initial setup was straightforward.
What about the implementation team?
Implementation was done in-house.
What was our ROI?
We have seen a return on investments.
What's my experience with pricing, setup cost, and licensing?
Licensing costs are charged on a yearly basis and costs between 25,000 and 30,000.
Which other solutions did I evaluate?
We evaluated other options but this solution was the best fit for what we required.
What other advice do I have?
I rate this solution nine out of ten. The solution is good but can be improved by integrating with a popular coding IDE.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Vice President at a tech services company with 51-200 employees
Very easy to use and requires minimal coding and customizations
Pros and Cons
- "Easy to use and requires minimal coding and customizations."
- "Doesn't provide a lot of credits or trial options."
What is our primary use case?
Our primary use case of this product is for our customers who are running large systems and looking for an API -- a quick, easy integration with their own system. We use Databricks to create a secure API interface. I'm vice president of data science and we are customers of Databricks.
What is most valuable?
Databricks is quite easy to use and requires less coding and customizations than a solution like AWS SageMaker which I'd previously used on a lot of projects. Databricks enables more people to efficiently build and host their ML code. Another great aspect is that MLflow is already integrated with Databricks which makes a big difference. It enables us to track and monitor all our different experiments. We have mostly used the MLflow part and generic notebooks with the ML building machine learning model, as well as using Pytorch for some of our medical imaging. We were able to quickly deploy both these features without requiring anything extra.
What needs improvement?
I'm struggling a little because I wanted to do some POC solutions. I present a lot of projects in various forums and seminars and there aren't a lot of credits and trial options with Databricks. Even if we want to explore, we're not able to and that's a challenge. The solution is quite expensive.
For how long have I used the solution?
I've been using this solution for a year.
What do I think about the stability of the solution?
It's currently stable although we have not yet tested it with a huge volume of data. We'll focus on the performance and model serving capability in the near future. We're still carrying out performance testing, developing the models and figuring out the infrastructure.
What do I think about the scalability of the solution?
Scalability is quite good because we just used 128 GB of resources. It's quite easy to scale.
How was the initial setup?
It was relatively simple, we didn't face any challenges. Deployment takes around two days.
Which other solutions did I evaluate?
We did a PSU in Azure ML Studio which is quite a good solution, easy to deploy and use. It's almost a no-code platform. We've also found Azure ML Studio to be quite cost-effective.
What other advice do I have?
I would recommend trying Databricks because it's cloud agnostic. A lot of customers currently use Azure but want to build something on their own down the track. Databricks makes that easy with its integration with other cloud customers. If somebody wants to build something on their infrastructure or their own virtual cloud, this is a good platform.
I rate the solution eight out of 10 because of the issue I'm having with a lack of trial options.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Updated: January 2026
Product Categories
Cloud Data Warehouse Data Science Platforms Data Management Platforms (DMP) Streaming AnalyticsPopular Comparisons
Azure Data Factory
Teradata
Palantir Foundry
Snowflake
KNIME Business Hub
Qlik Talend Cloud
IBM SPSS Statistics
Amazon SageMaker
Dataiku
Alteryx
Microsoft Azure Machine Learning Studio
Confluent
Coralogix
Altair RapidMiner
Apache Kafka
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which do you prefer - Databricks or Azure Machine Learning Studio?
- How would you compare Databricks vs Amazon SageMaker?
- Which would you choose - Databricks or Azure Stream Analytics?
- Which product would you choose for a data science team: Databricks vs Dataiku?
- Which ETL or Data Integration tool goes the best with Amazon Redshift?
- What are the main differences between Data Lake and Data Warehouse?
- What are the benefits of having separate layers or a dedicated schema for each layer in ETL?
- What are the key reasons for choosing Snowflake as a data lake over other data lake solutions?
- Are there any general guidelines to allocate table space quota to different layers in ETL?
- What cloud data warehouse solution do you recommend?





















