Databricks Reviews and Pricing

reviewer2046045

Lead Analytics at a manufacturing company with 10,001+ employees

Jan 11, 2023

Download

Useful machine learning and easy to scale

Pros and Cons

"In the manufacturing industry, Databricks can be beneficial to use because of machine learning. It is useful for tasks, such as product analysis or predictive maintenance."

"The stability of the clusters or the instances of Databricks would be better if it was a much more stable environment. We've had issues with crashes."

What is our primary use case?

Our team is currently utilizing machine learning for various applications, and a few members are also exploring Databrick's use for ML operations.

What is most valuable?

In the manufacturing industry, Databricks can be beneficial to use because of machine learning. It is useful for tasks, such as product analysis or predictive maintenance.

For how long have I used the solution?

I have been using Databricks for approximately six months

What do I think about the stability of the solution?

The stability of the clusters or the instances of Databricks would be better if it was a much more stable environment. We've had issues with crashes.

Buyer's Guide

Databricks

July 2025

Free Report: Databricks Reviews and More

Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: July 2025.

DOWNLOAD NOW

861,524 professionals have used our research since 2012.

What do I think about the scalability of the solution?

The scalability of Databricks is good as long as you have a data lake, and it's easy to scale.

We have approximately 50 users using this solution in my company.

How are customer service and support?

We have a different team who handles the support. I do not have contact with Databricks support.

Which solution did I use previously and why did I switch?

I have not used a similar solution to Databricks.

What was our ROI?

I have seen an ROI using Databricks.

What's my experience with pricing, setup cost, and licensing?

I rate the price of Databricks as eight out of ten.

What other advice do I have?

Having a good understanding of physical security in relation to cybersecurity in an OT (Operational Technology) environment would be beneficial, and utilizing an existing data lake prior to implementing a Databricks initiative would greatly aid in its success.

I rate Databricks an eight out of ten.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.

Jayaprakash Kaippada

Lead Architect at Birlasoft IndiaLtd.

Jul 11, 2022

Download

Data analytics platform that supports large volumes of data and related activities

Pros and Cons

"This solution offers a lake house data concept that we have found exciting. We are able to have a large amount of data in a data lake and can manage all relational activities."

"The connectivity with various BI tools could be improved, specifically the performance and real time integration."

What is most valuable?

This solution offers a lake house data concept that we have found exciting. We are able to have a large amount of data in a data lake and can manage all relational activities. All asset complaints properties are available and this is very useful to ensure the quality of all data.

What needs improvement?

The connectivity with various BI tools could be improved, specifically the performance and real time integration. There is also some improvement required in the semantic layers to manage the data match as well as the data warehouse features.

In a future release, we would like to have features to better manage all ML development activities.

For how long have I used the solution?

I have been using this solution for three years.

What do I think about the stability of the solution?

This is a stable solution, especially compared to other technology on the market.

What do I think about the scalability of the solution?

It is a scalable solution but this depends on the platform that is being used. If you use a cloud platform such as Azure, it offers scalability. However, some platforms will not support scalability using Databricks.

We have around 20 users in our development team using Databricks.

How are customer service and support?

The customer service and support for this solution is good.

How would you rate customer service and support?

Positive

How was the initial setup?

The initial setup is pretty simple and requires minimal configuration compared to other technology.

What's my experience with pricing, setup cost, and licensing?

I would rate the pricing for this solution a four out of five. This does depend on the environment or the infrastructure that one is using. There is a difference in pricing between using Azure or being on-premises.

Which other solutions did I evaluate?

Azure Synapse is a competitor that we evaluated but it is not mature enough to provide better performance than Databricks. We choose Databricks due to the ability to have a lot of data in Data Lakes and the Data Warehouse. We are also able to run data science activities using ML flow.

What other advice do I have?

If you are looking for custom model development and a lot of data management in a cloud agnostic manner, then Databricks is a good solution.

I would rate this solution an eight out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure

Disclosure: My company does not have a business relationship with this vendor other than being a customer.

Buyer's Guide

Databricks

July 2025

Free Report: Databricks Reviews and More

Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: July 2025.

DOWNLOAD NOW

861,524 professionals have used our research since 2012.

IshwarSukheja

Head CEO at bizmetric

Jun 26, 2022

Download

A user-friendly and customizable solution that offers excellent integration

Pros and Cons

"The solution is built from Spark and has integration with MLflow, which is important for our use case."

"The ability to customize our own pipelines would enhance the product, similar to what's possible using ML files in Microsoft Azure DevOps."

What is our primary use case?

Our use case is confidential, but I can say we use it for a deep learning model for machine learning.

What is most valuable?

The solution is built from Spark and has integration with MLflow, which is important for our use case.

Databricks is also user-friendly, providing customizable codes and models that allow people to experiment quickly.

Integration of Delta Lake is another useful feature.

What needs improvement?

Writing pandas-profiling reports could be easier.

The ability to customize our own pipelines would enhance the product, similar to what's possible using ML files in Microsoft Azure DevOps.

For how long have I used the solution?

I have been using this product for one and a half years.

What do I think about the stability of the solution?

For now the solution seems stable.

What do I think about the scalability of the solution?

The solution is easy to scale horizontally and it has a useful auto-scaling feature. For vertical scaling, you need to bring the system down and make some adjustments.

On my current project I have a team of 30 members under me, including data engineers and data science people. Our data science, engineering, and MLOps projects are expanding, so we are planning to do some vertical scaling to increase the team size to over 100 members. In our company, we are trying to certify more and more people in Databricks because it's cloud-agnostic.

How are customer service and support?

We have never needed to contact customer support, online resources have been sufficient to solve our problems.

How was the initial setup?

The initial setup of the solution is straightforward, once you understand the UI it is easy to implement. I would rate Databricks a four out of five for ease of setup.

One migration project took two to three months, including writing all the code and implementing end-to-end pipelines.

We are planning to deploy the solution in stages over the next 15 months to completely implement MLOps for our organization.

What's my experience with pricing, setup cost, and licensing?

I'm not involved in the financing, but I can say that the solution seemed reasonably priced compared to the competitors. Similar products are usually in the same price range. With five being affordable and one being expensive, I would rate Databricks a four out of five.

I find that deployed systems work out cheaper than having to operate manually, which appeals to our customers.

What other advice do I have?

I would rate this solution an eight out of ten.

There is an issue where clusters are automatically deleted after termination or after 100 days of non-usage. This could be more user-friendly, and they could include an enabler to pin the clusters you want to keep, instead of having to go and research why clusters got deleted after implementing the product. That documentation needs to be right in front of the user to avoid issues.

I definitely recommend this product to other users.

Which deployment model are you using for this solution?

Hybrid Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure

Disclosure: My company has a business relationship with this vendor other than being a customer. Partner

Elizabeth Ho

Manager, Customer Journey at a retailer with 10,001+ employees

May 21, 2022

Download

You can connect multiple data sources and share work easily

Pros and Cons

"I like how easy it is to share your notebook with others. You can give people permission to read or edit. I think that's a great feature. You can also pull in code from GitHub pretty easily. I didn't use it that often, but I think that's a cool feature."

"I would like it if Databricks adopted an interface more like R Studio. When I create a data frame or a table, R Studio provides a preview of the data. In R Studio, I can see that it created a table with so many columns or rows. Then I can click on it and open a preview of that data."

What is our primary use case?

I use Databricks for customer marketing analytics.

What is most valuable?

Databricks lets you schedule jobs pretty easily, and you can use SQL, Spark SQL, Python, or R. It also allows you to save a table or view.

I like that you can connect to multiple data sources. Most of our data is stored in the Azure data lake, but my previous company connected to SQL databases or even blob storage.

They've improved on many features. I don't do data engineering, but I had an issue a couple of years ago at my two companies ago. It took a long time to read and save tables, but I think the new Delta feature helped.

I like how easy it is to share your notebook with others. You can give people permission to read or edit. I think that's a great feature. You can also pull in code from GitHub pretty easily. I didn't use it that often, but I think that's a cool feature.

What needs improvement?

I would like it if Databricks adopted an interface more like R Studio. When I create a data frame or a table, R Studio provides a preview of the data. In R Studio, I can see that it created a table with so many columns or rows. Then I can click on it and open a preview of that data.

Because I work in analytics and not data engineering, I think that's probably the biggest one. There are better graphical tools, so I don't think Databricks can compete. You can do a simple graph, and it's not that great. However, I don't think they can ever stack up to Tableau, so it's probably not worth it to improve upon that.

For how long have I used the solution?

I've been using Databricks for two years.

What do I think about the stability of the solution?

Databricks is stable.

What do I think about the scalability of the solution?

Databricks is scalable.

How are customer service and support?

Databricks tech support has been great every time I've dealt with them. Their team is highly knowledgeable.

How was the initial setup?

Setting up Databricks is easy. I set it up at my previous company. That was on Azure as well, but they utilized a third-party team with expertise in Databricks to ensure everything was optimized.

What other advice do I have?

I rate Databricks 10 out of 10. I recommend taking advantage of Databricks support or a third-party provider to ensure it's set up optimally. I don't know if it's an additional service you must pay for, but we always had access to Databricks support in my last company.

I think that's worth the money because there are so many different scenarios with distributed computing. Even people who study analytics may not understand the ins and out of Spark. It's worth it to have a service contract for support.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure

Disclosure: My company does not have a business relationship with this vendor other than being a customer.

Sanjay Bheemasenarao

Director - Data Engineering expert at Sankir Technologies

Mar 19, 2022

Download

Is user friendly and has great performance, but documentation needs improvement

Pros and Cons

"Databricks has a scalable Spark cluster creation process. The creators of Databricks are also the creators of Spark, and they are the industry leaders in terms of performance."

"If I want to create a Databricks account, I need to have a prior cloud account such as an AWS account or an Azure account. Only then can I create a Databricks account on the cloud. However, if they can make it so that I can still try Databricks even if I don't have a cloud account on AWS and Azure, it would be great. That is, it would be nice if it were possible to create a pseudo account and be provided with a free trial. It is very essential to creating a workforce on Databricks. For example, students or corporate staff can then explore and learn Databricks."

What is our primary use case?

I use Databricks to explore new features and provide the industry visibility and scalability of Databricks to the companies that I work with.

I create proof of concepts for companies. As a consultant, I also create training courses on Databricks. If a company wants to leverage a service provided by Databricks and needs to train people, they use our courses.

What is most valuable?

Databricks has a scalable Spark cluster creation process. The creators of Databricks are also the creators of Spark, and they are the industry leaders in terms of performance.

Databricks has made great strides in terms of performance.

It is very user friendly. I like the ease of creating a Spark cluster, submitting a job, or creating a notebook.

The UI has also changed for the better compared to what it was two years ago.

What needs improvement?

If I want to create a Databricks account, I need to have a prior cloud account such as an AWS account or an Azure account. Only then can I create a Databricks account on the cloud. However, if they can make it so that I can still try Databricks even if I don't have a cloud account on AWS and Azure, it would be great. That is, it would be nice if it were possible to create a pseudo account and be provided with a free trial. It is very essential to creating a workforce on Databricks. For example, students or corporate staff can then explore and learn Databricks.

It's a big ask to have people jump through a lot of hoops to get approval to create a Databricks cluster just to explore it, but if they can try it on their own with a free trial without an underlying cloud account it would be more convenient.

Documentation can be improved as well. There are so many versions of documents. For example, when I tried to create a DBU vault and secrets file, I had to go through multiple versions of documents. This could be improved so that the documentation is easy to use.

For how long have I used the solution?

I've been using this solution for about two years.

What do I think about the stability of the solution?

Stability wise, it's quite okay. In my experience, it doesn't crash.

What do I think about the scalability of the solution?

I have not used autoscaling because it consumes a lot of money and because my experience has been alright. In some cases, though, it is tied to the quota of the underlying infrastructure. I have not tested the scalability to its fullest extent, but with the workloads I run, it has been fine.

How are customer service and support?

When I wanted to create an AWS account and contacted technical support via email, I never received a response. Recently, however, I think they have improved their support a little bit, and I did get a call in response to my question. Overall, I've not faced any issues with the person I had to contact directly.

How was the initial setup?

The initial setup is not very easy, but it's medium in complexity.

What's my experience with pricing, setup cost, and licensing?

Databricks is a very expensive solution. Pricing is an area that could definitely be improved. They could provide a lower end compute and probably reduce the price.

What other advice do I have?

I would rate Databricks at seven on a scale from one to ten. If you compare it to Snowflake, for example, Snowflake doesn't mandate an underlying cloud account. It creates one on its own. That's a subtle convenience that Snowflake has and one that Databricks could also build.

Snowflake's documentation is easy to use in comparison to that of Databricks.

Which deployment model are you using for this solution?

Public Cloud

Disclosure: My company does not have a business relationship with this vendor other than being a customer.

Jorge Alvarado

Owner at a marketing services firm with 1-10 employees

Jan 24, 2022

Download

The data governance has been absolutely efficient in between other kinds of solutions

Pros and Cons

"Databricks' Lakehouse architecture has been most useful for us. The data governance has been absolutely efficient in between other kinds of solutions."

"I would like it if Databricks made it easier to set up a project."

What is our primary use case?

We use Databricks for video streaming and security purposes.

What is most valuable?

Databricks' Lakehouse architecture has been most useful for us. The data governance has been absolutely efficient in between other kinds of solutions.

What needs improvement?

I would like it if Databricks made it easier to set up a project. The use case determines which services we are going to use. You have the application engine, and you generate a potential budget for your workloads, so you can understand what you are going to do, what you are going to use, and what you will invest in.

Because I'm deploying on the Google Cloud Platform, measuring the investment, value, and use case is extremely difficult. So I leave it and move on without the risk. It would be easier if I had one page where you can see three columns: one for the use cases of a specific architecture, a second one for the prices based on the volume of data or machine time, and the third column for the budget. That would make it easier to know if I am using the appropriate architecture for the right solution.

I have seen something like that in Microsoft Azure, but obviously Microsoft Azure costs a lot of money. Amazon has something like that, but it's very complicated to use.

For how long have I used the solution?

We've been using Databricks for about five years.

What do I think about the stability of the solution?

Databricks is very stable and powerful.

What do I think about the scalability of the solution?

It was simple to make Databricks scalable. We found that we could set up an alert to tell us if we needed more resources, money, or time from our team. We're alerted when the system detects some trigger for any use of the instance. If you have another alert from your side, that would be extremely useful because it takes a lot of time to develop that kind of trigger.

How are customer service and support?

Databricks technical support was lovely. We don't need it so much, but the few questions we had were answered immediately.

How was the initial setup?

I am not a data engineer because I just started data science at the company, but it was straightforward and clear for the architect to set up. He provided me with that idea because he realized it would take time if we had use cases. You can select and change the data or add some modules or products. You have all the technology to do so.

What other advice do I have?

I rate Databricks eight out of 10. I like to move my customers into Databricks, but I take care of the internal system infrastructure so they can continue to use familiar software or operating systems and databases. They have a lot of doubts because they don't know the solution. We need to train them, explain things, and show the solution's potential value.

Generally, companies try to keep the same flavor when they migrate. For example, if they are using many Microsoft products, they want to work with Azure. If they are open to other options, they go with GCP or AWS. However, Databricks doesn't have enough customers here in my market because it's not a visible brand. Azure, GCP, and AWS are highly visible here, so the local teams are friendly with the three brands.

Which deployment model are you using for this solution?

Private Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Other

Disclosure: My company does not have a business relationship with this vendor other than being a customer.

reviewer1702092

Machine Learning Engineer at a mining and metals company with 10,001+ employees

Dec 6, 2021

Download

Highly scalable, stable and good technical support

Pros and Cons

"Databricks is a scalable solution. It is the largest advantage of the solution."

"The interface of Databricks could be easier to use when compared to other solutions. It is not easy for non-data scientists. The user interface is important before we had to write code manually and as solutions move to "No code AI" it is critical that the interface is very good."

What is our primary use case?

We were using Databricks to build an AI solution. We are only evaluating it, we have approximately three people that tried it out. Later we choose another solution, we did not fully deploy Databricks.

How has it helped my organization?

Before I used Databricks it took me a long time to do some functions and now with Databricks I can do them much quicker. It scales very well.

What needs improvement?

The interface of Databricks could be easier to use when compared to other solutions. It is not easy for non-data scientists. The user interface is important before we had to write code manually and as solutions move to "No code AI" it is critical that the interface is very good.

For how long have I used the solution?

I have used Databricks within the last 12 months.

What do I think about the stability of the solution?

The solution is stable.

What do I think about the scalability of the solution?

Databricks is a scalable solution. It is the largest advantage of the solution.

How are customer service and support?

We have been in contact with the technical support of Databricks, they were good.

Which solution did I use previously and why did I switch?

We have used a lot of different solutions, such as Watson and DataIQ.

How was the initial setup?

The initial setup is easy. However, I do not know much about the implementation because the company does it.

What about the implementation team?

We did the implementation of the solution.

What other advice do I have?

If companies want scalability, they should choose Databricks.

I rate Databricks a nine out of ten.

Which deployment model are you using for this solution?

Public Cloud

Disclosure: My company does not have a business relationship with this vendor other than being a customer.

RameshCh

Sr. BigData Architect at ITC Infotech

Jul 1, 2020

Download

Very elastic, easy to scale, and a straightforward setup

Pros and Cons

"It's easy to increase performance as required."

"Instead of relying on a massive instance, the solution should offer micro partition levels. They're working on it, however, they need to implement it to help the solution run more effectively."

What is our primary use case?

We work with clients in the insurance space mostly. Insurance companies need to process claims. Their claim systems run under Databricks, where we do multiple transformations of the data.

What is most valuable?

The elasticity of the solution is excellent.

The storage, etc., can be scaled up quite easily when we need it to.

It's easy to increase performance as required.

The solution runs on Spark very well.

What needs improvement?

Instead of relying on a massive instance, the solution should offer micro partition levels. They're working on it, however, they need to implement it to help the solution run more effectively.

They're currently coming out with a new feature, which is Date Lake. It will come with a new layer of data compliance.

For how long have I used the solution?

We've been using the solution for two years.

What do I think about the stability of the solution?

I don't see any issues with stability going down to the cluster. It would certainly be fine if it's maintained. It's highly available even if things are dropped. It will still be up and running. I would describe it as very reliable. We don't have issues with crashing. There aren't bugs and glitches that affect the way it works.

What do I think about the scalability of the solution?

The system is extremely scalable. It's one of its greatest features and a big selling point. If a company needs to scale or expand, they can do so very easily.

We require daily usage from the solution even though we don't directly work with Databricks on a day to day basis. Due to the fact that we schedule everything we need and it will trigger work that needs to be done, it's used often. Do you need to log into the database console every day? No. You just need to configure it one time and that's it. Then it will deliver everything needed in the time required.

How are customer service and technical support?

We use Microsoft support, so we are enterprise customers for them. We raise a service request for Databricks, however, we use Microsoft. Overall, we've been satisfied with the support we've been given. They're responsive to our needs.

Which solution did I use previously and why did I switch?

We work with multiple clients and this solution is just one of the examples of products we work with. We use several others as well, depending on the client.

It's all wrappers between the same underlying systems. For example, Spark. It's all open-source. We've worked with them as well as the wrappers around it, whether the company was labeled Databrary, IBM insights, Cloudera, etc. These wrappers are all on the same open-source system.

If we with Azure data, we take over Databricks. Otherwise, we have to create a VM separately. Those things are not needed because Azure is already providing those things for us.

How was the initial setup?

The situation may have been a bit different for me than for many users or organizations. I've been in this industry for more than 15 or 17 years. I have a lot of experience. I also took the time to do some research and preparation for the setup. It was straightforward for me.

The deployment with Microsoft usually can be done in 20 minutes. However, it can take 40 to 45 minutes to complete. An organization only requires one person to upload the data and have complete access to the account.

What about the implementation team?

I deployed the solution myself. I didn't require any assistance, so I didn't enlist any resellers or consultants to help with the process.

What's my experience with pricing, setup cost, and licensing?

The solution is expensive. It's not like a lot of competitors, which are open-source.

What other advice do I have?

There isn't really a version, per se.

It's a popular service. I'd recommend the solution. The solution is cloud-agnostic right now, so it really can go into any cloud. It's the users who will be leveraging installed environments that can have these services, no matter if they are using Azure or Ubiquiti, or other systems.

I don't think you can find any other tool or any other service that is faster them Databricks. I don't see that right now. It's your best option.

Overall, I'd rate the solution eight out of ten. The reason I'm not giving it full marks is that it's expensive compared to open source alternatives. Also, the configuration is difficult, so sometimes you need to spend a couple of hours to get it right.

Which deployment model are you using for this solution?

Public Cloud

Disclosure: My company does not have a business relationship with this vendor other than being a customer.