Try our new research platform with insights from 80,000+ expert users
reviewer1270416 - PeerSpot reviewer
Vice President, Business Intelligence and Analytics at a tech services company with 10,001+ employees
Real User
Stable cloud platform for data engineering and has a straightforward setup
Pros and Cons
  • "I haven't heard about any major stability issues. At this time I feel like it's stable."
  • "Pricing is one of the things that could be improved."

What is our primary use case?

We are still exploring the solution. We utilize it much, much better than their star schema models that they are trying to replace it with. We bring in Databricks and then see how they can leverage the additional analytical functionalities around the Databricks cloud. It's more in exploratory ways. We recommend Databricks, especially with the Azure cloud frameworks.

What needs improvement?

Pricing is one of the things that could be improved.

Also, there could be improvement in the visual analytics space there and on the machine learning functions. I haven't explored so I don't know about the functions and features that are there. If it is not there, then I think that's something which they should consider including.

For how long have I used the solution?

My team has been exploring Databricks for close to five or six months.

What do I think about the stability of the solution?

I haven't heard about any major stability issues. At this time I feel like it's stable.

Buyer's Guide
Databricks
February 2025
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: February 2025.
838,713 professionals have used our research since 2012.

What do I think about the scalability of the solution?

In terms of scalability, I think once we put it across for larger use-cases the scalability question will really arise. So we'll need detailed information. I assume that we will be able to scale up.

I think we do not have more than 10 people working on it now. Because we are in the earlier stages of implementation, it's more like a POC now. I really don't know whether it's been open for the larger audience yet.

How was the initial setup?

The initial setup was straightforward.

What about the implementation team?

It is better to be installed with the help of integrators, or consultants, or with an experienced team.

What other advice do I have?

It's more data scientists using Databricks. I would call them power users trying to see how they can get a hand on it, though they are not data scientists. They try to understand it a little bit better for their future use.

On a scale of one to ten, I would rate it an eight, easy. 

Which deployment model are you using for this solution?

Public Cloud
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
PeerSpot user
it_user1235523 - PeerSpot reviewer
Machine Learning Engineer at a tech vendor with 51-200 employees
Real User
A convenient notebook, good stability, and a straightforward setup
Pros and Cons
  • "The most valuable aspect of the solution is its notebook. It's quite convenient to use, both terms of the research and the development and also the final deployment, I can just declare the spark jobs by the load tables. It's quite convenient."
  • "The solution could be improved by integrating it with data packets. Right now, the load tables provide a function, like team collaboration. Still, it's unclear as to if there's a function to create different branches and/or more branches. Our team had used data packets before, however, I feel it's difficult to integrate the current with the previous data packets."

What is our primary use case?

We primarily use the solution to run current jobs; to run the spark jobs as the current job.

What is most valuable?

The most valuable aspect of the solution is its notebook. It's quite convenient to use, both terms of the research and the development and also the final deployment, I can just declare the spark jobs by the load tables. It's quite convenient.

What needs improvement?

The solution could be improved by integrating it with data packets. Right now, the load tables provide a function, like team collaboration. Still, it's unclear as to if there's a function to create different branches and/or more branches. Our team had used data packets before, however, I feel it's difficult to integrate the current with the previous data packets.

The support could be improved a bit around the database. When we stream it to Data Lake, some data cannot be loaded. It should be a priority to fix this.

For how long have I used the solution?

I've been using the solution for half a year.

What do I think about the stability of the solution?

The solution is stable.

What do I think about the scalability of the solution?

The solution is scalable. However, it still needs us to manually set out the number of nodes in a cluster. It's really dependent on the application. Sometimes, when the tasks are bigger, and it gets a little difficult for us to define the number of nodes in a cluster. If the solution could allow users to set up the clusters, I think that'll be good.

Currently, we have three people using the solution. We may increase usage in the future.

How are customer service and technical support?

The technical support is quite good. In the beginning, when we had a few POC projects, they were very supportive.

Which solution did I use previously and why did I switch?

We didn't previously use a different solution, however, we built our own from scratch. This is the first unified platform that we've used.

How was the initial setup?

The initial setup is very straightforward. We just use their job functions. To deploy as a spark job is quite straightforward. 

In our use case, we also had some external databases to handle the deployment. For example, we only generated some prediction results. We saved the results into an external database. The solution takes time to deploy to the external database, but the spark job is quite easy.

What other advice do I have?

I'm a software development engineer. I'm working with the latest version.

As long as the developers have an understanding of spark, and understanding technical tricks, it's very fast in terms of using the database.

I'd rate the solution eight out of ten.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Databricks
February 2025
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: February 2025.
838,713 professionals have used our research since 2012.
reviewer1888527 - PeerSpot reviewer
Big Data and Cloud Architect at a computer software company with 201-500 employees
Real User
Excellent workspace and notebooks
Pros and Cons
  • "Databricks' most valuable features are the workspace and notebooks. Its integration, interface, and documentation are also good."
  • "Databricks' technical support takes a while to respond and could be improved."

What is our primary use case?

I primarily use Databricks for data pipelines.

What is most valuable?

Databricks' most valuable features are the workspace and notebooks. Its integration, interface, and documentation are also good.

For how long have I used the solution?

I've been working with Databricks for around five years.

What do I think about the stability of the solution?

Databricks is stable.

What do I think about the scalability of the solution?

Databricks is scalable.

How are customer service and support?

Databricks' technical support takes a while to respond and could be improved.

How was the initial setup?

The initial setup was easy.

What's my experience with pricing, setup cost, and licensing?

Databricks' cost could be improved.

What other advice do I have?

I would give Databricks a rating of eight out of ten.

Which deployment model are you using for this solution?

Private Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Enterprise Data Architect at a financial services firm with 51-200 employees
Real User
Assists with quickly computing a considerable amount of historical data and helps us with data ingestion
Pros and Cons
  • "Its lightweight and fast processing are valuable."
  • "The Databricks cluster can be improved."

What is our primary use case?

Our primary use case for this solution is for data ingestion and the DQ rules we are implementing. We deploy the solution on Azure cloud.

How has it helped my organization?

Whenever we send data to downstream applications for creating a file, multiple business rules are involved, and this solution assists with quickly computing a considerable amount of historical data.

What is most valuable?

Its lightweight and fast processing are valuable.

What needs improvement?

The product could include some UI features to improve the ease of use, like drag and drop for a few aggregated functions. Additionally, the Databricks cluster can be improved.

For how long have I used the solution?

We have been using Databricks for approximately two years and are currently using the latest version.

What do I think about the stability of the solution?

The solution is very stable. However, sometimes it intermittently restarts. I rate the stability an eight out of ten.

What do I think about the scalability of the solution?

The solution is scalable, and we are trying to implement more use cases with Databricks in our organization as we advance. I rate the scalability an eight out of ten.

How are customer service and support?

I rate customer service and support a nine out of ten.

How would you rate customer service and support?

Positive

How was the initial setup?

The initial setup was not very complex. We deploy the solution manually and the time required depends on the complexity of the business logic. I rate it an eight out of ten.

What about the implementation team?

We implemented the solution through an in-house team.

What other advice do I have?

I rate the solution an eight out of ten.

Disclosure: My company has a business relationship with this vendor other than being a customer: Gold Partners
PeerSpot user
reviewer2058678 - PeerSpot reviewer
Director of Data (Engineering & Science) at a tech services company with 11-50 employees
Real User
An easy-to-use solution useful to run patch jobs
Pros and Cons
  • "The ease of use and its accessibility are valuable."
  • "The integration and query capabilities can be improved."

What is our primary use case?

Our primary use case for the solution is to run batch jobs.

What is most valuable?

The ease of use and its accessibility are valuable.

What needs improvement?

The solution can be improved by expanding its integration capabilities and providing the ability to query external vendors directly.

For how long have I used the solution?

We have been using the solution for a little less than a year, and we deploy it on the Amazon cloud.

What do I think about the stability of the solution?

The solution is stable.

What do I think about the scalability of the solution?

The solution is scalable, and there are approximately seven developers and two DevOps employees utilizing the solution.

How are customer service and support?

We have had a good experience with customer service and support. I rate them a nine out of ten.

How would you rate customer service and support?

Positive

How was the initial setup?

The initial setup for the solution is a bit complex.

What's my experience with pricing, setup cost, and licensing?

I wouldn't consider it a costly solution. Like all other solutions, it depends on how you use them. If you provision sparked clusters much larger than what you need, it becomes costly. For example, it is not more costly than EMR, the AWS equivalent, and from my perspective, it is much better.

What other advice do I have?

I rate the solution a nine out of ten. The solution is good, but the integration and query capabilities can be improved.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
reviewer1708788 - PeerSpot reviewer
Technical Architect at a tech services company with 10,001+ employees
Real User
Facilitates robust solutions through collaboration but non-SQL users may struggle
Pros and Cons
  • "I like the ability to use workspaces with other colleagues because you can work together even without seeing the other team's job."
  • "Anyone who doesn't know SQL may find the product difficult to work with."

What is most valuable?

I like the ability to use workspaces with other colleagues because you can work together even without seeing the other team's job. So you can create a robust solution by working together with other professionals.

What needs improvement?

One area for improvement would be that anyone who doesn't know SQL may find the product difficult to work with. It would also be useful to have a remote support team inside Databricks, which would collect and analyze user feedback.

For how long have I used the solution?

I have been using Databricks since 2018.

How are customer service and support?

I had a little trouble with customer support but this was solved.

How was the initial setup?

The initial setup was a little complex because it was a new architecture for the customer, so there was nothing to compare it to in order to accelerate the project. This meant the deployment of the first project using Databricks took almost nine months and the second took almost a year.

Which deployment model are you using for this solution?

Hybrid Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
reviewer1479477 - PeerSpot reviewer
Business Development Specialist at a tech services company with 51-200 employees
Real User
Useful end-to-end data analytics, highly stable, and scalable
Pros and Cons
  • "Databricks covers end-to-end data analytics workflow in one platform, this is the best feature of the solution."
  • "Databricks could improve in some of its functionality."

What is our primary use case?

Databricks is the full data analytics platform. It involves end to end data analytics process.

What is most valuable?

Databricks covers end-to-end data analytics workflow in one platform, this is the best feature of the solution.

What needs improvement?

Databricks could improve in some of its functionality.

For how long have I used the solution?

I have been using Databricks for approximately a year and a half.

What do I think about the stability of the solution?

Databricks is very stable.

What do I think about the scalability of the solution?

The scalability of Databricks is good.

We have 30 to 40 people are using this solution in my company.

What other advice do I have?

I rate Databricks a nine out of ten.

Which deployment model are you using for this solution?

Public Cloud
Disclosure: My company has a business relationship with this vendor other than being a customer: Partners
PeerSpot user
reviewer1276107 - PeerSpot reviewer
Engineer at a tech services company with 10,001+ employees
Real User
An easy initial setup with a good time travel feature, but needs better model scoring
Pros and Cons
  • "The time travel feature is the solution's most valuable aspect."
  • "Databricks is an analytics platform. It should offer more data science. It should have more features for data scientists to work with."

What is our primary use case?

We use the solution for multiple items. We use lots of data crunching, development, and algorithms on it.

What is most valuable?

The time travel feature is the solution's most valuable aspect.

What needs improvement?

The management of the solution needs to be modernized. Managing the radius data is hard.

The solution requires modern scoring. There's not a good way of knowing how the models are performing from a data science perspective. The solution needs more model scoring abilities. It doesn't necessarily need more model monitoring, but more model scoring and performance from a data science perspective. 

Databricks is an analytics platform. It should offer more data science. It should have more features for data scientists to work with.

For how long have I used the solution?

I've been using the solution for one year so far.

What do I think about the stability of the solution?

The solution is not exactly stable. We've faced a few bugs which have really affected it. There are bugs especially when it comes to connecting with Spark.

What do I think about the scalability of the solution?

It's hard to say how scalable the solution is. The scalability comes into play on the Spark side, not on the Databricks side.

We have about 20 people on the solution right now.

How are customer service and technical support?

We've never been in touch with technical support, so I don't have any experience in terms of dealing with them.

How was the initial setup?

The initial setup is straightforward. I wouldn't say that it's complex in any way.

Deployment times vary and really depend on multiple factors. It can take anywhere from a few weeks to a few months to deploy the solution. In our case, it took us about three months to fully deploy it.

It takes two to three people to deploy the solution.

What about the implementation team?

I deployed the solution with the help of my team.

What's my experience with pricing, setup cost, and licensing?

I'm not sure what the licensing costs are on the solution.

Which other solutions did I evaluate?

We did evaluate Amazon PageMaker before ultimately choosing Databricks. It's the only other solution we evaluated at the time.

What other advice do I have?

We're partners with Databricks.

We're using the latest version of the solution, but I can't recall what version number we are on.

I'd advise others considering the solution to look at usage. They shouldn't adopt the solution blindly. How the implementation and usage will go will depend on the skill of the data engineer and what your requirements are.

I'd rate the solution seven out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
PeerSpot user