Try our new research platform with insights from 80,000+ expert users

Databricks vs Infobright DB comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Databricks
Average Rating
8.2
Reviews Sentiment
7.0
Number of Reviews
88
Ranking in other categories
Cloud Data Warehouse (7th), Data Science Platforms (1st), Streaming Analytics (1st)
Infobright DB
Average Rating
7.6
Reviews Sentiment
6.3
Number of Reviews
10
Ranking in other categories
Relational Databases Tools (37th), Data Warehouse (27th)
 

Featured Reviews

ShubhamSharma7 - PeerSpot reviewer
Capability to integrate diverse coding languages in a single notebook greatly enhances workflow
Databricks offers various courses that I can use, whether it's PySpark, Scala, or R. I can leverage all these courses in a single notebook, which is beneficial for clients as they can access various tools in one place whenever needed. This is quite significant. I usually work with PySpark based on client requirements. After coding, I feed the Databricks notebooks into the ADF pipeline for updates. Databricks' capability to process data in parallel enhances data processing speed. Furthermore, I can connect our Databricks notebook directly with Power BI and other visualization tools like Qlik. Once we develop code, it allows us to transform raw data into visualizations for clients using analysis diagrams, which is very helpful.
SD
If you need a real big data solution, look for a distributed solution that actually has a proven track record.
This version of Infobright has zero support for distributed scalability. The internal smart grid employed for each table has a major flaw in that the data size cannot be expunged until 2GB of data is reached at the column-level. This is a major flaw, making usage in a big-data scenario impossible. This means that you can delete as many records from a database table as you want. However, unless the 2GB aggregate size threshold was reached for some of the columns in the table, no reduction in disk space usage will occur. Only the data from the columns that reached 2GB will actually decrease. Other columns below 2GB in size do not leave the disk. I spent countless hours trying to find some workaround for this. I have nightmares of my e-mail inbox full of unsolvable questions about data size reduction from our field engineers.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The main features of the solution are efficiency."
"There are good features for turning off clusters."
"The processing capacity is tremendous in the database."
"When we have a huge volume of data that we want to process with speed, velocity, and volume, we go through Databricks."
"Databricks gives us the ability to build a lakehouse framework and do everything implicit to this type of database structure. We also like the ability to stream events. Databricks covers a broad spectrum, from reporting and machine learning to streaming events. It's important for us to have all these features in one platform."
"The solution's features are fantastic and include interactive clusters that perform at top speed when compared to other solutions."
"Easy to use and requires minimal coding and customizations."
"Databricks is a robust solution for big data processing, offering flexibility and powerful features."
"It has very amazing smart grid query feature for very fast aggregate queries across millions of rows"
 

Cons

"I would like it if Databricks made it easier to set up a project."
"The product could be improved regarding the delay when switching to higher-performing virtual machines compared to other platforms."
"I would like to see the integration between Databricks and MLflow improved. It is quite hard to train multiple models in parallel in the distributed fashions. You hit rate limits on the clients very fast."
"The product needs samples and templates to help invite users to see results and understand what the product can do."
"Databricks' performance when serving the data to an analytics tool isn't as good as Snowflake's."
"Anyone who doesn't know SQL may find the product difficult to work with."
"Would be helpful to have additional licensing options."
"The API deployment and model deployment are not easy on the Databricks side."
"Only the data from the columns that reached 2GB will actually decrease. Other columns below 2GB in size do not leave the disk."
 

Pricing and Cost Advice

"My smallest project is around a hundred euros, and my most expensive is just under a thousand euros a week. That is based on terabytes of data processed each month."
"I would rate the tool’s pricing an eight out of ten."
"The solution is based on a licensing model."
"The price of Databricks is reasonable compared to other solutions."
"Databricks uses a price-per-use model, where you can use as much compute as you need."
"I'm not involved in the financing, but I can say that the solution seemed reasonably priced compared to the competitors. Similar products are usually in the same price range. With five being affordable and one being expensive, I would rate Databricks a four out of five."
"The solution is a good value for batch processing and huge workloads."
"It is an expensive tool. The licensing model is a pay-as-you-go one."
"Our pricing was based on server instances and it was actually very cheap compared to Oracle. I guess you get what you pay for."
report
Use our free recommendation engine to learn which Cloud Data Warehouse solutions are best for your needs.
846,617 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Computer Software Company
11%
Manufacturing Company
9%
Healthcare Company
6%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or Python. It offers many different cluster choices and excellent integration with ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designed to accelerate innovation projects. It is based on Spark so it is very fast. It...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analytics teams that have to interpret data to further the business goals of their orga...
Ask a question
Earn 20 points
 

Comparisons

No data available
 

Also Known As

Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
Infobright
 

Overview

 

Sample Customers

Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
REZ-1, SonicWALL, IntegriChain, Fuseforward International Inc., Polystar, Live Rail, Mavenir Systems, JDSU Partners, Bango
Find out what your peers are saying about Databricks vs. Infobright DB and other solutions. Updated: March 2025.
846,617 professionals have used our research since 2012.