What Data Science Platform is best suited to a large-scale enterprise?

Data Science Platforms empower data analysts to develop, evaluate, and deploy analytical models efficiently. They integrate data exploration, visualization, and predictive modeling in one cohesive environment.These platforms serve as indispensable tools for data-driven decision-making, providing intuitive interfaces and scalable computing power. They enable seamless collaboration between data scientists and business stakeholders, allowing actionable insights to drive strategic initiatives...

Download Data Science Platforms Report Read more

Related categories

Data Mining

Predictive Analytics

Data Preparation Tools

Related Q&As

Dec 5, 2024

Why is Data Science Platforms important for companies?

Aug 14, 2021

What enterprise data analytics platform has the most powerful data visualization capabilities?

Data Science Platforms experts

Arun Srivastav

CEO at Planfirma Technologies Private Limited

SaurabhSingh1

Solution Sales Architect at Softline

Dimitris Iracleous

Lead Technical Instructor at Code.Hub

SaurabhSingh4

Data Analyst at Wespath Benefits and Investments

Laurence Moseley

Emeritus Professor of Health Services Research at University of South Wales

NagendraVuppala

Tax Manager at RSM

LJ

LijomonJose

System Architect at UST Global EspaÃ±a

SS

Sachin Shukre

Sr Manager at a transportation company with 10,001+ employees

Join the PeerSpot community

Ziad Chaudhry Sr. Manager - Systems Engineering at L3Harris Technologies · Answer 1 · 2020-10-15T16:18:55Z

ZC

Ziad Chaudhry

Sr. Manager - Systems Engineering at L3Harris Technologies

Real User

Oct 15, 2020

DakaIku is a great general purpose data science platform for both supervised and unsupervised learning. It handles Big Data very well.

AA

Anastasia Ant

Co-Founder at Retable

User

Aug 25, 2021

@Ziad Chaudhry I'd also vote for Dataiku, look at their cases https://www.dataiku.com/storie...

See all 2 replies

AaronCooke Founder at Helio Summit · Answer 2 · 2020-08-18T12:47:29Z

AC

AaronCooke

Founder at Helio Summit

Real User

Aug 18, 2020

Sparkcognition's Darwin product can handle very large data sets.

Rony_Sklar

Community Manager at a tech services company with 51-200 employees

Real User

Aug 19, 2020

Thanks for your input @AaronCooke :)

See all 2 replies

Djalma Gomes, Pmp, Mba Managing Partner at Data Pine · Answer 3 · 2021-08-26T12:58:55Z

Data science platform is a vague term.

It all depends on what you wish to accomplish. Are you talking about fast databases, ETLs, a Machine Learning tool, integration with R or Python, Self-Service Data Visualization Tool, Collaboration? No size fits all...

Jinhyung Cho CEO with 1-10 employees · Answer 4 · 2021-08-26T03:42:21Z

Dataiku, Domino, RapidMiner are notable candidates for your purpose, I presume.

It has been 2 years when I checked several vendors and made the list as candidates. They all support large-scale data manipulation for data analysis and machine learning development as a platform that can be used by many people in a collaborative way.

score 1 · Answer 5 · 2021-08-24T10:48:49Z

I suspect that I cannot answer this. I have used Knime and RapidMiner with data sets that have had up to about 80,000 rows and 1,500 columns and both have performed well. However, I doubt whether the questioner would classify my usage as "large amounts of data". If my usage is like theirs, then both packages can be recommended.

Both Knime and RapidMiner offer the facility to link with Python or R, and those languages have modules or methods which offer better performance on large data sets (multi-processing or using GPUs, etc.), so those combinations might serve their purpose. So, they might use, say, Knime for ease of use and, say, R for the excess power or RapidMiner and Python.

Hyundong Lee Sales & Operations Manager at superbai · Answer 6 · 2020-09-09T22:37:17Z

If you want to handle computer vision data, I recommend the Superb AI Suite.
https://www.superb-ai.com/

score 1 · Answer 7 · 2020-08-18T17:51:00Z

YP

Yogesh PARTE

Data Science Practice Lead at a tech services company with 1,001-5,000 employees

User

Aug 18, 2020

The question also needs to specify which domain, what kind of data and public or private platforms.

For structured/tabular data driverless AI / H20.ai sparkling water is my preferred platform.

Rony_Sklar

Community Manager at a tech services company with 51-200 employees

Real User

Aug 19, 2020

@Yogesh PARTE Good point - this is a more general question, but I do agree that it's easier to make recommendations with more details. Would you mind sharing more about why H20.ai Sparkling Water is your preferred choice in this instance?