Try our new research platform with insights from 80,000+ expert users

Spring Cloud Data Flow vs Teradata comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Oct 6, 2024
 

Categories and Ranking

Spring Cloud Data Flow
Ranking in Data Integration
24th
Average Rating
7.8
Number of Reviews
8
Ranking in other categories
Streaming Analytics (10th)
Teradata
Ranking in Data Integration
17th
Average Rating
8.2
Number of Reviews
74
Ranking in other categories
Customer Experience Management (3rd), Backup and Recovery (20th), Relational Databases Tools (7th), Data Warehouse (3rd), BI (Business Intelligence) Tools (10th), Marketing Management (6th), Cloud Data Warehouse (6th)
 

Featured Reviews

NitinGoyal - PeerSpot reviewer
Aug 15, 2024
Has a plug-and-play model and provides good robustness and scalability
The solution's community support could be improved. I don't know why the Spring Cloud Data Flow community is not very strong. Community support is very limited whenever you face any problem or are stuck somewhere. I'm not sure whether it has improved in the last six months because this pipeline was set up almost two years ago. I struggled with that a lot. For example, there was limited support whenever I got an exception and sought help from Stack Overflow or different forums. Interacting with Kubernetes needs a few certificates. You need to define all the certificates within your application. With the help of those certificates, your Java application or Spring Cloud Data Flow can interact with Kubernetes. I faced a lot of hurdles while placing those certificates. Despite following the official documentation to define all the replicas, readiness, and liveliness probes within the Spring Cloud Data Flow application, it was not working. So, I had to troubleshoot while digging in and debugging the internals of Spring Cloud Data Flow at that time. It was just a configuration mismatch, and I was doing nothing weird. There was a small spelling difference between how Spring Cloud Data Flow was expecting it and how I passed it. I was just following the official documentation.
SurjitChoudhury - PeerSpot reviewer
Feb 20, 2024
Offers seamless integration capabilities and performance optimization features, including extensive indexing and advanced tuning capabilities
We created and constructed the warehouse. We used multiple loading processes like MultiLoad, FastLoad, and Teradata Pump. But those are loading processes, and Teradata is a powerful tool because if we consider older technologies, its architecture with nodes, virtual processes, and nodes is a unique concept. Later, other technologies like Informatica also adopted the concept of nodes from Informatica PowerCenter version 7.x. Previously, it was a client-server architecture, but later, it changed to the nodes concept. Like, we can have the database available 24/7, 365 days. If one node fails, other nodes can take care of it. Informatica adopted all those concepts when it changed its architecture. Even Oracle databases have since adapted their architecture to them. However, this particular Teradata company initially started with its own different type of architecture, which major companies later adopted. It has grown now, but initially, whatever query we sent it would be mapped into a particular component. After that, it goes to the virtual processor and down to the disk, where the actual physical data is loaded. So, in between, there's a map, which acts like a data dictionary. It also holds information about each piece of data, where it's loaded, and on which particular virtual processor or node the data resides. Because Teradata comes with a four-node architecture, or however many nodes we choose, the cost is determined by that initially. So, what type of data does each and every node hold? It's a shared-no architecture. So, whatever task is given to a virtual processor it will be processed. If there's a failure, then it will be taken care of by another virtual processor. Moreover, this solution has impacted the query time and data performance. In Teradata, there's a lot of joining, partitioning, and indexing of records. There are primary and secondary indexes, hash indexing, and other indexing processes. To improve query performance, we first analyze the query and tune it. If a join needs a secondary index, which plays a major role in filtering records, we might reconstruct that particular table with the secondary index. This tuning involves partitioning and indexing. We use these tools and technologies to fine-tune performance. When it comes to integration, tools like Informatica seamlessly connect with Teradata. We ensure the Teradata database is configured correctly in Informatica, including the proper hostname and properties for the load process. We didn't find any major complexity or issues with integration. But, these technologies are quite old now. With newer big data technologies, we've worked with a four-layer architecture, pulling data from Hadoop Lake to Teradata. We configure Teradata with the appropriate hostname and credentials, and use BTEQ queries to load data. Previously, we converted the data warehouse to a CLD model as per Teradata's standardized procedures, moving from an ETL to an EMT process. This allowed us to perform gap analysis on missing entities based on the model and retrieve them from the source system again. We found Teradata integration straightforward and compatible with other tools.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable features of Spring Cloud Data Flow are the simple programming model, integration, dependency Injection, and ability to do any injection. Additionally, auto-configuration is another important feature because we don't have to configure the database and or set up the boilerplate in the database in every project. The composability is good, we can create small workloads and compose them in any way we like."
"The dashboards in Spring Cloud Dataflow are quite valuable."
"There are a lot of options in Spring Cloud. It's flexible in terms of how we can use it. It's a full infrastructure."
"The product is very user-friendly."
"The most valuable feature is real-time streaming."
"The best thing I like about Spring Cloud Data Flow is its plug-and-play model."
"The solution's most valuable feature is that it allows us to use different batch data sources, retrieve the data, and then do the data processing, after which we can convert and store it in the target."
"It is a highly robust software solution."
"It has increased the speed of reporting."
"It's stable and reliable."
"I've never had any issues with scalability."
"It is a stable program."
"We did performance testing. We had a set of real life MicroStrategy reports. Our conditions were: Not allowed to redesign data model, not allowed to rewrite the queries, all queries should be generated by MicroStrategy, no aggregates. Teradata appeared to be way faster than a similarly configured (in terms of hardware) Oracle server."
"The most valuable features are the large volume of data and the structuring of the data to optimize it and get very optimal data warehouse solutions for customers."
"The ease of deployment is useful so clients are up and running quickly in comparison to other products."
 

Cons

"Spring Cloud Data Flow is not an easy-to-use tool, so improvements are required."
"The configurations could be better. Some configurations are a little bit time-consuming in terms of trying to understand using the Spring Cloud documentation."
"The solution's community support could be improved."
"Spring Cloud Data Flow could improve the user interface. We can drag and drop in the application for the configuration and settings, and deploy it right from the UI, without having to run a CI/CD pipeline. However, that does not work with Kubernetes, it only works when we are working with jars as the Spring Cloud Data Flow applications."
"On the tool's online discussion forums, you may get stuck with an issue, making it an area where improvements are required."
"Some of the features, like the monitoring tools, are not very mature and are still evolving."
"I would improve the dashboard features as they are not very user-friendly."
"Since I was working on the very basic, legacy systems, the memory thing was always a challenge. If Teradata is moving to the cloud, the space constraint or the memory issue that my company generally faces will eventually resolve, in time. What I'd like to see in the next release of Teradata is that it becomes full-fledged on the cloud, apart from better connectivity to various systems. For example, if I have to read or include a Python script, if I write some basic codes, I should be able to read even unstructured data. I know that it's not supported even in Snowflake, but at least semi-structured data support, if that can be a little more enhanced, that would be good."
"Data synchronization to the DR site."
"The solution needs improvement in its stability, support and pricing."
"Teradata could improve by being less complicated. There are some aspects that are not available on the Unix server and a Unix system is required to access some data, such as in case of an emergency."
"The setup is not straightforward."
"GUI of administrative tools is really outdated."
"Teradata's UI could be more user-friendly."
"Teradata's pricing is quite high compared to Redshift, Synapse, or GCP alternatives."
 

Pricing and Cost Advice

"The solution provides value for money, and we are currently using its community edition."
"If you want support from Spring Cloud Data Flow there is a fee. The Spring Framework is open-source and this is a free solution."
"This is an open-source product that can be used free of charge."
"In this day and age, we want to get things done quickly. So, we go to the AWS Marketplace."
"The cost is substantial, totaling around $1.2 million, solely dedicated to upgrading the hardware."
"Price is quite high, so if it is really possible to use other solutions (e.g. you do not have strict requirements for performance and huge data volumes), it might be better to look at alternatives from the RDBMS world."
"The tool costs about 30,000 euros a month, while Azure Synapse SQL only costs 10,000."
"Teradata is not cheap, but you get what you pay for."
"Teradata is a very expensive solution."
"The price of the solution could be reduced, it is expensive."
"Teradata's licensing is on the expensive side."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
813,418 professionals have used our research since 2012.
 

Comparison Review

it_user232068 - PeerSpot reviewer
Aug 5, 2015
Netezza vs. Teradata
Original published at https://www.linkedin.com/pulse/should-i-choose-net Two leading Massively Parallel Processing (MPP) architectures for Data Warehousing (DW) are IBM PureData System for Analytics (formerly Netezza) and Teradata. I thought talking about the similarities and differences…
 

Top Industries

By visitors reading reviews
Financial Services Firm
29%
Computer Software Company
17%
Manufacturing Company
7%
Retailer
6%
Financial Services Firm
25%
Computer Software Company
11%
Manufacturing Company
8%
Healthcare Company
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What needs improvement with Spring Cloud Data Flow?
I would improve the dashboard features as they are not very user-friendly. Another area for improvement is the documentation, as it is not very precise. There are limited resources available on Spr...
What is your primary use case for Spring Cloud Data Flow?
I am a developer using Spring Cloud Dataflow. We primarily use it to convert our applications from monolithic to microservices. The solution is used for scheduling tasks in a specific order and ens...
What advice do you have for others considering Spring Cloud Data Flow?
My advice would be to thoroughly review the documentation and understand if Spring Cloud Dataflow is the right solution for your application. For applications with only one or two microservices, it...
Comparing Teradata and Oracle Database, which product do you think is better and why?
I have spoken to my colleagues about this comparison and in our collective opinion, the reason why some people may declare Teradata better than Oracle is the pricing. Both solutions are quite simi...
Which companies use Teradata and who is it most suitable for?
Before my organization implemented this solution, we researched which big brands were using Teradata, so we knew if it would be compatible with our field. According to the product's site, the comp...
Is Teradata a difficult solution to work with?
Teradata is not a difficult product to work with, especially since they offer you technical support at all levels if you just ask. There are some features that may cause difficulties - for example,...
 

Also Known As

No data available
IntelliFlex, Aster Data Map Reduce, , QueryGrid, Customer Interaction Manager, Digital Marketing Center, Data Mover, Data Stream Architecture
 

Learn More

 

Overview

 

Sample Customers

Information Not Available
Netflix
Find out what your peers are saying about Microsoft, Informatica, Talend and others in Data Integration. Updated: October 2024.
813,418 professionals have used our research since 2012.