Try our new research platform with insights from 80,000+ expert users

Azure Data Factory vs Teradata comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Oct 6, 2024
 

Categories and Ranking

Azure Data Factory
Ranking in Data Integration
1st
Ranking in Cloud Data Warehouse
3rd
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
86
Ranking in other categories
No ranking in other categories
Teradata
Ranking in Data Integration
18th
Ranking in Cloud Data Warehouse
6th
Average Rating
8.2
Reviews Sentiment
7.0
Number of Reviews
76
Ranking in other categories
Customer Experience Management (5th), Backup and Recovery (19th), Relational Databases Tools (7th), Data Warehouse (3rd), BI (Business Intelligence) Tools (10th), Marketing Management (6th)
 

Featured Reviews

Thulani David Mngadi - PeerSpot reviewer
Data flow feature is valuable for data transformation tasks
The workflow automation features in GitLab, particularly its low code/no code approach, are highly beneficial for accelerating development speed. This feature allows for quick creation of pipelines and offers customization options for integration needs, making it versatile for various use cases. GitLab supports a wide range of connectors, catering to a majority of integration needs. Azure Data Factory's virtual enterprise and monitoring capabilities, the visual interface of GitLab makes it user-friendly and easy to teach, facilitating adoption within teams. While the monitoring capabilities are sufficient out of the box, they may not be as comprehensive as dedicated enterprise monitoring tools. GitLab's monitoring features are manageable for production use, with the option to integrate log analytics or create custom dashboards if needed. The data flow feature in Azure Data Factory within GitLab is valuable for data transformation tasks, especially for those who may not have expertise in writing complex code. It simplifies the process of data manipulation and is particularly useful for individuals unfamiliar with Spark coding. While there could be improvements for more flexibility, overall, the data flow feature effectively accomplishes its purpose within GitLab's ecosystem.
SurjitChoudhury - PeerSpot reviewer
Offers seamless integration capabilities and performance optimization features, including extensive indexing and advanced tuning capabilities
We created and constructed the warehouse. We used multiple loading processes like MultiLoad, FastLoad, and Teradata Pump. But those are loading processes, and Teradata is a powerful tool because if we consider older technologies, its architecture with nodes, virtual processes, and nodes is a unique concept. Later, other technologies like Informatica also adopted the concept of nodes from Informatica PowerCenter version 7.x. Previously, it was a client-server architecture, but later, it changed to the nodes concept. Like, we can have the database available 24/7, 365 days. If one node fails, other nodes can take care of it. Informatica adopted all those concepts when it changed its architecture. Even Oracle databases have since adapted their architecture to them. However, this particular Teradata company initially started with its own different type of architecture, which major companies later adopted. It has grown now, but initially, whatever query we sent it would be mapped into a particular component. After that, it goes to the virtual processor and down to the disk, where the actual physical data is loaded. So, in between, there's a map, which acts like a data dictionary. It also holds information about each piece of data, where it's loaded, and on which particular virtual processor or node the data resides. Because Teradata comes with a four-node architecture, or however many nodes we choose, the cost is determined by that initially. So, what type of data does each and every node hold? It's a shared-no architecture. So, whatever task is given to a virtual processor it will be processed. If there's a failure, then it will be taken care of by another virtual processor. Moreover, this solution has impacted the query time and data performance. In Teradata, there's a lot of joining, partitioning, and indexing of records. There are primary and secondary indexes, hash indexing, and other indexing processes. To improve query performance, we first analyze the query and tune it. If a join needs a secondary index, which plays a major role in filtering records, we might reconstruct that particular table with the secondary index. This tuning involves partitioning and indexing. We use these tools and technologies to fine-tune performance. When it comes to integration, tools like Informatica seamlessly connect with Teradata. We ensure the Teradata database is configured correctly in Informatica, including the proper hostname and properties for the load process. We didn't find any major complexity or issues with integration. But, these technologies are quite old now. With newer big data technologies, we've worked with a four-layer architecture, pulling data from Hadoop Lake to Teradata. We configure Teradata with the appropriate hostname and credentials, and use BTEQ queries to load data. Previously, we converted the data warehouse to a CLD model as per Teradata's standardized procedures, moving from an ETL to an EMT process. This allowed us to perform gap analysis on missing entities based on the model and retrieve them from the source system again. We found Teradata integration straightforward and compatible with other tools.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"It is very modular. It works well. We've used Data Factory and then made calls to libraries outside of Data Factory to do things that it wasn't optimized to do, and it worked really well. It is obviously proprietary in regards to Microsoft created it, but it is pretty easy and direct to bring in outside capabilities into Data Factory."
"One of the most valuable features of Azure Data Factory is the drag-and-drop interface. This helps with workflow management because we can just drag any tables or data sources we need. Because of how easy it is to drag and drop, we can deliver things very quickly. It's more customizable through visual effect."
"Azure Data Factory became more user-friendly when data-flows were introduced."
"We have been using drivers to connect to various data sets and consume data."
"It has built-in connectors for more than 100 sources and onboarding data from many different sources to the cloud environment."
"UI is easy to navigate and I can retrieve VTL code without knowing in-depth coding languages."
"When it comes to our business requirements, this solution has worked well for us. However, we have not stretched it to the limit."
"What I like best about Azure Data Factory is that it allows you to create pipelines, specifically ETL pipelines. I also like that Azure Data Factory has connectors and solves most of my company's problems."
"A conventional and easily defined way to build a data warehouse or a layer of data marts."
"The tool's most valuable feature is the warehousing model."
"When it comes to integration, tools like Informatica seamlessly connect with Teradata. We ensure the Teradata database is configured correctly in Informatica, including the proper hostname and properties for the load process. We didn't find any major complexity or issues with integration."
"It is a highly robust software solution."
"In Data Lab, you can schedule any testing you want to do in production. You can take a small subset of data from production, copy it there, and run all your tests. It reduces your testing costs because it's all in the lab."
"We did performance testing. We had a set of real life MicroStrategy reports. Our conditions were: Not allowed to redesign data model, not allowed to rewrite the queries, all queries should be generated by MicroStrategy, no aggregates. Teradata appeared to be way faster than a similarly configured (in terms of hardware) Oracle server."
"The product's scalability is great. Scalability-wise, I rate the solution a ten out of ten."
"The key advantages are Performance when processing Terabytes of data and scalability."
 

Cons

"I would like to be informed about the changes ahead of time, so we are aware of what's coming."
"The need to work more on developing out-of-the-box connectors for other products like Oracle, AWS, and others."
"Areas for improvement in Azure Data Factory include connectivity and integration. When you use integration runtime, whenever there's a failure, the backup process in Azure Data Factory takes time, so this is another area for improvement."
"It would be better if it had machine learning capabilities."
"There's space for improvement in the development process of the data pipelines."
"The solution needs to be more connectable to its own services."
"Azure Data Factory uses many resources and has issues with parallel workflows."
"The setup and configuration process could be simplified."
"The setup is not straightforward."
"I'm not sure about the unstructured data management capabilities. It could be improved."
"The primary challenge with Teradata lies in its cost structure, encompassing subscription fees, software licenses, and hardware expenses."
"Data ingestion is done via external utilities and not by the query language itself. It would be more convenient to have that functionality within its SQL dialect."
"An additional feature I would you like to see included in the next release, is that it needs to be more cloud-friendly."
"We tried to use case Teradata for a data warehouse system, but we had some problems in relation to the Teradata system, CDC tools, and source databases. We were unable to transfer data from HPE Integrity mainframe to Teradata."
"The reporting side wasn't very good in the past, but with the latest versions, it's getting better. Still, the friendliness of the PDC reporting and functionality needs to be improved."
"They should add more connectors to different platforms."
 

Pricing and Cost Advice

"Data Factory is affordable."
"ADF is cheaper compared to AWS."
"I don't see a cost; it appears to be included in general support."
"This is a cost-effective solution."
"The solution is cheap."
"The cost is based on the amount of data sets that we are ingesting."
"I rate the product price as six on a scale of one to ten, where one is low price and ten is high price."
"The licensing is a pay-as-you-go model, where you pay for what you consume."
"It is still a very expensive solution. While I very much like the pure technological supremacy of the software itself, I believe Teradata as a company needs to become more affordable. They are already losing the market to more flexible or cheaper competitors."
"The initial cost may seem high, but the TCO is low."
"Teradata is a very expensive solution."
"We are looking for a more flexible cost model for the next version that we use, whether it be cloud or on-premise."
"The price of the solution could be reduced, it is expensive."
"In this day and age, we want to get things done quickly. So, we go to the AWS Marketplace."
"Make sure you have the in-house skills to design and support the solution, as relying on external sources is extremely costly and tends to lock you into specific platforms, tools, and paradigms."
"​I would advise others to look into migration and setup as a fixed price and incorporate a SaaS option for other Teradata services​."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
824,053 professionals have used our research since 2012.
 

Comparison Review

it_user232068 - PeerSpot reviewer
Aug 5, 2015
Netezza vs. Teradata
Original published at https://www.linkedin.com/pulse/should-i-choose-net Two leading Massively Parallel Processing (MPP) architectures for Data Warehousing (DW) are IBM PureData System for Analytics (formerly Netezza) and Teradata. I thought talking about the similarities and differences…
 

Top Industries

By visitors reading reviews
Financial Services Firm
13%
Computer Software Company
12%
Manufacturing Company
9%
Healthcare Company
7%
Financial Services Firm
26%
Computer Software Company
11%
Manufacturing Company
8%
Healthcare Company
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Azure Data Factory compare with Informatica PowerCenter?
Azure Data Factory is flexible, modular, and works well. In terms of cost, it is not too pricey. It offers the stability and reliability I am looking for, good scalability, and is easy to set up an...
How does Azure Data Factory compare with Informatica Cloud Data Integration?
Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load transformations, allowing users to apply transformations either in code by using Power Q...
Comparing Teradata and Oracle Database, which product do you think is better and why?
I have spoken to my colleagues about this comparison and in our collective opinion, the reason why some people may declare Teradata better than Oracle is the pricing. Both solutions are quite simi...
Which companies use Teradata and who is it most suitable for?
Before my organization implemented this solution, we researched which big brands were using Teradata, so we knew if it would be compatible with our field. According to the product's site, the comp...
Is Teradata a difficult solution to work with?
Teradata is not a difficult product to work with, especially since they offer you technical support at all levels if you just ask. There are some features that may cause difficulties - for example,...
 

Also Known As

No data available
IntelliFlex, Aster Data Map Reduce, , QueryGrid, Customer Interaction Manager, Digital Marketing Center, Data Mover, Data Stream Architecture
 

Overview

 

Sample Customers

1. Adobe 2. BMW 3. Coca-Cola 4. General Electric 5. Johnson & Johnson 6. LinkedIn 7. Mastercard 8. Nestle 9. Pfizer 10. Samsung 11. Siemens 12. Toyota 13. Unilever 14. Verizon 15. Walmart 16. Accenture 17. American Express 18. AT&T 19. Bank of America 20. Cisco 21. Deloitte 22. ExxonMobil 23. Ford 24. General Motors 25. IBM 26. JPMorgan Chase 27. Microsoft (Azure Data Factory is developed by Microsoft) 28. Oracle 29. Procter & Gamble 30. Salesforce 31. Shell 32. Visa
Netflix
Find out what your peers are saying about Azure Data Factory vs. Teradata and other solutions. Updated: December 2024.
824,053 professionals have used our research since 2012.