I primarily use Data Factory for data ingestion and B2B transformation.
Data Warehouse Analyst at ACSO Australia
Good connectivity but monitorability could be better
Pros and Cons
- "Data Factory's best features are connectivity with different tools and focusing data ingestion using pipeline copy data."
- "Data Factory's monitorability could be better."
What is our primary use case?
What is most valuable?
Data Factory's best features are connectivity with different tools and focusing data ingestion using pipeline copy data.
What needs improvement?
Data Factory's monitorability could be better. In the next release, Data Factory should include integrations with open-source tools like Air Flow.
For how long have I used the solution?
I've been working with Data Factory for about a year.
Buyer's Guide
Azure Data Factory
November 2024
Learn what your peers think about Azure Data Factory. Get advice and tips from experienced pros sharing their opinions. Updated: November 2024.
816,406 professionals have used our research since 2012.
What do I think about the stability of the solution?
Data Factory is stable.
What do I think about the scalability of the solution?
Data Factory is scalable.
How are customer service and support?
Microsoft's technical support is good, so long as your company has a good relationship with them.
Which solution did I use previously and why did I switch?
I previously worked with Talend, Matillion, and Fivetran.
What's my experience with pricing, setup cost, and licensing?
Data Factory is expensive.
What other advice do I have?
I would rate Data Factory seven out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
Technical Director, Senior Cloud Solutions Architect (Big Data Engineering & Data Science) at NorthBay Solutions
Great for gathering data and pipeline orchestration; much improved monitoring feature
Pros and Cons
- "An excellent tool for pipeline orchestration."
- "The solution needs to be more connectable to its own services."
What is our primary use case?
We generally implement this product for data transformation for our clients. We create the pipelines and provide training before handing it over to them. We generally deal with large-scale organizations. I'm a senior solutions architect.
How has it helped my organization?
I think the main benefit of this solution is the ease of use, especially for companies that have come from an SSIS type of background where they are used to Microsoft tools.
What is most valuable?
If you have a very simple pipeline you can use Data Factory for transformations, but it's really for serious analytics. This is an excellent tool for pipeline orchestration; connecting the different components and activities as well as gathering data. It's an orchestration tool, not a transformation tool. The monitoring feature has drastically improved.
What needs improvement?
Data Factory is embedded in the new Synapse Analytics. The problem is if you're using the core Data Factory, you can't call a notebook within Synapse. It's possible to call Databricks from Data Factory, but not the Spark notebook and I don't understand the reason for that restriction. To my mind, the solution needs to be more connectable to its own services.
There is a list of features I'd like to see in the next release, most of them related to oversight and security. AWS has a lake builder, which basically enforces the whole oversight concept from the start of your pipeline but unfortunately Microsoft hasn't yet implemented a similar feature.
For how long have I used the solution?
I've been using this solution for five years.
What do I think about the stability of the solution?
From what I've seen this is a stable solution.
What do I think about the scalability of the solution?
The solution is easy to scale keeping in mind that Data Factory doesn't do any computations. We use it mainly to push the computations to Databricks or Synapse. Projects with our clients generally last a few months and only until they go into production. I believe the ability to increase is always there.
How are customer service and support?
We typically do not use customer support, but there were a few cases several years ago as the product was moving to the cloud that things were not so stable and we contacted support services - they were very good.
Which solution did I use previously and why did I switch?
When I first started in this field, everything was basically Hadoop on-premise and Hadoop infrastructure. With the increase in cloud integrations, things have changed. Once the big data services got introduced, we were probably one of the few companies in North America that were actually into analytics and big data and we were the first to implement related Microsoft products in Canada.
How was the initial setup?
The initial setup is straightforward. I'm a huge fan and user of CI/CD pipelines and never do deployments manually. It's all automated and deployment takes a few minutes.
What's my experience with pricing, setup cost, and licensing?
Licensing costs of Data Factory are reasonable. The cost is mainly on the Synapse and Databricks side of things because they are the tools where the computations are done and where you need more nodes and servers.
What other advice do I have?
It's important to study the solution before purchasing it. The problem in this market is that because most users are generally not very knowledgeable, they typically fall for services that are not compatible with their use case. Data Factory comes with all the transformations but that doesn't work for serious analytics customers who generally need to resort to Databricks or Synapse which involves training and education. Since it's a new field and everything has just blasted off, it's very hard for people to catch on.
In my opinion, Airflow still ranks as number one but I would rate Data Factory an eight out of 10.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Buyer's Guide
Azure Data Factory
November 2024
Learn what your peers think about Azure Data Factory. Get advice and tips from experienced pros sharing their opinions. Updated: November 2024.
816,406 professionals have used our research since 2012.
Director of Product Management at EIM solutions
Powerful data transformations and the stability is good. Performance is good
Pros and Cons
- "The most valuable features are data transformations."
- "The speed and performance need to be improved."
What is our primary use case?
Our primary use case is data loading.
What is most valuable?
The most valuable features are data transformations.
What needs improvement?
The speed and performance can be further improved.
This solution should be able to connect with custom APIs.
For how long have I used the solution?
We have been using this solution for one or two months.
What do I think about the stability of the solution?
The stability of the Azure Data Factory is very good.
What do I think about the scalability of the solution?
The scalability is good. We have about five users for this solution and they are all developers.
How are customer service and technical support?
The technical support for this solution is good.
Which solution did I use previously and why did I switch?
We did not use another solution prior to this one.
How was the initial setup?
The initial setup of the Azure Data Factory was simple. It took us one week to deploy it.
What about the implementation team?
We deployed this solution with our in-house team.
What was our ROI?
Good
What's my experience with pricing, setup cost, and licensing?
Our licensing fees are approximately 15,000 ($150 USD) per month. There are no additional fees.
Which other solutions did I evaluate?
We evaluated a solution by Talend before selecting the Azure Data Factory.
What other advice do I have?
My advice for anybody who is implementing this solution is to first get in touch with the Microsoft team to get their support so that you don't spend too much on research.
I would rate this solution a eight out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Principal at a tech services company with 51-200 employees
Good product integrations but transient issues sometimes cause pipeline failures
Pros and Cons
- "It is beneficial that the solution is written with Spark as the back end."
- "There are limitations when processing more than one GD file."
What is our primary use case?
Our company uses the solution for data ingestion.
What is most valuable?
It is beneficial that the solution is written with Spark as the back end.
The solution is cloud-based and integrates well with other Azure products such as Synapse Analytics.
What needs improvement?
There are limitations when processing more than one GD file.
Data ingestion pipelines sometimes fail because of transient issues that have to do with the cloud network. It takes more than six hours to process or ingest 300,000 records and that is a long time.
For how long have I used the solution?
I have been using the solution for two years.
What do I think about the stability of the solution?
The solution is new in the market and pretty stable because ADF is a little more codified than AWS. Synapse Analytics adds another tool for data.
Stability is not quite at the level of Informatica or DataStage.
What do I think about the scalability of the solution?
The solution is scalable.
For multi-tenant applications connected to multiple databases, Microsoft recommends a share box and a cell post integration run time. But a run time connecting to multiple sources has limitations and requires multiple shares connecting to your data if you are ingesting it from on-premises.
How are customer service and support?
Technical support is okay. Support is contracted or partnered with various companies but is fine as a first level.
Most of the time, technical support has to connect with product engineers who troubleshoot issues.
How was the initial setup?
The setup is not very complex but requires intake, setting up integration services, and connecting to databases like Oracle before you push it to service.
What about the implementation team?
We implemented the solution in-house.
What's my experience with pricing, setup cost, and licensing?
The pricing is pay-as-you-go or reserve instance. Of the two options, reserve instance is much cheaper.
In the cloud, everything is service based and expensive. Users should be knowledgeable enough to maximize the solution.
For example, it makes no sense to run integration services all day if you are not ingesting data because you pay for that usage. It is important to understand how the product works to manage it accordingly and keep costs down.
What other advice do I have?
I rate the solution a six out of ten.
Which deployment model are you using for this solution?
Private Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Network Team Lead at a computer software company with 1,001-5,000 employees
A stable solution that helps to move data from on-premises to the cloud
Pros and Cons
- "We use the solution to move data from on-premises to the cloud."
- "The tool’s workflow is not user-friendly. It should also improve its orchestration monitoring."
What is our primary use case?
We use the solution to move data from on-premises to the cloud.
What needs improvement?
The tool’s workflow is not user-friendly. It should also improve its orchestration monitoring.
For how long have I used the solution?
I have been using the product since 2019.
What do I think about the stability of the solution?
The tool is a stable product.
What other advice do I have?
I would rate the solution a seven out of ten.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Senior Consultant at a computer software company with 1,001-5,000 employees
Top notch stability, technical support very good, and multiple project client
Pros and Cons
- "I am one hundred percent happy with the stability."
- "I would like to see this time travel feature in Snowflake added to Azure Data Factory."
What is our primary use case?
Our primary use case is mainly for ETL and transforming the data and then using it for power VA. So there we are handling multiple projects. It is not just a single thing, but mainly used at the end is data analytics only.
What needs improvement?
I would like to see this time travel feature in Snowflake added to Azure Data Factory. In addition to taking care of data internally, how it is instead of using indexing, they have some different mechanisms to quickly execute the query instead of normal indexes. Both of these would be great to see implemented in future upgrades.
For how long have I used the solution?
I have been working with Azure Data Factory for the past twelve months.
What do I think about the stability of the solution?
I am one hundred percent happy with the stability.
How are customer service and support?
Technical support is excellent.
How would you rate customer service and support?
Positive
What other advice do I have?
I would rate Azure Data Factory an eight on a scale of one to ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company has a business relationship with this vendor other than being a customer: customer/partner
Director Technology at a computer software company with 10,001+ employees
Easy pipeline setup and good integration
Pros and Cons
- "Data Factory's best feature is the ease of setting up pipelines for data and cloud integrations."
- "Data Factory could be improved in terms of data transformations by adding more metadata extractions."
What is our primary use case?
I primarily use Data Factory for creating pipelines on cloud in terms of integrating multiple cloud services.
What is most valuable?
Data Factory's best feature is the ease of setting up pipelines for data and cloud integrations. It also has good integration with other Azure services.
What needs improvement?
Data Factory could be improved in terms of data transformations by adding more metadata extractions.
For how long have I used the solution?
I've been using Data Factory for five years.
What do I think about the stability of the solution?
Data Factory's stability has improved following some initial issues.
What do I think about the scalability of the solution?
Data Factory's scalability is good.
How was the initial setup?
The initial setup was easy as it's a SaaS offering.
What's my experience with pricing, setup cost, and licensing?
Data Factory is affordable.
What other advice do I have?
I would give Data Factory a rating of eight out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company has a business relationship with this vendor other than being a customer: Partners
IT Functional Analyst at a energy/utilities company with 1,001-5,000 employees
Is easy to use and is highly scalable
Pros and Cons
- "The two most valuable features of Azure Data Factory are that it's very scalable and that it's also highly reliable."
- "One area for improvement is documentation. At present, there isn't enough documentation on how to use Azure Data Factory in certain conditions. It would be good to have documentation on the various use cases."
What is our primary use case?
We are currently using it as an ETL (Extract, Transform, and Load) tool. We are using it to connect to various information providers or, in general, to various sources, to extract data, and then to insert it to our storage devices, databases, or data warehouses.
What is most valuable?
The two most valuable features of Azure Data Factory are that it's very scalable and that it's also highly reliable.
What needs improvement?
One area for improvement is documentation. At present, there isn't enough documentation on how to use Azure Data Factory in certain conditions. It would be good to have documentation on the various use cases.
Sometimes, it's really difficult to find the answers to very technical questions regarding certain conditions.
For how long have I used the solution?
I've been using Azure Data Factory since 2019.
What do I think about the stability of the solution?
It has been stable so far.
What do I think about the scalability of the solution?
Azure Data Factory is a very scalable solution. Including internal developers and external consultants working for us, we have about 10-15 people using this solution.
Which solution did I use previously and why did I switch?
We had been using various ETL tools during the years before moving to the cloud. We picked Azure Data Factory because we were moving towards the Azure cloud.
How was the initial setup?
The initial setup is very easy.
What about the implementation team?
We used a consultant as it was a big project. We had five to six specialists, including both internal and external employees, working on it. It took about about three to six months to complete.
What other advice do I have?
Azure Data Factory is a very easy to use tool. If you want to extract, manipulate, and load data to any type of Azure repository, I recommend this solution. However, I would not recommend it if the manipulation of data is very deep and complicated.
I would rate this solution at eight on a scale from one to ten.
Which deployment model are you using for this solution?
Private Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Buyer's Guide
Download our free Azure Data Factory Report and get advice and tips from experienced pros
sharing their opinions.
Updated: November 2024
Popular Comparisons
Informatica Intelligent Data Management Cloud (IDMC)
Informatica PowerCenter
Teradata
Oracle Data Integrator (ODI)
Talend Open Studio
IBM InfoSphere DataStage
Oracle GoldenGate
Palantir Foundry
SAP Data Services
Qlik Replicate
StreamSets
Alteryx Designer
Fivetran
Buyer's Guide
Download our free Azure Data Factory Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which solution do you prefer: KNIME, Azure Synapse Analytics, or Azure Data Factory?
- How do Alteryx, Denod, and Azure Data Factory overlap (or complement) each other?
- Do you think Azure Data Factory’s price is fair?
- What kind of organizations use Azure Data Factory?
- Is Azure Data Factory a secure solution?
- How does Azure Data Factory compare with Informatica PowerCenter?
- How does Azure Data Factory compare with Informatica Cloud Data Integration?
- Which is better for Snowflake integration, Matillion ETL or Azure Data Factory (ADF) when hosted on Azure?
- What is the best suitable replacement for ODI on Azure?
- Which product do you prefer: Teradata Vantage or Azure Data Factory?