What is our primary use case?
We don't use profiling as much, however, we do use it, in certain cases, for profiling. We use the Analyst tool to do out-of-box, high-level profiling of data to see high-level quality of completeness, and uniqueness, et cetera. Mainly, we use the Developer tool to connect to the sources and to write data quality rules.
How has it helped my organization?
It has improved our organization.
We started from just pretty much having flat files, and then doing some basic transformations, then writing back to Excel or QFD files.
We gradually moved to more analytical tasks. You don't just do statistical data quality you also do analytical. You do lots of joins with other sources and do the consistency checks, and to do more complex logic, and build metrics.
We use Tableau on the back of it to present the data and data quality, and then monitor it. We use it more like a batch process to build pipelines, and then, using Tableau, monitor the results of it and those metrics. Now, we work more with live updates and do that more than the batch.
What is most valuable?
It's probably one of the leading lights in ETL. They have really good built-in functionalities, or algorithms, that you can use to transform or process data and validate and standardize.
The solution is stable.
It's not too had to set up the cloud version.
Support is helpful and responsive.
What needs improvement?
We are in this transition mode, where we haven't yet got IDMC, the cloud version, so we don't actually have hands-on experience and have not actually seen the features. All we rely on, at the moment, is just the available documentation. What I don't like on the IDQ side is just the fact that in the on-premises version, you have all these applications, with separate configurations. In the cloud solution, it is fixed so that you have everything on one platform.
The performance isn't as good on-premises. For example, when you install clients, it's slow compared to the cloud. Still, we need to see. We haven't experienced it ourselves.
The upgrades are a downside. On-premises you manage all the changes in the software. You have to do that yourself, and if there's some problem with compatibility, it makes things that much harder. With the cloud, everything is managed by Informatica on the servers.
Managing the licenses with the on-premises version was difficult. However, with the cloud, it will be much simpler.
For how long have I used the solution?
I've been using the solution for the last six or seven years.
What do I think about the stability of the solution?
Once you set everything up, it is pretty stable. It's reliable. There are no bugs or glitches and it doesn't crash or freeze. It is way more stable than Hadoop and other applications.
What do I think about the scalability of the solution?
In terms of scaling, we used the clusters, and the processing was on Hadoop side. If we needed any extra space or any service, it was just managed there, so it was outside of Informatica.
Originally, we had 20 people using the solution, and then it was reduced to less than ten.
We do use it as much as we can for its purposes. In the past, we used that for the whole ETL process with data loads, and then we moved to Hadoop storage. At the moment, we are only going to be using Cloud Data Quality and others for cleansing, standardization, and deduplication, and then using some other Azure capabilities.
How are customer service and support?
I've dealt with support in the past. There were issues, and we had to deal directly with Informatica for some hotfixes. They were good. They just got straight to the point and were helpful overall.
How would you rate customer service and support?
How was the initial setup?
It is way more complex to install on-premises than in the cloud.
With the cloud, the installation will be way easier since you only install these secure agents. They have many different connectors, so it is definitely less hustle to install all these machines, and all these applications. On-premises, it was more user-based. Now, it's service-based, and you just pay for what you use and the licenses as well.
We had myself, an architect, and a developer as well as help from Informatica while handling the setup.
We have about two or three people that can deploy and maintain the solution. They also cover other applications, not just Informatica.
What about the implementation team?
We had Informatica support, and we had an internal group of people with Informatica knowledge who handled the solution. For some parts, we were involved as well, and we handled them ourselves.
What was our ROI?
We're still in the early stages of moving toward the cloud. We have not seen an ROI yet.
What's my experience with pricing, setup cost, and licensing?
When you are using the on-premises version, managing the licenses is quite difficult. However, on the cloud, you just pay for what you use, and it's a lot easier. With the cloud, if you want MDM, you pay for it, and if you want PowerCenter, you pay for it; however, if you don't want it or don't use it, you don't pay. We'll just pay for Data Quality, as it has all of the features we need inside it.
I'm not involved in the conversations around licensing and agreements. That said, my understanding is that Informatica is pretty expensive. I'd likely rate it two to two and a half out of five in terms of affordability.
Which other solutions did I evaluate?
We definitely considered others and had StreamSets used for some other purposes. The company that I moved out of was going to be switching off Informatica at some point due to licensing, et cetera, and they just chose to go to StreamSets with Snowflake for storage.
I haven't researched enough about other products in relation to Informatica.
What other advice do I have?
We are moving to the cloud version. On-premises, we were on version 10.4.2, and that moved to 10.5. Soon, we will be on the cloud.
We're using IDMC, which is not just Data Quality. It has governance, Axon, and other applications in it.
We're just a customer.
I'd advise people to research use cases before beginning. Companies need to understand what they are trying to achieve, figure out their requirements, and then appraise the solution.
While Informatica is good in terms of Data Quality and is probably the leading option, you need to be clear about budget, et cetera.
I would rate the solution seven out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company does not have a business relationship with this vendor other than being a customer.