Try our new research platform with insights from 80,000+ expert users
it_user827655 - PeerSpot reviewer
Principal Developer
Real User
​It lowers the amount of time in development from weeks to a day
Pros and Cons
  • "​It lowers the amount of time in development from weeks to a day.​"
  • "If the SQL input controls could dynamically determine the schema-based on the SQL alone, it would simplify the steps of having to use a manually created and saved schema for use in the TMap for the Postgres and Redshift components. This would make things even easier."

What is our primary use case?

We use it to load our big data system with S3 and Redshift. We also use it to process in HL7 from hospitals in real-time.

How has it helped my organization?

It lowers the amount of time in development from weeks to a day.

What is most valuable?

The ease of transforming data with inputs to TMaps and tJavaRow makes life so easy.

What needs improvement?

There is one place where I would appreciate an upgrade, if it is possible. If the SQL input controls could dynamically determine the schema-based on the SQL alone, it would simplify the steps of having to use a manually created and saved schema for use in the TMap for the Postgres and Redshift components. This would make things even easier. When it does guess the schema it tends to bring back every column from every table or every column from the table specified in the table name in the component. Sometimes, the SQL comes from multiple tables and has some transformations of data. 

I do not know if it would even be possible, but if this could be figured out automatically for the column names and types, that would be amazing.

Buyer's Guide
Talend Data Quality
August 2024
Learn what your peers think about Talend Data Quality. Get advice and tips from experienced pros sharing their opinions. Updated: August 2024.
814,763 professionals have used our research since 2012.

For how long have I used the solution?

More than five years.

What other advice do I have?

I have not run into anything we could not use Talend to find a solution for.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user153093 - PeerSpot reviewer
Data Architect with 5,001-10,000 employees
Vendor
I like the option to start with the community edition. At the same time, it uses a huge amount of memory.

Valuable Features

The option to start with the community edition

Improvements to My Organization

Solves problems with the quality of data applying some business rules. And with the data integrator load data from multiple source to a target source.

Room for Improvement

The usage of memory. This tool uses a huge amount of memory.

Use of Solution

Around 3 years

Deployment Issues

I haven't had problems.

Stability Issues

Yes, I had some problems with the Linux version because of launch some exceptions.

Scalability Issues

No

Customer Service and Technical Support

Customer Service: 5/5 - good customer serviceTechnical Support: 5/5 - good technical support

Initial Setup

We haven't experienced any problems.

ROI

You move data from one source to another without problem and apply some business rules in data.

Pricing, Setup Cost and Licensing

Basically a good server.

Other Solutions Considered

Yes we evaluated: Kettle, CloverEtl and Oracle Data Integrator

Other Advice

Put in a server with a lot of memory and if it’s a hard process then put in a dedicated server.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Talend Data Quality
August 2024
Learn what your peers think about Talend Data Quality. Get advice and tips from experienced pros sharing their opinions. Updated: August 2024.
814,763 professionals have used our research since 2012.
it_user848511 - PeerSpot reviewer
VP of Professional Services at a tech services company with 51-200 employees
Real User
Enables robust data matching, merging, Data Stewardship; needs operationalization of meta data
Pros and Cons
  • "The solution enables robust data matching, merging, survivorship, and Data Stewardship that can be a part of data quality workflows or true master data management."
  • "Needs integrated data governance in terms of dictionaries, glossaries, data lineage, and impact analysis. It also needs operationalization of meta-data."

What is our primary use case?

  • Fixing data by using regular expressions or synonyms and Data Stewardship.
  • Using data profiling to gauge the quality of the data before and after it’s used/needed.
  • Master Data Management - Authoring and matching survivorship, including Data Stewardship.

How has it helped my organization?

It allows our customers to master and expand their products to an international scale. In addition, it enables customers to consolidate multiple, disparate sources of data into a centralized, master data hub which can used for operations or analytics.

What is most valuable?

The solution enables robust data matching, merging, survivorship, and Data Stewardship that can be a part of data quality workflows or true master data management.

What needs improvement?

Needs integrated data governance in terms of dictionaries, glossaries, data lineage, and impact analysis. It also needs operationalization of meta data.

For how long have I used the solution?

Three to five years.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
PeerSpot user
Information Architect at a healthcare company
Vendor
Good and easy debugging functions while better tools for geo-data are needed.

Valuable Features

Maybe the best thing is the product's easy start-up level when you are familiar with Java. Also job creation is fast compared to some other tools. One more good thing is that tables' metadata is easy to bring into the tool and utilize. Last thing to mention here is flexibility to use Java code inside the job.

Improvements to My Organization

These are: fast job creation from start to finish which improves ROI, good and easy debugging functions.

Room for Improvement

First, We faced problems with stability of the products. Also some components were clearly not tested well, which meant that there were bugs. Better tools for geo-data are needed. Documentation was poor in the beginning but it got better over time.

Use of Solution

Talend Enterprise Data Integration 5.1 (1) and Talend Platform for
Data Services (2)

2 years by one customer (without Data Quality (1)), 6 months in other customer (with Data Quality(2))

Deployment Issues

At the customer deployment to the production environment from the test one was a bit exhausting. This could be because they didn't use/know the best-practices.

Stability Issues

Yes we had issues. Quite often the server needed rebooting as if there were memory leaks. Sometimes the CVS version management got stuck.

Scalability Issues

No issues. Only issues were with the Java memory which is scalable and changeable from the job settings.

Customer Service and Technical Support

Customer Service:

Customer service was good most of the time. Answers came in a timely fashion.

Technical Support:

It was good most of the time. Answers came in a timely fashion.

Initial Setup

It was pretty straightforward. Memory settings by the client needed some modification in the first place. From the server point of view I cannot say.

Implementation Team

In house team.

Other Solutions Considered

Yes. We evaluated IBM DataStage.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user154314 - PeerSpot reviewer
Technical Team Lead at a pharma/biotech company with 1,001-5,000 employees
Vendor
Although we faced memory issues with 3GB of RAM, I would recommend this product.

What is most valuable?

JRules, TMap, TParallel, ELT, etc

How has it helped my organization?

It has provided the feature wherein the business could make the changes as requested without performing the ETL deployment code to production.

What needs improvement?

I think the memory issues we faced when using the 3GB RAM compared to the 4GB RAM computers caused lot of issues. Probably can improve in that.

For how long have I used the solution?

4 years - Talend Open Studio 3.1.2, 4.1.3, 5.0, Talend Integration Suite 4.1.3, Talend Data Quality 4

What was my experience with deployment of the solution?

Intially we did encountered issues with the deployment, but over the period of time we were able to find the proper way to perform the deployment and also used a tool called HERMES for the deployment.

What do I think about the stability of the solution?

No issues

What do I think about the scalability of the solution?

No issues

How are customer service and technical support?

Customer Service:

Very nice customer service

Technical Support:

Excellent support from the technical support team

Which solution did I use previously and why did I switch?

Yes earlier we had Ab Initio but switched to Talend because initially it was an Open Studio with no cost involved and also it was supported by the JRules component.

How was the initial setup?

It was not straight forward as it was pretty new to everyone among our team, but over the period of time when we had hands on the tool everything got smooth.

What about the implementation team?

It was a in-house team.

Which other solutions did I evaluate?

Ab Initio, Informatica etc.

What other advice do I have?

I would definitely recommend others to implement this product as it is really helpful, easy to learn, user friendly, provides lot of enhanced features, etc.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
PeerSpot user
Associate Team Lead at a tech services company with 51-200 employees
Real User
Leaderboard
We needed to stop manually finding and cleaning data through Excel spreadsheets.

How has it helped my organization?

Data Quality easily identifiable instead of manual finding and cleaning the data through Excel (earlier used to follow) before ETL

What is most valuable?

Currently the best open source data quality tool available as compared to other open DQ tools ('DataCleaner', 'Open Source Data Quality & Profiling') for of a variety of reasons:

  1. Vast connectors to different DB, Web, CRM, etc
  2. Custom code is allowed
  3. Wide range of advanced algorithms
  4. Recommended for advanced users
  5. Detailed analysis, etc
  6. Large community of users

The most valuable features for us are: custom code, connectors, algorithms.

What do I think about the stability of the solution?

As it is a open source tool, some minor bugs are there.

How was the initial setup?

Fairly straightforward. Lots of user guides and tutorials are available to get started.

What's my experience with pricing, setup cost, and licensing?

The best part is that it is open source.

What other advice do I have?

Great product, surely give it a try.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Senior Consultant at a tech services company with 201-500 employees
Consultant
Customizable and straightforward implementation
Pros and Cons
  • "The solution is customizable."
  • "The performance is one area that Talend Data Quality could improve in because large volumes take a lot of time."

What is most valuable?

The solution is customizable.

For how long have I used the solution?

I have been using Talend Data Quality for approximately four and a half years.

What do I think about the scalability of the solution?

The performance is one area that Talend Data Quality could improve in because large volumes take a lot of time.

How are customer service and support?

I have not needed the technical support.

How was the initial setup?

The implementation is not difficult, it has been straightforward for the implementations we have done.

What about the implementation team?

We do the implementation of this solution.

What other advice do I have?

I rate Talend Data Quality a nine out of ten.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user497733 - PeerSpot reviewer
Executive Director and Business Unit Manager at a tech company with 51-200 employees
Vendor
It helps more accurately identify data-quality issues, and it is simple to install.

What is most valuable?

  • Analysing data trends: This works when you add a column to analyse. It shows you max, min, nulls, etc. per field. It allows a snapshot of your data.
  • Duplication

How has it helped my organization?

  • More accurate data-quality issue identification
  • Reporting

What needs improvement?

I would like to see them add a configuration wizard.

For how long have I used the solution?

I have been using for two years.

What do I think about the stability of the solution?

I did not encounter any stability issues.

What do I think about the scalability of the solution?

I encountered scalability issues.

How is customer service and technical support?

I consulted a lot of product forums, but I did not ask for support from Talend.

How was the initial setup?

The Talend software is very simple to install. Because it runs on the Java platform, you need to make sure you have a JRE installed. Then, you download the ZIP file from the Talend website. You extract the file, and the software is ready to use by executing the EXE file.

What's my experience with pricing, setup cost, and licensing?

Try the free version first!

What other advice do I have?

It is a good tool; include it in your planning.

Disclosure: My company has a business relationship with this vendor other than being a customer: We are a Talend distribution partner
PeerSpot user