Try our new research platform with insights from 80,000+ expert users
Antoine JACOB - PeerSpot reviewer
Decision-making & data management manager at Erilia
Real User
Top 20
Data quality features shine with an okay setup and good stability
Pros and Cons
  • "Stability feels fine."
  • "There is a need for mastery in some areas."

What is most valuable?

The data quality features are quite noteworthy. Data quality features are important.

What needs improvement?

There is a need for mastery in some areas.

What do I think about the stability of the solution?

Stability feels fine.

How are customer service and support?

Customer service needs a better approach.

Buyer's Guide
Talend Open Studio
January 2025
Learn what your peers think about Talend Open Studio. Get advice and tips from experienced pros sharing their opinions. Updated: January 2025.
838,713 professionals have used our research since 2012.

How was the initial setup?

The initial setup was okay.

What about the implementation team?

The implementation team used Studio.

What's my experience with pricing, setup cost, and licensing?

The setup cost is expensive, so there are considerations.

What other advice do I have?

I'd rate the solution eight out of ten.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
Flag as inappropriate
PeerSpot user
reviewer2128914 - PeerSpot reviewer
Business Intelligence Intern at a computer software company with 1,001-5,000 employees
Real User
Many high functioning components, no maintenance required, and straightforward setup
Pros and Cons
  • "The most valuable feature of Talend Open Studio is the tMap component. There is a lot of functionality in one component."
  • "The stability of the solution could improve when running jobs. There can be errors when running projects but in the end, it works well and the errors do not impact the result."

What is our primary use case?

I am using Talend Open Studio for ETL. I extra data from APIs and transform it for the data warehouse.

What is most valuable?

The most valuable feature of Talend Open Studio is the tMap component. There is a lot of functionality in one component.

What needs improvement?

The stability of the solution could improve when running jobs. There can be errors when running projects but in the end, it works well and the errors do not impact the result.

For how long have I used the solution?

I have been using Talend Open Studio for approximately one month.

What do I think about the stability of the solution?

There are times in our usage when the solution crashes. When we are running jobs that are more than one or two it can crash or not work well.

I rate the stability of Talend Open Studio a five out of ten.

What do I think about the scalability of the solution?

We have approximately eight people using this solution in my company.

The solution is scalable.

Which solution did I use previously and why did I switch?

I have used Informatica before Talend Open Studio and when comparing the two, the interface is more intuitive in Talend Open Studio.

How was the initial setup?

The setup of Talend Open Studio is simple. The process took approximately two hours and it was my first experience.

I rate the initial setup of Talend Open Studio a ten out of ten.

What about the implementation team?

I implemented the solution.

What's my experience with pricing, setup cost, and licensing?

I am using the open-source version and it is free.

What other advice do I have?

The solution does not require any maintenance.

I recommend this solution to beginners. It is free and the interface is intuitive and has a lot of helpful components.

I rate Talend Open Studio an eight out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Talend Open Studio
January 2025
Learn what your peers think about Talend Open Studio. Get advice and tips from experienced pros sharing their opinions. Updated: January 2025.
838,713 professionals have used our research since 2012.
Hassan-Mustafa - PeerSpot reviewer
Senior Data Engineer at a financial services firm with 1,001-5,000 employees
Real User
Top 10
A good API integration tool with big data capabilities that needs to be priced better
Pros and Cons
  • "The API integration and big data approach are very good because of how you extract data from JSP files or big data web repositories like MongoDB."

    What is our primary use case?

    We are currently using the free version. We use the solution to extract loads from the source system into the entire house. We are also doing API and data integrations from TXT or XML files.

    Though we are not currently using it for big data, there is a plan to extract data from MongoDB.

    What is most valuable?

    The API integration and big data approach are very good because of how you extract data from JSP files or big data web repositories like MongoDB. This is an excellent tool for big data and API integration.

    What needs improvement?

    Comparing Talend Open Studio with SSIS, Informatica, or Data Services, I think the overall workspace concept can be improved since these other solutions have workflows and data flows.

    For how long have I used the solution?

    I have been using Talend Open Studio for the past three months.

    What do I think about the stability of the solution?

    The solution is stable enough. It's a mature tool. It is a bit complex in a few areas, such as when writing your ETL Jobs, because tLoop is slightly confusing.

    Which solution did I use previously and why did I switch?

    I have used SSIS, SAP Data Services, and Informatica to a small extent in my previous companies. My current company uses Talend.

    How was the initial setup?

    We currently use the free version, where we don't get the built-in scheduler. We had to deploy the job on the server as a file and, using the CRON jobs, schedule the batch or shell file. We can deploy the solution and the ETL jobs in ten or 15 minutes.

    What's my experience with pricing, setup cost, and licensing?

    We are using the tool's free version because the enterprise version is a little expensive.

    You would need three enterprise servers if you are going for a full-scale lifecycle, like development, quality, and production. That would be expensive because the servers have yearly costs.

    What other advice do I have?

    We can only have one user in the free version. We are now wondering whether we should go for the enterprise version or switch to a different platform, like the HANA platform, HANA cloud, or Datasphere. Or another platform called Encarta. There are multiple technologies we are exploring.

    It's a good tool overall, but that depends on whether you have the enterprise version. The free version is not for a team in which you have more than one member. In this case, you have to go for the enterprise version, which is a bit expensive.

    Likewise, you do not get support with the free version, so you won't speak with tech support. There is, however, a community with a knowledge base in forums.

    You also get the scheduler, the deployment, and everything with the enterprise version. You could deploy the solution on the server and then schedule the jobs. Likewise, you get logging and an audit trail of your jobs. Currently, we maintain these jobs manually.

    They should make purchasing the servers for the enterprise version a one-time payment, where you buy the server, and that should be it. The yearly cost is what makes Talend not a great tool.

    Talend is fine if you are only using it for the ETL tools. However, there are other options, such as SSIS, Informatica, and SAP Data Services, which are not promising for Talend Open Studio. Some things are easier with other tools. Talend is logical, but a few things are not very straightforward. If there is an option to choose from other ETL tools, I would rate Informatica number one, in second place, maybe SSIS, and then maybe Talend.

    If Talend Open Studio is cheaper than other solutions, you can go for it.

    Disclosure: I am a real user, and this review is based on my own experience and opinions.
    PeerSpot user
    PeerSpot user
    (2IC) Senior System Analyst at a insurance company with 10,001+ employees
    Real User
    Creates a job stream that connects to multiple data sources, but needs better installation configuration for other databases
    Pros and Cons
    • "The Talend Studio connected to the Talend MDM (Master Data Management) is the most valuable feature. Talend Studio is used to create a job stream that connects to multiple data sources, matches, compares or creates a golden record for overall identification. It also has a good catalogue of objects that can be dragged and dropped for building models."
    • "It needs better installation configuration for other databases. Although the installation allows you to select another database, this doesn't mean that all connection points in the application point to the database selected. You actually need to do a search through the entire install to locate the configuration settings and change them."
    • "In version 6.2 we did encounter issues with the job servers and specifically with ESB. Version 6.3 is better but large jobs can cause the MDM server to fall over, requiring a reboot."

    How has it helped my organization?

    By being able to cross-match records across multiple data sources and create a logical dataflow with options to place rejected records in a separate table, we are able to cleanse and create golden records in multiple categories. Rejected records, once identified, can be assessed for repair. This also means that we can identify how and where the rejected record occurred.

    What is most valuable?

    The Talend Studio connected to the Talend MDM (Master Data Management) is the most valuable feature. Talend Studio is used to create a job stream that connects to multiple data sources, matches, compares or creates a golden record for overall identification. It also has a good catalogue of objects that can be dragged and dropped for building models.

    What needs improvement?

    It needs better installation configuration for other databases. Although the installation allows you to select another database, this doesn't mean that all connection points in the application point to the database selected. You actually need to do a search through the entire install to locate the configuration settings and change them.

    For how long have I used the solution?

    One to three years.

    What do I think about the stability of the solution?

    In version 6.2 we did encounter issues with the job servers and specifically with ESB. Version 6.3 is better but large jobs can cause the MDM server to fall over, requiring a reboot.

    We've built in some self-healing scripts to detect a loss of connectivity and force a restart of the services.

    What do I think about the scalability of the solution?

    Our Talend installation has been deployed onto Red Hat OpenStack, separating out MDM, TAC, DQ, and thee job servers. I made a point of determining data storage requirements for each server, and a memory ulimit setting to match the resource profile of the components. It was trial and error but it paid off by allowing the Talend system to process large jobs of 200-300 million records over a number of hours, rather than days.

    How are customer service and technical support?

    Support tends to be good for the usual types of issues, but once a problem gets more complex and deeply into the nuts and bolts of the product, support struggles.

    Which solution did I use previously and why did I switch?

    Initially we used Pentaho, however, it was determined that this was not as feature rich as Talend.

    How was the initial setup?

    The initial setup out of the box is straightforward. However, it becomes more complex as you start to distribute the components and get forced down a path of connecting to one type of database for all the components. In my case, I had to deploy Talend using RedHat Ansible and use only a PostgreSQL database.

    I needed to first install the software, search for all references to H2 or PostgreSQL, change the configuration files, and then do it all over again for the distributed installs; then translate this into Ansible scripts. So although it's not directly Talend that made this complex, the installation by Talend gives the option to install to PostgreSQL but doesn't use PostgreSQL for all database repositories.

    What's my experience with pricing, setup cost, and licensing?

    Pricing and licensing are fairly straightforward. It is reasonably priced and managed. It's a good solution overall.

    Which other solutions did I evaluate?

    Pentaho, and prior to that SAS MDM which was similar but it was harder to create models. We also ran a PoC for IBM Infosphere MDM, but the cost was considered unacceptable.

    What other advice do I have?

    Make sure you have someone with technical skills and patience to install in a distributed deployment. Learn the product well and build in your own log shipping with either Splunk or Elastic or Telegraf to ease your diagnostic pains.

    Disclosure: I am a real user, and this review is based on my own experience and opinions.
    PeerSpot user
    Architect at Elections Canada | Elections Canada
    Real User
    A stable solution that is easy to install and works well so far, but the error-handling is not user-friendly and it needs a built-in tool for geospatial data
    Pros and Cons
    • "I didn't have many problems installing it. It seemed very straightforward to me."
    • "I think my biggest problem with the tool is that the errors are very hard to debug."

    What is our primary use case?

    I use the open-source version of this solution. It's data integration-related. Right now, I'm doing some legacy TL into a new enterprise data repository.

    What needs improvement?

    When the tool has an error, the error-handling is not that user-friendly. It might just be my inexperience with the tool, but I struggle a lot with finding my layers sometimes. I think my biggest problem with the tool is that the errors are very hard to debug.

    Our special need is geospatial data, so that could be included as part of the tool and not only as an add-on. It's always hard to do add-ons and tools when it's a third party, and every time there's going to be an upgrade, you don't know if it will keep working. It would be great if they were included as part of the main tool, like FME. That would be my main concern about the Talend tool right now.

    For how long have I used the solution?

    I have been using this solution for about a month. 

    What do I think about the stability of the solution?

    The solution is stable. 

    What do I think about the scalability of the solution?

    The solution has add-ons that you can add. I'm dealing with geospatial data, and you have to have add-ons to do that, but I haven't gotten a chance to add them yet. The first add-on I tried didn't work, but I think it might just be outdated and not compatible with the version of the product that I have.

    There are two of us using this solution, my colleague and myself, but in the next month or so we are going to get the Professional version in a private cloud environment. We're exploring the tool to decide if it's going to become our tool of choice for the data repository and data warehouse. Right now, we're working locally with the open-source version of it, but we're trying to decide if Talend is the right tool for us and in order to do that, we want to have the proper version of Talend in the proper environment and then do all the tests.

    Which solution did I use previously and why did I switch?

    FME has been used for a long time at my company, strictly for managing geospatial data. Now, we're creating an enterprise repository, and we need to deal with the regular data plus the geospatial data together, so we're trying to find a tool that will be able to handle both of these types of data, and manage all of the enterprise requirements that we have. We have used solutions like FME, but I don't believe there was any other tool that was used on the scale that we're trying to use Talend now.

    How was the initial setup?

    The initial setup was pretty straightforward. I didn't have many problems installing it. It seemed very straightforward to me. 

    Deployment can take a few hours, depending on your environment, because it requires some things that you may need to install if you're missing them. I think it took me probably under an hour to do it.

    What about the implementation team?

    I did the deployment myself on my local workstation. 

    What's my experience with pricing, setup cost, and licensing?

    Right now, because we're using the open-source version, there's no cost. However, down the road when we use the Professional version, there will be costs. I don't know what the cost will be because I'm not involved in that, but I know that there is a license at that point.

    What other advice do I have?

    I would rate this solution as a seven out of ten. 

    Which deployment model are you using for this solution?

    On-premises
    Disclosure: I am a real user, and this review is based on my own experience and opinions.
    PeerSpot user
    Omar_Ismail - PeerSpot reviewer
    ECM, Archives and Digital Preservation Consultant at DataServe
    Real User
    Top 5Leaderboard
    An integration and warehousing solution with reasonable pricing
    Pros and Cons
    • "It is easy to use and covers most of the functions needed. We can use the code without any extra effort. The open source is very good. They have the same commercials with additional connectors. The graphical design environment is also very easy."
    • "The solution needs more integrations."

    What is our primary use case?

    We use the solution for integration, warehousing, and enterprise service costs.

    How has it helped my organization?

    The technical team uses media resources from Facebook and other social media platforms for ETL.

    What is most valuable?

    It is easy to use and covers most of the functions needed. We can use the code without any extra effort. The open source is very good. They have the same commercials with additional connectors. The graphical design environment is also very easy.

    What needs improvement?

    The solution needs more integrations.

    For how long have I used the solution?

    I have been using Talend Open Studio for two or three years.

    What do I think about the stability of the solution?

    The product is very stable.

    What do I think about the scalability of the solution?

    The solution’s scalability is good. We have five users and are building ETL integration and design to implement with another system. We have three licenses for developers.

    Which solution did I use previously and why did I switch?

    We worked with the SSI, but it had very few primitive integrations and was hard to use. We switched to Talend because of the integration of two-way communication. You can add components for that warehouse for the quality. It is a complete platform for integration.

    How was the initial setup?

    The initial setup is very efficient and takes about a half day to complete with configuration and needed components.

    What was our ROI?

    Talend gives you what you want without hiring new people and offers extra services. They train mid-senior developers to use it perfectly.

    What's my experience with pricing, setup cost, and licensing?

    The product’s pricing is reasonable. It has an annual subscription.

    What other advice do I have?

    I recommend the solution because it has a huge community in which to get support. We have additional connectors from the community and support.

    Overall, I rate the solution a nine out of ten.

    Which deployment model are you using for this solution?

    On-premises
    Disclosure: I am a real user, and this review is based on my own experience and opinions.
    PeerSpot user
    Data Analyst at Object Nation PL
    Real User
    Top 20
    A powerful solution for data integration with user-friendly interface, flexibility, and extensive component library
    Pros and Cons
    • "The standout feature for me is the user-friendly nature of the components."
    • "When faced with a challenge, such as the necessity to link up with an unconventional data source like the legacy Cyprus Vision database that wasn't inherently supported by Talend, I had to resort to writing Python code to establish the connection."

    What is our primary use case?

    We use it to construct a robust data warehouse. This involves seamlessly integrating data from diverse sources into our data warehouse. It plays a crucial role in ensuring data cleanliness and quality throughout the integration process. We also use it for effective fraud detection measures, further enhancing the reliability and security of our data warehouse.

    What is most valuable?

    The standout feature for me is the user-friendly nature of the components. They make the entire process exceptionally easy. Another aspect I appreciate is the scripting part, where Java code can be written. Many of the components I've utilized, especially those for data connections and recommendations to various sources, have proven highly beneficial. The capability to seamlessly integrate Python code within Talend Open Studio adds another layer of versatility.

    What needs improvement?

    When faced with a challenge, such as the necessity to link up with an unconventional data source like the legacy Cyprus Vision database that wasn't inherently supported by Talend, I had to resort to writing Python code to establish the connection. The administration console could potentially streamline payment management processes.

    For how long have I used the solution?

    I have been working with it for ten years now.

    What do I think about the stability of the solution?

    The stability provided is excellent, we didn't have any issues with it.

    What do I think about the scalability of the solution?

    It provides good scalability capabilities.

    Which solution did I use previously and why did I switch?

    We used DataStage some time ago. More recently, I've also had experience with Microsoft Integration Services. Upon comparison, I found Microsoft Integration Services to be more user-friendly in comparison to DataStage.

    How was the initial setup?

    The initial setup was straightforward.

    What about the implementation team?

    We simply download the tool and place it into a designated folder. After setting up, we execute the tool and commence job development. Once the job development phase is complete, we build the jobs, generating a Java file that can be further exported to a job scheduler for seamless integration into our workflow.

    What's my experience with pricing, setup cost, and licensing?

    The cost, particularly in Africa, is quite high.

    What other advice do I have?

    Overall, I would rate it eight out of ten.

    Which deployment model are you using for this solution?

    On-premises
    Disclosure: I am a real user, and this review is based on my own experience and opinions.
    PeerSpot user
    Archan Chatterje - PeerSpot reviewer
    Consultant at Keyrus
    Real User
    A solution that offers good scalability and stability with a responsive technical support team
    Pros and Cons
    • "The initial setup was quite straightforward. The deployment took between two and three days."
    • "The profiling perspective needs improvement. Instead of using it in the studio, we are using a different tool which is also provided by Talend. It's redundant."

    What is our primary use case?

    We primarily use the solution just to pull the data from the source to the landing zone. We are using the administrative console as well.

    What needs improvement?

    The profiling perspective needs improvement. Instead of using it in the studio, we are using a different tool which is also provided by Talend. It's redundant. They should remove that from studio to make it more lightweight or improve upon its interface.

    The two things that Talend lacks is an MDM and the CDC. I know Talend has both of them, but both of them are not exactly usable in actual scenarios. I know that Talend has some integrations with other companies for MDM, but I'm not sure what they're doing for CDC. Maybe Talend can work something out regarding the CDC feature. I know the solution has its own CDC, but they're not focused on it as much as people would like them to be. I would like to know what their plans for it are or if they plan to partner with another organization to market it.

    For how long have I used the solution?

    I've been using the solution for four or five years.

    What do I think about the stability of the solution?

    Currently, the solution is stable. However, three or four years back, it used to be a bit unstable. The software is quite stable as well.

    What do I think about the scalability of the solution?

    The solution has quite a good range of scalability. The jobs are quite scalable since we have now integrated them with Docker so we can follow all the principles of microservices. 

    We have a license for five users. We have someone focused on data quality. Someone else is looking into the solution as a tester, and testers are working on the stewardship tool to correct the data and verify it with the business. Then there are two free developers who are solely working on the Open Studio to turn out code. 

    We do have plans to increase usage.

    How are customer service and technical support?

    I don't have any complaints with technical support. They've been very responsive.

    Which solution did I use previously and why did I switch?

    We previously used to use a wide variety of solutions, including Informatica.

    How was the initial setup?

    The initial setup was quite straightforward. The deployment took between two and three days. 

    We deployed it in our Unix boxes using a third-party tool called Jenkins.

    The whole deployment model is automated and it happens every three weeks. There are just two people needed for maintenance.

    What about the implementation team?

    We are the consultants. We handled the implementation ourselves for our clients.

    What's my experience with pricing, setup cost, and licensing?

    There are costs above the standard licensing fee, for example, if you need storage space.

    Which other solutions did I evaluate?

    We evaluated other options, such as Azure Data Factory.

    What other advice do I have?

    We are using the AWS public cloud deployment model.

    I would recommend the product. As long as you follow the best practices you will get what you want out of it.

    I would rate the solution eight out of ten.

    Disclosure: I am a real user, and this review is based on my own experience and opinions.
    PeerSpot user
    Buyer's Guide
    Download our free Talend Open Studio Report and get advice and tips from experienced pros sharing their opinions.
    Updated: January 2025
    Product Categories
    Data Integration
    Buyer's Guide
    Download our free Talend Open Studio Report and get advice and tips from experienced pros sharing their opinions.