What is our primary use case?
It used to be Talend Open Studio, and I've got a paid version that's fully supported by Talend. Now, it's called Talend Cloud. Talend Open Studio parts are exactly the same as the open-source version. But with the cloud platform, when you build a job or an API, you publish it onto the cloud platform, and that manages all the API functionality really well.
The best part is that if something goes wrong with an error or an issue, the logging is phenomenal. It tells you exactly what's wrong. That, coupled with Talend Open Studio, makes it really easy to resolve any issues.
Talend Open Studio itself is fantastic. When you build a job or an API, it tells you what's wrong, what needs to be fixed, and how to fix it. Plus, Talend does the support documentation, and it's really straightforward to follow.
There's plenty of videos online on how to build jobs to transform data, build APIs, and link to other databases. Talend can handle quite major transformations and ETLs. So it's really simple and easy to follow, with all the documentation being intuitive.
We use probably about 10% of Talend's functionality. But the primary use is keeping multiple databases (near 200 different databases) in sync. Talend integrates that information with REST APIs. We're moving a lot of our integration into the Talend platform, taking it from old XML-based integration to REST APIs.
We use Crunchy Data's Crunchy Bridge solution for a lot of our databases in the back end, and Talend, coupled with Postgres, creates a very simple platform for us to build integrations really fast. It also means that we're using intelligence there on change events that occur in the database to keep multiple repositories in sync.
We're moving towards more of a single source of truth platform rather than multiple sources of truth that we've been dealing with for the last fifteen or so years.
How has it helped my organization?
Data transformation has proven to be most effective for data management. That's amazing. We have a lot of external parties that gather information for us. We're a global government council, so we're transforming XML, JSON, comma-separated files, emails—every format that you can deliver data to us. GeoJSON, geometry, and spatial data—we're all feeding it through the Talend platform, and we're bringing it all into a single format from Postgres.
It's just so simple to be able to take those foreign formats and transform them into the standard format that we have in-house. Likewise, we can take data from our database and transform it back into those other formats so that external parties can use their existing platforms without involving a human to do any data set in between. It's just made a huge improvement in how fast we can get data out and also consume data for the scientists that we have internally.
The error handling capability in Talend is really what sold us on the product. You can get everything from a full Java stack trace right down to putting a job through and stepping through it one step at a time.
Each component within that job can be set to different levels, from warnings to the actual error itself. It's incredible how easy it is to find a problem and resolve it. Nine times out of ten, it prompts you on what you should be doing. So, if you've done something wrong, maybe you haven't got the schema quite right; it'll actually tell you that you have an error in the schema before you publish the job.
It's subtle, so when it actually even finds errors, it won't go live to production or development until you fix those errors. And if an email does go through, or if there's an ETL with data, it tells you exactly what's wrong. And, generally, most of the time, it gives you some indication of what you need to do to fix it.
Data Governance Maturity
We're on the low end of maturity when it comes to data governance. But one of the beauties of Talend is that we now have one place to go to define business rules around data quality and data ownership. We're moving down the path of a more domain-oriented architecture, giving business owners more control of their own data by structuring our architecture around our business services.
Empowering Analysts
Talend Cloud is making it easy for us to have both developers and data analysts. Data analysts are able to manipulate the data through the cloud solution without having to write jobs and APIs. We publish those APIs and jobs, and they can even manipulate things through pipelines within Talend Cloud.
This means you don't need a highly trained developer. You can just have an analyst who knows how to use the toolset but doesn't need to know the details or mechanics of how everything works.
Streamlined Data Management
This makes it easier for us to align our services with our data. Validation and other processes are pre-built into the jobs themselves, which means that data quality, data control, and data ownership are a lot clearer and in line with our business services.
This streamlines things and makes them a lot easier. However, our maturity level around data governance is still low. We didn't really have any until about two years ago. It was a bit of a wild west when it came to data.
What is most valuable?
The interaction with Talend is minimal, mainly because the documentation is very mature. For most of the issues we've had or things that we've needed to try that we weren't quite sure about, we just went straight back to the documentation, and there was an example of pretty much everything that we needed to do.
We have had quite a bit of interaction with the support side of Talend, but almost every instance has come back to something with our firewall rules or the way we've set up our security internally. Our operations team just started to learn about Azure. We've been on Azure for about four years, and our operations team still hasn't quite gotten the grasp of turning out all of our hands there.
But most of the issues that we have had, our calls with Talend, have been around security and how we basically get out of our Azure environment to keep somebody outside of our banks.
What needs improvement?
Talend is doing a lot of work at the moment, and it's not there yet, but the whole platform could be managed in a SaaS-type environment. You still need to have the Studio running on a virtual desktop or a PC. They will get to be able to do the whole thing inside your browser, so you don't need to install anything locally. It's down the track, and it's the nirvana that we were looking for in Boomi.
But the biggest challenge they have is that the platform is so focused on the Studio for all of its development. They'll probably get there, but they have such a mature Studio client that it's a huge amount of work to get all of that functionality into a browser or SaaS platform. That's pretty much the biggest flaw with the Talend environment—being reliant on the Studio, which needed to be on a local machine.
The only other thing is that you have to integrate into an API gateway. We're in Azure, so we use Microsoft Azure Gateway. It doesn't come with its own gateway, which is another sort of big plus side that we saw in Boomi. Talend isn't quite there yet with the API gateway.
Other than that, it's bloody hard to find something because it just seems to be all good.
For how long have I used the solution?
We've been using it now for 24 months. We ran it parallel with Boomi at the same time, so we ran Boomi for about a year and a half. And we've been using Talend now for about 24 months. It'll be 24 months in June. What took us about six months in Boomi, we accomplished in about two months in Talend. It was a lot easier to pick up and use.
What do I think about the stability of the solution?
We haven't noticed any slowness. We have an internal KPI for APIs to be responsive within 400 milliseconds, and if not, we have thresholds and alerts set up. We haven't had any APIs even come close to those thresholds.
This is partly due to our architecture, which uses storage queues or message queues for a lot of the ETL or data transfer. The APIs themselves get a sort of special treatment that ensures they have all the resources they need.
- Standard Configuration Sufficient:
As for the Talend solution itself, we haven't experienced any impact from updates causing slowness or needing to increase the service to a larger amount of RAM beyond the standard configuration.
What do I think about the scalability of the solution?
Scalability is part of our licensing agreement. We can scale both horizontally and vertically. Adding additional servers is quite easy; we just spin up a new engine.
Load Balancing:
The nice thing is that when you create engines, you load them into a cluster pool. If a certain job or API is under heavy load, it will automatically move the load across multiple servers. So, it does a kind of load-balancing
Resource Optimization:
So, it does a kind of load balancing. We use Azure load balancer, but Talend is able to see load balancing across different environments. A job comes in, and Talend determines which server is free. If five servers are all under stress, it will bring up another server and use that at the same time. It's very easy for us to scale both horizontally and vertically.
How are customer service and support?
Talend's support is good. We've got a lot of solution internally, and they provide third-party support.
The only negative is that their support all comes out of Singapore, so we do have about a five-hour time gap between New Zealand and Singapore. But the people are great, even though English isn't their first language.
They provide really good documentation, so 90% of the time we can sort things out just by reading the FAQs or the documents they have.
For the times we do need to talk to people, nine times out of ten, they respond within two or three hours of us logging a call. And pretty much every job we've logged with them has been a problem in our environment, and we created ourselves within our own security. We actually haven't had a problem with the Talend software itself.
Talend's support is really good. They're actually very, very patient and helped us resolve the issues we've had within our own environment. It's not actually part of the Talend platform. It's interesting enough that our security team learns more from them than they do from our security provider.
How would you rate customer service and support?
Which solution did I use previously and why did I switch?
We used Boomi. We ended up switching from Boomi to Talend Open Studio. When things started getting really challenging, we had another project that was already using a solution that was very similar to Boomi, and it turned out that that solution was actually better suited to our needs than the Boomi platform.
Mainly around debugging or finding errors and stuff like that, so many other platforms can be a little bit easier to use or even troubleshoot.
Talend is a lot easier to manage and a little bit more mature product.
With Boomi, we were constantly moving from one screen to another, and you could tell it wasn't reaching its potential. So, we ended up going for Talend's own platform, and that's when we cut out development time for APIs and data ETL jobs down to just a few hours, if not minutes.
With Boomi, we were spending quite a bit of time trying to figure things out. Talend is just a phenomenal product. Boomi will get there eventually, they're just putting a lot of time and money into development, but Talend is already able to deliver the functionality that we're after.
How was the initial setup?
It's actually one of the simplest things that we've had to deploy. You quite literally go to the manual and follow it step by step. Our main operations guy did the installation. He was absent sometime, and we had a new contractor start up. We just gave them the documentation from the website, and they followed it step by step.
Everything just seemed to work. There was no real fireworks. The fireworks did come about when we put on the security certificate onto the machine. All the certificates that we had in the Java environment were using relative paths, and we found that they needed to use a full path instead. It took us two or three weeks to figure it out.
We even worked with Talend support for a while until we figured it out. The documentation is an example of what most vendors strive for, but Talend has documented everything really, really well.
What about the implementation team?
Our main operations guy did the installation.
Deployment time:
For deployment time, a remote engine, which is what everything runs on, takes about 45 minutes to do a full install and configuration. And that's not just double-clicking the executable; it's doing all the configuration as well.
We've had to do that because we've upgraded to Java 17, which required us to set everything up on a new VM. That was an internal decision, not a Talend decision.
We just wanted to set up a fresh server. It was 45 minutes to an hour, and it was all installed and running. We were able to deploy APIs and jobs onto that and do the testing.
If it's just a job or an API that we've created, you can build it and deploy it within 30 seconds to a minute after you've built it. It's really easy for developers to deploy new jobs onto the server, test them, and give you feedback very quickly.
Deployment resources:
We generally have two people working on it at any one time. We've created several different templates, so if someone wants a new API or a new job to transfer data, we have pre-built templates to speed up development.
Most of the customization is done via configuration. They just type the information into variables, and in nine times out of ten, they don't actually have to change anything in the job itself. The entire job can be deployed as a complete brand new job, without having to build everything from scratch.
These templates make it really easy to deploy new work. My job is to define what models or templates to use, and then I hand them over to the developers. We've gone from six or seven people spending a week building things in Python or various other languages to just two developers spending half a day a week building jobs.
We've been able to realign our resources to move on to tougher challenges. Our integration now is significantly easier than what it used to be a few years ago.
What was our ROI?
Return on investment is kind of hard to quantify, but initially we sped up development with nine people for at least three years.
After the first year, we really had enough with three people. But as a council, it's hard to calculate with a caveat or put it under internal costs.
The easiest way to quantify it is that rather than years for full deployment, we're probably about two and a half. So, we've reduced the number of staff needed to support the platform by about 300%. If we figure out that in reference to an investment, that'd be fantastic.
What's my experience with pricing, setup cost, and licensing?
Pricing is always a challenge. It is quite an expensive model, but because the platform is so simple to use, we haven't had to purchase any additional licenses.
Unexpected Efficiency:
We started out with three developer licenses. I'm not sure we've even used all of them. After the first year, we didn't actually need any more licenses because the guys that were using it were getting through ten times as much work as we expected.
So we don't actually have enough work for six developers, let alone the three developers that use it one day a week. We've spent time on building templates and standard patterns, and as a result, within a year, our workload was pretty much down to zero. We're probably about a year ahead of schedule compared to our original roadmap.
Overall Value:
So, it is very pricey, but we've been able to chew through the work so fast that it's actually hard to justify buying more licenses.
What other advice do I have?
Overall, I would rate it a nine out of ten. I've been recommending Talend for six or seven months now to almost anybody who's interested in that space.
Compared to other products, we had about 12 on the list that we analyzed, and we ended up choosing Boomi and Talend. And even then, those two were quite significantly out in front. It's a low-code, no-code, simple-to-use platform that's easy to support.
Talend is so far out in front of everybody else that we know it's very hard not to recommend them to anybody. It's all around error handling and ease of use. Boomi does so many tasks so well, but in some areas, it's really quite hard to use.
Those areas are where Talend is so easy. It's a real big difference. Functionally wise, they're both providing the same functions, but Talend is significantly easier to use.