In a recent question posted on IT Central Station, a community member asked for advice about designing reports using Jaspersoft. He's comparing Pentaho vs. Jaspersoft which he would like to use to clean the data. Here's a quick roundup of what our community members had to say:
reviewer111504 wrote that it "Makes no difference really from a design and report point of view. They both are created from the Kettle codebase however Pentaho with its latest release is revamping its data ingestion solutions so its ahead of the curve against Jasper ETL. If you already paid for the full Jasper licences then its a no brainer to use Jasper toolkit."
Several users were quick to explain that "Jaspersoft is strictly a reporting tool and has no ETL capability. Pentaho has some ETL capabilities, but out of my experience I Will always use Talend DI." (Igor Korelič). reviewer72435 also wrote "Jaspersoft is for embedding analytics/reports within applications predominantly in a Java application framework. I don't believe it has any "clean the data first" capabilities, which is more of a data quality/ETL requirement."
Another user, Ivan de Vargas Lopes Jr says "Ideally, you use an ETL tool like Talend or SSIS (SQL Server Integration Services), because they are specialized tools for data processing. But between the two tools mentioned, the Pentaho is simpler than the Jaspersoft."
BIExpert221 is clearly a Pentaho fan. He wrote "Only have background in Pentaho usage so can’t speak to how easy Jaspersoft is to use, however, I have to say Pentaho DI is excellent and is one of the most straightforward ETL tools I’ve seen in 15 years of working in BI. It is an extremely user friendly interface and has an impressive and passionate user community who can help you if you get stuck, as well as excellent documentation on how to do things. I’d thoroughly recommend using Pentaho DI"
CIO Marty Smith notes that neither product is good for large data volumes. Mainly, he says "The data cleansing capability of the two products is very comparable. But it really depends on the volume of data and the time allowed to clean and load the data."
Do you agree with these recommendations? Please let the community know by commenting below.
Jaspersoft had no ETL tools, or tools to generate cubes Mondrian although it is perfectly integrated with tools from Pentaho, however I think it is a great success which has included Talend ETL tool for Jaspersoft (called JasperETL), which is best ? Well, I think it depends on who is going it to use Talend is a code generator and more powerful metadata that Kettle, has many more components in tool, however Kettle can use Open Office to define components this is an advantage that formulas can use this utility, also because it is more portable Kettle will generate an XML file as a project while Talend is a complete project as a workspace with their own files, in performance at least I have not had problems with any although some claim that Talend has the advantage.
Jaspersoft does have a ETL tool called Jaspersoft-ETL which is an OEM of the Talend BI tool. Having this as part of the JS toolset enabled a better overall solution than Pentaho. The Pentaho ETL seems like an ETL lite, a little more intuitive but nowhere near as functional. Another differentiator for Jasperosft is that it offers a better embedding solution, the Javascript API really blows away the traditional ways of embedding.
I am using Pentaho's di (spoon) atm at work for data warehousing and dashboard design, and it's quite good but I'm not too sure if its as well established as the Talend BI which is pretty much an OEM that Jaspersoft-ETL uses as @elesh mentioned. I think they are both great tools but if you were limited to time and had to chose among the two I would personally suggest the Talend or Jaspersoft-ETL over the spoon.