Try our new research platform with insights from 80,000+ expert users

Grooper vs IBM Datacap comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Oct 10, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Grooper
Ranking in Intelligent Document Processing (IDP)
28th
Average Rating
8.6
Number of Reviews
4
Ranking in other categories
No ranking in other categories
IBM Datacap
Ranking in Intelligent Document Processing (IDP)
7th
Average Rating
7.6
Reviews Sentiment
6.7
Number of Reviews
27
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of April 2025, in the Intelligent Document Processing (IDP) category, the mindshare of Grooper is 0.5%, up from 0.3% compared to the previous year. The mindshare of IBM Datacap is 4.7%, up from 4.4% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Intelligent Document Processing (IDP)
 

Featured Reviews

reviewer1552698 - PeerSpot reviewer
Good data ingestion and classification capabilities, supports various media types and formats, and the interface is easy to use
Currently, we're still using version 2-7-2, and now they're about to do the beta release on their version 2021. In this coming version, we expect that some of our issues will be fixed. We've had challenges in classification tasks where similar documents were flagged as multiple matches. The system would identify them and say, "Hey, I think I've got multiple matches. It could either be this one or that one." Because of that, it required us to instruct the system to either leave it unclassified, or we had to halt the process for somebody to look at it. With the new version for 2021, they have changed the paradigm. As it is now, we're using something called a form type, where pages within the document are referenced using a specific page number. For example, in a ten-page document, you might refer to information specifically on the first or fifth page. In the new paradigm, there is a first, middle, and last page concept, as opposed to having the different form types with all of the different pages. What they're telling me is that it's going to make the classification more accurate. Just because the first page of two different documents looks the same, they will not be considered duplicates. Having multiple points of reference will now allow it to better distinguish them. The other area we have had challenges with is table extractions, where if the data headers were not defined, or the tables did not have descriptions for the columns. My understanding is that in the 2021 version, they've now shown that they're handling that. Again, we don't have it and haven't been able to test it, but it's coming. Technical support is definitely an area that they need improvement in, in terms of the front-line individuals.
DeekshithShetty - PeerSpot reviewer
High-accuracy document processing with easy integration and good image enhancement
The OCR extractions are very good, almost 100% accurate now. The integration with other tools is very easy, with inbuilt actions to configure and directly export to multiple depositories. It is also good at image enhancement, including the removal of lines, which is helpful in processing and automation and ensures very little user intervention, thus saving time.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The user interface is easy to use, and the flexibility is noteworthy."
"Grooper processes difficult sorts of data and unstructured or semi-structured content very well. It's probably one of the better solutions I've seen compared to other solutions I've seen out there. It does a lot more things like segmentation extraction. It does it a lot better. Grooper has more focus on these types of freeform documents where other solutions are very generic and this is a little more elaborate in what they've done. I think they take it to the next level of extracting freeform data."
"There are many options and customizations that you can make to each individual extractor that allows you to tweak it for exactly what you need."
"Lexicons where the key vocabulary can be inputted it is very helpful."
"Both Datacap Studio and Datacap Navigator are great features."
"It is the best solution for scanning purposes."
"It reduces human error and saves time."
"It's resiliency. There are multiple ways of identifying what you are looking for. There are multiple export formats."
"The administration of the application following an error is most valuable. We are able to know easily when something is stuck in the system."
"It is highly extensible, which we found to be most valuable. It is a very extensible solution because it is based on configurable rule sets. We were able to amend and adjust the solution and very easily add custom code and custom components. It does require some programming experience, but we found that not to be an issue. We liked its extensibility."
"Very scalable and stable data capture and extraction solution that's very simple to install."
"It's a platform, not a configured application, so you can do what you want with it."
 

Cons

"Technical support is definitely an area that they need improvement in, in terms of the front-line individuals."
"They should have more sub-extractors or exclusion extractors so that the user does not have to make a parent data type."
"If Grooper could "sense" important fields on the document and auto-build extractors for them, that'd be really cool."
"Grooper is new. It's new beta stuff, so we've had some issues, but that's understandable. Getting the beta product to more of a true release is where it needs improvement. I'm going through training now, so it's hard to judge what they have and don't have until I get through that training. Training is the main thing for me because I'm trying to learn and take things I've learned from other products and try to transfer that knowledge to this one."
"Speed of OCR is one issue. It's a challenge because we have customers that have millions and millions of pages that they want this solution to crank through. In order to do that you have to have a large infrastructure in place, and that directly impacts licensing based on the core count."
"The reading efficiency of the solution needs to be improved."
"I would like to see the product have the ability to process more documents in parallel. Right now, it is a single queue. Therefore, if you want to really test the load and stress test it, having multiple instances and the ability to scale it up would be great."
"Its weaknesses are primarily tied to the lack of available resources and expertise in the market to effectively support and provide solutions and services to each customer for seamless implementation. Expertise in this specific product is rare throughout the market. One key reason is the product's limited downloads. Additionally, archiving solutions are often perceived as complex and challenging, dissuading many companies from venturing into this domain. Consequently, partners who specialize in archiving solutions are always seeking straightforward, uncomplicated options that are easy to manage and meet customer expectations."
"I would like better ease of use and more support options."
"When a client's requirement is not available in Datacap, out-of-the-box actions should be simpler to implement and not very complex."
"When I scan a document in Datacap that has a watermark or the document is a little distorted, the image output is poor. It either becomes completely black, or there is so much distortion that we cannot read the numbers or the addresses mentioned in the POD. When we scan a document, we expect the output to be at least 95 percent accurate."
"There should be an increase in the capacity of the workflows. Datacap is a little limited in this aspect. So, you cannot really implement all the possibilities."
 

Pricing and Cost Advice

"Know how many pages you will be needing to process, as the pricing is based on that."
"Overall, their pricing is higher than the competitors, but they offer functionality that is otherwise not available."
"In Egypt, we have exchange rates that change year to year, and currently, we're facing an increase in the exchange rate between our Egyptian pound and the dollar. Since we have been using IBM DataCap two years ago we had a good price, it was not expensive. However, I cannot say now if it is expensive or not because of the exchange rate. Additionally, I don't have the data of other competitors and I don't know the prices."
"This solution is the most expensive in the market."
"This solution offers seamless integration with other enterprise products, which is my area of responsibility, focusing on government sector projects. Larger enterprise projects don't pose problems. It might be suitable for small businesses as well."
"We were using the User Value Unit licensing, which means we get charged per active user of the system, and if I'm not mistaken, we also had it for the rule runner service. They had a PVU license model, which is a processor value unit. For each process that we have in our system, we pay a certain amount of money. We found the pricing to be quite steep. It was really an expensive solution in comparison to Kofax, which had a different licensing model and was actually cheaper overall because they charge per page and not per user and per process."
"It varies, and it depends on the client's requirements and negotiations. Nowadays, Datacap is also included in the IBM Cloud Pak for Business Automation."
"It is an expensive solution."
"You save a lot of time and money, but the benefit is you have people who are able to run the systems, check to see if there are any errors at all, and there are a lot less errors than a human system."
"This solution offers seamless integration with other enterprise products, which is my area of responsibility, focusing on government sector projects. Larger enterprise projects don't pose problems. It might be suitable for small businesses as well."
report
Use our free recommendation engine to learn which Intelligent Document Processing (IDP) solutions are best for your needs.
845,040 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
No data available
Financial Services Firm
19%
Computer Software Company
15%
Government
13%
Insurance Company
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

Ask a question
Earn 20 points
What do you like most about IBM Datacap?
The installation of the solution is very simple.
What needs improvement with IBM Datacap?
When a client's requirement is not available in Datacap, out-of-the-box actions should be simpler to implement and not very complex. There should be an AI feature where a requirement from the clien...
 

Comparisons

No data available
 

Also Known As

No data available
Datacap
 

Overview

 

Sample Customers

Oklahoma DOT, Mercy Hospital System, OLERS, Oklahoma State University, Change Healthcare, U.S. Nuclear Regulatory Commission, American Airlines Credit Union
Turkcell, PowerSouth Energy Cooperative, Central Nacional Unimed, Conqord Oil
Find out what your peers are saying about Grooper vs. IBM Datacap and other solutions. Updated: March 2025.
845,040 professionals have used our research since 2012.