Try our new research platform with insights from 80,000+ expert users

Grooper vs IBM Datacap comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Oct 10, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Grooper
Ranking in Intelligent Document Processing (IDP)
29th
Average Rating
8.6
Number of Reviews
4
Ranking in other categories
No ranking in other categories
IBM Datacap
Ranking in Intelligent Document Processing (IDP)
8th
Average Rating
7.6
Reviews Sentiment
6.7
Number of Reviews
27
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of May 2025, in the Intelligent Document Processing (IDP) category, the mindshare of Grooper is 0.5%, up from 0.3% compared to the previous year. The mindshare of IBM Datacap is 5.0%, up from 4.3% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Intelligent Document Processing (IDP)
 

Featured Reviews

reviewer1552698 - PeerSpot reviewer
Good data ingestion and classification capabilities, supports various media types and formats, and the interface is easy to use
Currently, we're still using version 2-7-2, and now they're about to do the beta release on their version 2021. In this coming version, we expect that some of our issues will be fixed. We've had challenges in classification tasks where similar documents were flagged as multiple matches. The system would identify them and say, "Hey, I think I've got multiple matches. It could either be this one or that one." Because of that, it required us to instruct the system to either leave it unclassified, or we had to halt the process for somebody to look at it. With the new version for 2021, they have changed the paradigm. As it is now, we're using something called a form type, where pages within the document are referenced using a specific page number. For example, in a ten-page document, you might refer to information specifically on the first or fifth page. In the new paradigm, there is a first, middle, and last page concept, as opposed to having the different form types with all of the different pages. What they're telling me is that it's going to make the classification more accurate. Just because the first page of two different documents looks the same, they will not be considered duplicates. Having multiple points of reference will now allow it to better distinguish them. The other area we have had challenges with is table extractions, where if the data headers were not defined, or the tables did not have descriptions for the columns. My understanding is that in the 2021 version, they've now shown that they're handling that. Again, we don't have it and haven't been able to test it, but it's coming. Technical support is definitely an area that they need improvement in, in terms of the front-line individuals.
DeekshithShetty - PeerSpot reviewer
High-accuracy document processing with easy integration and good image enhancement
The OCR extractions are very good, almost 100% accurate now. The integration with other tools is very easy, with inbuilt actions to configure and directly export to multiple depositories. It is also good at image enhancement, including the removal of lines, which is helpful in processing and automation and ensures very little user intervention, thus saving time.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Lexicons where the key vocabulary can be inputted it is very helpful."
"The user interface is easy to use, and the flexibility is noteworthy."
"Grooper processes difficult sorts of data and unstructured or semi-structured content very well. It's probably one of the better solutions I've seen compared to other solutions I've seen out there. It does a lot more things like segmentation extraction. It does it a lot better. Grooper has more focus on these types of freeform documents where other solutions are very generic and this is a little more elaborate in what they've done. I think they take it to the next level of extracting freeform data."
"There are many options and customizations that you can make to each individual extractor that allows you to tweak it for exactly what you need."
"The most valuable feature is its ability to capture data, which changes all the time into different formats."
"At the forefront of my thoughts, the standout feature of this intelligent product is its remarkable capability. This project we're currently engaged in revolves around streamlining workflows within both our company and the customer's company. It entails handling information from various documents with diverse formats and types, even when they contain the same data. The ability to connect this information with the appropriate database and recognize it irrespective of the format or source is an extremely valuable feature. Moreover, leveraging machine learning is crucial since our customer deals with an extensive archive of over five million documents. Machine learning can significantly alleviate the backlog by becoming well-versed in various scenarios they might encounter during their work once we've completed our application."
"I can have all scanners accessible from my end."
"The OCR extractions are very good, almost 100% accurate now."
"It is the best solution for scanning purposes."
"Very scalable and stable data capture and extraction solution that's very simple to install."
"The second thing that I like about Datacap is the fingerprint capture which is easy to configure on Datacap. From the form of the document, if a document is redundant in the same department, we can configure the capture based on the form of the documents"
"It is highly extensible, which we found to be most valuable. It is a very extensible solution because it is based on configurable rule sets. We were able to amend and adjust the solution and very easily add custom code and custom components. It does require some programming experience, but we found that not to be an issue. We liked its extensibility."
 

Cons

"Technical support is definitely an area that they need improvement in, in terms of the front-line individuals."
"If Grooper could "sense" important fields on the document and auto-build extractors for them, that'd be really cool."
"They should have more sub-extractors or exclusion extractors so that the user does not have to make a parent data type."
"Grooper is new. It's new beta stuff, so we've had some issues, but that's understandable. Getting the beta product to more of a true release is where it needs improvement. I'm going through training now, so it's hard to judge what they have and don't have until I get through that training. Training is the main thing for me because I'm trying to learn and take things I've learned from other products and try to transfer that knowledge to this one."
"Datacap's technology seems a little behind the industry. It's still using the old .NET framework. They should move to .NET Core and start integrating some machine learning. You can do some integration yourself, but you expect a solution to include the latest machine-learning approaches if you're paying reasonable money for it."
"Datacap has performance issues when processing large volumes of documents. We're doing 18,000 pages daily. Scanning takes almost 20-30 minutes, but it normally takes one or two minutes. We informed IBM and opened a ticket for that. They forwarded the issue to developers but didn't give a specific timeline for it to be resolved. Version 8.1 is already at the end of support."
"Speed of OCR is one issue. It's a challenge because we have customers that have millions and millions of pages that they want this solution to crank through. In order to do that you have to have a large infrastructure in place, and that directly impacts licensing based on the core count."
"There should be an increase in the capacity of the workflows. Datacap is a little limited in this aspect. So, you cannot really implement all the possibilities."
"I would like to see the product have the ability to process more documents in parallel. Right now, it is a single queue. Therefore, if you want to really test the load and stress test it, having multiple instances and the ability to scale it up would be great."
"Its weaknesses are primarily tied to the lack of available resources and expertise in the market to effectively support and provide solutions and services to each customer for seamless implementation. Expertise in this specific product is rare throughout the market. One key reason is the product's limited downloads. Additionally, archiving solutions are often perceived as complex and challenging, dissuading many companies from venturing into this domain. Consequently, partners who specialize in archiving solutions are always seeking straightforward, uncomplicated options that are easy to manage and meet customer expectations."
"It can take some time to implement."
"Third-party integration could be improved; it's very slow."
 

Pricing and Cost Advice

"Overall, their pricing is higher than the competitors, but they offer functionality that is otherwise not available."
"Know how many pages you will be needing to process, as the pricing is based on that."
"IBM could offer more competitive pricing. This would allow them to attain more users. Some of our clients are considering moving to a different solution called Encapture which is similar but offers more competitive pricing."
"If you want IBM Datacap on cloud, which is a service run by IBM, the price can be quite expensive, but if you want to just purchase the licenses and own those yourself, then the price is very competitive."
"This solution is the most expensive in the market."
"Pricing depends on how much we use it. We pay per bulk quantity. We pay as you go. Therefore, it sort of depends on our usage of it."
"It is an expensive solution."
"This solution offers seamless integration with other enterprise products, which is my area of responsibility, focusing on government sector projects. Larger enterprise projects don't pose problems. It might be suitable for small businesses as well."
"Pricing needs to stay competitive."
"You save a lot of time and money, but the benefit is you have people who are able to run the systems, check to see if there are any errors at all, and there are a lot less errors than a human system."
report
Use our free recommendation engine to learn which Intelligent Document Processing (IDP) solutions are best for your needs.
850,671 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
No data available
Financial Services Firm
18%
Government
14%
Computer Software Company
13%
Insurance Company
9%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

Ask a question
Earn 20 points
What do you like most about IBM Datacap?
The installation of the solution is very simple.
What needs improvement with IBM Datacap?
When a client's requirement is not available in Datacap, out-of-the-box actions should be simpler to implement and not very complex. There should be an AI feature where a requirement from the clien...
 

Comparisons

No data available
 

Also Known As

No data available
Datacap
 

Overview

 

Sample Customers

Oklahoma DOT, Mercy Hospital System, OLERS, Oklahoma State University, Change Healthcare, U.S. Nuclear Regulatory Commission, American Airlines Credit Union
Turkcell, PowerSouth Energy Cooperative, Central Nacional Unimed, Conqord Oil
Find out what your peers are saying about Grooper vs. IBM Datacap and other solutions. Updated: April 2025.
850,671 professionals have used our research since 2012.