The accuracy of OCR depends on the document quality.
For unstructured documents embedded OCR tools (from Google, Microsoft) are fairly acceptable.
For semistructured documents (invoices & receipts & bank statements) we are working with Abbyy FlexiCapture tools.
For each document type it is possible to define a layout for data that must be extracted using Abbyy FlexiLayout. This layout is then used in Abbyy FC Document Definition to test the extraction. This will generate so called .fcdot that can be integrated in UIPath workflows using Abbyy Intelligent OCR package. In this way the extraction is very accurate and automation works excellent. Still, the confidence level should be defined in Abbyy FC. In the case of any uncertainties, UIPath raises Abbyy Verification station. It is possible to extract data not only from pdf files, but also from .tiff, .png and .jpg.
Still, to use Abbyy Intelligent OCR package is mandatory to use Abbyy FC Engine. This is licensed separately by UIPath directly and not by Abbyy.
One last recommendation:
Use Abbyy FC Standalone instead of Abbyy FC Distributed. It is mandatory to use integration in UIPath activities for the .fcdot files.
The accuracy of OCR depends on the document quality.
For unstructured documents embedded OCR tools (from Google, Microsoft) are fairly acceptable.
For semistructured documents (invoices & receipts & bank statements) we are working with Abbyy FlexiCapture tools.
For each document type it is possible to define a layout for data that must be extracted using Abbyy FlexiLayout. This layout is then used in Abbyy FC Document Definition to test the extraction. This will generate so called .fcdot that can be integrated in UIPath workflows using Abbyy Intelligent OCR package. In this way the extraction is very accurate and automation works excellent. Still, the confidence level should be defined in Abbyy FC. In the case of any uncertainties, UIPath raises Abbyy Verification station. It is possible to extract data not only from pdf files, but also from .tiff, .png and .jpg.
Still, to use Abbyy Intelligent OCR package is mandatory to use Abbyy FC Engine. This is licensed separately by UIPath directly and not by Abbyy.
One last recommendation:
Use Abbyy FC Standalone instead of Abbyy FC Distributed. It is mandatory to use integration in UIPath activities for the .fcdot files.
Graduate Engineering Trainee at Vodafone Shared Services India
0
0
Most of the times OCR is fairly accurate. But when there are similar texts on the screen it behaves ambiguously. For instance if you have "mode-I" and "mode-II" on the same page it might confuse between the two.
It depend on the content you are scanning. my assumption is if content is clear then it can give 100% but if content is blur or hand written then it can reduce to 50% depend on the clarity of the content
Intelligent Automation Specialist | RPA, AI and Digital Solutions | Hackett Consulting Services
#HCS #RiseOfTheMidMarket
0
0
UiPath uses many different methods of text recognition and reading, including multiple OCR engines such as Microsoft and Google, all of which are very reliable.
Already have UiPath Platform™ for Agentic Automation?
About UiPath Platform™ for Agentic Automation
UiPath (NYSE: PATH) is a global leader in agentic automation, empowering enterprises to harness the full potential of AI agents to autonomously execute and optimize complex business processes. The UiP
With over 2.5 million reviews, we can provide the specific details that help you make an informed software buying decision for your business. Finding the right product is important, let us help.