We don't know how many type of document are coming client side. We have read these documents with 100 accuracy how we can do that. Do u have the solution for that
Hi Vishal ,
You can read the documents in couple of different ways.
1. If it is a pdf document you can extract the whole text from the document.
2. If it is a pdf document and you want specific fields from it , you can use traditional OCR's from uipath like Microsoft OCR and Tesseract OCR ( Google ) to get the data.
If you are not sure about the indexing of the data on a document, you can use a new feature from Uipath called Document Understanding to train the bot with the type of documents and the bot extracts the data.
Document Understanding from Uipath is relatively new.
You can use matured OCR engines for Dynamic Document extracting from ABBYY called ABBYY flexicapture.
Please let me know which one works :)
There are three major file types you’ll likely run into:
1- PDF: you will need to use Regular Expressions or (Regex). However, you’ll need to test the output on different PDF to capture any inconsistencies and account for them in the code.
2-Word documents: UiPath offers a Word package with the tools needed to automate word processes.
3-Excel: UiPath integrates with Excel the best
A general rule is to teat the output.
I am a certified UiPath developer and I can help you with your project. You can find me on Upwork, Guru, Freelancer, and Fiverr. Feel free to reach out to me.
I hope that helped.
Already have UiPath Platform™ for Agentic Automation?
About UiPath Platform™ for Agentic Automation
UiPath (NYSE: PATH) is a global leader in agentic automation, empowering enterprises to harness the full potential of AI agents to autonomously execute and optimize complex business processes. The UiP
With over 3 million reviews, we can provide the specific details that help you make an informed software buying decision for your business. Finding the right product is important, let us help.