This project interprets text images via the system file explorer and utilizes 3 OCR engines to convert images within a folder into respective text files. This enables the user to process images in a compact, efficient and convenient manner. This allows the user to easily convert properly scanned images while maintaining the integrity of the document format. This robot helps us in the following ways:
- It removes the intermediate steps of locating an Image-to-Text conversion website or software.
- It utilizes three OCR Engines to provide options to the User.
- It provides a direct interface to the text files, with a distinction as to which OCR engine was used.
Process Definition: Images are read from file and the robot uses 3 OCR engines to convert the images into text files.
Inputs: Image files present in a folder.
Output: Text files that are ready to be utilized by the user.
Process Design:
- The file path is recorded to “imgs” and the number of arguments are recorded to “args”.
- A loop within the “imgs” begins and the images are being converted to a UiPath.Core.Image object.
- This object is then converted into the respective text object using three OCR engines, Google Tesseract OCR Engine, Abbyy Cloud OCR Engine and the Microsoft OCR Engine.
- Finally, the text objects are flushed to the disk and the task is completed.