Giter VIP home page Giter VIP logo

maxinexiong / scraping-scanned-pdf-docs-using-ocr-with-rpa Goto Github PK

View Code? Open in Web Editor NEW
2.0 1.0 0.0 5.09 MB

This repository contains automation solutions that efficiently extracts text from scanned PDF documents with consistent layouts. Utilizing Tesseract OCR engine, the UiPath RPA robot achieves nearly 90% accuracy, streamlining the process and significantly reducing manual workload.

License: MIT License

ocr optical-character-recognition robotic-process-automation rpa scanned-documents scanned-receipts screen-scraping uipath uipath-classic-design uipath-modern-design

scraping-scanned-pdf-docs-using-ocr-with-rpa's Introduction

UiPath RPA Robot

Extracting Text from Scanned PDF Documents using Optical Character Recognition (OCR)

GitHub License: MIT Platform - UiPath RPA


This repository houses efficient automation solutions designed for extracting text from a series of scanned PDF documents sharing identical layouts and formats. Leveraging the built-in Tesseract OCR engine, the UiPath RPA robot achieves nearly 90% accuracy in text recognition (and with other paid OCR services such as Microsoft Azure Computer Vision OCR engine and Google Cloud Vision OCR engine, the accuracy can reach up to 98%!). It adeptly extracts relevant information from each scanned document and consolidates it into a single text file, as demonstrated by the output text files located in the Outputs folder. Offering a swift automation solution, this RPA robot significantly reduces the time-consuming manual effort required for such tasks.

This repository includes solutions created using both Classic Design and Modern Design in UiPath Studio.

You can check out the automation demo video for classic solution below:

screen-scraping-OCR-classic.mp4

Below is a snapshot of an example of the final output from the modern solution (Unfortunately, I am unable to showcase the automation demo video for the modern solution here since the solution directly extracts information from the scanned documents without the need to open them, making it significantly faster!):

scraping-scanned-receipt-modern-OCR


Installation

Before installing UiPath Softwares, please make sure your system meets the hardware and software requirements outlined in the UiPath documentation.

The Uipath Platform includes the following tools:

  • UiPath Studio
  • UiPath Assistant
  • UiPath Automation Cloud, including UiPath Orchestrator
To run this project successfully, please follow these steps to install UiPath Studio:

Step 1 : Visit uipath.com and click Try UiPath Free button.

Step 2: Sign up for a personal account.

Step 3: Verify your account in email.

Step 4: Log into the UiPath Automation Cloud using your account, and click the Download Uipath Studio button.

Step 5: Click Sign in.

Step 6: Select UiPath Studio Pro.

Step 7: Follow the system instructions to complete the installation of UiPath Studio Pro.

Please also follow these steps below to connect your local machine to the UiPath Automation Cloud for deploying this workflow (if desired):

Step 1: Sign up and log into UiPath Automation Cloud.

Step 2: Add a Tenant.

Step 3: Edit the user and assign the Automation Users role to grant them permission to execute processes.

Step 4: Go to the Orchestrator interface and click on Tenant in the left pane.

Choose Folders and then click the + icon to create a new folder.

Step 5: Navigate back to Tenant interface and follow the steps below to start adding an Automation User for Unattended Robot in Manage Access.

a) Scroll down to locate the target user, then assign the Automation User role to grant them the necessary permissions. Click Next button to move on to the next page.

b) In the Personal automations setup page, select the options to Enable user to run automations and Create a personal workspace for this user and enable optimal Studio Web experience, then click on the Next button.

c) On the Unattended setup page, check the option to Enable this user to run unattended automations, choose Specific Windows credentials for local machine connection to Orchestrator, provide Domain\Username of your user account on local machine (which can be found by executing whoami in Command Prompt), and enter the Password for accessing your local machine. Finally, click on the Update button.

Step 6: Now, go to the Machines page where you should see the workspace machine for the target user already created. Click the ellipsis to select Edit Machine.

Enter 1 for both the Production (Unattended) and Testing fields, then click the Update button.

Step 7: Now return to the newly created folder, choose the Machines menu, and click Manage Machines in Folder button to assign the machine you just configured to the folder.

You should now have both the User and Machine assigned to the new folder.

Step 8: Open UiPath Assistant and click Sign In. If you see the green circle in the top right corner, you’ve successfully connected your local UiPath Studio to the UiPath Automation Cloud.

You can confirm the connection by opening UiPath Studio and checking for a green circle at the bottom.


To publish a process from UiPath Studio to Orchestrator, switch to the new folder you just created in the Orchestrator, and then click to Publish the process as a package.

To learn more about other best practices on Orchestrator, please refer to the Orchestrator User Guide.


Usage

To run the RPA workflow on your local machine, follow these steps:

  1. Either download this repository to your local machine or clone it directly within your UiPath Studio.
  2. Open the UiPath Studio software on your machine.
  3. Locate and open the Main.xaml file from the downloaded repository in UiPath Studio.
  4. Run the Main_modern.xaml or Main_classic.xaml file to start the OCR-based scraping process on the scanned documents.

Acknowledgement

I would like to express my gratitude to the UiPath community for providing resources, tutorials, and a platform for automation enthusiasts to learn and collaborate.


License

This project is licensed under the MIT License, which means you're free to modify, distribute, and use the code in your own projects.

scraping-scanned-pdf-docs-using-ocr-with-rpa's People

Contributors

maxinexiong avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.