Giter VIP home page Giter VIP logo

igorxcardoso / ocr-fast Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 0.0 2.19 MB

Este projeto é uma implementação de Reconhecimento Óptico de Caracteres (OCR) utilizando as bibliotecas PyTesseract e OpenCV em Python. A combinação dessas poderosas ferramentas permite a extração eficiente de texto a partir de imagens, abrindo portas para aplicações diversas, como processamento de documentos, automação de tarefas e reconhecimento.

Python 81.56% Dockerfile 18.44%

ocr-fast's Introduction

OCR with Python, Tesseract, and OpenCV

This project demonstrates Optical Character Recognition (OCR) on images using Python, Tesseract, and OpenCV.

Requirements

  • Python
  • OpenCV
  • Tesseract

Setup

  1. Install the required packages:
    • Download and install Tesseract from here.
  2. Update the tesseract_cmd variable in run.py with the path to the Tesseract executable on your system.

Usage

  1. Place the image you want to perform OCR on in the img directory.
  2. Update the cv2.imread function in run.py with the path to your image.
  3. Run the script:
python run.py

The script will load the image, convert it to grayscale (optional, depending on the image), and then apply OCR using Tesseract. The resulting text will be printed to the console.

Note

  • The current script is set to recognize the Portuguese language. If you want to use another language, change the lang parameter in the pytesseract.image_to_string function to the appropriate language code. You can find the list of supported languages here.

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

This project is licensed under the MIT License.

ocr-fast's People

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.