Giter VIP home page Giter VIP logo

image-to-text-speech-converter's Introduction

Image-to-text-Speech-converter

forthebadge forthebadge forthebadge forthebadge

Optical Character Recognition (OCR) is one of the most widely implemented types of data entry methods.With this tool, you can save a lot of time.Productivity needs more time and with this project, you can save your precious time by getting text/audio in seconds.Scanned documents need to be edited most of the time, particularly when some information must be updated. OCR converts data to text, which can be easily edited.In adition to that ,you can also generate Audio file from Picture or Text written in the GUI.

Screenshot from 2021-03-22 09-32-12

I have used here,

  1. Python-tesseract is an optical character recognition (OCR) tool for python. That is, it will recognize and “read” the text embedded in images.Python-tesseract is a wrapper for Google’s Tesseract-OCR Engine. It is also useful as a stand-alone invocation script to tesseract, as it can read all image types supported by the Pillow and Leptonica imaging libraries, including jpeg, png, gif, bmp, tiff, and others. Additionally, if used as a script, Python-tesseract will print the recognized text instead of writing it to a file.
  2. Tkinter is the standard GUI toolkit for Python.
  3. Pyperclip is a cross-platform Python module for copy and paste clipboard functions.
  4. gTTS (Google Text-to-Speech), a Python library and CLI tool to interface with Google Translate's text-to-speech API. Write spoken mp3 data to a file, a file-like object (bytestring) for further audio manipulation.
  5. PIL (Python Imaging Library), provides extensive file format support, an efficient internal representation, and fairly powerful image processing capabilities.

Some Previous Blogs:

Made with ❤️ by

image-to-text-speech-converter's People

Contributors

abhishek-iiit avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.