Giter VIP home page Giter VIP logo

tesseractstudio.net's People

Contributors

doculynxdev avatar farhadkhalafi avatar opaitsoftware avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

tesseractstudio.net's Issues

Text OCR method

Current windows Tesseract gives output in "Vector" mode. But Tesseract studio is having Text OCR option, Can someone help me to achieve "Text only" OCR option in command prompt(Windows Tesseract environment)?

Page Range Scan Only Scans First Page

When performing an OCR scan with page range selected to "Page" and range entered in the format [start_range]-[end_range], Tesseract Studio only performs an OCR scan on the [start_range] page. The OCR scan does not include any other pages. OCR scanning behavior appears to be equivalent to having the page range selected to "Current". Error is repeatable on multiple documents, possibly every document (occurs on every document I have tried).

Tesseract Studio Version: 1.1.5.36939
OS: Windows 10 Professional Version 1809
OCR Language: Eng
Spell Checker Dictionary: None/en_US (same behavior with both settings)
Parallel Processes: Automatic
Overwrite Existing OCR: Selected/Deselected (same behavior with both settings)

All Page Scan Does Not Function

When performing an OCR scan with page range selected to "All", Tesseract Studio does not perform the OCR scan and returns an error dialog saying "Error: Failed to recognize page". Error is repeatable on multiple documents, possibly every document (occurs on every document I have tried).

Scanning with page range selected to "Current" works as expected.

Tesseract Studio Version: 1.1.5.36939
OS: Windows 10 Professional Version 1809
OCR Language: Eng
Spell Checker Dictionary: None/en_US (same behavior with both settings)
Parallel Processes: Automatic
Overwrite Existing OCR: Selected/Deselected (same behavior with both settings)

Error when saving file

I am getting the error "_VT: Font has not been set" when trying to save a PDF after processing. The full error message is:

_VT: Font has not been set
   at _MpA._mCA(String , Boolean )
   at _MpA._mCA(Double , Double , String )
   at _5Hb._vb(String , _0jA )
   at _wY._URb(String )
   at _wY._DYA(Command , String )

I am on Version 1.5.0.33332 and using Windows 10 Pro 1803.

Cannot start OCR

Open a PDF then Click Start OCR Button in the toolbar and Start button in the dialog.

image

I tested three pdf files include ccitt.pdf sample and the results are the same.

c# project

where I can get C# project code ?

  1. how to run from command line prompt ?
    visualstudio.exe -l eng file.pdf file.xml
    and so on...

community license file please

Hi
I have extracted the installer as an archive so that I can use this on a work / organisation computer in a portable way.

The programme opens but there seems to be an issue with the license

Is it possible to get a community license key or license file please?

Thank you

Crashes on OCR and will not open again

I was able to OCR one page (Windows 10 64bit), but doing the entire PDF (40MBs) made Tesseract Studio disappear after 20 seconds and will not open again. No error message was given. Have uninstalled, restarted, and reinstalled with a new download, but the program will not open running as admin or with compatibility mode on.

Unable To Save PDF Document

After opening a PDF, I am unable to save that document either overwriting the existing or with a new file name. Tesseract Studio returns an error saying "FormatException: The input string format is incorrect" (translated). Note: OS Language is French, error message in original language is "FormatException: Le format de la chaîne d'entrée est incorrect."

It does not matter whether any OCR action has been performed on the opened document.

Tesseract Studio Version: 1.2.0.41757
OS: Windows 10 Professional Version 1809
OS Language: French

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.