Giter VIP home page Giter VIP logo

serak-tesseract-trainer's People

Contributors

serak avatar

Watchers

 avatar

serak-tesseract-trainer's Issues

Training is taking too long to complete

What steps will reproduce the problem?
1. Using serak trainer and box files created by jtessbox of total 1000 page of 
Arabic language. 
2. .tr files are generated with huge size. 
3. on screen showing mftraing.exe running. Now it completes 6 days and didn't 
finish yet. 

What is the expected output? What do you see instead?
To complete training after sometime!

What version of the product are you using? On what operating system?


Please provide any additional information below.


Original issue reported on code.google.com by [email protected] on 3 Feb 2015 at 8:12

Error on "Train Tesseract"

What steps will reproduce the problem?
1. Created TIFF and BOX file with jTessBoxEditor
2. Created new project
3. Add Train Image
4. Click Train Tesseract - Error Occurs

What is the expected output? What do you see instead?
I haven't got a successful run yet. I get this error:
See the end of this message for details on invoking 
just-in-time (JIT) debugging instead of this dialog box.

************** Exception Text **************
System.ComponentModel.Win32Exception (0x80004005): The system cannot find the 
file specified
   at System.Diagnostics.Process.StartWithShellExecuteEx(ProcessStartInfo startInfo)
   at System.Diagnostics.Process.Start(ProcessStartInfo startInfo)
   at SerakTesseractTrainer.ShellExcutor.cmdExcute(String command, String TessractFolder, String parameter, String projectFolder)
   at SerakTesseractTrainer.TessMain.TrainTesseract()
   at SerakTesseractTrainer.MainForm.button3_Click(Object sender, EventArgs e)
   at System.Windows.Forms.Button.OnMouseUp(MouseEventArgs mevent)
   at System.Windows.Forms.Control.WmMouseUp(Message& m, MouseButtons button, Int32 clicks)
   at System.Windows.Forms.Control.WndProc(Message& m)
   at System.Windows.Forms.ButtonBase.WndProc(Message& m)
   at System.Windows.Forms.Button.WndProc(Message& m)
   at System.Windows.Forms.NativeWindow.Callback(IntPtr hWnd, Int32 msg, IntPtr wparam, IntPtr lparam)


************** Loaded Assemblies **************
mscorlib
    Assembly Version: 4.0.0.0
    Win32 Version: 4.0.30319.18444 built by: FX451RTMGDR
    CodeBase: file:///C:/Windows/Microsoft.NET/Framework64/v4.0.30319/mscorlib.dll
----------------------------------------
SerakTesseractTrainer
    Assembly Version: 1.0.0.0
    Win32 Version: 1.0.0.0
    CodeBase: file:///C:/Users/Hyubusa/Desktop/SerakTesseractTrainer.exe
----------------------------------------
System.Windows.Forms
    Assembly Version: 4.0.0.0
    Win32 Version: 4.0.30319.18408 built by: FX451RTMGREL
    CodeBase: file:///C:/Windows/Microsoft.Net/assembly/GAC_MSIL/System.Windows.Forms/v4.0_4.0.0.0__b77a5c561934e089/System.Windows.Forms.dll
----------------------------------------
System.Drawing
    Assembly Version: 4.0.0.0
    Win32 Version: 4.0.30319.18408 built by: FX451RTMGREL
    CodeBase: file:///C:/Windows/Microsoft.Net/assembly/GAC_MSIL/System.Drawing/v4.0_4.0.0.0__b03f5f7f11d50a3a/System.Drawing.dll
----------------------------------------
System
    Assembly Version: 4.0.0.0
    Win32 Version: 4.0.30319.18408 built by: FX451RTMGREL
    CodeBase: file:///C:/Windows/Microsoft.Net/assembly/GAC_MSIL/System/v4.0_4.0.0.0__b77a5c561934e089/System.dll
----------------------------------------
System.Xml
    Assembly Version: 4.0.0.0
    Win32 Version: 4.0.30319.34234 built by: FX452RTMGDR
    CodeBase: file:///C:/Windows/Microsoft.Net/assembly/GAC_MSIL/System.Xml/v4.0_4.0.0.0__b77a5c561934e089/System.Xml.dll
----------------------------------------
System.Configuration
    Assembly Version: 4.0.0.0
    Win32 Version: 4.0.30319.18408 built by: FX451RTMGREL
    CodeBase: file:///C:/Windows/Microsoft.Net/assembly/GAC_MSIL/System.Configuration/v4.0_4.0.0.0__b03f5f7f11d50a3a/System.Configuration.dll
----------------------------------------

************** JIT Debugging **************
To enable just-in-time (JIT) debugging, the .config file for this
application or computer (machine.config) must have the
jitDebugging value set in the system.windows.forms section.
The application must also be compiled with debugging
enabled.

For example:

<configuration>
    <system.windows.forms jitDebugging="true" />
</configuration>

When JIT debugging is enabled, any unhandled exception
will be sent to the JIT debugger registered on the computer
rather than be handled by this dialog box.

What version of the product are you using? On what operating system?


Please provide any additional information below.


Original issue reported on code.google.com by [email protected] on 17 Sep 2014 at 12:24

Box File Not Found With The Image Folder

What steps will reproduce the problem?
1. When adding new image .tiff file, the summary message appears
2. Choices are: Create New Box with help saying its based on default lang (eng)
3. Or Bootstrapping a new character set with help saying it will create a new 
character set for your lang.

What is the expected output? What do you see instead?
In video tutorial this screen doesnt appear, and using either option when 
Combining tessdata the file: 'normproto' is missing.
I would like to know how to generate this normproto file.
Couldnt find the answer online anywhere.

What version of the product are you using? On what operating system?
Serak trainer for tesseract 3.0X (the most recent availible)
Tessract 3.02.02 (installed with .exe)
Windows 8 pro 64-bit

Please provide any additional information below.
i attached 2 print screen of error messages.
1 is the error when adding the image.
2 is the erros when combining tessData.

Thanks ahead for any feedback.
Feel free to contact to my e-mail: [email protected]


Original issue reported on code.google.com by [email protected] on 18 Apr 2013 at 2:43

Attachments:

new font doesn't recognize


1. successfully created the box file for the tiff image 
2. double checked with the jtessboxeditor for the boxed characters
3. training has successfully completed  

The problem is, when I test the OCR using Serak-trainer the output of the image 
is not in the expected font. I wanted to identify certain mathematical symbols 
from the OCR. so ive downloaded a maths font, and created a tiff file, created 
the box file, and modified the characters according to the font.
finally when I trained and check through the OCR test, the result is in 
english. not in the font that i wanted.


im using serak-tesseract- trainer in windows

please help me on this, thank you in advance ! 

Original issue reported on code.google.com by [email protected] on 22 Jul 2013 at 12:57

Attachments:

Character height for common languages

My issue is not a problem, it's a doubt.

I can't get the zero character recognized for sans-serif font, but I think that 
this is happening because of row height (size font).

Is it necessary to indicate during the training the row height?

Version 1.0.0.0


Best regards,

Carlos

Original issue reported on code.google.com by [email protected] on 4 Dec 2013 at 4:04

Net framwork initialiation error

What steps will reproduce the problem?
1. downloaded the version v.04 rar unzipped
2. when clicked on exe file it displayed netframe initialisation error
3. screenshot attached for ready reference.

What is the expected output? What do you see instead?
does not open GUI trainer

What version of the product are you using? On what operating system?
WinXP(with sp3). trainer version v.04.

Please provide any additional information below.
 I have also installed jboxeditor 1.0. How to point to tesseract files?
wanted to test for kannada script.

Original issue reported on code.google.com by [email protected] on 15 Dec 2013 at 2:44

Attachments:

FIleNotFound at Test OCR / Recognize

What steps will reproduce the problem?
1. TestOCR
2. Recognize

No Output, error pops up: The FIle 
"C:\User\username\Desktop\sevenseg\output.txt" could not be found.

Using Serak tesseract V0.4 on Win7

************** Ausnahmetext **************
System.IO.FileNotFoundException: Die Datei 
"C:\Users\Gabriel\Desktop\sevenseg\output.txt" konnte nicht gefunden werden.
Dateiname: "C:\Users\Gabriel\Desktop\sevenseg\output.txt"
   bei System.IO.__Error.WinIOError(Int32 errorCode, String maybeFullPath)
   bei System.IO.FileStream.Init(String path, FileMode mode, FileAccess access, Int32 rights, Boolean useRights, FileShare share, Int32 bufferSize, FileOptions options, SECURITY_ATTRIBUTES secAttrs, String msgPath, Boolean bFromProxy, Boolean useLongPath)
   bei System.IO.FileStream..ctor(String path, FileMode mode, FileAccess access, FileShare share, Int32 bufferSize, FileOptions options, String msgPath, Boolean bFromProxy)
   bei System.IO.FileStream..ctor(String path, FileMode mode, FileAccess access, FileShare share, Int32 bufferSize, FileOptions options)
   bei System.IO.StreamReader..ctor(String path, Encoding encoding, Boolean detectEncodingFromByteOrderMarks, Int32 bufferSize)
   bei System.IO.StreamReader..ctor(String path, Encoding encoding)
   bei System.IO.File.InternalReadAllLines(String path, Encoding encoding)
   bei SerakTesseractTrainer.MainForm.recognize(Object sender, EventArgs e)
   bei System.Windows.Forms.Button.OnMouseUp(MouseEventArgs mevent)
   bei System.Windows.Forms.Control.WmMouseUp(Message& m, MouseButtons button, Int32 clicks)
   bei System.Windows.Forms.Control.WndProc(Message& m)
   bei System.Windows.Forms.ButtonBase.WndProc(Message& m)
   bei System.Windows.Forms.Button.WndProc(Message& m)
   bei System.Windows.Forms.NativeWindow.Callback(IntPtr hWnd, Int32 msg, IntPtr wparam, IntPtr lparam)



Original issue reported on code.google.com by [email protected] on 24 Jun 2013 at 10:31

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.