The header line (title) of a document is often written in larger font as the normal te

Line detection with different font sizes about dup-ocropy HOT 2 CLOSED

ocropus-archive commented on July 29, 2024

Line detection with different font sizes

from dup-ocropy.

Comments (2)

tmbdev commented on July 29, 2024

ocropus-gpageseg assumes that text lines are roughly the same scale. In return, it can detect even touching text lines in noisy documents pretty well. But that's only one of many strategies and possible tradeoffs. Your documents look like they are quite clean but have large variations in font size.

The best way to do text line recognition reliably is probably to run multiple different line detectors and combine their outputs.

As a simple version of that, you could try to run ocropus-gpageseg at different scales, try to recognize all the candidate text lines from the different parameter settings, and throw away those that give gibberish either due to being merged or split up.

Obviously, that is not going to be cheap. But ultimately, the only arbiter of whether a text line has been correctly segmented is whether you can recognize it, so for general purpose text line segmentation, invoking a recognizer somewhere is necessary.

For Latin script, you can also try to classify individual connected components as text/non-text and then attempt to group those together.

I'm planning on releasing a 2D LSTM based segmenter at some point, but that will still take a while.

from dup-ocropy.

zuphilip commented on July 29, 2024

Actually, in my example above the layout segmentation is perfect with ocropus-gpageseg --vscale 2.

from dup-ocropy.

Recommend Projects

Line detection with different font sizes about dup-ocropy HOT 2 CLOSED

Comments (2)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent