Giter VIP home page Giter VIP logo

eynollah's People

Contributors

bertsky avatar cneud avatar emresvd avatar kba avatar mikegerber avatar vahidrezanezhad avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

eynollah's Issues

CLI still fails

Another one surfaced:

2 [[3.6625916e-17 1.0000000e+00 8.9200320e-18 2.0860562e-30 3.3768879e-20
  2.0703751e-30]] num_col_classifier
300
False is_image_enhanced
enhancing: 12.93634581565857
97.62038955952693 ratio_of_two_models
textregion: 238.8552384376526
textline: 301.728711605072
-0.8484848484848477 slope_deskew
deskewing: 311.6365988254547
marginals: 312.0726170539856
2 num_col_classifier
boxes in: 313.63718938827515
Traceback (most recent call last):
  File "/data/ocr-d/eynollah/sbb_newspapers_org_image/eynollah.py", line 3635, in run
    self.write_into_page_xml(txt_con_org, page_coord, self.dir_out, order_text_new, id_of_texts_tot, all_found_texline_polygons, all_box_coord, polygons_of_images, polygons_of_marginals, all_found_texline_polygons_marginals, all_box_coord_marginals, self.curved_line, slopes, slopes_marginals)
  File "/data/ocr-d/eynollah/sbb_newspapers_org_image/eynollah.py", line 1941, in write_into_page_xml
    page_print_sub = ET.SubElement(page, "PrintSpace")
TypeError: SubElement() argument 1 must be xml.etree.ElementTree.Element, not lxml.etree._Element

reading order representation (XML order vs index)

The reading order detection capabilities of eynollah look pretty amazing AFAICS – if viewed through the glasses of PageViewer. But it is noteworthy that the actual representation for PAGE-XML does not correspond to that schema's documentation regarding @index.

It surprisingly turns out that PageViewer gets it wrong too. See here for full report.

So IMO eyhollah needs to actually inverse its representation: the (currently correct) XML ordering needs to also become the (currently broken) @index ordering.

Sharing models through Hugging Face Hub

Hi there!

This project is very cool. I see you host and share models with your own hosted server. Would you be interested in sharing your models in the Hugging Face Hub? The Hub offers free hosting of over 25K models, and it would make your work more accessible and visible to the rest of the ML community. We can help you set up a Qurator organization if it makes sense.

Some of the benefits of sharing your models through the Hub would be:

  • wider reach of your work to the ecosystem
  • freely hosted demos with Streamlit/Gradio. See demos from TrOCR and DocTR.
  • versioning, commit history and diffs
  • repos provide useful metadata about their tasks, languages, metrics, etc that make them discoverable
  • multiple features from TensorBoard visualizations, PapersWithCode integration, and more

Creating the repos and adding new models should be a relatively straightforward process if you've used Git before. This is a step-by-step guide explaining the process in case you're interested. Please let us know if you would be interested and if you have any questions.

Happy to hear your thoughts,
Omar and the Hugging Face team

cc @NielsRogge

IndexError: list index out of range (slopes[region_idx])

Hi, when running Eynollah on this image, using

en = Eynollah("models_eynollah", imgfile, dir_out=path.split(imgfile)[0], curved_line=True, full_layout=True)
pcgts = en.run()

it fails with

Traceback (most recent call last):
  File "<stdin>", line 6, in <module>
  File "venv/lib/python3.6/site-packages/qurator/eynollah/eynollah.py", line 2074, in run
    pcgts = self.writer.build_pagexml_full_layout(contours_only_text_parent, contours_only_text_parent_h, page_coord, order_text_new, id_of_texts_tot, all_found_texline_polygons, all_found_texline_polygons_h, all_box_coord, all_box_coord_h, polygons_of_images, polygons_of_tabels, polygons_of_drop_capitals, polygons_of_marginals, all_found_texline_polygons_marginals, all_box_coord_marginals, slopes, slopes_marginals, cont_page, polygons_lines_xml)
  File "venv/lib/python3.6/site-packages/qurator/eynollah/writer.py", line 221, in build_pagexml_full_layout
    self.serialize_lines_in_region(textregion, all_found_texline_polygons_h, mm, page_coord, all_box_coord_h, slopes, counter)
  File "venv/lib/python3.6/site-packages/qurator/eynollah/writer.py", line 117, in serialize_lines_in_region
    if self.curved_line and np.abs(slopes[region_idx]) <= 45:
IndexError: list index out of range

Some other pages from the book seem to work, the results are looking really good (except for drop caps, but they are not that easy to identify and put in the correct order for humans as well). I'm segmenting the rest of the book now and will see if there are more errors like that one.

Upgrade to PAGE-XML2019

It should be safe to upgrade to PAGE-XML2019 by replacing occurences of

("xmlns", "http://schema.primaresearch.org/PAGE/gts/pagecontent/2017-07-15")
("xmlns:xsi", "http://www.w3.org/2001/XMLSchema-instance")
("xsi:schemaLocation", "http://schema.primaresearch.org/PAGE/gts/pagecontent/2017-07-15")

with their PAGE-XML2019 equivalents

("xmlns", "http://schema.primaresearch.org/PAGE/gts/pagecontent/2019-07-15")
("xmlns:xsi", "http://www.w3.org/2001/XMLSchema-instance")
("xsi:schemaLocation", "http://schema.primaresearch.org/PAGE/gts/pagecontent/2019-07-15")

adapt to renamed model

In the newest published model, all file paths have changed:

2021-04-25 2022-04-05
model_3up_new_good_no_augmentation.h5 model_bin_sbb_ens.h5 model_enhancement.h5 model_ensemble_s.h5 model_main_covid19_lr5-5_scale_1_1_great.h5 model_main_home_corona3_rot.h5 model_no_patches_class0_30eopch.h5 model_page_mixed_best.h5 model_scale_classifier.h5 model_tables_ens_mixed_new_2.h5 model_textline_newspapers.h5 eynollah-binarization_20210425.h5 eynollah-column-classifier_20210425.h5 eynollah-enhancement_20210425.h5 eynollah-full-regions-1column_20210425.h5 eynollah-full-regions-3+column_20210425.h5 eynollah-main-regions_20220314.h5 eynollah-main-regions-aug-rotation_20210425.h5 eynollah-main-regions-aug-scaling_20210425.h5 eynollah-main-regions-ensembled_20210425.h5 eynollah-page-extraction_20210425.h5 eynollah-tables_20210319.h5 eynollah-textline_20210425.h5

CLI does not work

I cannot get eynollah to run. Here's what I did for installation:

git clone https://github.com/qurator-spk/eynollah
pip install -e eynollah
make -C eynollah models

Everything went smoothly. Now I wanted to run on my image:

mkdir LINES-EYNOLLAH
eynollah -i ORIGINAL/FILE_0002_ORIGINAL.jpg -o LINES-EYNOLLAH -m /data/ocr-d/eynollah/models_eynollah -fl -ae -cl

This returns after about 1sec without doing anything (exit value is 1). It's the same if I remove -fl -ae -cl. (In fact, it's still the same if I remove the entire -m option.) No error, no result.

drop_capitals.py: ValueError: attempt to get argmin of an empty sequence

Hi, I think I found another one:

wget https://api.digitale-sammlungen.de/iiif/image/v2/bsb00052981_00339/full/full/0/default.png
eynollah -i default.png -o . -m eynollah/models_eynollah -fl -cl
13:16:03.204 INFO eynollah - resize and enhance image
13:16:03.204 INFO eynollah - Detected 230 DPI
13:16:19.326 INFO eynollah - Found 3 columns ([[1.6621375e-26 1.6978607e-38 1.0000000e+00 2.5424867e-32 9.4024474e-31
  0.0000000e+00]])
13:16:33.584 INFO eynollah - Image is enhanced
13:16:33.726 INFO eynollah - Enhancing took 30.522119998931885s
13:16:39.280 INFO eynollah - Image dimensions: 448x672
13:16:58.684 INFO eynollah - Image dimensions: 448x672
13:17:19.415 INFO eynollah - Image dimensions: 448x672
13:17:39.792 INFO eynollah - ratio_of_two_models: 99.93604678448163
13:17:40.588 INFO eynollah - Textregion detection took 66.86148571968079s
13:17:47.636 INFO eynollah - Graphics detection took 7.048167943954468s
13:17:47.636 INFO eynollah - cont_page [array([[  88,   87],
       [2933,   87],
       [2933, 4525],
       [  88, 4525]])]
13:17:52.956 INFO eynollah - Image dimensions: 448x672
13:18:04.696 INFO eynollah - textline detection took 17.060104370117188s
13:18:21.939 INFO eynollah - slope_deskew: -0.3636363636363633
13:18:21.939 INFO eynollah - deskewing took 17.242716073989868s
13:18:21.962 INFO eynollah - detection of marginals took 0.022979736328125s
13:18:27.893 INFO eynollah - Image dimensions: 896x896
13:18:33.513 INFO eynollah - Image dimensions: 896x896
13:18:53.899 INFO eynollah - areas_cnt_text [6.06679334e-05 3.96004787e-08 1.24939510e-03 1.53873996e-02
 3.28577052e-03 5.43809614e-03 5.36713208e-03 6.94196391e-05
 1.72341283e-04 1.30660354e-01 1.54637414e-01 7.77194243e-02
 3.97628407e-04 1.18769756e-03 4.26853560e-04]
Traceback (most recent call last):
  File "/.../bin/eynollah", line 33, in <module>
  File "/.../lib/python3.7/site-packages/click/core.py", line 1137, in __call__
    return self.main(*args, **kwargs)
  File "/.../lib/python3.7/site-packages/click/core.py", line 1062, in main
    rv = self.invoke(ctx)
  File "/.../lib/python3.7/site-packages/click/core.py", line 1404, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/.../lib/python3.7/site-packages/click/core.py", line 763, in invoke
    return __callback(*args, **kwargs)
  File "/.../lib/python3.7/site-packages/qurator/eynollah/cli.py", line 142, in main
    pcgts = eynollah.run()
  File "/.../lib/python3.7/site-packages/qurator/eynollah/eynollah.py", line 2024, in run
    all_found_texline_polygons = adhere_drop_capital_region_into_corresponding_textline(text_regions_p, polygons_of_drop_capitals, contours_only_text_parent, contours_only_text_parent_h, all_box_coord, all_box_coord_h, all_found_texline_polygons, all_found_texline_polygons_h, kernel=KERNEL, curved_line=self.curved_line)
  File "/.../lib/python3.7/site-packages/qurator/eynollah/utils/drop_capitals.py", line 157, in adhere_drop_capital_region_into_corresponding_textline
    arg_min = np.argmin(np.abs(y_lines - y_min_d[i_drop]))
  File "<__array_function__ internals>", line 6, in argmin
  File "/.../lib/python3.7/site-packages/numpy/core/fromnumeric.py", line 1267, in argmin
    return _wrapfunc(a, 'argmin', axis=axis, out=out)
  File "/.../lib/python3.7/site-packages/numpy/core/fromnumeric.py", line 61, in _wrapfunc
    return bound(*args, **kwds)
ValueError: attempt to get argmin of an empty sequence

Inference stuck

I tried running this on google colab, took a decent 480p image and ran it against this tool but it gets stuck after image enhancement. Not sure if I'm doing anything wrong.

!eynollah -i "images/passport.jpg" -o "output/" -m "models_eynollah/"

18:17:58.077 INFO eynollah - resize and enhance image
18:17:58.077 INFO eynollah - Detected 96 DPI
18:18:17.683 INFO eynollah - Found 1 columns ([[1.00000000e+00 1.43745085e-11 3.50643126e-11 1.02011668e-14
  1.21690427e-11 8.64729710e-09]])
18:19:56.064 INFO eynollah - Image is enhanced
18:19:56.167 INFO eynollah - Enhancing took 118.0904438495636s 
18:20:03.120 INFO eynollah - Image dimensions: 448x672
18:22:48.668 INFO eynollah - Image dimensions: 448x672

It keeps running forever.

Also, what is the inference time for an image?Thanks!

Run Time and Reading Order (of Text Recognition)

Hallo everyone,

recently I tried eynollah to achieve better region recognition. I noticed two points:

  1. the run time per page took approx. 25 minutes
  2. after the text recognition on the basis of these regions the reading order of recognized text was not correctly displayed
    There have been few regions that eynollah has trouble with. Still, the tool recognized the regions better than tesseract. But the use would hardly be possible unter these condition. Is there some ways to fix this points?

Best,
Aysoltan

Produces text results

Currently, ocrd-eynollah-segment produces (empty) TextEquiv elements. I believe it should not produce any, as this results in OCR processors giving a lot of warnings:

22:28:22.628 WARNING processor.CalamariRecognize - Line 'region_0081_line_0001' already contained text results
22:28:22.636 WARNING processor.CalamariRecognize - Line 'region_0081_line_0002' already contained text results
22:28:22.665 WARNING processor.CalamariRecognize - Line 'region_0081_line_0003' already contained text results
22:28:22.695 WARNING processor.CalamariRecognize - Line 'region_0081_line_0004' already contained text results
22:28:22.734 WARNING processor.CalamariRecognize - Line 'region_0081_line_0005' already contained text results

Use of image from --allow-enhancement option in OCR-D workflow

Hi
A question. Is the --allow-enhancement option resultant image intended to be used in an OCR-D workflow? In some cases, I believe resultant image has dimensions that differ from the dimensions of the input image. So if it is intended to be used in OCR-D, what is the suggested workflow (and the commands & parameters) that preserves the overlaying of the ocr results with the image?
Thanks!

Order of regions

The order of text regions detected by eynollah is not right.
When running eynollah-segment for the attached image the text regions are presented in wrong order.

The workflow used is:
"olena-binarize -I OCR-D-IMG -O OCR-D-BIN"
"eynollah-segment -I OCR-D-BIN -O OCR-D-SEG -P models default -P curved_line true"
"tesserocr-recognize -I OCR-D-SEG -O OCR-D-OCR-TESSEROCR -P model ecco"
'fileformat-transform -I OCR-D-OCR-TESSEROCR -O OCR-D-TEXT -P from-to "page text"'

Test19.zip

trying to get running...

Hi. I am trying to get this running on Windows 10 using Visual Studio Code.

If cd into the repo and run a command like:
eynollah -i C:/Users/Scott/Desktop/Python2/Kpages/Pages/076v.jpg -o C:/Users/Scott/Desktop/Python2/Kpages -m C:/Users/Scott/Desktop/Python2/eynollah/models_eynollah -si C:/Users/Scott/Desktop/Python2/Kpages
it doesn't appear to run. A new command prompt comes up after a couple of seconds -- but no output and no error message.

Any guidance would be appreciated.

Reverse text line order from OCR-D

Hi,
using eynollah in a OCR-D workflow produced a reverse text line order within each region, so that the last actual line is line_001 in the PAGE XML.

I'm new to eynollah and OCR-D, so I might have made a mistake somewhere. Any ideas anyone?
Thanks!

I used this workflow:

ocrd process \
  "sbb-binarize -I OCR-D-IMG -O OCR-D-BIN -P model default" \
  "eynollah-segment -I OCR-D-BIN -O OCR-D-SEG -P models default -P curved_line true" \
  "calamari-recognize -I OCR-D-SEG -O OCR-D-OCR -P checkpoint_dir qurator-gt4histocr-1.0"

Used image

PageView screenshot

PageView screenshot

And here's the xml section corresponding to the first news paragraph:

XML
 <pc:TextRegion id="region_0010" type="paragraph">
<pc:Coords points="372,501 371,501 363,501 363,502 357,502 356,501 350,501 349,502 347,502 347,503 345,505 345,506 345,506 345,520 345,521 345,524 348,526 348,529 361,529 361,528 364,528 365,529 373,529 373,530 373,530 374,531 373,531 372,531 371,532 360,532 360,533 359,533 358,532 348,532 348,533 346,533 345,533 343,533 342,534 341,534 341,535 327,535 327,539 326,540 321,540 321,539 318,539 317,538 316,538 315,538 313,538 313,537 311,537 310,536 309,536 308,536 306,536 305,535 303,535 302,535 294,535 293,535 291,535 290,535 284,535 283,535 281,535 281,535 276,535 275,535 271,535 271,536 263,536 262,536 251,536 250,536 235,536 234,535 224,535 223,536 208,536 207,535 204,535 203,535 181,535 180,535 172,535 171,536 165,536 165,536 165,537 164,536 159,536 158,537 158,537 157,536 156,536 155,536 153,536 153,535 127,535 126,536 118,536 117,535 106,535 105,536 102,536 101,536 101,537 101,538 101,540 100,541 100,542 99,543 99,544 98,545 98,551 99,551 99,555 97,556 96,556 96,557 95,557 94,558 92,558 91,558 90,558 89,559 81,559 80,560 78,560 78,560 76,560 75,561 73,561 72,561 66,561 65,562 60,562 59,563 51,563 50,564 50,574 50,575 50,621 50,621 50,665 49,665 49,681 50,681 50,701 50,701 51,702 51,712 51,713 51,718 52,719 51,720 51,726 52,727 52,731 51,731 51,735 51,735 51,753 51,753 51,757 51,758 51,766 51,767 51,779 52,780 52,795 53,796 53,808 52,809 52,819 53,820 53,830 52,830 52,832 51,833 51,840 52,840 52,850 53,850 53,881 53,881 53,882 53,890 53,890 53,903 53,904 53,909 53,910 53,910 54,910 55,911 127,911 128,911 132,911 133,912 133,912 134,913 138,913 138,913 143,913 144,913 152,913 153,913 155,913 156,914 158,914 159,915 190,915 190,914 205,914 205,915 211,915 212,915 218,915 219,915 224,915 225,913 226,913 227,913 232,913 233,912 235,912 236,911 240,911 240,911 250,911 250,910 270,910 271,910 271,905 271,904 271,898 272,898 272,895 273,894 273,893 273,893 273,892 275,890 276,890 276,890 283,890 283,889 286,889 286,888 288,888 288,888 290,888 290,887 290,886 296,886 296,886 297,886 298,885 302,885 303,886 305,886 306,886 313,886 314,886 323,886 323,885 328,885 328,885 329,885 332,885 333,886 354,886 355,885 357,885 358,885 377,885 378,885 379,885 380,886 394,886 395,885 396,885 396,885 404,885 405,885 410,885 410,886 418,886 419,886 419,888 420,888 428,888 428,888 429,888 429,886 430,886 430,886 431,885 444,885 445,884 452,884 453,885 471,885 471,884 473,884 474,883 478,883 479,884 496,884 496,883 563,883 563,883 565,883 566,883 578,883 578,884 580,884 580,885 591,885 591,884 595,884 596,883 608,883 608,883 633,883 633,883 635,883 635,884 641,884 642,883 650,883 651,883 658,883 658,883 661,883 662,885 663,885 664,886 685,886 686,885 686,885 686,883 687,883 687,880 688,880 688,858 687,858 687,848 686,847 686,841 687,841 687,787 686,786 686,771 686,771 686,769 686,768 686,765 687,765 687,731 686,731 686,725 686,725 686,720 686,719 686,706 686,706 686,698 686,698 686,690 687,689 687,676 686,675 686,668 686,668 686,666 686,666 686,650 686,649 686,633 685,633 685,619 686,618 686,609 685,608 685,598 685,598 685,591 685,590 685,578 685,578 685,541 684,540 684,540 683,539 683,538 683,538 682,538 675,538 675,537 675,536 674,535 674,533 673,532 673,531 666,531 665,532 660,532 660,533 658,533 657,532 654,532 653,531 636,531 635,532 606,532 606,533 603,533 603,532 598,532 598,531 586,531 586,531 581,531 581,530 576,530 576,530 575,530 575,529 575,528 576,528 576,528 577,528 578,527 581,527 581,526 587,526 588,526 594,526 595,525 596,525 597,525 600,525 601,524 601,524 601,523 602,523 630,523 630,522 631,521 642,521 643,522 648,522 648,521 658,521 658,516 657,516 657,515 656,514 655,514 653,512 653,510 653,510 653,508 652,508 634,508 633,507 633,506 632,505 632,503 631,503 626,503 626,503 601,503 600,504 588,504 588,503 583,503 583,503 570,503 570,503 553,503 552,503 532,503 531,503 493,503 493,503 486,503 486,503 469,503 468,504 463,504 463,503 453,503 453,504 440,504 439,503 435,503 434,503 400,503 400,502 392,502 391,501 390,501 390,501"/>
            <pc:TextLine id="region_0010_line_0001">
                <pc:Coords points="55,888 55,888 53,888 52,889 52,890 51,890 51,893 51,893 48,893 48,898 48,899 48,910 49,910 50,910 50,913 50,913 51,914 64,914 65,913 159,913 160,915 160,916 161,916 161,917 176,917 176,916 180,916 181,916 181,915 182,914 183,914 184,913 191,913 191,914 211,914 211,915 223,915 225,913 248,913 249,913 266,913 266,912 268,911 268,911 269,910 276,910 276,910 276,909 276,894 268,894 266,893 251,893 250,892 250,892 249,891 249,891 248,890 248,889 227,889 226,888 226,888 207,888 206,888 203,888 202,889 195,889 195,890 194,889 184,889 183,888 171,888 170,890 160,890 160,891 159,891 158,891 158,892 158,893 135,893 135,893 134,893 133,892 133,891 132,891 132,889 131,888 118,888 118,889 118,890 117,891 98,891 97,890 97,890 96,889 96,888 73,888 72,888 55,888"/>
                <pc:TextEquiv conf="0.996653318405151">
                    <pc:Unicode>Kulturkampfes halten.</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextLine id="region_0010_line_0002">
                <pc:Coords points="655,859 654,860 622,860 622,863 621,865 611,865 610,865 583,865 581,863 581,863 580,863 573,863 572,861 572,861 571,860 558,860 558,863 556,865 556,865 555,865 516,865 515,865 515,865 513,863 512,863 511,863 504,863 503,861 474,861 473,863 473,863 472,864 472,865 471,866 412,866 411,865 411,862 390,862 389,863 389,865 386,867 300,867 298,866 298,863 276,863 276,864 276,865 276,866 275,867 270,867 269,868 256,868 256,867 252,867 251,866 251,863 233,863 233,863 231,865 223,865 223,866 216,866 215,866 213,866 213,867 210,867 208,865 208,862 185,862 184,863 157,863 157,865 155,866 153,866 152,866 146,866 145,865 130,865 129,866 129,867 128,868 88,868 88,868 87,868 86,867 86,866 86,866 86,863 50,863 50,863 49,863 49,868 48,868 48,868 48,885 49,885 50,885 50,886 51,888 51,888 53,889 65,889 66,888 92,888 93,888 116,888 116,888 190,888 190,888 202,888 203,888 240,888 240,888 253,888 253,888 260,888 261,888 287,888 288,888 292,888 293,887 415,887 415,888 415,888 416,889 416,890 417,891 431,891 431,891 431,890 432,889 432,888 433,887 435,887 435,886 457,886 458,886 475,886 475,886 501,886 501,886 583,886 583,885 595,885 596,885 597,885 598,885 625,885 626,885 646,885 647,884 656,884 656,885 687,885 688,884 689,884 689,881 690,881 690,861 689,861 683,861 683,860 670,860 670,860 667,860 666,859 655,859"/>
                <pc:TextEquiv conf="0.99672269821167">
                    <pc:Unicode>haben, weil ſie ihn für einen Gegner Bismarcks und des</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextLine id="region_0010_line_0003">
                <pc:Coords points="541,835 540,836 540,837 538,839 528,839 527,840 482,840 481,839 468,839 468,840 466,841 443,841 443,841 410,841 410,841 401,841 400,842 384,842 383,841 360,841 358,840 358,838 330,838 330,838 329,839 329,840 328,841 328,841 327,841 313,841 312,842 308,842 308,843 298,843 298,842 284,842 283,841 274,841 273,840 273,840 268,840 267,838 267,838 266,837 211,837 210,838 198,838 197,837 181,837 180,838 160,838 159,839 159,841 158,842 158,843 158,843 156,843 155,843 123,843 122,841 122,840 121,839 121,838 91,838 91,839 90,840 89,840 89,841 88,842 71,842 71,843 65,843 64,842 63,842 62,841 62,838 49,838 49,843 48,843 47,843 47,855 48,855 48,860 49,860 50,860 50,861 53,861 54,862 92,862 93,863 107,863 108,862 123,862 123,863 156,863 157,863 159,863 160,864 180,864 181,863 195,863 195,862 211,862 212,863 234,863 235,863 246,863 248,862 310,862 310,863 310,863 311,864 311,865 311,866 351,866 351,865 352,865 352,864 354,862 388,862 389,861 433,861 435,863 435,864 451,864 451,863 451,863 453,861 456,861 456,861 469,861 470,860 475,860 475,861 500,861 501,860 539,860 540,861 551,861 552,860 560,860 560,860 612,860 613,859 620,859 621,860 636,860 637,859 674,859 675,858 686,858 688,856 690,856 690,840 688,840 687,840 686,840 685,838 685,838 684,838 671,838 671,838 666,838 666,839 641,839 640,837 640,836 618,836 615,840 580,840 579,840 577,840 576,840 568,840 567,838 567,836 566,835 541,835"/>
                <pc:TextEquiv conf="0.995206952095032">
                    <pc:Unicode>bereiteten Feierlichkeiten zeigten, mag darin ſeinen Grund</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextLine id="region_0010_line_0004">
                <pc:Coords points="405,810 404,811 404,811 403,812 386,812 386,811 381,811 380,811 346,811 346,811 318,811 318,812 318,815 316,816 307,816 306,816 255,816 255,817 254,816 217,816 216,815 215,815 215,815 201,815 200,815 199,815 198,815 198,815 197,814 197,813 196,813 196,812 196,811 176,811 176,812 163,812 163,815 162,815 162,816 161,817 98,817 97,816 95,816 93,815 89,815 87,813 87,812 53,812 52,813 51,813 51,813 50,814 50,815 50,816 50,817 49,818 48,818 48,834 50,834 50,835 50,836 51,836 52,836 53,837 69,837 70,838 75,838 75,838 88,838 89,838 95,838 95,837 121,837 122,838 123,838 123,838 123,839 124,840 137,840 138,839 138,838 138,838 142,838 143,837 143,838 160,838 160,838 161,838 162,839 183,839 184,838 197,838 198,837 248,837 249,838 249,838 263,838 265,837 282,837 283,838 283,839 297,839 297,838 298,836 346,836 347,837 359,837 360,836 394,836 395,836 416,836 416,835 516,835 516,835 537,835 538,835 577,835 578,835 624,835 625,834 651,834 652,835 653,835 653,835 653,836 654,836 667,836 667,836 668,835 668,835 668,834 670,834 671,833 688,833 689,833 689,832 690,831 690,815 689,815 688,815 688,814 683,814 683,813 646,813 646,813 630,813 630,812 612,812 611,813 607,813 606,814 583,814 581,811 581,811 580,810 563,810 561,812 561,813 561,813 561,814 560,815 528,815 528,814 516,814 516,815 513,815 513,815 483,815 482,815 481,815 478,812 478,810 431,810 430,811 418,811 418,810 405,810"/>
                <pc:TextEquiv conf="0.986474871635437">
                    <pc:Unicode>Schwarzen ſich weniger zurückhaltend bei den dem Kronprinzen</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextLine id="region_0010_line_0005">
                <pc:Coords points="657,783 656,784 626,784 626,785 626,786 626,788 625,788 608,788 607,789 595,789 594,790 579,790 578,789 578,789 577,788 577,785 533,785 531,786 531,789 531,790 530,790 529,790 524,790 523,791 512,791 511,790 496,790 495,790 465,790 465,789 465,786 451,786 451,787 451,788 443,788 442,788 437,788 436,788 426,788 426,788 425,788 424,788 424,786 406,786 406,788 405,788 405,789 405,790 403,790 402,790 400,790 399,791 396,791 395,790 383,790 383,791 370,791 369,790 367,790 366,790 365,790 364,789 364,785 346,785 346,786 318,786 318,786 313,786 313,789 312,790 306,790 305,790 289,790 288,790 282,790 281,791 268,791 267,790 266,790 266,790 263,790 263,789 253,789 252,790 250,790 250,790 249,791 248,791 230,791 230,791 211,791 210,791 195,791 195,791 188,791 187,792 173,792 172,791 151,791 149,790 148,790 147,788 147,788 142,788 141,787 141,786 108,786 107,787 105,787 105,788 98,788 98,788 92,788 91,788 80,788 79,789 78,788 78,786 65,786 65,787 62,787 61,788 61,789 61,790 60,790 60,790 58,791 48,791 48,808 49,808 50,808 50,811 50,811 98,811 99,812 120,812 120,811 175,811 176,811 192,811 193,811 220,811 221,811 225,811 226,811 250,811 251,812 251,813 251,813 251,815 267,815 268,814 271,814 272,813 282,813 283,812 298,812 299,811 312,811 313,811 321,811 321,811 322,811 323,813 353,813 354,812 366,812 366,811 367,811 368,811 368,811 369,810 407,810 408,811 428,811 429,810 436,810 437,811 440,811 441,811 464,811 465,811 483,811 485,812 485,813 500,813 500,813 501,812 501,811 503,810 506,810 506,809 508,809 509,810 563,810 565,811 580,811 582,810 591,810 591,809 597,809 598,808 621,808 622,809 628,809 629,810 642,810 643,808 655,808 656,808 688,808 688,807 690,807 690,788 684,788 683,788 681,788 680,787 680,783 657,783"/>
                <pc:TextEquiv conf="0.997538805007935">
                    <pc:Unicode>allenthalben einen ſympathiſchen Empfang. Daß auch die</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextLine id="region_0010_line_0006">
                <pc:Coords points="570,758 570,758 565,758 564,759 564,762 563,763 557,763 556,764 555,764 554,765 532,765 531,764 516,764 514,762 514,761 500,761 500,761 500,762 488,762 486,764 472,764 471,765 463,765 462,764 446,764 445,763 423,763 421,765 406,765 404,763 404,761 403,760 387,760 386,761 382,761 381,761 381,763 381,764 381,765 380,765 335,765 335,765 334,765 333,764 333,761 320,761 319,762 319,764 318,765 318,765 317,765 293,765 293,766 251,766 249,763 245,763 244,763 239,763 238,762 236,762 236,761 217,761 216,762 213,762 211,763 211,764 210,765 179,765 178,766 178,766 177,766 151,766 151,766 149,766 148,765 148,765 146,764 146,763 135,763 134,764 129,764 128,763 103,763 101,762 98,762 98,761 98,761 53,761 52,761 49,761 49,765 48,766 47,766 47,782 49,782 50,783 50,786 90,786 91,787 103,787 104,786 113,786 114,787 126,787 127,786 130,786 130,786 218,786 219,786 255,786 255,786 296,786 297,785 321,785 322,786 335,786 336,785 408,785 409,785 420,785 421,785 423,785 425,787 430,787 431,788 445,788 446,787 446,786 448,785 480,785 480,784 495,784 495,785 507,785 508,784 523,784 523,783 577,783 578,784 580,784 580,785 580,785 592,785 593,784 600,784 600,783 605,783 605,784 619,784 620,785 646,785 647,785 650,785 651,783 654,783 655,783 674,783 675,782 686,782 686,781 687,781 687,781 688,780 690,780 690,763 687,763 686,763 686,763 684,761 684,760 672,760 672,761 671,761 670,761 669,761 654,761 653,761 653,761 653,759 652,758 652,758 640,758 639,759 624,759 623,758 605,758 605,758 570,758"/>
                <pc:TextEquiv conf="0.988161742687225">
                    <pc:Unicode>Reiches. der in Bahern mehrere Truppenrevüen abhielt, fand</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextLine id="region_0010_line_0007">
                <pc:Coords points="640,732 640,733 639,733 632,733 631,735 631,736 630,736 629,736 628,736 613,736 611,735 611,735 610,734 609,734 608,733 591,733 590,734 589,734 588,735 574,735 573,735 551,735 551,738 550,738 528,738 527,738 511,738 510,737 493,737 492,738 491,738 491,738 481,738 480,739 467,739 466,738 463,738 461,736 461,735 443,735 442,736 442,736 440,738 439,738 438,739 430,739 429,738 421,738 420,738 420,738 418,736 418,736 417,736 416,736 416,735 415,735 414,735 400,735 400,735 398,735 398,736 397,736 396,736 396,737 395,738 395,739 394,740 353,740 353,740 349,740 348,739 336,739 335,740 291,740 290,739 290,735 250,735 250,735 221,735 221,735 221,738 220,739 220,740 220,740 198,740 197,740 176,740 176,740 176,741 156,741 155,740 155,737 133,737 133,738 133,738 132,739 132,740 131,741 110,741 108,739 108,737 98,737 98,736 81,736 81,740 80,741 80,741 79,741 55,741 55,742 47,742 47,758 48,758 48,759 48,761 49,761 68,761 68,761 91,761 91,761 109,761 110,761 175,761 176,761 176,763 176,763 196,763 196,763 198,761 209,761 210,760 215,760 216,761 238,761 240,763 253,763 254,762 254,761 255,761 255,761 256,760 295,760 296,760 368,760 369,760 397,760 398,760 423,760 424,759 438,759 439,760 453,760 453,759 503,759 504,758 516,758 517,759 534,759 535,760 535,760 535,761 535,761 548,761 548,760 549,760 549,759 550,758 551,758 552,758 639,758 640,759 666,759 668,757 688,757 689,756 689,755 690,754 690,738 681,738 681,737 666,737 665,736 665,732 640,732"/>
                <pc:TextEquiv conf="0.992116212844849">
                    <pc:Unicode>gendſten Truppeninſpektionen vor. Der Kronprinz des Deutſchen</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextLine id="region_0010_line_0008">
                <pc:Coords points="530,707 530,708 525,708 524,708 519,708 518,709 516,709 515,708 472,708 472,710 471,710 468,710 468,711 468,711 468,712 467,713 466,713 466,713 448,713 447,714 433,714 433,713 406,713 406,714 385,714 384,713 372,713 371,714 366,714 366,715 365,715 364,714 350,714 350,713 347,713 346,713 334,713 334,713 333,714 328,714 327,715 303,715 301,712 300,712 299,711 278,711 278,711 278,710 277,709 246,709 245,710 235,710 234,710 233,710 232,710 217,710 216,710 201,710 201,711 201,711 201,713 200,714 200,715 200,715 171,715 170,714 170,713 170,713 170,711 152,711 151,711 131,711 131,712 110,712 110,711 98,711 97,710 50,710 49,711 49,714 48,715 47,715 47,731 48,731 49,731 49,736 50,737 50,738 81,738 81,737 83,735 152,735 153,736 171,736 172,735 238,735 238,736 261,736 262,735 263,735 263,735 278,735 278,735 305,735 306,735 395,735 396,734 408,734 408,733 409,734 430,734 430,733 464,733 465,734 470,734 471,735 471,735 472,735 472,736 473,737 496,737 497,736 508,736 508,736 510,733 526,733 527,734 531,734 532,735 532,736 561,736 562,736 562,735 563,735 563,734 563,733 565,733 566,733 581,733 581,732 626,732 626,733 643,733 644,732 661,732 661,731 680,731 680,730 680,730 690,730 690,713 680,713 678,712 646,712 646,711 646,711 645,710 645,708 629,708 629,710 628,710 628,711 628,711 622,711 621,712 610,712 610,713 595,713 594,712 593,712 592,711 592,708 570,708 570,710 569,710 568,710 568,710 550,710 550,711 550,712 549,712 548,711 548,708 548,707 530,707"/>
                <pc:TextEquiv conf="0.981864213943481">
                    <pc:Unicode>ſich des veſten Wohlſeins und nimmt noch häufig die anſtren—</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextLine id="region_0010_line_0009">
                <pc:Coords points="516,683 516,683 501,683 500,684 495,684 494,685 494,686 492,688 485,688 484,687 476,687 475,686 475,686 474,686 473,686 473,685 472,685 471,685 471,685 470,684 454,684 453,685 453,685 451,686 451,687 450,688 425,688 425,688 423,688 423,687 422,687 421,686 421,685 421,685 421,684 409,684 408,685 403,685 402,684 366,684 366,685 365,685 365,685 365,686 365,687 364,688 355,688 354,688 345,688 343,686 343,685 342,685 322,685 320,686 320,687 320,688 320,689 319,690 302,690 302,691 301,691 269,691 268,691 268,691 266,690 242,690 241,689 241,689 240,688 240,686 227,686 226,685 226,685 211,685 210,685 189,685 188,686 184,686 183,686 173,686 171,685 155,685 155,686 154,685 134,685 134,688 132,690 110,690 110,691 98,691 98,690 77,690 75,688 75,686 49,686 49,688 48,689 48,690 48,691 47,691 47,707 48,707 50,708 50,711 78,711 79,710 98,710 98,710 120,710 121,710 156,710 156,710 170,710 171,710 176,710 177,711 178,711 178,711 191,711 192,711 193,711 194,710 214,710 215,711 215,711 216,711 226,711 226,710 227,709 268,709 268,708 268,708 311,708 311,706 312,706 320,706 321,706 321,707 321,708 322,708 323,708 323,708 324,709 325,709 326,710 338,710 339,709 386,709 386,708 415,708 415,708 418,708 419,708 431,708 431,708 439,708 440,708 456,708 456,709 470,709 470,708 478,708 479,708 501,708 503,709 523,709 523,708 530,708 531,708 560,708 560,707 575,707 576,708 591,708 591,707 608,707 608,706 630,706 630,707 646,707 646,706 687,706 688,706 688,706 688,705 689,704 690,704 690,688 688,688 687,687 687,686 650,686 649,686 648,686 648,685 648,684 646,684 646,683 636,683 636,685 634,686 622,686 621,687 613,687 612,688 609,688 608,687 592,687 591,686 591,686 590,686 590,683 567,683 566,683 549,683 548,683 543,683 542,684 540,684 540,683 536,683 536,683 516,683"/>
                <pc:TextEquiv conf="0.989803791046143">
                    <pc:Unicode>ſehen und begünſtigen. Se. M. der Deutſche Kaiſer erfreut</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextLine id="region_0010_line_0010">
                <pc:Coords points="561,657 561,659 560,660 560,661 560,661 556,661 556,662 546,662 546,663 523,663 521,661 521,661 521,660 521,658 505,658 505,661 504,661 500,661 499,661 481,661 481,661 480,661 479,662 474,662 473,663 464,663 463,661 463,661 462,660 462,658 441,658 439,660 439,661 438,662 431,662 431,663 415,663 413,661 413,659 413,658 413,658 398,658 397,658 368,658 367,659 367,660 366,661 366,662 365,663 359,663 358,664 330,664 329,663 326,663 325,663 324,663 323,661 323,660 322,660 322,659 255,659 255,660 254,660 253,660 253,660 253,661 252,661 252,662 251,663 251,663 251,664 221,664 220,663 220,662 220,661 220,660 219,659 176,659 176,660 165,660 165,663 164,664 163,664 162,665 151,665 150,665 145,665 144,666 126,666 126,665 119,665 118,665 85,665 84,663 84,662 83,661 83,660 48,660 48,661 48,665 47,666 47,682 48,682 48,683 48,684 50,686 51,686 52,687 84,687 86,685 142,685 143,686 143,686 181,686 182,685 183,685 183,685 188,685 188,685 201,685 201,685 207,685 208,685 221,685 222,685 223,685 223,684 281,684 281,685 282,685 283,685 295,685 296,684 318,684 318,683 364,683 365,684 400,684 401,685 414,685 415,683 416,683 416,683 445,683 446,683 460,683 461,683 481,683 481,683 482,683 483,684 486,684 486,685 515,685 515,684 516,683 535,683 535,682 539,682 540,683 540,683 541,684 556,684 557,685 570,685 571,683 583,683 584,683 585,683 596,683 597,683 598,683 598,682 605,682 606,681 614,681 615,682 631,682 632,683 632,684 633,685 646,685 646,684 646,683 648,683 648,683 649,683 649,682 650,681 650,681 651,681 686,681 687,680 688,680 688,680 689,679 690,679 690,663 688,663 688,662 688,661 679,661 678,661 663,661 662,661 651,661 650,662 621,662 620,663 619,663 618,662 606,662 606,661 605,661 604,660 604,658 603,657 561,657"/>
                <pc:TextEquiv conf="0.992499232292175">
                    <pc:Unicode>höheren geiſtlichen Behörden ſolche Vorpoftengefechte gerne</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextLine id="region_0010_line_0011">
                <pc:Coords points="656,631 655,632 646,632 645,633 635,633 635,632 633,632 632,631 618,631 618,634 617,635 617,635 616,636 616,636 615,636 608,636 608,637 589,637 588,636 561,636 560,637 515,637 513,636 513,635 500,635 500,635 498,637 465,637 464,638 409,638 408,638 407,638 406,636 406,634 376,634 376,635 375,635 375,636 374,638 372,638 371,638 345,638 344,638 343,638 342,637 330,637 328,638 311,638 310,638 305,638 303,636 283,636 282,636 264,636 264,636 263,637 263,638 263,638 261,638 261,639 183,639 183,638 183,638 170,638 169,638 168,638 167,638 163,638 162,637 153,637 153,636 140,636 138,639 124,639 123,640 121,640 120,639 112,639 111,638 111,636 91,636 91,636 90,636 90,637 88,637 88,638 87,640 61,640 60,639 48,639 48,640 47,640 47,656 48,657 48,659 81,659 81,660 118,660 119,660 119,662 135,662 135,661 135,661 135,660 136,660 152,660 153,659 161,659 162,658 163,658 163,659 175,659 176,658 181,658 182,659 220,659 220,660 236,660 237,659 240,659 241,658 312,658 313,658 363,658 363,658 387,658 388,658 427,658 428,659 431,659 431,660 443,660 443,659 445,658 448,658 449,657 531,657 532,656 538,656 538,657 562,657 563,656 578,656 579,657 591,657 591,656 611,656 612,656 613,656 614,656 646,656 646,656 685,656 685,655 688,655 688,653 688,653 690,653 690,636 688,636 687,636 680,636 678,634 678,631 656,631"/>
                <pc:TextEquiv conf="0.987055063247681">
                    <pc:Unicode>der Tagesordnung und werden ſolange vorkommen, als die</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextLine id="region_0010_line_0012">
                <pc:Coords points="518,606 518,606 503,606 503,609 503,610 503,610 501,611 501,611 500,612 480,612 480,613 443,613 441,611 441,608 439,608 438,608 406,608 406,609 405,610 405,611 404,612 397,612 396,613 368,613 366,611 366,611 366,610 366,609 363,609 363,608 346,608 346,609 345,610 340,610 339,610 337,610 336,610 326,610 325,609 325,609 324,608 306,608 305,609 296,609 295,608 276,608 276,609 275,609 274,610 274,611 272,613 271,613 270,613 250,613 250,613 249,613 248,612 248,611 248,611 248,608 217,608 217,608 216,609 189,609 189,611 188,611 188,612 187,613 181,613 181,614 176,614 175,615 147,615 146,614 78,614 77,615 73,615 72,614 64,614 63,613 63,613 62,613 61,613 60,611 48,611 48,612 47,613 47,631 48,633 48,636 64,636 65,636 65,635 65,635 66,635 66,634 120,634 121,635 121,637 136,637 136,636 138,634 170,634 171,635 171,636 171,637 183,637 183,636 183,636 184,636 184,635 185,634 200,634 201,633 223,633 223,634 224,634 225,635 225,636 248,636 248,636 248,635 250,634 250,634 251,633 275,633 277,636 277,636 278,637 311,637 311,636 312,636 315,636 315,635 315,635 316,635 316,634 343,634 343,633 344,633 345,633 385,633 386,632 404,632 406,634 414,634 415,635 415,635 416,635 438,635 438,635 441,635 443,633 444,633 445,632 506,632 506,633 507,633 508,634 512,634 513,635 516,635 516,635 564,635 565,635 565,634 567,631 568,631 568,631 615,631 616,631 621,631 622,632 623,632 623,633 635,633 636,631 637,631 638,631 643,631 643,630 657,630 658,631 686,631 686,628 686,628 690,628 690,611 689,611 688,611 688,606 676,606 676,608 676,608 676,610 675,610 663,610 663,611 639,611 638,610 637,610 635,608 635,608 635,607 619,607 619,609 617,611 575,611 573,610 573,609 573,608 573,607 562,607 561,606 553,606 552,606 518,606"/>
                <pc:TextEquiv conf="0.996099174022675">
                    <pc:Unicode>ſperrungen zelotiſcher Hetzkapläne ſtehen auch jetzt noch auf</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextLine id="region_0010_line_0013">
                <pc:Coords points="640,581 639,581 639,583 638,584 638,585 638,585 601,585 600,584 600,581 564,581 564,583 563,583 563,585 562,586 514,586 513,586 513,586 512,585 500,585 500,586 499,587 453,587 452,588 439,588 438,587 436,587 435,586 435,583 396,583 395,583 395,584 395,585 395,586 393,588 346,588 345,588 332,588 331,587 330,587 330,586 330,584 306,584 306,585 305,585 305,586 305,587 304,587 303,588 293,588 292,588 283,588 283,589 269,589 268,588 246,588 246,588 234,588 233,588 200,588 198,586 198,584 192,584 191,583 181,583 181,583 145,583 145,583 135,583 135,584 129,584 128,583 107,583 106,584 103,584 103,585 96,585 95,584 75,584 75,585 61,585 61,584 48,584 47,585 47,608 48,609 60,609 61,610 77,610 78,611 91,611 92,610 93,610 93,609 95,609 96,610 116,610 116,609 143,609 143,610 144,610 145,610 145,611 156,611 156,610 158,609 168,609 168,610 178,610 179,610 191,610 193,608 199,608 200,608 231,608 231,607 236,607 236,608 265,608 266,608 281,608 282,610 332,610 333,609 333,608 333,608 336,608 337,607 392,607 393,606 394,606 395,607 401,607 401,608 414,608 415,607 455,607 456,608 456,610 471,610 472,609 481,609 481,608 482,608 486,608 486,607 496,607 497,606 503,606 503,606 606,606 607,605 642,605 643,606 656,606 657,605 678,605 679,604 679,603 680,602 690,602 690,586 678,586 678,585 667,585 666,585 666,585 665,584 665,583 665,583 665,581 640,581"/>
                <pc:TextEquiv conf="0.993716180324554">
                    <pc:Unicode>d. h. Nachrichten von größerem Belange, denn kleine Ein—</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextLine id="region_0010_line_0014">
                <pc:Coords points="346,557 345,558 333,558 333,558 333,559 332,560 332,561 331,562 330,562 329,563 296,563 295,562 295,562 293,561 293,560 293,559 269,559 268,558 254,558 254,561 252,563 226,563 226,562 225,562 224,561 223,561 223,561 223,560 211,560 211,561 210,561 203,561 202,562 197,562 196,563 171,563 170,563 140,563 140,563 139,563 138,562 138,561 138,561 138,560 137,559 135,559 135,558 130,558 129,558 82,558 81,558 70,558 69,559 47,559 47,583 48,584 118,584 119,585 131,585 132,583 169,583 170,583 206,583 206,582 232,582 233,583 255,583 256,583 268,583 269,583 271,583 273,585 273,586 285,586 285,585 286,584 286,583 288,582 315,582 315,583 328,583 330,584 331,584 331,585 333,585 333,585 351,585 352,584 352,583 353,582 378,582 378,581 400,581 401,582 427,582 428,581 433,581 434,582 464,582 465,581 465,581 466,582 467,582 468,583 468,583 468,584 468,585 485,585 485,584 485,583 486,581 521,581 522,581 558,581 559,580 570,580 571,581 583,581 585,583 601,583 601,582 603,581 606,581 607,580 636,580 636,580 668,580 668,580 683,580 683,578 683,578 690,578 690,561 681,561 680,560 669,560 668,560 657,560 656,559 644,559 643,560 642,560 641,560 589,560 588,561 523,561 521,560 510,560 508,561 500,561 500,561 491,561 491,561 485,561 485,560 483,560 483,560 481,560 480,558 480,558 465,558 464,558 455,558 455,559 438,559 438,560 438,560 436,561 418,561 416,559 400,559 400,560 400,561 399,561 398,560 378,560 378,561 376,561 375,560 375,558 375,558 368,558 368,557 346,557"/>
                <pc:TextEquiv conf="0.993075370788574">
                    <pc:Unicode>Nachrichten in der letzten Zeit etwas ſparſamer geworden,</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextLine id="region_0010_line_0015">
                <pc:Coords points="658,528 658,529 655,529 655,530 630,530 629,529 597,529 596,530 596,532 595,533 595,535 595,535 555,535 555,535 555,531 531,531 531,530 503,530 502,531 502,533 501,533 493,533 491,535 485,535 485,536 462,536 461,535 461,532 461,531 460,531 460,531 446,531 445,531 445,531 444,532 444,535 443,536 441,536 441,536 406,536 405,536 404,536 403,535 403,532 403,531 359,531 358,533 358,535 357,536 356,536 355,537 309,537 308,536 306,536 305,536 305,531 273,531 271,533 270,533 270,533 263,533 263,534 263,535 262,535 261,536 258,536 258,536 258,537 226,537 225,536 211,536 210,536 210,537 210,538 177,538 176,537 175,537 175,536 162,536 160,538 142,538 141,538 124,538 123,536 122,536 121,536 121,533 121,532 104,532 103,533 101,533 101,534 100,535 97,535 96,536 95,536 95,537 94,538 94,539 93,540 93,554 103,554 103,555 103,555 104,556 105,556 105,556 131,556 132,557 132,558 160,558 161,557 162,557 163,558 205,558 205,557 261,557 262,558 271,558 273,560 273,560 305,560 305,560 308,557 318,557 319,556 323,556 323,557 340,557 341,556 358,556 359,557 397,557 398,556 423,556 424,556 440,556 440,556 490,556 491,558 491,559 492,560 520,560 520,559 520,558 521,556 522,556 523,556 539,556 540,555 594,555 595,556 607,556 608,555 635,555 635,555 645,555 646,554 655,554 656,555 681,555 681,554 686,554 686,553 686,551 687,550 690,550 690,536 689,535 689,535 688,534 678,534 677,533 677,529 671,529 671,528 658,528"/>
                <pc:TextEquiv conf="0.975517272949219">
                    <pc:Unicode>Von den deutfchen Cultur-Kampfſtätten ſind die</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextLine id="region_0010_line_0016">
                <pc:Coords points="549,500 548,500 543,500 542,501 516,501 515,501 515,502 515,503 514,503 514,503 513,504 513,505 513,505 510,505 510,506 482,506 481,505 480,505 480,505 479,505 478,504 466,504 465,505 465,505 464,506 451,506 450,505 436,505 435,504 435,501 404,501 403,501 390,501 389,501 375,501 375,501 349,501 348,502 348,505 348,506 345,506 345,506 340,506 340,523 346,523 346,523 346,528 360,528 361,527 386,527 387,526 406,526 406,527 418,527 419,526 420,526 420,526 460,526 461,525 467,525 468,526 487,526 488,525 571,525 571,526 583,526 584,525 598,525 598,525 656,525 656,522 656,521 663,521 663,511 662,511 662,510 661,509 660,509 658,507 658,505 656,505 655,505 646,505 645,504 636,504 636,503 636,502 635,501 635,500 622,500 621,500 621,500 620,501 620,503 619,504 596,504 596,503 591,503 591,503 588,503 588,502 578,502 577,501 570,501 568,500 549,500"/>
                <pc:TextEquiv conf="0.9727663397789">
                    <pc:Unicode>Roſenheim, den 5. September.</pc:Unicode>
                </pc:TextEquiv>
            </pc:TextLine>
            <pc:TextEquiv>
                <pc:Unicode>Kulturkampfes halten.
haben, weil ſie ihn für einen Gegner Bismarcks und des
bereiteten Feierlichkeiten zeigten, mag darin ſeinen Grund
Schwarzen ſich weniger zurückhaltend bei den dem Kronprinzen
allenthalben einen ſympathiſchen Empfang. Daß auch die
Reiches. der in Bahern mehrere Truppenrevüen abhielt, fand
gendſten Truppeninſpektionen vor. Der Kronprinz des Deutſchen
ſich des veſten Wohlſeins und nimmt noch häufig die anſtren—
ſehen und begünſtigen. Se. M. der Deutſche Kaiſer erfreut
höheren geiſtlichen Behörden ſolche Vorpoftengefechte gerne
der Tagesordnung und werden ſolange vorkommen, als die
ſperrungen zelotiſcher Hetzkapläne ſtehen auch jetzt noch auf
d. h. Nachrichten von größerem Belange, denn kleine Ein—
Nachrichten in der letzten Zeit etwas ſparſamer geworden,
Von den deutfchen Cultur-Kampfſtätten ſind die
Roſenheim, den 5. September.</pc:Unicode>
            </pc:TextEquiv>

training facilities and more region types

It would be awesome if some or all models used throughout eynollah's workflow could be adapted to other domains by providing the tools for training. Ideally this would be complemented with some documentation – but I assume you will publish academically on this approach sooner or later?

Specifically, it would be great if one could integrate detection of additional types of regions (provided there's some suitable structural GT), like:

  • vertical text
  • tables
  • handwriting
  • signatures
  • stamps
  • music scores
  • maps
  • image subclasses (figures, illustrations, photographs)

Or is there perhaps going to be some way of running incremental annotation here (i.e. masking certain areas of the page, so these can be segmented externally)? (Or is this already possible via API?)

Cannot run the file giving error.

Traceback (most recent call last):
File "/usr/bin/pytest", line 11, in
load_entry_point('pytest==3.3.2', 'console_scripts', 'pytest')()
File "/usr/local/lib/python2.7/dist-packages/pkg_resources/init.py", line 489, in load_entry_point
return get_distribution(dist).load_entry_point(group, name)
File "/usr/local/lib/python2.7/dist-packages/pkg_resources/init.py", line 2852, in load_entry_point
return ep.load()
File "/usr/local/lib/python2.7/dist-packages/pkg_resources/init.py", line 2443, in load
return self.resolve()
File "/usr/local/lib/python2.7/dist-packages/pkg_resources/init.py", line 2449, in resolve
module = import(self.module_name, fromlist=['name'], level=0)
File "/usr/lib/python2.7/dist-packages/pytest.py", line 13, in
from _pytest.fixtures import fixture, yield_fixture
File "/usr/lib/python2.7/dist-packages/_pytest/fixtures.py", line 842, in
class FixtureFunctionMarker(object):
File "/usr/lib/python2.7/dist-packages/_pytest/fixtures.py", line 844, in FixtureFunctionMarker
params = attr.ib(convert=attr.converters.optional(tuple))
TypeError: attrib() got an unexpected keyword argument 'convert'

After running pytest tests it givens me this kind of error. Can anyone help me out with this ?

TextLine coordinates too coarse

This is the same issue reported for the predecessor sbb_textline_detection before: region contours are fine-grained, tight polygons, but text lines are coarse bounding boxes that expand way beyond their parent region and overlap between neighbours.

Example:
FILE_0002_ORIGINAL_pageviewer-detail

Now, I understood the author as saying that this will not happen in eynollah. But he also said there's a performance trade-off. So can I enable the fine-grained text line contours somewhere?

Although I have not looked into the code yet, maybe my previous fix would apply here, too? (Yes, I do know extracting text line contours is more involved than just intersecting with the parent – especially separating neighbours. Still...)

Cant run eynollah

I just installed eynollah and I want to rty it on a jpg image.

I type in my cli the following eynollah -i repport_6.jpg -o eynollah

But the following error keep coming regardless of my output foler

  File "D:\Anaconda\envs\eynollah\Scripts\eynollah-script.py", line 33, in <module>
    sys.exit(load_entry_point('eynollah', 'console_scripts', 'eynollah')())
  File "D:\Anaconda\envs\eynollah\lib\site-packages\click\core.py", line 1128, in __call__
    return self.main(*args, **kwargs)
  File "D:\Anaconda\envs\eynollah\lib\site-packages\click\core.py", line 1053, in main
    rv = self.invoke(ctx)
  File "D:\Anaconda\envs\eynollah\lib\site-packages\click\core.py", line 1395, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "D:\Anaconda\envs\eynollah\lib\site-packages\click\core.py", line 754, in invoke
    return __callback(*args, **kwargs)
  File "C:\Users\eynollah\qurator\eynollah\cli.py", line 148, in main
    headers_off=headers_off,
  File "C:\Users\\eynollah\qurator\eynollah\eynollah.py", line 137, in __init__
    self.model_dir_of_enhancement = dir_models + "/model_enhancement.h5"
TypeError: unsupported operand type(s) for +: 'NoneType' and 'str'

Im on windows10 working on a conda environement

Does someone know where the issue come from and how to fix it?

No GPU acceleration

When running eynollah on an GPU-enabled server there is no GPU acceleration.
Below is when eynollah was run on a server without GPU (123.5 seconds)
eynollah_cpu_only

And this is when eynollah was run on a GPU-enabled server (133.4 seconds)
eynollah_with_gpu

Result of running nvidia-smi command
gpu_usage

Please guide me, did I miss something during installation or do I need to change certain settings.
Thank you.

Running results in OCR-D

Hello again. :)

In this closed issue , @kba kindly recommended the following workflow to use eynollah results in an OCR-D workflow:

ocrd workspace init
ocrd workspace add -G IMG -i IMG_1 -g page1 image1.png
ocrd workspace add -G SEG -i SEG_1 -g page1 image1.xml
ocrd-tesserocr-recognize -P segmentation_level none -P textequiv_level line

I'm having some challenges implementing this. It may just have to do with folders and paths, or maybe some "blanks" I failed to fill in...

Everything goes smoothly until the last line. (I believe it wants an input parameter?) The output is:

        Input fileGrp[@USE='INPUT'] not in METS!

If I try adding -I SEG, output includes the following:

Traceback (most recent call last):
  File "/home/scott/src/github/OCR-D/ocrd_all/venv/lib/python3.6/site-packages/ocrd/workspace.py", line 111, in download_file
    raise Exception("Not already downloaded, moving on")
Exception: Not already downloaded, moving on

and
FileNotFoundError: File path passed as 'url' to download_to_directory does not exist: C:/Users/Scott/Desktop/Python2/K/eyn_test2/F073r.jpg
and
FileNotFoundError: File path passed as 'url' to download_to_directory does not exist: /mnt/c/users/scott/desktop/python2/k/eyn_test2/C:/Users/Scott/Desktop/Python2/K/eyn_test2/F073r.jpg
and
Exception: Already tried prepending baseurl '/mnt/c/users/scott/desktop/python2/k/eyn_test2'. Cannot retrieve '/mnt/c/users/scott/desktop/python2/k/eyn_test2/C:/Users/Scott/Desktop/Python2/K/eyn_test2/F073r.jpg'

If I try adding -I SEG_1, the output is:

        Input fileGrp[@USE='SEG_1'] not in METS!

Any suggestions welcome and appreciated!

Unable to process document due to ValueError: attempt to get argmax of an empty sequence

For this workspace:

PPN729186350.zip

I get the following error:


Traceback (most recent call last):
  File "/usr/local/bin/ocrd-eynollah-segment", line 8, in <module>
    sys.exit(main())
  File "/usr/local/lib/python3.6/dist-packages/click/core.py", line 829, in __call__
    return self.main(*args, **kwargs)
  File "/usr/local/lib/python3.6/dist-packages/click/core.py", line 782, in main
    rv = self.invoke(ctx)
  File "/usr/local/lib/python3.6/dist-packages/click/core.py", line 1066, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/usr/local/lib/python3.6/dist-packages/click/core.py", line 610, in invoke
    return callback(*args, **kwargs)
  File "/usr/local/lib/python3.6/dist-packages/qurator/eynollah/ocrd_cli.py", line 8, in main
    return ocrd_cli_wrap_processor(EynollahProcessor, *args, **kwargs)
  File "/usr/local/lib/python3.6/dist-packages/ocrd/decorators/__init__.py", line 91, in ocrd_cli_wrap_processor
    run_processor(processorClass, ocrd_tool, mets, workspace=workspace, **kwargs)
  File "/usr/local/lib/python3.6/dist-packages/ocrd/processor/helpers.py", line 72, in run_processor
    processor.process()
  File "/usr/local/lib/python3.6/dist-packages/qurator/eynollah/processor.py", line 57, in process
    Eynollah(**eynollah_kwargs).run()
  File "/usr/local/lib/python3.6/dist-packages/qurator/eynollah/eynollah.py", line 1744, in run
    contours_biggest = contours_only_text_parent[np.argmax(areas_cnt_text)]
  File "<__array_function__ internals>", line 6, in argmax
  File "/usr/local/lib/python3.6/dist-packages/numpy/core/fromnumeric.py", line 1186, in argmax
    return _wrapfunc(a, 'argmax', axis=axis, out=out)
  File "/usr/local/lib/python3.6/dist-packages/numpy/core/fromnumeric.py", line 61, in _wrapfunc
    return bound(*args, **kwds)
ValueError: attempt to get argmax of an empty sequence

Command line used:

ocrd-eynollah-segment --overwrite -I OCR-D-IMG-BIN -O OCR-D-SEG-LINE -P models /var/lib/eynollah

Memory usage explosion with very narrow images (e.g. book spine)

With this document (PPN894261851.zip) we experienced an OOM error. Further investigation revealed this memory usage (measured using procpath):

eynollah vs Buchrücken drawio

The culprit seems to be this "page" from the document - an image of a book spine:

FILE_0017_MAX tif

Relevant parts from the log output:

18:25:30.757 INFO eynollah - INPUT FILE PHYS_0017 (17/18)
18:25:30.780 INFO eynollah - resize and enhance image
18:25:30.780 INFO eynollah - Detected 25 DPI
18:25:40.756 INFO eynollah - Found 5 columns ([[4.1955504e-01 1.7818451e-13 2.7631987e-21 7.5972243e-22 5.8044493e-01
  0.0000000e+00]])
18:31:39.449 INFO eynollah - Image is enhanced
18:31:40.369 INFO eynollah - Enhancing took 369.5891568660736s
18:31:47.043 INFO eynollah - Image dimensions: 448x672
18:43:35.935 INFO eynollah - Image dimensions: 224x448
18:52:07.638 INFO eynollah - Image dimensions: 448x672
19:01:28.031 INFO eynollah - Textregion detection took 1787.6620445251465s
19:01:36.604 INFO eynollah - Graphics detection took 8.571088552474976s
19:01:36.604 INFO eynollah - cont_page [array([[  519,   445],
       [ 4404,   445],
       [ 4404, 27685],
       [  519, 27685]])]
19:01:41.160 INFO eynollah - Image dimensions: 448x672
19:08:15.645 INFO eynollah - textline detection took 399.04073786735535s
19:26:32.295 INFO eynollah - slope_deskew: -90.0
19:26:32.451 INFO eynollah - deskewing took 1096.8060252666473s
19:26:33.040 INFO eynollah - detection of marginals took 0.5885534286499023s
19:26:55.466 INFO eynollah - Image dimensions: 896x896
19:27:51.663 INFO eynollah - Image dimensions: 896x896
19:34:22.576 INFO eynollah - areas_cnt_text [1.60449940e-05 3.67248936e-05 4.69396395e-05 1.78734430e-05
 6.68446924e-05 1.59316018e-05 2.67794541e-05 3.35782605e-05
 2.04153178e-05 1.02601028e-04 1.49299709e-05 2.50974700e-05
 1.09640792e-04 4.56729543e-04 1.69521315e-05 7.82122588e-05
 9.06334276e-05 2.25603199e-04 1.58796304e-05 4.07455914e-05
 1.44858515e-05 1.97103964e-04 3.92242463e-05 2.14925435e-05
 2.01601854e-05 1.57520642e-05 1.14313495e-04 2.90331237e-05
 1.44291554e-04 2.15615238e-04 3.12064739e-05 4.46585667e-04
 2.03675986e-04 4.18700639e-05 2.75817038e-04 2.86669615e-04
 4.78515016e-05 1.76816212e-04 2.13172581e-04 2.02211337e-04
 3.27372684e-05 1.72403366e-05 1.62434303e-05 3.26522243e-05
 2.49226571e-05 1.41551243e-05 2.55297777e-04 2.39352001e-05
 1.48591008e-05 1.77080794e-05 1.41844173e-04 7.28828262e-05
 1.27079565e-04 1.09125803e-04 5.03886517e-05 1.61253135e-05
 2.59356273e-04 3.43578317e-05 1.49417826e-04 1.00711158e-04
 1.49819423e-05 5.42553252e-04 2.48706857e-05 2.26875554e-03
 4.71257916e-04 8.13966893e-05 7.39080805e-05 4.21195267e-04
 3.22033802e-05 2.35572262e-04 2.46580753e-05 2.20656465e-04
 2.95670119e-05 1.99759231e-05 4.83650737e-04 2.61520173e-04
 1.14686745e-04 5.78111151e-05 1.14729267e-04 1.89081467e-05
 1.68529133e-04 1.66998339e-04 1.72875834e-05 2.23552691e-04
 1.04831546e-03 6.28268293e-04 5.47693697e-04 1.98365452e-04
 2.78094331e-05 6.26397322e-05 5.01098959e-05 1.08133621e-04
 9.64258784e-05 5.27179162e-05 6.81203545e-05 1.25246392e-04
 7.48104933e-04 8.99908719e-05 6.32440181e-04 1.75379911e-05
 9.17437261e-05 3.56807405e-05 3.17781595e-05 2.56077349e-05
 1.14162306e-04 3.40275770e-04 1.91113077e-05 2.73133423e-05
 2.53143326e-04 4.32118714e-05 1.93848663e-04 3.59594963e-05
 1.95918070e-04 1.34687236e-03 1.60180634e-04 2.35761249e-05
 6.63717525e-04 4.14731913e-05 1.89790168e-05 1.82136195e-05
 1.86530142e-05 2.08773909e-04 2.22569958e-04 3.77780235e-04
 4.02589500e-05 5.98474497e-05 1.02081314e-04 3.75233635e-05
 4.72098908e-04 5.47306274e-05 1.23058868e-04 1.49281755e-03
 8.34802707e-05 1.13349662e-04 2.02093220e-04 2.57681376e-03
 2.15686108e-04 5.79150579e-05 4.43079958e-05 2.98197820e-04
 2.61132750e-05 8.44677276e-05 5.68189335e-05 3.62051794e-05
 7.14342410e-04 1.95589233e-03 1.87621542e-04 2.56549816e-05
 1.75568898e-05 1.43630100e-05 9.49763483e-04 5.73769175e-04
 3.36840932e-04 1.75474405e-05 1.04953916e-04 6.89329984e-05
 6.42224981e-05 2.66504705e-04 6.18412623e-05 5.68283828e-05
 2.05906032e-04 1.20568964e-04 2.07554943e-05 2.06421021e-05
 6.66509807e-05 3.16127959e-05 1.37913244e-05 7.39458779e-05
 3.90399840e-05 2.61038257e-05 2.60187815e-05 2.02953110e-04
 4.78609509e-05 1.26876404e-04 8.87908046e-05 4.99917791e-05
 2.68890665e-04 4.74404549e-05 1.45269562e-04 1.67092832e-04]
19:43:02.340 INFO eynollah - Job done in 4651.560835599899s

This log output is not from the OOM, but another run I did on a different machine to investigate the problem. If I interpret the cont_page part correctly, the image is blown up to [ 4404, 27685], which would certainly explain the OOM error on the other machine.

Reproduce with ocrd-eynollah-segment -I MAX -O TEST-SEGMENT -P models /path/to/models.

PAGE-XML coordinates can have self-intersections

On this image, eynollah produces polygons that are invalid:

ERROR processor.ExtractPages - Page "PHYS_0002" ImageRegion "r91" Self-intersection[2151 3197]
ERROR processor.ExtractPages - Page "PHYS_0002" ImageRegion "r92" Self-intersection[1605 99]
The incriminated data is here

    <ImageRegion id="r91">
      <Coords points="2631,3129 2630,3130 2628,3130 2627,3131 2625,3131 2623,3133 2622,3133 2621,3134 2620,3134 2619,3135 2618,3135 2616,3137 2613,3137 2612,3138 2610,3138 2608,3140 2607,3140 2606,3141 2605,3141 2604,3142 2602,3142 2601,3143 2598,3143 2597,3144 2594,3144 2593,3145 2592,3145 2590,3147 2589,3147 2588,3148 2587,3148 2586,3149 2585,3149 2584,3150 2583,3150 2582,3151 2581,3151 2579,3153 2575,3153 2574,3154 2569,3154 2568,3155 2565,3155 2564,3156 2558,3156 2557,3157 2555,3157 2554,3158 2552,3158 2550,3160 2546,3160 2545,3161 2543,3161 2542,3162 2540,3162 2539,3163 2535,3163 2534,3164 2530,3164 2529,3165 2525,3165 2524,3166 2521,3166 2520,3167 2519,3167 2518,3168 2515,3168 2514,3169 2510,3169 2509,3170 2508,3170 2507,3171 2501,3171 2500,3172 2489,3172 2488,3171 2479,3171 2478,3172 2476,3172 2475,3173 2472,3173 2471,3174 2470,3174 2468,3176 2462,3176 2461,3177 2459,3177 2458,3178 2452,3178 2451,3179 2437,3179 2436,3180 2422,3180 2421,3181 2414,3181 2413,3182 2406,3182 2405,3183 2401,3183 2400,3184 2380,3184 2379,3185 2365,3185 2364,3186 2357,3186 2356,3187 2311,3187 2310,3188 2270,3188 2269,3189 2266,3189 2265,3190 2263,3190 2262,3191 2258,3191 2257,3192 2236,3192 2235,3191 2224,3191 2223,3192 2221,3192 2220,3193 2218,3193 2217,3194 2216,3193 2205,3193 2204,3194 2198,3194 2197,3195 2193,3195 2192,3196 2186,3196 2185,3195 2177,3195 2176,3196 2169,3196 2168,3197 2151,3197 2630,3197 2631,3196 2632,3197 2690,3197 2690,3166 2682,3166 2681,3165 2668,3165 2665,3162 2664,3162 2662,3160 2657,3160 2655,3158 2655,3157 2653,3155 2653,3153 2651,3153 2650,3152 2650,3140 2649,3139 2649,3138 2648,3138 2647,3137 2646,3137 2645,3136 2645,3129 2631,3129"/>
    </ImageRegion>
    <ImageRegion id="r92">
      <Coords points="1628,90 1628,91 1633,91 1634,92 1634,93 1632,95 1631,95 1630,96 1625,96 1624,97 1623,97 1622,98 1609,98 1608,99 1605,99 1608,99 1609,98 1627,98 1628,97 1629,97 1630,98 1648,98 1649,99 1673,99 1674,98 1723,98 1724,99 1744,99 1745,98 1756,98 1757,99 1773,99 1774,98 1784,98 1785,99 1936,99 1937,100 1953,100 1954,99 1986,99 1987,100 2003,100 2004,99 2035,99 2036,100 2054,100 2055,99 2102,99 2103,100 2114,100 2115,99 2141,99 2142,98 2169,98 2170,97 2181,97 2182,98 2256,98 2257,97 2262,97 2263,96 2269,96 2270,97 2278,97 2279,96 2317,96 2318,95 2351,95 2352,94 2422,94 2423,93 2443,93 2444,92 2463,92 2464,91 2475,91 2476,90 2485,90 2486,91 2496,91 2497,90 2504,90 1628,90"/>
    </ImageRegion>

Eynollah process is crashing and exiting with -di flag

When eynollah is run with -di flag, in some instances it crashes and stops processing.

The command used:
eynollah -di ./issue_images/000___2020___sustainability_report___ed8c9421412e74c2ad60da1f702dafa2__174456655 -m ./models/ -o ./page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/ -light -fl -ipe -tab

The images were generated from pdfs using pdf2image python package.

Server specifications are as follows:

Server | GPUs| CPUs| RAM| GPU Memory
g3s.xlarge | 1 | 4 | 30.5 | 8

Tesla M60 GPU

I've attached the images of one pdfs, and also attached 10 sample pdfs that are causing the issue. Please note that some of the documents are not in English, would that cause any issue?
000___2020___sustainability_report___ed8c9421412e74c2ad60da1f702dafa2__174456655.zip
pdfs

It was recommended earlier to use -ipe flag to avoid another issue occurring: https://github.com/qurator-spk/eynollah/issues/77

The issue does not appear when running with the -i flag. But because we have a lot of documents to process, we need the -di flag.

The stack trace is:

eynollah -di ./issue_images/000___2020___sustainability_report___ed8c9421412e74c2ad60da1f702dafa2__174456655 -m ./models/ -o ./page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/ -light -fl -ipe -tab
[Errno 2] No such file or directory: 'identify': 'identify'
17:23:21.814 INFO eynollah - Resizing and enhancing image...
17:23:21.814 INFO eynollah - Detected 230 DPI
17:23:26.879 INFO eynollah - Found 1 columns ([[9.9999678e-01 3.2104797e-06 9.3880003e-18 2.2546431e-17 3.9680694e-18
  4.1701970e-16]])
17:23:26.893 INFO eynollah - Image was enhanced.
17:23:26.903 INFO eynollah - Enhancing took 5.2s 
17:23:27.190 INFO eynollah - Image dimensions: 224x448
17:23:32.681 INFO eynollah - Image dimensions: 448x672
17:23:38.579 INFO eynollah - Image dimensions: 448x672
17:23:51.908 INFO eynollah - slope_deskew: 0.22727272727272663
17:24:11.386 INFO eynollah - detection of marginals took 0.2s
17:24:14.449 INFO eynollah - Image dimensions: 896x896
17:24:22.544 INFO eynollah - Image dimensions: 896x896
17:24:33.125 INFO eynollah - Job done in 71.4s
17:24:33.125 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page15.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:24:33.270 INFO eynollah - Resizing and enhancing image...
17:24:33.270 INFO eynollah - Detected 230 DPI
17:24:33.361 INFO eynollah - Found 2 columns ([[8.2128608e-06 9.9959677e-01 3.6321968e-08 3.7463789e-04 8.4602252e-06
  1.1857667e-05]])
17:24:33.375 INFO eynollah - Image was enhanced.
17:24:33.386 INFO eynollah - Enhancing took 0.2s 
17:24:33.692 INFO eynollah - Image dimensions: 224x448
17:24:35.564 INFO eynollah - Image dimensions: 448x672
17:24:37.337 INFO eynollah - Image dimensions: 448x672
17:24:47.917 INFO eynollah - slope_deskew: 0.22727272727272663
17:25:08.496 INFO eynollah - detection of marginals took 0.2s
17:25:11.471 INFO eynollah - Image dimensions: 896x896
17:25:13.299 INFO eynollah - Image dimensions: 896x896
17:25:22.837 INFO eynollah - Job done in 49.7s
17:25:22.838 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page60.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:25:22.961 INFO eynollah - Resizing and enhancing image...
17:25:22.961 INFO eynollah - Detected 230 DPI
17:25:23.047 INFO eynollah - Found 2 columns ([[9.0465095e-04 9.9909532e-01 1.9286886e-11 2.2578611e-11 1.8923075e-12
  1.3957758e-09]])
17:25:23.061 INFO eynollah - Image was enhanced.
17:25:23.067 INFO eynollah - Enhancing took 0.2s 
17:25:23.392 INFO eynollah - Image dimensions: 224x448
17:25:25.271 INFO eynollah - Image dimensions: 448x672
17:25:27.041 INFO eynollah - Image dimensions: 448x672
17:25:37.847 INFO eynollah - slope_deskew: -2.5
17:25:59.285 INFO eynollah - detection of marginals took 0.1s
17:26:02.021 INFO eynollah - Image dimensions: 896x896
17:26:03.874 INFO eynollah - Image dimensions: 896x896
17:26:11.034 INFO eynollah - Job done in 48.2s
17:26:11.034 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page68.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:26:11.164 INFO eynollah - Resizing and enhancing image...
17:26:11.164 INFO eynollah - Detected 230 DPI
17:26:11.253 INFO eynollah - Found 1 columns ([[1.0000000e+00 4.8253853e-15 1.0667150e-13 8.4323712e-19 1.3115061e-21
  2.3552048e-24]])
17:26:11.263 INFO eynollah - Image was enhanced.
17:26:11.269 INFO eynollah - Enhancing took 0.2s 
17:26:11.615 INFO eynollah - Image dimensions: 224x448
17:26:12.909 INFO eynollah - Image dimensions: 448x672
17:26:14.146 INFO eynollah - Image dimensions: 448x672
17:26:24.355 INFO eynollah - slope_deskew: -0.22727272727272663
17:26:47.650 INFO eynollah - detection of marginals took 0.2s
17:26:50.667 INFO eynollah - Image dimensions: 896x896
17:26:53.209 INFO eynollah - Image dimensions: 896x896
17:26:59.795 INFO eynollah - Job done in 48.8s
17:26:59.796 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page4.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:26:59.917 INFO eynollah - Resizing and enhancing image...
17:26:59.918 INFO eynollah - Detected 230 DPI
17:27:00.008 INFO eynollah - Found 3 columns ([[1.7893100e-04 5.9911486e-08 9.9982101e-01 6.1556178e-09 1.0413114e-13
  3.4249886e-08]])
17:27:00.019 INFO eynollah - Image was enhanced.
17:27:00.025 INFO eynollah - Enhancing took 0.2s 
17:27:00.413 INFO eynollah - Image dimensions: 224x448
17:27:03.388 INFO eynollah - Image dimensions: 448x672
17:27:06.196 INFO eynollah - Image dimensions: 448x672
17:27:17.682 INFO eynollah - slope_deskew: -0.22727272727272663
17:27:41.710 INFO eynollah - detection of marginals took 0.0s
17:27:44.726 INFO eynollah - Image dimensions: 896x896
17:27:46.601 INFO eynollah - Image dimensions: 896x896
17:27:57.192 ERROR eynollah - index 77 is out of bounds for axis 0 with size 77
17:27:57.581 INFO eynollah - Job done in 57.8s
17:27:57.581 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page42.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:27:57.705 INFO eynollah - Resizing and enhancing image...
17:27:57.705 INFO eynollah - Detected 230 DPI
17:27:57.793 INFO eynollah - Found 2 columns ([[2.1592376e-01 7.8407621e-01 1.9423045e-10 2.0287512e-09 3.5663394e-09
  5.5609437e-11]])
17:27:57.799 INFO eynollah - Image was enhanced.
17:27:57.805 INFO eynollah - Enhancing took 0.2s 
17:27:58.198 INFO eynollah - Image dimensions: 224x448
17:28:00.243 INFO eynollah - Image dimensions: 448x672
17:28:02.181 INFO eynollah - Image dimensions: 448x672
17:28:12.333 INFO eynollah - slope_deskew: -6.136363636363637
17:28:37.773 INFO eynollah - detection of marginals took 0.2s
17:28:40.800 INFO eynollah - Image dimensions: 896x896
17:28:42.733 INFO eynollah - Image dimensions: 896x896
17:28:50.875 INFO eynollah - Job done in 53.3s
17:28:50.875 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page0.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:28:50.998 INFO eynollah - Resizing and enhancing image...
17:28:50.998 INFO eynollah - Detected 230 DPI
17:28:51.086 INFO eynollah - Found 2 columns ([[8.5421326e-04 9.9914575e-01 4.5472558e-12 1.1925855e-09 2.6132213e-13
  2.1704227e-10]])
17:28:51.093 INFO eynollah - Image was enhanced.
17:28:51.098 INFO eynollah - Enhancing took 0.2s 
17:28:51.510 INFO eynollah - Image dimensions: 224x448
17:28:53.584 INFO eynollah - Image dimensions: 448x672
17:28:55.537 INFO eynollah - Image dimensions: 448x672
17:29:06.026 INFO eynollah - slope_deskew: 0.22727272727272663
17:29:32.607 INFO eynollah - detection of marginals took 0.2s
17:29:35.765 INFO eynollah - Image dimensions: 896x896
17:29:37.700 INFO eynollah - Image dimensions: 896x896
17:29:46.908 INFO eynollah - Job done in 56.0s
17:29:46.908 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page14.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:29:47.027 INFO eynollah - Resizing and enhancing image...
17:29:47.027 INFO eynollah - Detected 230 DPI
17:29:47.140 INFO eynollah - Found 1 columns ([[9.9490535e-01 5.0945967e-03 1.8765526e-08 5.1456077e-08 4.9560528e-10
  2.8305782e-09]])
17:29:47.150 INFO eynollah - Image was enhanced.
17:29:47.155 INFO eynollah - Enhancing took 0.2s 
17:29:47.592 INFO eynollah - Image dimensions: 224x448
17:29:49.270 INFO eynollah - Image dimensions: 448x672
17:29:50.708 INFO eynollah - Image dimensions: 448x672
17:30:00.833 INFO eynollah - slope_deskew: 0.22727272727272663
17:30:29.102 INFO eynollah - detection of marginals took 0.2s
17:30:32.179 INFO eynollah - Image dimensions: 896x896
17:30:34.940 INFO eynollah - Image dimensions: 896x896
17:30:41.077 INFO eynollah - Job done in 54.2s
17:30:41.077 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page70.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:30:41.200 INFO eynollah - Resizing and enhancing image...
17:30:41.201 INFO eynollah - Detected 230 DPI
17:30:41.286 INFO eynollah - Found 2 columns ([[2.0235138e-04 9.9979764e-01 2.1669530e-24 2.6852763e-33 1.0369204e-29
  1.3410287e-24]])
17:30:41.292 INFO eynollah - Image was enhanced.
17:30:41.298 INFO eynollah - Enhancing took 0.2s 
17:30:41.770 INFO eynollah - Image dimensions: 224x448
17:30:43.968 INFO eynollah - Image dimensions: 448x672
17:30:46.034 INFO eynollah - Image dimensions: 448x672
17:30:56.518 INFO eynollah - slope_deskew: 0.22727272727272663
17:31:25.699 INFO eynollah - detection of marginals took 0.1s
17:31:28.954 INFO eynollah - Image dimensions: 896x896
17:31:30.949 INFO eynollah - Image dimensions: 896x896
17:31:41.950 INFO eynollah - Job done in 60.9s
17:31:41.950 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page23.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:31:42.077 INFO eynollah - Resizing and enhancing image...
17:31:42.077 INFO eynollah - Detected 230 DPI
17:31:42.167 INFO eynollah - Found 2 columns ([[1.7589232e-02 9.8240948e-01 3.6152553e-08 3.3549600e-07 1.9289803e-10
  9.5576365e-07]])
17:31:42.176 INFO eynollah - Image was enhanced.
17:31:42.182 INFO eynollah - Enhancing took 0.2s 
17:31:42.696 INFO eynollah - Image dimensions: 224x448
17:31:44.942 INFO eynollah - Image dimensions: 448x672
17:31:47.045 INFO eynollah - Image dimensions: 448x672
17:31:57.927 INFO eynollah - slope_deskew: -0.22727272727272663
17:32:28.396 INFO eynollah - detection of marginals took 0.1s
17:32:31.417 INFO eynollah - Image dimensions: 896x896
17:32:33.438 INFO eynollah - Image dimensions: 896x896
17:32:42.399 INFO eynollah - Job done in 60.4s
17:32:42.400 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page30.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:32:42.527 INFO eynollah - Resizing and enhancing image...
17:32:42.527 INFO eynollah - Detected 230 DPI
17:32:42.626 INFO eynollah - Found 4 columns ([[1.13236457e-02 4.73081786e-03 2.45059677e-03 9.75846648e-01
  5.64821949e-03 1.18253396e-07]])
17:32:42.645 INFO eynollah - Image was enhanced.
17:32:42.664 INFO eynollah - Enhancing took 0.3s 
17:32:43.195 INFO eynollah - Image dimensions: 224x448
17:32:48.472 INFO eynollah - Image dimensions: 448x672
17:32:52.762 INFO eynollah - Image dimensions: 448x672
17:33:27.814 INFO eynollah - slope_deskew: 0.22727272727272663
17:34:02.510 INFO eynollah - detection of marginals took 0.0s
17:34:13.798 INFO eynollah - Image dimensions: 896x896
17:34:16.284 INFO eynollah - Image dimensions: 896x896
17:34:38.955 ERROR eynollah - index 17 is out of bounds for axis 0 with size 17
17:34:39.297 INFO eynollah - Job done in 116.9s
17:34:39.298 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page72.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:34:39.434 INFO eynollah - Resizing and enhancing image...
17:34:39.435 INFO eynollah - Detected 230 DPI
17:34:39.536 INFO eynollah - Found 2 columns ([[5.8579806e-04 9.9939597e-01 4.9447756e-08 1.7291617e-05 6.3068683e-07
  2.2113298e-07]])
17:34:39.543 INFO eynollah - Image was enhanced.
17:34:39.548 INFO eynollah - Enhancing took 0.2s 
17:34:40.090 INFO eynollah - Image dimensions: 224x448
17:34:42.427 INFO eynollah - Image dimensions: 448x672
17:34:44.667 INFO eynollah - Image dimensions: 448x672
17:34:55.640 INFO eynollah - slope_deskew: 0.22727272727272663
17:35:29.373 INFO eynollah - detection of marginals took 0.2s
17:35:32.662 INFO eynollah - Image dimensions: 896x896
17:35:34.696 INFO eynollah - Image dimensions: 896x896
17:35:44.273 INFO eynollah - Job done in 65.0s
17:35:44.273 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page43.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:35:44.394 INFO eynollah - Resizing and enhancing image...
17:35:44.395 INFO eynollah - Detected 230 DPI
17:35:44.506 INFO eynollah - Found 1 columns ([[9.9999988e-01 1.2977694e-07 1.8217089e-19 1.1470287e-18 2.0206059e-19
  2.6257076e-16]])
17:35:44.513 INFO eynollah - Image was enhanced.
17:35:44.519 INFO eynollah - Enhancing took 0.2s 
17:35:45.082 INFO eynollah - Image dimensions: 224x448
17:35:46.970 INFO eynollah - Image dimensions: 448x672
17:35:48.669 INFO eynollah - Image dimensions: 448x672
17:35:58.930 INFO eynollah - slope_deskew: 0.22727272727272663
17:36:33.567 INFO eynollah - detection of marginals took 0.2s
17:36:36.979 INFO eynollah - Image dimensions: 896x896
17:36:39.856 INFO eynollah - Image dimensions: 896x896
17:36:47.622 INFO eynollah - Job done in 63.3s
17:36:47.622 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page63.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:36:47.746 INFO eynollah - Resizing and enhancing image...
17:36:47.747 INFO eynollah - Detected 230 DPI
17:36:47.849 INFO eynollah - Found 1 columns ([[9.36765969e-01 4.50825573e-05 4.65917517e-04 2.67724935e-02
  1.00483385e-05 3.59404497e-02]])
17:36:47.858 INFO eynollah - Image was enhanced.
17:36:47.864 INFO eynollah - Enhancing took 0.2s 
17:36:48.451 INFO eynollah - Image dimensions: 224x448
17:36:50.220 INFO eynollah - Image dimensions: 448x672
17:36:51.875 INFO eynollah - Image dimensions: 448x672
17:37:03.023 INFO eynollah - slope_deskew: 90.68181818181819
17:37:38.752 INFO eynollah - detection of marginals took 0.2s
17:37:42.009 INFO eynollah - Image dimensions: 896x896
17:37:44.793 INFO eynollah - Image dimensions: 896x896
17:37:52.793 INFO eynollah - Job done in 65.2s
17:37:52.793 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page11.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:37:52.923 INFO eynollah - Resizing and enhancing image...
17:37:52.923 INFO eynollah - Detected 230 DPI
17:37:53.032 INFO eynollah - Found 2 columns ([[4.5354588e-12 1.0000000e+00 9.2473356e-21 2.6050010e-20 1.2868872e-18
  2.7973180e-20]])
17:37:53.038 INFO eynollah - Image was enhanced.
17:37:53.043 INFO eynollah - Enhancing took 0.2s 
17:37:53.646 INFO eynollah - Image dimensions: 224x448
17:37:56.177 INFO eynollah - Image dimensions: 448x672
17:37:58.556 INFO eynollah - Image dimensions: 448x672
17:38:09.662 INFO eynollah - slope_deskew: 0.22727272727272663
17:38:47.611 INFO eynollah - detection of marginals took 0.2s
17:38:50.928 INFO eynollah - Image dimensions: 896x896
17:38:53.035 INFO eynollah - Image dimensions: 896x896
17:39:04.130 INFO eynollah - Job done in 71.3s
17:39:04.131 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page3.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:39:04.256 INFO eynollah - Resizing and enhancing image...
17:39:04.256 INFO eynollah - Detected 230 DPI
17:39:04.357 INFO eynollah - Found 2 columns ([[9.8076407e-06 9.9999022e-01 6.2655035e-13 2.4978108e-11 6.4944949e-10
  5.6354559e-15]])
17:39:04.364 INFO eynollah - Image was enhanced.
17:39:04.370 INFO eynollah - Enhancing took 0.2s 
17:39:05.034 INFO eynollah - Image dimensions: 224x448
17:39:07.733 INFO eynollah - Image dimensions: 448x672
17:39:10.201 INFO eynollah - Image dimensions: 448x672
17:39:22.690 INFO eynollah - slope_deskew: 33.40909090909091
17:40:02.293 INFO eynollah - detection of marginals took 0.2s
17:40:05.281 INFO eynollah - Image dimensions: 896x896
17:40:07.485 INFO eynollah - Image dimensions: 896x896
17:40:17.665 INFO eynollah - Job done in 73.5s
17:40:17.666 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page62.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:40:17.791 INFO eynollah - Resizing and enhancing image...
17:40:17.791 INFO eynollah - Detected 230 DPI
17:40:17.896 INFO eynollah - Found 1 columns ([[1.0000000e+00 1.2721689e-14 1.5432566e-23 2.8695236e-22 5.5751865e-24
  2.9005479e-22]])
17:40:17.903 INFO eynollah - Image was enhanced.
17:40:17.908 INFO eynollah - Enhancing took 0.2s 
17:40:18.571 INFO eynollah - Image dimensions: 224x448
17:40:20.468 INFO eynollah - Image dimensions: 448x672
17:40:22.334 INFO eynollah - Image dimensions: 448x672
17:40:32.670 INFO eynollah - slope_deskew: -0.22727272727272663
17:41:12.817 INFO eynollah - detection of marginals took 0.2s
17:41:16.060 INFO eynollah - Image dimensions: 896x896
17:41:19.008 INFO eynollah - Image dimensions: 896x896
17:41:27.155 INFO eynollah - Job done in 69.5s
17:41:27.156 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page59.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:41:27.285 INFO eynollah - Resizing and enhancing image...
17:41:27.285 INFO eynollah - Detected 230 DPI
17:41:27.388 INFO eynollah - Found 1 columns ([[9.1651952e-01 7.7357300e-02 6.1220485e-03 1.4635415e-07 3.6491908e-08
  1.0714779e-06]])
17:41:27.395 INFO eynollah - Image was enhanced.
17:41:27.400 INFO eynollah - Enhancing took 0.2s 
17:41:28.085 INFO eynollah - Image dimensions: 224x448
17:41:30.108 INFO eynollah - Image dimensions: 448x672
17:41:31.964 INFO eynollah - Image dimensions: 448x672
17:41:42.410 INFO eynollah - slope_deskew: 0.22727272727272663
17:42:24.198 INFO eynollah - detection of marginals took 0.2s
17:42:27.480 INFO eynollah - Image dimensions: 896x896
17:42:30.336 INFO eynollah - Image dimensions: 896x896
17:42:38.635 INFO eynollah - Job done in 71.5s
17:42:38.636 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page64.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:42:38.761 INFO eynollah - Resizing and enhancing image...
17:42:38.761 INFO eynollah - Detected 230 DPI
17:42:38.870 INFO eynollah - Found 1 columns ([[1.0000000e+00 5.4450954e-08 1.9040921e-14 1.5524335e-14 2.4731620e-17
  1.8405318e-14]])
17:42:38.879 INFO eynollah - Image was enhanced.
17:42:38.884 INFO eynollah - Enhancing took 0.2s 
17:42:39.580 INFO eynollah - Image dimensions: 224x448
17:42:41.602 INFO eynollah - Image dimensions: 448x672
17:42:43.472 INFO eynollah - Image dimensions: 448x672
17:42:53.292 INFO eynollah - slope_deskew: 0.22727272727272663
17:43:35.945 INFO eynollah - detection of marginals took 0.2s
17:43:39.289 INFO eynollah - Image dimensions: 896x896
17:43:42.284 INFO eynollah - Image dimensions: 896x896
17:43:51.050 INFO eynollah - Job done in 72.4s
17:43:51.051 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page7.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:43:51.181 INFO eynollah - Resizing and enhancing image...
17:43:51.181 INFO eynollah - Detected 230 DPI
17:43:51.288 INFO eynollah - Found 2 columns ([[6.7446349e-06 9.9999321e-01 1.5826074e-21 1.1936957e-19 1.7129364e-22
  9.4692787e-20]])
17:43:51.296 INFO eynollah - Image was enhanced.
17:43:51.301 INFO eynollah - Enhancing took 0.2s 
17:43:52.031 INFO eynollah - Image dimensions: 224x448
17:43:54.823 INFO eynollah - Image dimensions: 448x672
17:43:57.462 INFO eynollah - Image dimensions: 448x672
17:44:08.676 INFO eynollah - slope_deskew: 0.22727272727272663
17:44:52.064 INFO eynollah - detection of marginals took 0.1s
17:44:55.271 INFO eynollah - Image dimensions: 896x896
17:44:57.471 INFO eynollah - Image dimensions: 896x896
17:45:09.898 INFO eynollah - Job done in 78.8s
17:45:09.898 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page31.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:45:10.029 INFO eynollah - Resizing and enhancing image...
17:45:10.030 INFO eynollah - Detected 230 DPI
17:45:10.161 INFO eynollah - Found 2 columns ([[1.2462350e-03 9.9875379e-01 1.0535631e-15 1.9107204e-15 9.0120409e-14
  4.8048621e-11]])
17:45:10.169 INFO eynollah - Image was enhanced.
17:45:10.174 INFO eynollah - Enhancing took 0.3s 
17:45:10.946 INFO eynollah - Image dimensions: 224x448
17:45:14.039 INFO eynollah - Image dimensions: 448x672
17:45:16.685 INFO eynollah - Image dimensions: 448x672
17:45:27.492 INFO eynollah - slope_deskew: 0.22727272727272663
17:46:12.853 INFO eynollah - detection of marginals took 0.2s
17:46:16.312 INFO eynollah - Image dimensions: 896x896
17:46:18.566 INFO eynollah - Image dimensions: 896x896
17:46:29.058 INFO eynollah - Job done in 79.2s
17:46:29.059 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page12.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:46:29.192 INFO eynollah - Resizing and enhancing image...
17:46:29.193 INFO eynollah - Detected 230 DPI
17:46:29.306 INFO eynollah - Found 1 columns ([[9.9098992e-01 8.4519759e-04 5.3344184e-04 7.6302164e-03 1.1421319e-06
  1.5900962e-07]])
17:46:29.314 INFO eynollah - Image was enhanced.
17:46:29.323 INFO eynollah - Enhancing took 0.3s 
17:46:30.088 INFO eynollah - Image dimensions: 224x448
17:46:32.235 INFO eynollah - Image dimensions: 448x672
17:46:34.268 INFO eynollah - Image dimensions: 448x672
17:46:44.769 INFO eynollah - slope_deskew: 0.22727272727272663
17:47:31.873 INFO eynollah - detection of marginals took 0.2s
17:47:35.470 INFO eynollah - Image dimensions: 896x896
17:47:38.544 INFO eynollah - Image dimensions: 896x896
17:47:46.865 INFO eynollah - Job done in 77.8s
17:47:46.865 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page74.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:47:46.996 INFO eynollah - Resizing and enhancing image...
17:47:46.997 INFO eynollah - Detected 230 DPI
17:47:47.112 INFO eynollah - Found 1 columns ([[5.2864259e-01 4.7135681e-01 2.6477535e-08 4.4270399e-07 1.0998244e-08
  1.2951223e-07]])
17:47:47.119 INFO eynollah - Image was enhanced.
17:47:47.124 INFO eynollah - Enhancing took 0.3s 
17:47:47.905 INFO eynollah - Image dimensions: 224x448
17:47:50.026 INFO eynollah - Image dimensions: 448x672
17:47:52.103 INFO eynollah - Image dimensions: 448x672
17:48:02.679 INFO eynollah - slope_deskew: -0.22727272727272663
17:48:50.552 INFO eynollah - detection of marginals took 0.2s
17:48:53.982 INFO eynollah - Image dimensions: 896x896
17:48:56.983 INFO eynollah - Image dimensions: 896x896
17:49:05.721 INFO eynollah - Job done in 78.9s
17:49:05.722 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page47.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:49:05.853 INFO eynollah - Resizing and enhancing image...
17:49:05.853 INFO eynollah - Detected 230 DPI
17:49:05.988 INFO eynollah - Found 2 columns ([[7.6000012e-02 9.2395031e-01 7.0659871e-07 4.8722104e-05 2.4176957e-08
  1.4622682e-07]])
17:49:05.997 INFO eynollah - Image was enhanced.
17:49:06.003 INFO eynollah - Enhancing took 0.3s 
17:49:06.835 INFO eynollah - Image dimensions: 224x448
17:49:10.045 INFO eynollah - Image dimensions: 448x672
17:49:12.873 INFO eynollah - Image dimensions: 448x672
17:49:24.162 INFO eynollah - slope_deskew: 0.22727272727272663
17:50:14.382 INFO eynollah - detection of marginals took 0.2s
17:50:17.908 INFO eynollah - Image dimensions: 896x896
17:50:20.219 INFO eynollah - Image dimensions: 896x896
17:50:31.639 INFO eynollah - Job done in 85.9s
17:50:31.639 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page8.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:50:31.783 INFO eynollah - Resizing and enhancing image...
17:50:31.783 INFO eynollah - Detected 230 DPI
17:50:31.898 INFO eynollah - Found 2 columns ([[8.7143868e-05 9.9986637e-01 8.0918295e-07 4.5673863e-05 7.9836700e-11
  5.7888422e-10]])
17:50:31.904 INFO eynollah - Image was enhanced.
17:50:31.909 INFO eynollah - Enhancing took 0.3s 
17:50:32.769 INFO eynollah - Image dimensions: 224x448
17:50:35.880 INFO eynollah - Image dimensions: 448x672
17:50:38.726 INFO eynollah - Image dimensions: 448x672
17:50:50.079 INFO eynollah - slope_deskew: -0.22727272727272663
17:51:41.755 INFO eynollah - detection of marginals took 0.2s
17:51:45.288 INFO eynollah - Image dimensions: 896x896
17:51:47.743 INFO eynollah - Image dimensions: 896x896
17:51:59.028 INFO eynollah - Job done in 87.4s
17:51:59.028 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page56.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:51:59.168 INFO eynollah - Resizing and enhancing image...
17:51:59.168 INFO eynollah - Detected 230 DPI
17:51:59.289 INFO eynollah - Found 4 columns ([[5.68567120e-05 1.11779338e-03 4.58855592e-10 9.98825252e-01
  1.03730954e-07 2.76266010e-09]])
17:51:59.313 INFO eynollah - Image was enhanced.
17:51:59.334 INFO eynollah - Enhancing took 0.3s 
17:52:00.209 INFO eynollah - Image dimensions: 224x448
17:52:06.558 INFO eynollah - Image dimensions: 448x672
17:52:11.737 INFO eynollah - Image dimensions: 448x672
17:52:42.828 INFO eynollah - slope_deskew: -0.22727272727272663
17:53:37.718 INFO eynollah - detection of marginals took 0.0s
17:53:46.263 INFO eynollah - Image dimensions: 896x896
17:53:49.078 INFO eynollah - Image dimensions: 896x896
17:54:13.775 ERROR eynollah - index 8 is out of bounds for axis 0 with size 8
17:54:14.216 INFO eynollah - Job done in 135.2s
17:54:14.216 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page66.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:54:14.361 INFO eynollah - Resizing and enhancing image...
17:54:14.361 INFO eynollah - Detected 230 DPI
17:54:14.481 INFO eynollah - Found 1 columns ([[1.0000000e+00 7.9131202e-10 4.3179337e-21 7.0015629e-21 8.6972561e-19
  4.6513189e-16]])
17:54:14.491 INFO eynollah - Image was enhanced.
17:54:14.497 INFO eynollah - Enhancing took 0.3s 
17:54:15.382 INFO eynollah - Image dimensions: 224x448
17:54:17.776 INFO eynollah - Image dimensions: 448x672
17:54:20.066 INFO eynollah - Image dimensions: 448x672
17:54:30.048 INFO eynollah - slope_deskew: 0.22727272727272663
17:55:23.608 INFO eynollah - detection of marginals took 0.2s
17:55:27.143 INFO eynollah - Image dimensions: 896x896
17:55:30.352 INFO eynollah - Image dimensions: 896x896
17:55:40.252 INFO eynollah - Job done in 86.0s
17:55:40.253 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page32.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:55:40.386 INFO eynollah - Resizing and enhancing image...
17:55:40.386 INFO eynollah - Detected 230 DPI
17:55:40.508 INFO eynollah - Found 1 columns ([[9.98961449e-01 1.03860244e-03 7.20132883e-13 3.84316907e-14
  1.03801373e-17 1.37641685e-14]])
17:55:40.518 INFO eynollah - Image was enhanced.
17:55:40.523 INFO eynollah - Enhancing took 0.3s 
17:55:41.416 INFO eynollah - Image dimensions: 224x448
17:55:43.894 INFO eynollah - Image dimensions: 448x672
17:55:46.185 INFO eynollah - Image dimensions: 448x672
17:55:56.727 INFO eynollah - slope_deskew: 0.22727272727272663
17:56:57.395 INFO eynollah - detection of marginals took 0.2s
17:57:00.892 INFO eynollah - Image dimensions: 896x896
17:57:04.053 INFO eynollah - Image dimensions: 896x896
17:57:11.800 INFO eynollah - Job done in 91.5s
17:57:11.801 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page71.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:57:11.939 INFO eynollah - Resizing and enhancing image...
17:57:11.939 INFO eynollah - Detected 230 DPI
17:57:12.064 INFO eynollah - Found 2 columns ([[4.1942412e-14 1.0000000e+00 1.9817858e-22 8.2292565e-24 3.4146444e-22
  1.0321113e-19]])
17:57:12.074 INFO eynollah - Image was enhanced.
17:57:12.079 INFO eynollah - Enhancing took 0.3s 
17:57:13.037 INFO eynollah - Image dimensions: 224x448
17:57:16.328 INFO eynollah - Image dimensions: 448x672
17:57:19.378 INFO eynollah - Image dimensions: 448x672
17:57:30.469 INFO eynollah - slope_deskew: -0.22727272727272663
17:58:26.865 INFO eynollah - detection of marginals took 0.2s
17:58:30.563 INFO eynollah - Image dimensions: 896x896
17:58:33.101 INFO eynollah - Image dimensions: 896x896
17:58:45.344 INFO eynollah - Job done in 93.5s
17:58:45.345 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page17.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
17:58:45.482 INFO eynollah - Resizing and enhancing image...
17:58:45.482 INFO eynollah - Detected 230 DPI
17:58:45.608 INFO eynollah - Found 2 columns ([[3.4554201e-01 6.4518327e-01 7.2022890e-09 2.8826059e-06 3.8390746e-07
  9.2713786e-03]])
17:58:45.615 INFO eynollah - Image was enhanced.
17:58:45.620 INFO eynollah - Enhancing took 0.3s 
17:58:46.591 INFO eynollah - Image dimensions: 224x448
17:58:49.997 INFO eynollah - Image dimensions: 448x672
17:58:53.049 INFO eynollah - Image dimensions: 448x672
17:59:04.171 INFO eynollah - slope_deskew: -8.40909090909091
18:00:02.593 INFO eynollah - detection of marginals took 0.2s
18:00:06.444 INFO eynollah - Image dimensions: 896x896
18:00:09.130 INFO eynollah - Image dimensions: 896x896
18:00:21.897 INFO eynollah - Job done in 96.5s
18:00:21.897 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page25.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
18:00:22.040 INFO eynollah - Resizing and enhancing image...
18:00:22.040 INFO eynollah - Detected 230 DPI
18:00:22.173 INFO eynollah - Found 2 columns ([[6.3049205e-02 9.3692261e-01 2.6634392e-08 3.3268186e-09 1.6232895e-09
  2.8214910e-05]])
18:00:22.183 INFO eynollah - Image was enhanced.
18:00:22.189 INFO eynollah - Enhancing took 0.3s 
18:00:23.201 INFO eynollah - Image dimensions: 224x448
18:00:26.646 INFO eynollah - Image dimensions: 448x672
18:00:29.787 INFO eynollah - Image dimensions: 448x672
18:00:41.327 INFO eynollah - slope_deskew: 0.22727272727272663
18:01:41.421 INFO eynollah - detection of marginals took 0.2s
18:01:45.340 INFO eynollah - Image dimensions: 896x896
18:01:47.978 INFO eynollah - Image dimensions: 896x896
18:02:01.805 INFO eynollah - Job done in 99.9s
18:02:01.806 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page55.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
18:02:01.961 INFO eynollah - Resizing and enhancing image...
18:02:01.962 INFO eynollah - Detected 230 DPI
18:02:02.100 INFO eynollah - Found 1 columns ([[9.9999738e-01 2.5688335e-06 7.4584470e-15 3.8716059e-15 4.9351511e-15
  3.1203092e-13]])
18:02:02.111 INFO eynollah - Image was enhanced.
18:02:02.117 INFO eynollah - Enhancing took 0.3s 
18:02:03.200 INFO eynollah - Image dimensions: 224x448
18:02:05.928 INFO eynollah - Image dimensions: 448x672
18:02:08.525 INFO eynollah - Image dimensions: 448x672
18:02:19.475 INFO eynollah - slope_deskew: 0.22727272727272663
18:03:21.299 INFO eynollah - detection of marginals took 0.2s
18:03:24.944 INFO eynollah - Image dimensions: 896x896
18:03:28.280 INFO eynollah - Image dimensions: 896x896
18:03:38.619 INFO eynollah - Job done in 96.8s
18:03:38.619 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page54.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
18:03:38.767 INFO eynollah - Resizing and enhancing image...
18:03:38.767 INFO eynollah - Detected 230 DPI
18:03:38.906 INFO eynollah - Found 2 columns ([[8.510065e-06 9.999610e-01 5.524491e-08 3.035140e-05 8.794814e-09
  6.137044e-08]])
18:03:38.912 INFO eynollah - Image was enhanced.
18:03:38.918 INFO eynollah - Enhancing took 0.3s 
18:03:39.996 INFO eynollah - Image dimensions: 224x448
18:03:43.600 INFO eynollah - Image dimensions: 448x672
18:03:46.819 INFO eynollah - Image dimensions: 448x672
18:03:58.473 INFO eynollah - slope_deskew: 0.22727272727272663
18:05:00.928 INFO eynollah - detection of marginals took 0.2s
18:05:04.798 INFO eynollah - Image dimensions: 896x896
18:05:07.446 INFO eynollah - Image dimensions: 896x896
18:05:21.463 INFO eynollah - Job done in 102.8s
18:05:21.463 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page58.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
18:05:21.615 INFO eynollah - Resizing and enhancing image...
18:05:21.615 INFO eynollah - Detected 230 DPI
18:05:21.748 INFO eynollah - Found 2 columns ([[2.4525495e-03 9.9754745e-01 1.5853483e-13 1.5544596e-13 2.5738819e-14
  2.6027101e-12]])
18:05:21.762 INFO eynollah - Image was enhanced.
18:05:21.767 INFO eynollah - Enhancing took 0.3s 
18:05:22.813 INFO eynollah - Image dimensions: 224x448
18:05:26.399 INFO eynollah - Image dimensions: 448x672
18:05:29.728 INFO eynollah - Image dimensions: 448x672
18:05:41.498 INFO eynollah - slope_deskew: -0.22727272727272663
18:06:44.690 INFO eynollah - detection of marginals took 0.2s
18:06:48.522 INFO eynollah - Image dimensions: 896x896
18:06:51.199 INFO eynollah - Image dimensions: 896x896
18:07:04.526 INFO eynollah - Job done in 103.1s
18:07:04.527 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page51.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
18:07:04.678 INFO eynollah - Resizing and enhancing image...
18:07:04.678 INFO eynollah - Detected 230 DPI
18:07:04.818 INFO eynollah - Found 2 columns ([[6.9712265e-04 9.9930286e-01 3.5636527e-13 2.0413872e-11 1.5612523e-12
  1.5327095e-11]])
18:07:04.824 INFO eynollah - Image was enhanced.
18:07:04.830 INFO eynollah - Enhancing took 0.3s 
18:07:05.931 INFO eynollah - Image dimensions: 224x448
18:07:09.646 INFO eynollah - Image dimensions: 448x672
18:07:12.964 INFO eynollah - Image dimensions: 448x672
18:07:24.695 INFO eynollah - slope_deskew: 0.22727272727272663
18:08:29.768 INFO eynollah - detection of marginals took 0.2s
18:08:33.513 INFO eynollah - Image dimensions: 896x896
18:08:36.225 INFO eynollah - Image dimensions: 896x896
18:08:50.915 INFO eynollah - Job done in 106.4s
18:08:50.915 INFO eynollah.writer - output filename: './page_xml/sustainability_report/2020/ed8c9421412e74c2ad60da1f702dafa2__174456655/page37.xml'
[Errno 2] No such file or directory: 'identify': 'identify'
18:08:51.067 INFO eynollah - Resizing and enhancing image...
18:08:51.068 INFO eynollah - Detected 230 DPI
18:08:51.214 INFO eynollah - Found 2 columns ([[3.9930819e-03 9.9600691e-01 6.7124854e-15 5.5872969e-19 7.7624076e-18
  3.6332231e-14]])
18:08:51.225 INFO eynollah - Image was enhanced.
18:08:51.231 INFO eynollah - Enhancing took 0.3s 
18:08:52.329 INFO eynollah - Image dimensions: 224x448
18:08:56.042 INFO eynollah - Image dimensions: 448x672
18:08:59.439 INFO eynollah - Image dimensions: 448x672
18:09:11.275 INFO eynollah - slope_deskew: -0.22727272727272663
18:10:18.229 INFO eynollah - detection of marginals took 0.2s
18:10:22.078 INFO eynollah - Image dimensions: 896x896
Traceback (most recent call last):
  File "/home/ubuntu/layout_detection/venv/bin/eynollah", line 33, in <module>
    sys.exit(load_entry_point('eynollah', 'console_scripts', 'eynollah')())
  File "/home/ubuntu/layout_detection/venv/lib/python3.7/site-packages/click/core.py", line 1128, in __call__
    return self.main(*args, **kwargs)
  File "/home/ubuntu/layout_detection/venv/lib/python3.7/site-packages/click/core.py", line 1053, in main
    rv = self.invoke(ctx)
  File "/home/ubuntu/layout_detection/venv/lib/python3.7/site-packages/click/core.py", line 1395, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/home/ubuntu/layout_detection/venv/lib/python3.7/site-packages/click/core.py", line 754, in invoke
    return __callback(*args, **kwargs)
  File "/home/ubuntu/layout_detection/eynollah/qurator/eynollah/cli.py", line 181, in main
    eynollah.run()
  File "/home/ubuntu/layout_detection/eynollah/qurator/eynollah/eynollah.py", line 2929, in run
    polygons_of_images, img_revised_tab, text_regions_p_1_n, textline_mask_tot_d, regions_without_separators_d, regions_fully, regions_without_separators, polygons_of_marginals, contours_tables = self.run_boxes_full_layout(image_page, textline_mask_tot, text_regions_p, slope_deskew, num_col_classifier, img_only_regions, table_prediction, erosion_hurts)
  File "/home/ubuntu/layout_detection/eynollah/qurator/eynollah/eynollah.py", line 2798, in run_boxes_full_layout
    regions_fully, regions_fully_only_drop = self.extract_text_regions(image_page, True, cols=num_col_classifier)
  File "/home/ubuntu/layout_detection/eynollah/qurator/eynollah/eynollah.py", line 1017, in extract_text_regions
    prediction_regions2 = self.do_prediction(patches, img2, model_region, marginal_of_patch_percent)
  File "/home/ubuntu/layout_detection/eynollah/qurator/eynollah/eynollah.py", line 674, in do_prediction
    label_p_pred = model.predict(img_patch.reshape(1, img_patch.shape[0], img_patch.shape[1], img_patch.shape[2]))
  File "/home/ubuntu/layout_detection/venv/lib/python3.7/site-packages/keras/engine/training.py", line 1462, in predict
    callbacks=callbacks)
  File "/home/ubuntu/layout_detection/venv/lib/python3.7/site-packages/keras/engine/training_arrays.py", line 324, in predict_loop
    batch_outs = f(ins_batch)
  File "/home/ubuntu/layout_detection/venv/lib/python3.7/site-packages/tensorflow_core/python/keras/backend.py", line 3476, in __call__
    run_metadata=self.run_metadata)
  File "/home/ubuntu/layout_detection/venv/lib/python3.7/site-packages/tensorflow_core/python/client/session.py", line 1472, in __call__
    run_metadata_ptr)
tensorflow.python.framework.errors_impl.ResourceExhaustedError: 2 root error(s) found.
  (0) Resource exhausted: OOM when allocating tensor with shape[1,896,896,67] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc
     [[{{node concatenate_5_5/concat}}]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.

     [[activation_56_5/truediv/_17093]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.

  (1) Resource exhausted: OOM when allocating tensor with shape[1,896,896,67] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc
     [[{{node concatenate_5_5/concat}}]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.

0 successful operations.
0 derived errors ignored.

Textline Prediction Fine Tuning

Firstly, thank you for this project! It's really cool.

I see in your README that you fine-tuned a previously trained textline detection model resulting in your final "newspaper textline" model. After digging into DH Segment's references, I believe the original model came from READ-BAD/ARU-Net by Grüning et el. Correct? Given this information, could you give me some tips on how I might now be able to fine tune your textline detection model with my own dataset? Do you have any code you might be willing to share?

Thanks!

README overhaul

The README will also need another overhaul.

  • remove warning message when basic refactoring is completed
  • add some example images of segmentation results for visualization purposes
  • clarify inconsistencies, e.g. the -cl flag is once mentioned to stand for contours as lines and further below for curved lines
  • mention Makefile and make options (e.g. make models)
  • improve the flowchart visualization (saved for later)

Multi class Instance Segmentation

Hi,
I would love to test out your pre-trained models on some documents. Could you point me to the right model to perform pixel wise region (text, image, separator) classification. Would be of gr8 help.
Keep up the amazing work!!

AttributeError: 'EynollahProcessor' object has no attribute 'resolve_resource'

% ocrd-eynollah-segment -I OCR-D-IMG -O OCR-D-SEG-LINE -P models /var/lib/eynollah
17:49:26.462 INFO eynollah - INPUT FILE PHYS_0024 (1/1) 
Traceback (most recent call last):
  File "/usr/local/bin/ocrd-eynollah-segment", line 8, in <module>
    sys.exit(main())
  File "/usr/local/lib/python3.6/dist-packages/click/core.py", line 829, in __call__
    return self.main(*args, **kwargs)
  File "/usr/local/lib/python3.6/dist-packages/click/core.py", line 782, in main
    rv = self.invoke(ctx)
  File "/usr/local/lib/python3.6/dist-packages/click/core.py", line 1066, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/usr/local/lib/python3.6/dist-packages/click/core.py", line 610, in invoke
    return callback(*args, **kwargs)
  File "/usr/local/lib/python3.6/dist-packages/qurator/eynollah/ocrd_cli.py", line 8, in main
    return ocrd_cli_wrap_processor(EynollahProcessor, *args, **kwargs)
  File "/usr/local/lib/python3.6/dist-packages/ocrd/decorators/__init__.py", line 81, in ocrd_cli_wrap_processor
    run_processor(processorClass, ocrd_tool, mets, workspace=workspace, **kwargs)
  File "/usr/local/lib/python3.6/dist-packages/ocrd/processor/helpers.py", line 70, in run_processor
    processor.process()
  File "/usr/local/lib/python3.6/dist-packages/qurator/eynollah/processor.py", line 46, in process
    'dir_models': self.resolve_resource(self.parameter['models']),
AttributeError: 'EynollahProcessor' object has no attribute 'resolve_resource'

I am investigating if my core version is old, and report back here

ValueError: attempt to get argmax of an empty sequence

With PPN73399136X.zip, I get the following error, using 0.0.3:

% ocrd-eynollah-segment --overwrite -I OCR-D-IMG-BIN -O OCR-D-SEG-LINE -P models /var/lib/eynollah
07:24:53.121 INFO eynollah - INPUT FILE PHYS_0030 (1/1)
07:24:53.232 INFO eynollah - resize and enhance image
07:24:53.232 INFO eynollah - Detected 300 DPI
Traceback (most recent call last):
  File "/usr/local/bin/ocrd-eynollah-segment", line 8, in <module>
    sys.exit(main())
  File "/usr/local/lib/python3.6/dist-packages/click/core.py", line 829, in __call__
    return self.main(*args, **kwargs)
  File "/usr/local/lib/python3.6/dist-packages/click/core.py", line 782, in main
    rv = self.invoke(ctx)
  File "/usr/local/lib/python3.6/dist-packages/click/core.py", line 1066, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/usr/local/lib/python3.6/dist-packages/click/core.py", line 610, in invoke
    return callback(*args, **kwargs)
  File "/usr/local/lib/python3.6/dist-packages/qurator/eynollah/ocrd_cli.py", line 8, in main
    return ocrd_cli_wrap_processor(EynollahProcessor, *args, **kwargs)
  File "/usr/local/lib/python3.6/dist-packages/ocrd/decorators/__init__.py", line 91, in ocrd_cli_wrap_processor
    run_processor(processorClass, ocrd_tool, mets, workspace=workspace, **kwargs)
  File "/usr/local/lib/python3.6/dist-packages/ocrd/processor/helpers.py", line 72, in run_processor
    processor.process()
  File "/usr/local/lib/python3.6/dist-packages/qurator/eynollah/processor.py", line 57, in process
    Eynollah(**eynollah_kwargs).run()
  File "/usr/local/lib/python3.6/dist-packages/qurator/eynollah/eynollah.py", line 1827, in run
    img_res, is_image_enhanced, num_col_classifier, num_column_is_classified = self.run_enhancement()
  File "/usr/local/lib/python3.6/dist-packages/qurator/eynollah/eynollah.py", line 1647, in run_enhancement
    is_image_enhanced, img_org, img_res, num_col_classifier, num_column_is_classified, img_bin = self.resize_and_enhance_image_with_column_classifier()
  File "/usr/local/lib/python3.6/dist-packages/qurator/eynollah/eynollah.py", line 400, in resize_and_enhance_image_with_column_classifier
    _, page_coord = self.early_page_for_num_of_column_classification(img_bin)
  File "/usr/local/lib/python3.6/dist-packages/qurator/eynollah/eynollah.py", line 650, in early_page_for_num_of_column_classification
    cnt = contours[np.argmax(cnt_size)]
  File "<__array_function__ internals>", line 6, in argmax
  File "/usr/local/lib/python3.6/dist-packages/numpy/core/fromnumeric.py", line 1186, in argmax
    return _wrapfunc(a, 'argmax', axis=axis, out=out)
  File "/usr/local/lib/python3.6/dist-packages/numpy/core/fromnumeric.py", line 61, in _wrapfunc
    return bound(*args, **kwds)
ValueError: attempt to get argmax of an empty sequence

Irritating "Image dimension" log message

I am processing a 4000x6000 image using ocrd-eynollah-segment and get - among other messages - this message:

14:32:10.541 INFO eynollah - Image dimensions: 448x672

Should this read "Patch dimensions" and maybe get a log level of DEBUG?

error while running inference

I'm recieving following error while running test either on my own document image or on the test tiff provided in resource folder

My versions
pip 21.3.1
python3.6

tensorflow-estimator 1.15.1
tensorflow-gpu 1.15.5
keras-mxnet 2.2.4.3

ValueError Traceback (most recent call last)
in
16 headers_off=headers_off,
17 )
---> 18 pcgts = eynollah.run()
19 eynollah.writer.write_pagexml(pcgts)
20

~/ELD/lib/python3.6/site-packages/qurator/eynollah/eynollah.py in run(self)
2305
2306 t0 = time.time()
-> 2307 img_res, is_image_enhanced, num_col_classifier, num_column_is_classified = self.run_enhancement()
2308
2309 self.logger.info("Enhancing took %.1fs ", time.time() - t0)

~/ELD/lib/python3.6/site-packages/qurator/eynollah/eynollah.py in run_enhancement(self)
1988 def run_enhancement(self):
1989 self.logger.info("Resizing and enhancing image...")
-> 1990 is_image_enhanced, img_org, img_res, num_col_classifier, num_column_is_classified, img_bin = self.resize_and_enhance_image_with_column_classifier()
1991 self.logger.info("Image was %senhanced.", '' if is_image_enhanced else 'not ')
1992 K.clear_session()

~/ELD/lib/python3.6/site-packages/qurator/eynollah/eynollah.py in resize_and_enhance_image_with_column_classifier(self)
406 img_bin = None
407
--> 408 _, page_coord = self.early_page_for_num_of_column_classification(img_bin)
409 model_num_classifier, session_col_classifier = self.start_new_session_and_model(self.model_dir_of_col_classifier)
410

~/ELD/lib/python3.6/site-packages/qurator/eynollah/eynollah.py in early_page_for_num_of_column_classification(self, img_bin)
646 else:
647 img = self.imread()
--> 648 model_page, session_page = self.start_new_session_and_model(self.model_page_dir)
649 img = cv2.GaussianBlur(img, (5, 5), 0)
650

~/ELD/lib/python3.6/site-packages/qurator/eynollah/eynollah.py in start_new_session_and_model(self, model_dir)
516 #gpu_options = tf.compat.v1.GPUOptions(per_process_gpu_memory_fraction=7.7, allow_growth=True)
517 session = tf.compat.v1.Session(config=tf.compat.v1.ConfigProto(gpu_options=gpu_options))
--> 518 model = load_model(model_dir, compile=False)
519
520 return model, session

~/ELD/lib/python3.6/site-packages/keras/engine/saving.py in load_model(filepath, custom_objects, compile)
494 f = h5dict(filepath, 'r')
495 try:
--> 496 model = _deserialize_model(f, custom_objects, compile)
497 finally:
498 if opened_new_file:

~/ELD/lib/python3.6/site-packages/keras/engine/saving.py in _deserialize_model(f, custom_objects, compile)
300 raise ValueError('No model found in config.')
301 model_config = json.loads(model_config.decode('utf-8'))
--> 302 model = model_from_config(model_config, custom_objects=custom_objects)
303 model_weights_group = f['model_weights']
304

~/ELD/lib/python3.6/site-packages/keras/engine/saving.py in model_from_config(config, custom_objects)
533 'Sequential.from_config(config)?')
534 from ..layers import deserialize
--> 535 return deserialize(config, custom_objects=custom_objects)
536
537

~/ELD/lib/python3.6/site-packages/keras/layers/init.py in deserialize(config, custom_objects)
53 module_objects=globs,
54 custom_objects=custom_objects,
---> 55 printable_module_name='layer')

~/ELD/lib/python3.6/site-packages/keras/utils/generic_utils.py in deserialize_keras_object(identifier, module_objects, custom_objects, printable_module_name)
143 config['config'],
144 custom_objects=dict(list(_GLOBAL_CUSTOM_OBJECTS.items()) +
--> 145 list(custom_objects.items())))
146 with CustomObjectScope(custom_objects):
147 return cls.from_config(config['config'])

~/ELD/lib/python3.6/site-packages/keras/engine/network.py in from_config(cls, config, custom_objects)
1030 if layer in unprocessed_nodes:
1031 for node_data in unprocessed_nodes.pop(layer):
-> 1032 process_node(layer, node_data)
1033
1034 name = config.get('name')

~/ELD/lib/python3.6/site-packages/keras/engine/network.py in process_node(layer, node_data)
989 # and building the layer if needed.
990 if input_tensors:
--> 991 layer(unpack_singleton(input_tensors), **kwargs)
992
993 def process_layer(layer_data):

~/ELD/lib/python3.6/site-packages/keras/engine/base_layer.py in call(self, inputs, **kwargs)
442 'You can build it manually via: '
443 'layer.build(batch_input_shape)')
--> 444 self.build(unpack_singleton(input_shapes))
445 self.built = True
446

~/ELD/lib/python3.6/site-packages/keras/layers/merge.py in build(self, input_shape)
360 'inputs with matching shapes '
361 'except for the concat axis. '
--> 362 'Got inputs shapes: %s' % (input_shape))
363
364 def _merge_function(self, inputs):

ValueError: A Concatenate layer requires inputs with matching shapes except for the concat axis. Got inputs shapes: [(None, 112, 112, 256), (0, 112, 112, 256)]

Improve timing messages

Could the timing messages be made more human-readable? For example:

18:31:40.369 INFO eynollah - Enhancing took 369.5891568660736s
19:01:28.031 INFO eynollah - Textregion detection took 1787.6620445251465s
19:01:36.604 INFO eynollah - Graphics detection took 8.571088552474976s
19:08:15.645 INFO eynollah - textline detection took 399.04073786735535s
19:26:32.451 INFO eynollah - deskewing took 1096.8060252666473s
19:26:33.040 INFO eynollah - detection of marginals took 0.5885534286499023s

All those only need 1 digit precision after the decimal point.

Spelling

This is for the collection of any spelling and consistency issues found during #2 and that should be targeted in a subsequent PR.

  • fix known spelling cases seperate, rotyate, teilwiese, enhancemnet etc. (check for usages)
  • run codespell
  • image_file or image_path instead of image_dir (where appropriate)
  • model_file or model_path instead of model_dir (where appropriate)
  • consistent naming of model files
  • package name (sbb_newspapers_org_image -> qurator.eynollah)

No output xml on empty pages?

Did process https://digi.ub.uni-heidelberg.de/diglitData/v/testset-ls-20201029.tgz through eynollah, but these 3 xml output files are missing:

< leere_Seite_-_1564_-_drwFronsperger1564_-_054v.tif
< leere_Seite_-_1700_-_oxenstierna1700_-_f.tif
< leere_Seite_-_1775_-_karl_theodor1775b_-_b.tif

There are a lot of ERROR eynollah - cannot convert float NaN to integer, but I'll assume those are not really errors.

Don't know which scan exactly - there are 3× error messages like this in the log:

20:56:53.243 INFO eynollah - resize and enhance image
20:56:53.245 INFO eynollah - Detected 300 DPI
20:57:05.408 INFO eynollah - Found 1 columns ([[9.9999952e-01 4.7599070e-07 1.7426101e-17 7.0970045e-18 3.1075767e-20
  1.6549760e-17]])
20:57:05.428 INFO eynollah - Image is not enhanced
20:57:05.474 INFO eynollah - Enhancing took 12.230977296829224s
20:57:10.428 INFO eynollah - Image dimensions: 448x672
20:57:47.870 INFO eynollah - Image dimensions: 448x672
20:58:21.326 INFO eynollah - Image dimensions: 448x672
20:59:06.024 INFO eynollah - ratio_of_two_models: 97.82006537033959
20:59:06.467 INFO eynollah - Textregion detection took 120.9934618473053s
20:59:12.820 ERROR eynollah - zero-size array to reduction operation minimum which has no identity
20:59:12.820 INFO eynollah - Graphics detection took 6.352412223815918s
20:59:12.820 INFO eynollah - cont_page [array([[ 385,  190],
       [2528,  190],
       [2528, 4049],
       [ 385, 4049]])]
20:59:12.820 INFO eynollah - No columns detected, outputting an empty PAGE-XML
<exif><width>2573</width><height>4374</height><photometricInterpretation>RGB</photometricInterpretation><n_frames>1</n_⸗
frames><compression>raw</compression><photometric_interpretation>None</photometric_interpretation><xResolution>300</xRe⸗
solution><yResolution>300</yResolution><resolutionUnit>inches</resolutionUnit><resolution>300</resolution></exif>
Traceback (most recent call last):
  File "/usr/local/bin/eynollah", line 8, in <module>
    sys.exit(main())
  File "/usr/local/lib/python3.7/dist-packages/click/core.py", line 829, in __call__
    return self.main(*args, **kwargs)
  File "/usr/local/lib/python3.7/dist-packages/click/core.py", line 782, in main
    rv = self.invoke(ctx)
  File "/usr/local/lib/python3.7/dist-packages/click/core.py", line 1066, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/usr/local/lib/python3.7/dist-packages/click/core.py", line 610, in invoke
    return callback(*args, **kwargs)
  File "/usr/local/lib/python3.7/dist-packages/qurator/eynollah/cli.py", line 135, in main
    pcgts = eynollah.run()
  File "/usr/local/lib/python3.7/dist-packages/qurator/eynollah/eynollah.py", line 1618, in run
    pcgts = self.writer.build_pagexml_no_full_layout([], page_coord, [], [], [], [], [], [], [], [], [], cont_page)
TypeError: build_pagexml_no_full_layout() missing 1 required positional argument: 'cont_page'

allow cropping separately

I know OCR-D integration is still underway, but just wanted to bring to attention that OCR-D currently lacks a neural (or in fact, any useable) cropping tool, and thus it would be extremely helpful if eynollah's (or sbb_textline_detection's) UNet-based model for that would be wrapped independent of the overall segmentation.

Paper reference

Hello, I was wondering if I could refer to this project on my paper.
I would like to know if there is any publicized research papers this project is based on, especially the text line detection part?

Thanks!

Eynollah on Python 3.8

Hi,
Eynollah's requirements include Tensorflow < 2. This option is not suppored on Python 3.8+. It will work on 3.7, but I'd prefer not install a dedicated environment for this. Will it break with a newer version? Do you have plans for upgrading it to TF 2.0+?
Thank you.

How to use the generated PAGE-XML as input to TrOCR?

Thanks for the great library!

I am new to CV/NLP and I am trying to figure out what to do with the generated PAGE-XML (I struggled to find accessible online content about the format).

I am essentially trying to figure out how to extract a set of single sentences I could use as input to TrOCR as mentioned in the following post: microsoft/unilm#451 (comment).

Would it be possible to provide some guidance?

Many thanks!

J

Eynollah crashes while processing image

During processing PDFs, we're facing crashes that stop the Eynollah process. The issue has been identified to be related to a certain type of page, which Eynollah is unable to process and it simply stops working. Below is the image.

page15

I've tested the issue using the following commands:

eynollah -i /home/ubuntu/layout_detection/eynollah_batch_processing/images/000___2014___1d8517527d3613cc180cf7fefe0dac7e__4189328/page15.jpg -m /home/ubuntu/layout_detection/eynollah_batch_processing/models -o /home/ubuntu/layout_detection/eynollah_batch_processing/page_xml/2014/1d8517527d3613cc180cf7fefe0dac7e__4189328 -light -fl

Also using the -di flag

eynollah -di /home/ubuntu/layout_detection/eynollah_batch_processing/images/000___2014___1d8517527d3613cc180cf7fefe0dac7e__4189328 -m /home/ubuntu/layout_detection/eynollah_batch_processing/models -o /home/ubuntu/layout_detection/eynollah_batch_processing/page_xml/2014/1d8517527d3613cc180cf7fefe0dac7e__4189328 -light -fl

The stack trace is below:

[Errno 2] No such file or directory: 'identify': 'identify'
09:56:57.674 INFO eynollah - Resizing and enhancing image...
09:56:57.674 INFO eynollah - Detected 230 DPI
09:57:19.008 INFO eynollah - Found 2 columns ([[1.2393231e-11 1.0000000e+00 1.3864657e-18 1.2094623e-19 1.5120450e-16
  1.8504462e-19]])
09:57:19.023 INFO eynollah - Image was enhanced.
09:57:19.047 INFO eynollah - Enhancing took 21.4s 
09:57:26.472 INFO eynollah - Image dimensions: 224x448
09:57:38.725 INFO eynollah - Image dimensions: 448x672
09:57:50.917 INFO eynollah - Image dimensions: 448x672
09:58:12.733 INFO eynollah - slope_deskew: 0.1212121212121211
09:58:22.525 INFO eynollah - detection of marginals took 0.4s
09:58:30.374 INFO eynollah - Image dimensions: 896x896
09:58:35.116 INFO eynollah - Image dimensions: 896x896
09:58:59.675 ERROR eynollah - index 2 is out of bounds for axis 0 with size 2
Traceback (most recent call last):
  File "/home/ubuntu/layout_detection/eynollah/qurator/eynollah/eynollah.py", line 1949, in do_order_of_regions_full_layout
    order_of_texts_tot.append(int(order_by_con_head[tj1]))
IndexError: index 2 is out of bounds for axis 0 with size 2

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/home/ubuntu/layout_detection/venv/bin/eynollah", line 33, in <module>
    sys.exit(load_entry_point('eynollah', 'console_scripts', 'eynollah')())
  File "/home/ubuntu/layout_detection/venv/lib/python3.7/site-packages/click/core.py", line 1128, in __call__
    return self.main(*args, **kwargs)
  File "/home/ubuntu/layout_detection/venv/lib/python3.7/site-packages/click/core.py", line 1053, in main
    rv = self.invoke(ctx)
  File "/home/ubuntu/layout_detection/venv/lib/python3.7/site-packages/click/core.py", line 1395, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/home/ubuntu/layout_detection/venv/lib/python3.7/site-packages/click/core.py", line 754, in invoke
    return __callback(*args, **kwargs)
  File "/home/ubuntu/layout_detection/eynollah/qurator/eynollah/cli.py", line 181, in main
    eynollah.run()
  File "/home/ubuntu/layout_detection/eynollah/qurator/eynollah/eynollah.py", line 3119, in run
    order_text_new, id_of_texts_tot = self.do_order_of_regions(contours_only_text_parent, contours_only_text_parent_h, boxes, textline_mask_tot)
  File "/home/ubuntu/layout_detection/eynollah/qurator/eynollah/eynollah.py", line 2339, in do_order_of_regions
    return self.do_order_of_regions_full_layout(*args, **kwargs)
  File "/home/ubuntu/layout_detection/eynollah/qurator/eynollah/eynollah.py", line 2021, in do_order_of_regions_full_layout
    order_of_texts_tot.append(int(order_by_con_head[tj1]))
IndexError: index 2 is out of bounds for axis 0 with size 2

Flag for OCR-D processor to periodically save mets.xml file (a suggestion)

Hi
I seem to be sporadically crashing eynollah on one of a large number of images when running it as an OCR-D processor. This may happen after a large number of images were processed - which takes many hours to run. Because eynollah currently updates the mets.xml file with the segmentation files created only when the processor completes, all the results from that run are missing from the mets.xml file so an OCR cannot be performed on the successful segmentations. The two alternatives seem to be: 1) debug why eynollah is crashing (or eliminate the image causing the crash) and rerun all the images again, or 2) edit the mets.xml by hand to include the info for the successful segmentations that were done before the crash. Is there another approach that can be used if this case occurs? If not, how about including a flag in the OCR-D processor so that it periodically updates the mets.xml file with the info from the successful segmentations.
Thanks!

Receiving error "TypeError: can't pickle _thread.RLock objects"

Hi
I am excited trying out your code and I installed it on my Windows 10 machine (Ryzen 3700x cpu, Nvidia RTX 2070 Super gpu) under anaconda (python 3.6.15, tensorflow 2.6.2, cudatoolkit 11.2.2) and it gets pretty far along before it crashes.
Here is my command line...

eynollah --image sn98062568_1933-11-18_ed-1_seq-3.png --out test1 --model models_eynollah --save_layout test1 --full-layout --enable-plotting --allow-enhancement --allow_scaling --log-level DEBUG

And I get sn98062568_1933-11-18_ed-1_seq-3_enhanced.png and sn98062568_1933-11-18_ed-1_seq-3_layout_main.png images generated that look reasonable. But here is the output stream just before and including the error...

14:32:25.982 INFO eynollah - detection of marginals took 4.2s
14:32:25.982 DEBUG eynollah - enter run_boxes_full_layout
14:32:26.780 DEBUG eynollah - enter extract_text_regions
14:32:26.894 DEBUG eynollah - enter start_new_session_and_model (model_dir=models_eynollah/model_3up_new_good_no_augmentation.h5)
14:32:28.952 DEBUG eynollah - enter do_prediction
14:32:28.954 DEBUG eynollah - Patch size: 896x896
14:32:32.797 DEBUG eynollah - enter do_prediction
14:32:32.799 DEBUG eynollah - Patch size: 896x896
14:32:41.277 DEBUG eynollah - exit extract_text_regions
14:32:42.255 DEBUG eynollah - enter extract_text_regions
14:32:42.256 DEBUG eynollah - enter start_new_session_and_model (model_dir=models_eynollah/model_no_patches_class0_30eopch.h5)
14:32:44.120 DEBUG eynollah - enter do_prediction
14:32:45.507 DEBUG eynollah - exit extract_text_regions
14:32:46.658 DEBUG eynollah - exit run_boxes_full_layout
14:33:52.914 DEBUG eynollah - enter get_slopes_and_deskew_new
Traceback (most recent call last):
Traceback (most recent call last):
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\runpy.py", line 193, in _run_module_as_main
  File "<string>", line 1, in <module>
    "__main__", mod_spec)
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\multiprocessing\spawn.py", line 105, in spawn_main
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\runpy.py", line 85, in _run_code
    exitcode = _main(fd)
    exec(code, run_globals)
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\multiprocessing\spawn.py", line 115, in _main
  File "C:\Users\Steve\anaconda3\envs\qurator-spk\Scripts\eynollah.exe\__main__.py", line 7, in <module>
    self = reduction.pickle.load(from_parent)
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\site-packages\click\core.py", line 1128, in __call__
EOFError: Ran out of input
    return self.main(*args, **kwargs)
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\site-packages\click\core.py", line 1053, in main
    rv = self.invoke(ctx)
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\site-packages\click\core.py", line 1395, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\site-packages\click\core.py", line 754, in invoke
    return __callback(*args, **kwargs)
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\site-packages\qurator\eynollah\cli.py", line 151, in main
    pcgts = eynollah.run()
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\site-packages\qurator\eynollah\eynollah.py", line 2458, in run
    slopes, all_found_texline_polygons, boxes_text, txt_con_org, contours_only_text_parent, all_box_coord, index_by_text_par_con = self.get_slopes_and_deskew_new(txt_con_org, contours_only_text_parent, textline_mask_tot_ea, image_page_rotated, boxes_text, slope_deskew)
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\site-packages\qurator\eynollah\eynollah.py", line 828, in get_slopes_and_deskew_new
    processes[i].start()
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\multiprocessing\process.py", line 105, in start
    self._popen = self._Popen(self)
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\multiprocessing\context.py", line 223, in _Popen
    return _default_context.get_context().Process._Popen(process_obj)
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\multiprocessing\context.py", line 322, in _Popen
    return Popen(process_obj)
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\multiprocessing\popen_spawn_win32.py", line 65, in __init__
    reduction.dump(process_obj, to_child)
  File "c:\users\steve\anaconda3\envs\qurator-spk\lib\multiprocessing\reduction.py", line 60, in dump
    ForkingPickler(file, protocol).dump(obj)
TypeError: can't pickle _thread.RLock objects

Do you have any idea of what the problem may be, and what I can do to fix it?
Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.