belval / textrecognitiondatagenerator Goto Github PK

A synthetic data generator for text recognition

License: MIT License

Python 99.27% Dockerfile 0.73%

synthetic data text-recognition training-set-generator ocr dataset fake text

textrecognitiondatagenerator's Issues

Find better german and spanish dictionaries

The dicts provided with the project for german and spanish are non-utf8. Unfortunately that means encoding errors may arise.

I will therefore try to replace the current dicts.

Can I set the size of the text?

Generating Images similar to Oxford Synthetic Word Dataset

Hi,
I am trying to generate images containing single words similar to that in the Oxford Synthetic Word Dataset. The words will also contain symbols such as colon, percentage etc.
The process to create the Oxford dataset is described in the below image.

I am unsure how to generate such words along with symbols as I get such images below which are very much different from the ones in the Oxford dataset.

Images from Oxford dataset are given below,

run.py crashes (OSError: unknown file format) when generating dataset for new non- latin language

I added support for hebrew with some .fft fonts and a dictionary. The adjusted run.py and datagenerator.py files run and work till they crash. When I put run.py in a for loop it some times works flawlessly (and generates images) and sometimes crashes. Any thoughts?

text_rec_task) galmoore@Gals-MacBook-Pro:~/anaconda/envs/text_rec_task/TextRecognitionDataGenerator/TextRecognitionDataGenerator$ python run_script.py

Missing modules for handwritten text generation.

args count10

100%|██████████████████████████████████████████████████████████| 10/10 [00:00<00:00, 56.40it/s]

Missing modules for handwritten text generation.

args count10

100%|█████████████████████████████████████████████████████████| 10/10 [00:00<00:00, 102.17it/s]

Missing modules for handwritten text generation.

args count10

100%|██████████████████████████████████████████████████████████| 10/10 [00:00<00:00, 67.58it/s]

Missing modules for handwritten text generation.

args count10

100%|█████████████████████████████████████████████████████████| 10/10 [00:00<00:00, 108.27it/s]

Missing modules for handwritten text generation.

args count10

100%|██████████████████████████████████████████████████████████| 10/10 [00:00<00:00, 70.82it/s]

Missing modules for handwritten text generation.

args count10

0%| | 0/10 [00:00<?, ?it/s]multiprocessing.pool.RemoteTraceback:

"""

Traceback (most recent call last):

File "/Users/galmoore/anaconda/envs/text_rec_task/lib/python3.6/multiprocessing/pool.py", line 119, in worker

result = (True, func(*args, **kwds))

File "/Users/galmoore/anaconda/envs/text_rec_task/TextRecognitionDataGenerator/TextRecognitionDataGenerator/data_generator.py", line 23, in generate_from_tuple

cls.generate(*t)

File "/Users/galmoore/anaconda/envs/text_rec_task/TextRecognitionDataGenerator/TextRecognitionDataGenerator/data_generator.py", line 42, in generate

image = computer_text_generator.generate(text, font, text_color, size, orientation, space_width, fit)

File "/Users/galmoore/anaconda/envs/text_rec_task/TextRecognitionDataGenerator/TextRecognitionDataGenerator/computer_text_generator.py", line 7, in generate

return _generate_horizontal_text(text, font, text_color, font_size, space_width, fit)

File "/Users/galmoore/anaconda/envs/text_rec_task/TextRecognitionDataGenerator/TextRecognitionDataGenerator/computer_text_generator.py", line 14, in _generate_horizontal_text

image_font = ImageFont.truetype(font=font, size=font_size)

File "/Users/galmoore/anaconda/envs/text_rec_task/lib/python3.6/site-packages/PIL/ImageFont.py", line 280, in truetype

return FreeTypeFont(font, size, index, encoding, layout_engine)

File "/Users/galmoore/anaconda/envs/text_rec_task/lib/python3.6/site-packages/PIL/ImageFont.py", line 145, in init

layout_engine=layout_engine)

OSError: unknown file format

"""

The above exception was the direct cause of the following exception:

Traceback (most recent call last):

File "run.py", line 376, in

main()

File "run.py", line 364, in main

), total=args.count):

File "/Users/galmoore/anaconda/envs/text_rec_task/lib/python3.6/site-packages/tqdm/_tqdm.py", line 1005, in iter

for obj in iterable:

File "/Users/galmoore/anaconda/envs/text_rec_task/lib/python3.6/multiprocessing/pool.py", line 735, in next

raise value

OSError: unknown file format

Missing modules for handwritten text generation.

args count10

100%|██████████████████████████████████████████████████████████| 10/10 [00:00<00:00, 70.12it/s]

Missing modules for handwritten text generation.

args count10

100%|██████████████████████████████████████████████████████████| 10/10 [00:00<00:00, 44.38it/s]

Missing modules for handwritten text generation.

args count10

100%|██████████████████████████████████████████████████████████| 10/10 [00:00<00:00, 37.21it/s]

Missing modules for handwritten text generation.

args count10

100%|██████████████████████████████████████████████████████████| 10/10 [00:00<00:00, 53.60it/s]

Support for non-latin scripts

People seems to be interested in out of the box support of non latin scripts.

I am currently working on an implementation, please see this branch

chinese font problem

some chinese fonts can not generate good samples(for example ,some word could not be generated),do you have some suggests to solve the problem .thank you in advance

hello, i change the font and have a error

`The above exception was the direct cause of the following exception:

Traceback (most recent call last):
File "/Users/liufengnan/Desktop/TextRecognitionDataGenerator/TextRecognitionDataGenerator/run.py", line 392, in
main()
File "/Users/liufengnan/Desktop/TextRecognitionDataGenerator/TextRecognitionDataGenerator/run.py", line 309, in main
), total=args.count):
File "/usr/local/lib/python3.6/site-packages/tqdm/_tqdm.py", line 979, in iter
for obj in iterable:
File "/usr/local/Cellar/python/3.6.5/Frameworks/Python.framework/Versions/3.6/lib/python3.6/multiprocessing/pool.py", line 735, in next
raise value
OSError: broken file
`

can it generate this type image?

two colors in a char

Support arabic and urdu text

Enhancement, but it would be interesting to add support for arabic and hindi scripts.

I think adding a new font folder and a new dict for both languages would work.

How to make the text bold or italic

Hi ,

is there a way we can make the text bold or italic given any font . This is not an issue , it's just a question.

Can this project generate text image for text detection mission?

for example
https://github.com/ankush-me/SynthText

Text length

Can I generate texts with a fixed length size?

How to generate a fixed length English string？

For example, the length of the string is 10．How to genearte?

Fix dependencies versioning

OpenCV bumped their version from 3.2 to 3.4. Following this change, they removed the 3.2 version from PyPI. This means that right now, someone who clones the repo and tries to install the dependencies with pip install -r requirements it will fail.

Missing modules for handwritten text generation.

run python run.py -w 5 -f 64

Need to change how to name the file

Something embarrassing happened when I was generating the image T.T

FileNotFoundError: [Errno 2] No such file or directory: 'out/튃 귍 쓌 / 혦 찵 컒 뵶 명 똔 톨 눔_124.jpg'

This symbol ‘/’ should be '//'

Some Chinese characters can't be made, it shows a square,just like 口口口口口

I have download some fonts ,but it does not work .could you please tell me how to do?

run.py arguments in README.md

It would be very useful if all the command line arguments were mentioned in the readme file.
Obviously, it could be viewed in the run.py file. Still, a person who is cloning the repo for the first time may not know all the options available.

Thanks!

Missing modules for handwritten text generation.

When I used run.py, it reported this error, indicating lines 388 and 333.

How to generate vertical images

for example we often read words and characters from left to right.
but in Chinese, we sometimes arrange characters from top to bottom.
So I just wonder can this code generate top to bottom configuration of Chinese sentences?

Non-transparent background in handwritten

As seen in the image below, images generated with -hw do not have a transparent background.

This uses matplotlib to "draw" the lines on a canvas.

I got this error ,can anyone help me ?

1 .error:
/data/20180809/TextRecognitionDataGenerator-master/TextRecognitionDataGenerator# python run.py -i "texts/subtitle.txt" -c 100 -w 5 -e png -b 3
Missing modules for handwritten text generation.
31%|#####################################2 | 31/100 [00:00<00:01, 68.48it/s]multiprocessing.pool.RemoteTraceback:
"""
Traceback (most recent call last):
File "/usr/lib/python3.4/multiprocessing/pool.py", line 119, in worker
result = (True, func(*args, **kwds))
File "/data/20180809/TextRecognitionDataGenerator-master/TextRecognitionDataGenerator/data_generator.py", line 22, in generate_from_tuple
cls.generate(*t)
File "/data/20180809/TextRecognitionDataGenerator-master/TextRecognitionDataGenerator/data_generator.py", line 34, in generate
image = ComputerTextGenerator.generate(text, font, text_color)
File "/data/20180809/TextRecognitionDataGenerator-master/TextRecognitionDataGenerator/computer_text_generator.py", line 12, in generate
image_font = ImageFont.truetype(font=font, size=32)
File "/data/20180809/TextRecognitionDataGenerator-master/py3env/lib/python3.4/site-packages/PIL/ImageFont.py", line 261, in truetype
return FreeTypeFont(font, size, index, encoding, layout_engine)
File "/data/20180809/TextRecognitionDataGenerator-master/py3env/lib/python3.4/site-packages/PIL/ImageFont.py", line 144, in init
self.font = core.getfont(font, size, index, encoding, layout_engine=layout_engine)
OSError: unknown file format
"""

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
File "run.py", line 290, in
main()
File "run.py", line 278, in main
), total=args.count):
File "/data/20180809/TextRecognitionDataGenerator-master/py3env/lib/python3.4/site-packages/tqdm/_tqdm.py", line 930, in iter
for obj in iterable:
File "/usr/lib/python3.4/multiprocessing/pool.py", line 689, in next
raise value
OSError: unknown file format

2.when i use my own background picture,i got the blurry picture,but i want a clear one.

why they got different width(ps:i use my own texts)
Looking forward to hear from you .
@Belval

Width and height of image

Why is the new_width being calculated in this line if it's not used anywhere else? Is it supposed to be used when resizing the image?

TextRecognitionDataGenerator/TextRecognitionDataGenerator/data_generator.py

Line 77 in de5be6a

 new_width = float(new_text_width + 10) * (float(height) / float(new_text_height + 10)) 

But the variable you are using when resizing is new_text_width.

TextRecognitionDataGenerator/TextRecognitionDataGenerator/data_generator.py

Line 78 in de5be6a

 image_on_background = background.resize((int(new_text_width), height), Image.ANTIALIAS) 

ETA on numbers and symbols

Hi,

Great project, could you please let me know a rough time frame for a release with numbers and symbols included?

Best,
Vishal

TextDataRecognitionGenerator as a python module

The preferred usage had always been through the CLI. Unfortunately, this approach is not frictionless when used in a real machine learning pipeline that might include data augmentations.

The v1 candidate would be giving access to the data generators classes and have an easy to use interface that can be used as seamlessly as the CLI.

A package would be uploaded to pypi for ease of use.

Save generated text to file separately

How can I save generated text to the file alongside with the corresponding image? For example text 'foo' was generated and was put on some background and saved as foo.jpg. Can I save also text 'foo' to some file call it foo.txt where will be only text 'foo'?

who would like to share some chinese ttf fonts?

wordart like this:

or some special fonts like this:

osError: broken file

Traceback (most recent call last):
File "run.py", line 340, in
main()
File "run.py", line 328, in main
), total=args.count):
File "/data/env/anaconda3/lib/python3.6/site-packages/tqdm/_tqdm.py", line 931, in iter
for obj in iterable:
File "/data/env/anaconda3/lib/python3.6/multiprocessing/pool.py", line 735, in next
raise value
OSError: broken file
Exception ignored in: <bound method tqdm.del of 0%| | 21/500000 [00:00<6:24:25, 21.68it/s]>
Traceback (most recent call last):
File "/data/env/anaconda3/lib/python3.6/site-packages/tqdm/_tqdm.py", line 883, in del
File "/data/env/anaconda3/lib/python3.6/site-packages/tqdm/_tqdm.py", line 1088, in close
File "/data/env/anaconda3/lib/python3.6/site-packages/tqdm/_tqdm.py", line 439, in _decr_instances
File "/data/env/anaconda3/lib/python3.6/_weakrefset.py", line 109, in remove
KeyError: <weakref at 0x7f0af5879688; to 'tqdm' at 0x7f0bc8e1d0f0>

How to keep words in order

For example,I want to create a rule that some words must appear before some other words.Some words interval occur.So I guess when I use RNN network may have a better performance.

Create comprehensive test suite

As the number of feature grows I can barely check for regression bugs. Therefore a test suite should be made with a continuous integration like TravisCI.

Tighter cropping

Right now the images have quite a bit of unnecessary padding around the text, which reduces the usability of the generated dataset for specific tasks.

Instead, make text as big as possible and add a padding argument.

How to generate pictures of a black backgroud(black paper) with white characters prited?

@Belval

labels

hello,thanks firstly, is there a labels.txt in the code for the generated images?

为什么生成的都是繁体字啊，我都放进去简体字体了

如题

background generator may cause error

if picture.size[0] < width:
picture = picture.resize([width, int(picture.size[1] * (width / picture.size[0]))],
#what if resized height is still smaller than needed height?
Image.ANTIALIAS)
elif picture.size[1] < height:
picture.thumbnail([int(picture.size[0] * (height / picture.size[1])), height], Image.ANTIALIAS)

Add argument font in run.py

Hi,
I used this repo to generate text with only one personal font.

I would like add a argument font in run.py, if a font is passed in param, it will be the only one used to generate pictures.

I will work in, I saw in readme you want an issue before a PR, so i open it 😊

how to set the distance between words?

Given the generated text lines, I found the gap between words is pretty large, could you tell me how to decrease the gap? Thank you.

how to set de font's size?

how to add other format of the generated images' filename,rather then 0 and 1

fonts 里面的ttf 字体库换一下

Originally posted by @yulinxuanzhi in #21 (comment)

I got this error, can anyone help me, please?

here is the error
python run.py -w 5 -f 64 -l am
0%| | 0/1000 [00:00<?, ?it/s]multiprocessing.pool.RemoteTraceback:
"""
Traceback (most recent call last):
File "/home/test/Anaconda3/envs/py35/lib/python3.5/multiprocessing/pool.py", line 119, in worker
result = (True, func(*args, **kwds))
File "/home/test/Documents/direse/scene/TextRecognitionDataGenerator/TextRecognitionDataGenerator/data_generator.py", line 22, in generate_from_tuple
cls.generate(*t)
File "/home/test/Documents/direse/scene/TextRecognitionDataGenerator/TextRecognitionDataGenerator/data_generator.py", line 36, in generate
image = ComputerTextGenerator.generate(text, font, text_color, size, orientation, space_width)
File "/home/test/Documents/direse/scene/TextRecognitionDataGenerator/TextRecognitionDataGenerator/computer_text_generator.py", line 9, in generate
return cls.__generate_horizontal_text(text, font, text_color, font_size, space_width)
File "/home/test/Documents/direse/scene/TextRecognitionDataGenerator/TextRecognitionDataGenerator/computer_text_generator.py", line 17, in __generate_horizontal_text
image_font = ImageFont.truetype(font=font, size=font_size)
File "/home/test/Anaconda3/envs/py35/lib/python3.5/site-packages/PIL/ImageFont.py", line 261, in truetype
return FreeTypeFont(font, size, index, encoding, layout_engine)
File "/home/test/Anaconda3/envs/py35/lib/python3.5/site-packages/PIL/ImageFont.py", line 144, in init
self.font = core.getfont(font, size, index, encoding, layout_engine=layout_engine)
OSError: unknown file format
"""

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
File "run.py", line 342, in
main()
File "run.py", line 330, in main
), total=args.count):
File "/home/test/Anaconda3/envs/py35/lib/python3.5/site-packages/tqdm/_tqdm.py", line 930, in iter
for obj in iterable:
File "/home/test/Anaconda3/envs/py35/lib/python3.5/multiprocessing/pool.py", line 731, in next
raise value
OSError: unknown file format
Exception ignored in: <bound method tqdm.del of 0%| | 0/1000 [00:00<?, ?it/s]>
Traceback (most recent call last):
File "/home/test/Anaconda3/envs/py35/lib/python3.5/site-packages/tqdm/_tqdm.py", line 882, in del
File "/home/test/Anaconda3/envs/py35/lib/python3.5/site-packages/tqdm/_tqdm.py", line 1087, in close
File "/home/test/Anaconda3/envs/py35/lib/python3.5/site-packages/tqdm/_tqdm.py", line 439, in _decr_instances
File "/home/test/Anaconda3/envs/py35/lib/python3.5/_weakrefset.py", line 109, in remove
KeyError: <weakref at 0x7f686b4df598; to 'tqdm' at 0x7f686b53cd30>

Arabic text generator

Hi,

File names generated by the Arabic version of the repo are correct as the word letters are connected. However, text in images has disconnected letters and the words started from left to right. The text in an image should be started from right to left and the letter must be connected. Any suggestion on how to correct these issues?

Thanks

text-color is unable to apply

found a bug in text-color use.
add background = background.convert('RGBA') in data_generator.py at line 89 can fix this problem

How to simulate a photo image of a white paper with black chars printed ?

I've found noise/blur......filters can't simulate the effect image taken by photo, because of the reflect of the camera lighting.

We can easily find many white dots around the character's edge

Adding Dilated convolution layer

How can Dilated convolution be employed in the current setting. ?

Came across this repository doing similar ..
https://github.com/MaybeShewill-CV/CRNN_Tensorflow

hello,how can i generate vertical text like chinese characters arranged vertically?it seems -d and -do settings do not work

The color is also off grey-ish while it should be black. If possible the --text_color parameters should be supported as well.

belval / textrecognitiondatagenerator Goto Github PK

textrecognitiondatagenerator's Issues

Recommend Projects

Recommend Topics

Recommend Org