Thanks for sharing the codes! It is a nice work! I want to finetune the model for

Can you give an example of a image in train list file? about deeptextspotter HOT 11 CLOSED

michalbusta commented on July 30, 2024

Can you give an example of a image in train list file?

from deeptextspotter.

Comments (11)

MichalBusta commented on July 30, 2024

hi,
the train list is just txt with the image paths.
each image in the should have corresponding txt file, where each line corresponds to word.

line example:
0 0.851475423177 0.247500450534 0.128710019264 0.0338691623536 -0.102781031299 CWRU
cls x y w h alpha txt

cls - class - not used
x - bbox center.x normalized by image width
y - bbox center.y normalized by image height
w - bbox width normalized by image diagonal
h - bbox height normalized by image diagonal
alpha - box angle in radians

from deeptextspotter.

ysc703 commented on July 30, 2024

Thanks for your quick response! It's very clearly now.

from deeptextspotter.

qnkhuat commented on July 30, 2024

Where do we put the file's name?

from deeptextspotter.

rohitsaluja22 commented on July 30, 2024

Hi we tried train.py but are getting following error:

I0213 04:00:44.575316 17258 net.cpp:242] This network produces output softmax
I0213 04:00:44.575346 17258 net.cpp:255] Network initialization done.
I0213 04:00:44.575435 17258 solver.cpp:56] Solver scaffolding done.
[416, 416]
I0213 04:00:44.581261 17258 solver.cpp:330] Iteration 0, Testing net (#0)
Floating point exception (core dumped)

We are creating a separate .txt file for each .jpeg file. For example, one of the .txt file looks like following:-

0 86.544325 113.70325 61.5000988376 10.8000627304 3.03217953614 calliberation

Please help us resolving the error.

from deeptextspotter.

rohitsaluja22 commented on July 30, 2024

thanks, I got it, I was not normalizing the coordinates and another mistake, the angle calculation was using the right-handed coordinate system, whereas the image follows the left-handed. That's why my angle was 3 radians.
Also, it's not mentioned here but I am giving image path as the first word in the line of train_list.txt. Hope I am right in that?

from deeptextspotter.

rohitsaluja22 commented on July 30, 2024

file names are to be put in train_list.txt and valid_list.txt. Each file with image_name.txt in foder "img" wherever your training image folder is present.

from deeptextspotter.

rohitsaluja22 commented on July 30, 2024

format: cls x y w h alpha txt
Can we give spaces in txt?

from deeptextspotter.

ustczhouyu commented on July 30, 2024

hi,
the train list is just txt with the image paths.
each image in the should have corresponding txt file, where each line corresponds to word.

line example:
0 0.851475423177 0.247500450534 0.128710019264 0.0338691623536 -0.102781031299 CWRU
cls x y w h alpha txt

cls - class - not used
x - bbox center.x normalized by image width
y - bbox center.y normalized by image height
w - bbox width normalized by image diagonal
h - bbox height normalized by image diagonal
alpha - box angle in radians

Hi, MichalBusta, so we don't use class, just x,y,w,h, alpha and txt?

from deeptextspotter.

MichalBusta commented on July 30, 2024

hi,
the train list is just txt with the image paths.
each image in the should have corresponding txt file, where each line corresponds to word.
line example:
0 0.851475423177 0.247500450534 0.128710019264 0.0338691623536 -0.102781031299 CWRU
Hi, MichalBusta, so we don't use class, just x,y,w,h, alpha and txt?

Hi, yes, class has been used to predict language of the text (but it did not not work so well, also we think that the language identification step is not needed: https://arxiv.org/abs/1801.09919)

from deeptextspotter.

ustczhouyu commented on July 30, 2024

Thank you very much for your kind reply. I am training the model now. If there are any problems, I will ask you again, please don’t feel disturbed. At 2018-10-23 16:35:13, "Michal Busta" <[email protected]> wrote: hi, the train list is just txt with the image paths. each image in the should have corresponding txt file, where each line corresponds to word. line example: 0 0.851475423177 0.247500450534 0.128710019264 0.0338691623536 -0.102781031299 CWRU Hi, MichalBusta, so we don't use class, just x,y,w,h, alpha and txt? Hi, yes, class has been used to predict language of the text (but it did not not work so well, also we think that the language identification step is not needed: https://arxiv.org/abs/1801.09919) — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

from deeptextspotter.

liuqifan67 commented on July 30, 2024

t

hi ，I just want ask the single txt file is just the list with x，y，w，h and alpha？I tried that before， but its didn't work . the error is RuntimeError: bad lexical cast:source type value could not be interpreted as target. Could you give me some advise？

from deeptextspotter.

Can you give an example of a image in train list file? about deeptextspotter HOT 11 CLOSED

Comments (11)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent