tongkunguan / siga Goto Github PK
View Code? Open in Web Editor NEW[CVPR2023] Self-supervised Implicit Glyph Attention for Text Recognition
[CVPR2023] Self-supervised Implicit Glyph Attention for Text Recognition
你好,在加载模型进行test时,显示以下错误
Traceback (most recent call last): File "/content/drive/MyDrive/SRresaerch/SIGA/SIGA_R/test.py", line 223, in <module> test(opt) File "/content/drive/MyDrive/SRresaerch/SIGA/SIGA_R/test.py", line 127, in test model.load_state_dict(pretrained_state_dict['net']) File "/usr/local/lib/python3.10/dist-packages/torch/nn/modules/module.py", line 2041, in load_state_dict raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format( RuntimeError: Error(s) in loading state_dict for DataParallel: Unexpected key(s) in state_dict: "module.model_one.Transformation.GridGenerator.inv_delta_C", "module.model_one.Transformation.GridGenerator.P_hat".
请问这可能是什么导致的问题?
hello:
after reading your paper, I want to use the segment method to do some work also in the scene text recognition, but I find that at the training stage, to genergate the image mask cost much time, it will increase the training time. I first think to generate the mask at loacl machine but the Sythe dataset has 15M images it also will takes a lots of days to generate all the masks. so can I ask how you deal with the problem when you at the training.
Have you released the code of Transformer architecture? Please forgive my ignorance, it seems like I can't find it.
Additionally, the Glyph Pseudo-label Construction (GPC), Glyph Attention Network (GLAN), and Attention-based Character Fusion Mod-
ule (ACFM), I didn't find them in the code when I searched them in abbreviations. I guess you wrote them in a couple of files under the modules folder, would you offer more information about the code of them? Where are they in the code respectively? How can I find and use these three modules for ablation studies? Again, forgive my ignorance, I'm really a budding nerd, Thank you so much.
Hello, thanks for your work. I thoroughly enjoyed reading the paper. I have a couple of questions regarding text mask generation.
The first link in the dataset seems to be inaccessible. Can it be fixed?
Also, I would like to ask about Tables 2 and 3, where some datasets under the first row are annotated with two numbers, such as 'IC13-857, IC13-1015,' etc. Does the number represent the number of samples in the test set?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.