Giter VIP home page Giter VIP logo

Comments (10)

rayush7 avatar rayush7 commented on August 22, 2024 2

@bgshih If I want the model to recognize words and multi-digit numbers both then will fine-tuning the model (already trained on VGG Synthetic word dataset) on dataset like Street View House Number (SVHN) gonna help? Or I need to retrain the model from the scratch?

from crnn.

bgshih avatar bgshih commented on August 22, 2024 1

@rayush7 I strongly feel fine-tuning be a better choice than starting from scratch.

from crnn.

bgshih avatar bgshih commented on August 22, 2024

The model is trained on a dataset that is heavily biased towards a-z characters. Training on a balanced dataset would help.

from crnn.

rayush7 avatar rayush7 commented on August 22, 2024

Thanks @bgshih

from crnn.

rayush7 avatar rayush7 commented on August 22, 2024

@bgshih Could you please explain the format of the single/multi digit labels for creating lmdb datasets?
Example if there is a word "Apple" in the image then setting its groundtruth label as the string 'Apple' is working for me. I can convert it into lmdb format and create my train and validation set and train my own model.
But if I have numerical labels for example number 350 in the image then by setting its label to string '350' is giving me error in the ascii2label function (in utilities.lua). Even though in ascii2label function if conditions are present to take into account the numerical digit values but still its throwing the error. Please explain what should be the groundtruth format of the single/multi digit labels for creating lmdb datasets?

from crnn.

bgshih avatar bgshih commented on August 22, 2024

@rayush7 This is not an expected behavior. What was the error message and where was it thrown?

from crnn.

rayush7 avatar rayush7 commented on August 22, 2024

@bgshih Hey the problem was in my lmdb dataset and not in code. The label string was in unreadable form. I resolved this problem. Now giving the '350' as a label for a image with number 350 in it works like a charm! Thanks @bgshih

from crnn.

rayush7 avatar rayush7 commented on August 22, 2024

@bgshih I am trying to finetune the crnn model (pretrained on vgg synthetic dataset) on svhn dataset. The images (32x32) I am using look like these

my_svhn_sample val_img_99 val_img_1668 val_img_6654 val_img_11111 val_img_20040 val_img_23389

While finetuning, the training loss is showing random patterns and no legitimate inference can be drawn because of that. Also all the validation set images are classified as "1". I am unable to figure out if this is a problem due to wrong choice of optimization parameters or crnn is unfit for training on such images. I am using the default config.lua file with adadelta as the optimizer. Any suggestions on what mistake I am making and how can I rectify that?

from crnn.

ahmedmazari-dhatim avatar ahmedmazari-dhatim commented on August 22, 2024

@bgshih , @rayush7 can you tell me the steps you followed to fine tune the pre-trained mode with your own dataset. l'm newbie on fine tuning pre-trained model. l want to try it for the first time .

Thank you

from crnn.

alexiskattan avatar alexiskattan commented on August 22, 2024

@rayush7 Did you train the model with digits? If so, could you post it online?

from crnn.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.