Giter VIP home page Giter VIP logo

Comments (4)

hasanirtiza avatar hasanirtiza commented on May 25, 2024

What do you mean by you cannot repeat ? did you mean that you download the pre-trained model provided in the repo and simply ran evaluation script on CityPersons and you cannot get the performance as stated in the benchmarking table of the README.md ? or you are trying to train a model by yourself using a pre-trained model ?

Regrading the first part, downloading this model and simply running
python ./tools/test_city_person.py configs/elephant/cityperson/cascade_hrnet.py ./models_pretrained/epoch_ 5 6 --out result_citypersons.json --mean_teacher should give you LAMR Reasonable : 7.5 and Heavy : 27.4. This has been tested several times and on multiple machines.

If you are trying to train it yourself (second part), please read the ArXiv pre-print version, section 6.1 and 6.2 to understand how to train.

from pedestron.

yitang-hnu avatar yitang-hnu commented on May 25, 2024

Thanks for your reply!
I can get the best result by running the evaluation script on CityPersons, but I want to train the model by myself to get your best result in CityPerson. From section 6.2 in your paper, did you first pretrain the Cascade R-CNN in the Wider Pedestrian, and then pretrain again in ECP, and finally finetune in CityPerson? If so, I want to know the details about the learning rate and the epoch during pretraining and finetuning.

from pedestron.

hasanirtiza avatar hasanirtiza commented on May 25, 2024

If so, I want to know the details about the learning rate and the epoch during pretraining and finetuning.

Yes, as stated in the paper, we trained it using progressive training pipeline. You can start by using the same learning rate as provided in the configs (for example in the case of Cascade R-CNN 0.02 ->8 gpus with 2 imgs/gpu). For the first part of the pipeline(pre-training on say WiderPedestrian) 20 epochs are usually enough, during the middle part of the pipeline(ECP) you can usually increase the number of epochs (have a look at the validation error and see where it converges). Finally, in the last part, usually 5-7 epochs are enough for the model to converge. You can also experiment with the learning rate if you want, especially in the last part.

from pedestron.

yitang-hnu avatar yitang-hnu commented on May 25, 2024

@hasanirtiza. Thanks for your detailed explanation for pre-training and fin-tuning! It really works, and I got the result of 8.53 MR^-2 in CityPerson, but it is still 1 MR-^2 point behind your best result. I have tried a series of learning rate(from 0.0025 to 0.0005 for 1 img and 1 GPU) for fine-tuning, and only got the result of 8.53 MR^-2 in CityPerson. So, I think it is not very well of my pre-trained model on Wilder Pedestrian and ECP before fine-tune. Could you provide the model that you pre-trained on Wilder Pedestrian and ECP before fine-tuning step in your paper Sec 6.2?

BTW, from my experiments on CityPerson datasets, the model pre-trained on Wilder Pedestrian only is much better than the model pre-trained on WIDER PERSON and WIDER Pedestrian that you provided on CityPerson datasets. If it is convenient, could you provide the pre-trained model only on WIDER Pedestrian? I think it is also much helpful for generic human detection.

from pedestron.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.