Giter VIP home page Giter VIP logo

Comments (8)

athus1990 avatar athus1990 commented on June 17, 2024

Or we could start from FAST RCNN and then denote the changes required for Faster RCNN

from faster_rcnn.

duygusar avatar duygusar commented on June 17, 2024

@athus1990 Can you elaborate? I have taken a leap of faith to faster rcnn without working with fast rcnn, I took a look at the paper. The experiments there also initializes from pretrained networks right? Is it only for time concern (or well better objectness)? Do you suggest that I train a fast rcnn model and then add the layers introduced in faster rcnn using the weights from the initial fast rcnn? Or can I just use do_proposal_train and then do_fast_rcnn_train with those proposals for 1st and 2nd stages? - edit:that is where I stumble do_proposal_train initializes with the pretrained models, trying to go over proposal_train to see

from faster_rcnn.

duygusar avatar duygusar commented on June 17, 2024

update: the vgg16 on imagenet comes from caffe model zoo so I think it is this one: https://gist.github.com/ksimonyan/211839e770f7b538e2d8 The ZF one is said to be trained on MSRA, I think the architecture should be Zeiler & Fergus paper they refer to, so it seems like they are not random models. However since I have one class and it is not any of the objects in those datasets I need to somehow skip initializing

from faster_rcnn.

Astrosun avatar Astrosun commented on June 17, 2024

@duygusar
May I know how you deal with the image mean part? So for my own data set, do we need to compute the image means of our own images? Do we still need to normalize the input?

Thanks a lot!

from faster_rcnn.

duygusar avatar duygusar commented on June 17, 2024

@Astrosun
You don't "have to" but it is recommended that you do for better performance. I have written a script to calculate the mean of all training data and I used that. To be honest, since the training dataset is big, it doesn't make much difference if you assign just some average numbers, around say 100 to 120 for each channel (rgb). It does help when you subtract it from the image, so at least use those random numbers even if you don't calculate the mean. Well take this advice with a grain of salt if you have a peculiar dataset :)

from faster_rcnn.

Astrosun avatar Astrosun commented on June 17, 2024

@duygusar

Thanks so much for your valuable advice and it will be the greatest help!

By the way, one more question, when you calculate the mean of your training data using your own script, do you add them up together then calculate the average value? I am wondering that whether it will cost a lot of time?

Thanks again!

from faster_rcnn.

duygusar avatar duygusar commented on June 17, 2024

@Astrosun
It was a long time ago, so I don't exactly remember but I did read in a list on images so I must have done it with a loop, since it is a simple computation, it didn't take that long. You also need to change the order of rgb and reshape the output though.

from faster_rcnn.

Astrosun avatar Astrosun commented on June 17, 2024

@duygusar

Thanks a lot for your valuable advice and I will try it!

from faster_rcnn.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.