Giter VIP home page Giter VIP logo

Comments (4)

he-y avatar he-y commented on June 14, 2024

Your observation is right. The operation of the gradient of FPGM is different from that of SFP.
For FPGM, the gradients of the pruned filters are zero, and for SFP, they are non-zero, refer to this link.

Several reasons for this:

  1. The nonzero gradient of SFP would enlarge the model capacity, which is the main point of SFP. But the contribution of FPGM is about the filter selection criterion, so we would like the eliminate the effect of model capacity to compare fairly.

  2. This kind of operation is also different from HFP (Please refer to HFP in my IJCAI paper). Although the selected filters are pruned and kept as zero, the connections of them remain.

Imagine the first case, we use 500 neurons to learn a training set for one epoch (This is HFP).
In the second case, we use 1000 neurons to learn and delete 500 of them. For every epoch, 500 neurons are different (This is SFP).
The third case, we use 1000 neurons to learn and delete 500 of them. For every epoch, 500 neurons are the same (This is something between SFP and HFP).

from filter-pruning-geometric-median.

yyx1107 avatar yyx1107 commented on June 14, 2024

Got it! Thank you~

from filter-pruning-geometric-median.

xingyueye avatar xingyueye commented on June 14, 2024

@he-y What if only change the filter selection method and make the selected filters different for every epoch?In other words, maintaining the advantages of both the model capability enlarge of SFP and the filter selection of FPGM?

from filter-pruning-geometric-median.

he-y avatar he-y commented on June 14, 2024

@he-y What if only change the filter selection method and make the selected filters different for every epoch?In other words, maintaining the advantages of both the model capability enlarge of SFP and the filter selection of FPGM?

I think the performance would be slightly better.
As discussed above, the original FPGM is "FPGM + half SFP". So the improvement of "FPGM + SFP" over "FPGM + half SFP" may be less than that of "SFP" over "HFP".

from filter-pruning-geometric-median.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.