Giter VIP home page Giter VIP logo

fractalnet's People

Contributors

dylansun avatar gustavla avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

fractalnet's Issues

Unrecognized command: define-eval

when i run crox train.crox.prototxt > train.prototxt, error happen:

ERROR globals.crox:2 Unrecognized command: define-eval

why? can anybody help me?

error: ‘FractalJoinParameter’ was not declared in this scope

src/caffe/layers/fractal_join_layer.cpp: In member function ‘virtual void caffe::FractalJoinLayer::LayerSetUp(const std::vectorcaffe::Blob<Dtype_>&, const std::vectorcaffe::Blob<Dtype_>&)’:
src/caffe/layers/fractal_join_layer.cpp:11:3: error: ‘FractalJoinParameter’ was not declared in this scope
FractalJoinParameter param = this->layer_param().fractal_join_param();
^
src/caffe/layers/fractal_join_layer.cpp:11:24: error: expected ‘;’ before ‘param’
FractalJoinParameter param = this->layer_param().fractal_join_param();
^
src/caffe/layers/fractal_join_layer.cpp:14:13: error: ‘param’ was not declared in this scope
std::copy(param.drop_path_ratio().begin()

reproduce

[Hi,]
when i reproduce this model on cifar10, i find that the validation error is very unstable. For the first 200 epochs,the top1 error is fluctuating between 0.15 to 0.35. Then it drops to 0.1 to 0.14, because of the smaller learningrate , during the 200 to 300 epochs .And finally ,it keeps stable at 0.08 , for the last 100 epochs. Does anyone see the similar things? how can i make the validation error smooth ?
The result as below.

net-train.pdf

Maybe there is a bug?

drops_ dose not be reset after Forward and Backward, it still keeps the value of last Forward.
I suggest to reset drops_ at the beginning of Forward.
this is my implementation of FractalNet with global and local drop in caffe:https://github.com/KangGrandesty/fractalnet.
It may be useful for you.

How can I implement the global_drop_path?

Hi, I conducted some experiments on cifar-10 dataset, and I only used the local_drop_path, I've tried lots of drop_path_ratio and dropout_ratio, the full-net's result was closed to yours , but the results of single columns were Not as satisfactory as expected. Was this owing to the global_drop_path which I didn't use? If so, how can I implement this global_drop_path? I'm not quite sure about the meaning of 'tying the weight'.
Hope to hear from you soon, thx!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.