Giter VIP home page Giter VIP logo

Comments (3)

cbalint13 avatar cbalint13 commented on September 26, 2024
  • You cannot train same layer on two CPU or cannot gain speed with two different adjacent layers on two separate GPU. The only way to gain speed on two or more GPU if e.g you NN architecture is split e.g. like this architecture: http://benanne.github.io/images/imagenet.png where you can take advantage from two GPU cards. There are many more situations where you can train sort of parallel architectures on multiple GPU e.g train two separate input layers that at some point will join each other.

from convnet.

Robert0812 avatar Robert0812 commented on September 26, 2024

Thank you, I will try to split the model.

from convnet.

Robert0812 avatar Robert0812 commented on September 26, 2024

I tested the model splitting with imagenet data on two K40 GPU boards. The 20-minibatch (each minibatch has 128 images of size 224x224) training time is 32.2 seconds for the AlexNet [Alex Krizhevsky NIPS 2012] using single GPU, while the 20-minibatch training time is 21.1 seconds if I split the model into two GPU memories following the model structure of AlexNet. I do get improvement in efficiency. But if I use the same structure with same batch size in training, Caffe can achieve around 19 seconds per 20 minibatches even on single GPU. Anyone else tested it? Tell me if my testing results are wrong.

from convnet.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.