Comments (3)
- You cannot train same layer on two CPU or cannot gain speed with two different adjacent layers on two separate GPU. The only way to gain speed on two or more GPU if e.g you NN architecture is split e.g. like this architecture: http://benanne.github.io/images/imagenet.png where you can take advantage from two GPU cards. There are many more situations where you can train sort of parallel architectures on multiple GPU e.g train two separate input layers that at some point will join each other.
from convnet.
Thank you, I will try to split the model.
from convnet.
I tested the model splitting with imagenet data on two K40 GPU boards. The 20-minibatch (each minibatch has 128 images of size 224x224) training time is 32.2 seconds for the AlexNet [Alex Krizhevsky NIPS 2012] using single GPU, while the 20-minibatch training time is 21.1 seconds if I split the model into two GPU memories following the model structure of AlexNet. I do get improvement in efficiency. But if I use the same structure with same batch size in training, Caffe can achieve around 19 seconds per 20 minibatches even on single GPU. Anyone else tested it? Tell me if my testing results are wrong.
from convnet.
Related Issues (20)
- How to cite it? HOT 1
- meaning of variables in RESPONSE_NORM type edge HOT 1
- imagenet case
- Assertion error about targets.shape[3] during Backprop. HOT 1
- about 'partial_sum'
- Warning : Could not set up P2P, GPU-to-GPU communication will be slow. CUDA error
- Step 1Invalid start / end 0 0 HOT 2
- video2hdf5.o error HOT 2
- Comipling the CPU-only feature extractor HOT 3
- Bug in LoadChunk(DataIterator& it, Matrix& mat, vector<int>& random_rows) HOT 1
- Unable to compile: undefined references HOT 1
- attempting to build cpu-only on OSX 10.10 HOT 1
- Wrong post
- GetName method undefined in pbtxt2dot.py
- Windows support
- Can we get some walk through on how to train CNN? HOT 1
- Assertion failed when run py/run_convnet.py
- Display localization
- How to train the convnet on multiGPUs?
- /usr/bin/ld: cannot find -leigenmat HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from convnet.