Comments (9)
Please check if you disable cudnn of bn correctly, or you can disable cudnn globally using CUDNN.ENABLED=False in your yaml config file.
from human-pose-estimation.pytorch.
Our code are tested using pytorch0.4.0 or 0.4.1, I did not test it using 0.5.0. Please try pytorch 0.4.0 or 0.4.1, and following the readme to disable cudnn.
from human-pose-estimation.pytorch.
I have changed to pytorch0.4.0, while the result still seems not right as below:
2018-10-16 20:17:50,715 Epoch: [139][0/696] Time 1.198s (1.198s) Speed 26.7 samples/s Data 0.906s (0.906s) Loss 0.00049 (0.00049) Accuracy 0.856 (0.856)
2018-10-16 20:18:19,306 Epoch: [139][100/696] Time 0.286s (0.295s) Speed 111.8 samples/s Data 0.000s (0.009s) Loss 0.00045 (0.00048) Accuracy 0.859 (0.854)
2018-10-16 20:18:47,906 Epoch: [139][200/696] Time 0.285s (0.290s) Speed 112.3 samples/s Data 0.000s (0.005s) Loss 0.00046 (0.00048) Accuracy 0.848 (0.851)
2018-10-16 20:19:16,534 Epoch: [139][300/696] Time 0.289s (0.289s) Speed 110.6 samples/s Data 0.000s (0.003s) Loss 0.00056 (0.00048) Accuracy 0.815 (0.851)
2018-10-16 20:19:45,108 Epoch: [139][400/696] Time 0.285s (0.288s) Speed 112.3 samples/s Data 0.000s (0.003s) Loss 0.00046 (0.00048) Accuracy 0.854 (0.852)
2018-10-16 20:20:13,669 Epoch: [139][500/696] Time 0.285s (0.288s) Speed 112.2 samples/s Data 0.000s (0.002s) Loss 0.00054 (0.00048) Accuracy 0.805 (0.852)
2018-10-16 20:20:42,283 Epoch: [139][600/696] Time 0.285s (0.287s) Speed 112.3 samples/s Data 0.000s (0.002s) Loss 0.00051 (0.00048) Accuracy 0.837 (0.852)
2018-10-16 20:21:10,280 Test: [0/93] Time 1.022 (1.022) Loss 0.0015 (0.0015) Accuracy 0.014 (0.014)
2018-10-16 20:21:24,151 | Arch | Head | Shoulder | Elbow | Wrist | Hip | Knee | Ankle | Mean | [email protected] |
2018-10-16 20:21:24,152 |---|---|---|---|---|---|---|---|---|---|
2018-10-16 20:21:24,152 | 256x256_pose_resnet_50_d256d256d256 | 0.000 | 0.238 | 0.222 | 1.598 | 6.941 | 0.625 | 0.142 | 1.452 | 0.075 |
2018-10-16 20:21:24,155 => saving checkpoint to output/mpii/pose_resnet_50/256x256_d256x3_adam_lr1e-3
2018-10-16 20:21:24,917 saving final model state to output/mpii/pose_resnet_50/256x256_d256x3_adam_lr1e-3/final_state.pth.tar
Is it also related to the version of python and opencv? Or something else?
from human-pose-estimation.pytorch.
After setting CUDNN.ENABLED=False, it seems everything works well now.
Test: [0/93] Time 1.235 (1.235) Loss 0.0004 (0.0004) Accuracy 0.934 (0.934)
Arch | Head | Shoulder | Elbow | Wrist | Hip | Knee | Ankle | Mean | [email protected] |
---|---|---|---|---|---|---|---|---|---|
256x256_pose_resnet_50_d256d256d256 | 96.623 | 94.803 | 88.018 | 81.772 | 87.017 | 82.450 | 77.869 | 87.525 | 33.690 |
=> saving checkpoint to output/mpii/pose_resnet_50/256x256_d256x3_adam_lr1e-3 |
But without cudnn, the training/testing process becomes very slow. Is there a Dockerfile which can reproduce the environment of your experiments? If there is, cloud you share it to me? Thank you : )
from human-pose-estimation.pytorch.
so I guesses that you may not disable cudnn of bn correctly. Please follow our steps to do it.
from human-pose-estimation.pytorch.
In the feature, I will add the Dockerfile.
from human-pose-estimation.pytorch.
I have changed the line below with "False":
return torch.batch_norm(
input, weight, bias, running_mean, running_var,
training, momentum, eps, False
)
and set the environment variable PYTORCH, is there anything I missed?
from human-pose-estimation.pytorch.
Please make sure that the runtime pytorch is what you changed, not other pytorch version.
from human-pose-estimation.pytorch.
Thank you for your patient reply. It's helpful. I finally found the problem.
from human-pose-estimation.pytorch.
Related Issues (20)
- 关于数据增强的疑惑。
- Help with Transfer Learning
- 模型的速度问题
- 关于数据处理,请大神解惑。 HOT 1
- 检测部分的代码 HOT 1
- 训练结果低于作者给的结果 HOT 3
- I have an error when I load image file in data/coco/images/val2017 HOT 2
- About the rotation of test_set in MPII
- cpu_nms HOT 1
- TypeError: _update_dict() missing 1 required positional argument: 'code' HOT 2
- experiments and core.config
- 只能实现单人姿态估计?
- TypeError: unhashable type: 'slice'
- flip_pairs?
- Which part of this code that explains joint propagation using optical flow?
- This repo is missing important files HOT 1
- lib make error HOT 1
- Support inference in C++
- Pretrained 2d pose estimation with resnet50 backbone
- How to I run demo in video.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from human-pose-estimation.pytorch.