Comments (5)
from asv-subtools.
你好, 请先确实pytorch版本,建议暂时不要使用1.10以上,推荐版本,1.7
…
在 2021年11月16日,下午2:18,songfuture @.***> 写道: 首先感谢xmuspeech的subtools工具~ 请问一下,当使用命令 subtools/runPytorchLauncher.sh run-resnet34-fbank-81-benchmark.py --gpu-id=0,1 --stage=3 --endstage=3 ,也就是 python3 -m torch.distributed.launch --nproc_per_node=2 run-resnet34-fbank-81-benchmark.py --gpu-id=0,1 --port 2345 --stage=3 --endstage=3 时,出现如下warning和error,可能是环境还是哪里出现问题导致多卡初始化失败呢? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android.
谢谢回复,按照您的建议把pytorch版本从1.9降到1.7后,可以顺利运行。再请问一下,多机多卡是以如下命令在多个机子上运行,”python3 -m torch.distributed.launch --nnodes=2 --nproc_per_node=2 --master_addr ***** --node_rank=0 run-resnet152-fbank-81-attention.py --stage=3 --endstage=3“?还是说subtools有单机多卡修改为多机多卡的机制呢?subtools/pytorch/libs/support/utils.py中显示可以easy拓展为多机,但是在运行单机多卡的脚本subtools/runPytorchLauncher.sh中没有发现切换为多机的参数,还是说在其他脚本里进行设置呢?
from asv-subtools.
您好,请问一下,subtools/pytorch/libs/support/utils.py是只能实现单机多卡吗?多机多卡需要自己改初始化之类的么?
from asv-subtools.
from asv-subtools.
谢谢答复~
from asv-subtools.
Related Issues (20)
- 百度云盘的voxceleb recipe链接已经失效了,可以重新分享么?
- run Voxceleb Recipe [Speaker Recognition] HOT 3
- subtools/scoreSets.sh中191行特征提取错误数统计值errorNum HOT 1
- 多卡运行报错join() HOT 1
- the num_targets and the max label in train.egs.csv are not equal HOT 1
- Removing deprecated has_key function
- 多卡GPU运行失败 HOT 2
- 想完成一个说话人确认系统,应该怎么开始? HOT 2
- Are there any plans to add the onnx conversion module to the model?
- rirs 和 musan 在哪里下载的?
- train.py,# Recover checkpoint 没有将loss的weight的保存,仅load模型,是不是有问题? HOT 1
- 当使用ResNet模型进行迁移的问题 HOT 1
- subtools/pytorch/pipeline/export_jit_model.sh 导出模型报错 HOT 2
- runtime中的processor打分模块能否开源一下
- 是否准备提供onnx 导出功能的支持 HOT 2
- 发现了一个逻辑漏洞/found a bug
- online训练提示标签越界问题:Assertion `idx_dim >= 0 && idx_dim < index_size && "index out of bounds"`
- subtools/pytorch/model/resnet_xvector.py文件152行多写了一个self x = self.self.cmvn_(x)
- extras/check_dependencies.sh: 13: function: not found HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from asv-subtools.