Comments (7)
Is it wrong to train an i-vector or x-vector model using language-id instead of speaker-id in lid? Because if we see language id as speaker id in baseline system, we get a better results than using the original speaker id to train the model.
from asv-subtools.
from asv-subtools.
Thanks a lot!
from asv-subtools.
Hi @Snowdar,
Thanks for this killing toolkit. I have a followup question:
For the OLR2020 baseline and LID task, the training dataset includes multiple spks(each spk has multiple utterances and one language label). And the test dataset includes multiple spks which not been seen in the training dataset before.
From my understanding based on your answer, I see that the difference between i-vector/-vector SRE task and i-vector/-vector LID task is use either spk-id or languge-id as label.
For the LID task, it is true that we should use language-id as the label. For example, if we do an LID task on such dataset which has 6 different languages, each language has 1000 spks, each spk has only one utterance. It is easy to explain that trained model successfully learned the language-related feature not the spk-related.
But if we do an LID task on such dataset which has 6 languages, each language has 30 spks, each spk has 300 utterances. If we use language-id as label here, 24 spk for train and 6 for test. How do we know the trained model can successfully predict the language label because it learned the language-related features or spk-related features?
I am confused about this part.
Thanks in advance
from asv-subtools.
from asv-subtools.
Hi @Snowdar,
Thanks for your reply.
For the training loss and validation loss in the training process, how we split the training dataset in the training part and validation part. Do we split the training dataset by different spk? In such case, the spks in the validation part do not appeared in the training part.
Or we shuffled the whole training dataset first, and then i.e., use the first 90% portion as the training part and the rest as the validation part? In such case, the spks in the validation part could appeared in the training part as well.
Thanks
from asv-subtools.
from asv-subtools.
Related Issues (20)
- 多gpu运行 HOT 5
- 百度云盘的voxceleb recipe链接已经失效了,可以重新分享么?
- run Voxceleb Recipe [Speaker Recognition] HOT 3
- subtools/scoreSets.sh中191行特征提取错误数统计值errorNum HOT 1
- 多卡运行报错join() HOT 1
- the num_targets and the max label in train.egs.csv are not equal HOT 1
- Removing deprecated has_key function
- 多卡GPU运行失败 HOT 2
- 想完成一个说话人确认系统,应该怎么开始? HOT 2
- Are there any plans to add the onnx conversion module to the model?
- rirs 和 musan 在哪里下载的?
- train.py,# Recover checkpoint 没有将loss的weight的保存,仅load模型,是不是有问题? HOT 1
- 当使用ResNet模型进行迁移的问题 HOT 1
- subtools/pytorch/pipeline/export_jit_model.sh 导出模型报错 HOT 2
- runtime中的processor打分模块能否开源一下
- 是否准备提供onnx 导出功能的支持 HOT 2
- 发现了一个逻辑漏洞/found a bug
- online训练提示标签越界问题:Assertion `idx_dim >= 0 && idx_dim < index_size && "index out of bounds"`
- subtools/pytorch/model/resnet_xvector.py文件152行多写了一个self x = self.self.cmvn_(x)
- extras/check_dependencies.sh: 13: function: not found HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from asv-subtools.