Comments (10)
from emotivoice.
Congratulations on your prompt actions regarding voice cloning. It would be helpful if you could provide more details, such as the data you are using, the number of training steps you have completed, and so on.
from emotivoice.
1.我按这个步骤中所说的数据进行训练的:https://github.com/netease-youdao/EmotiVoice/tree/main/data/DataBaker。
其中这两项没执行。
2.此外我改为本地模型训练,修改成以下代码:
train_dataset = Dataset_PromptTTS_JETS(config.train_data_path, config, style_encoder)
#data_sampler = DistributedSampler(train_dataset)
#train_loader = torch.utils.data.DataLoader(
# train_dataset,
# num_workers=8,
# shuffle=False,
# batch_size=config.batch_size,
# collate_fn=train_dataset.TextMelCollate,
# sampler = data_sampler,
#)
# 不再使用 DistributedSampler
train_loader = torch.utils.data.DataLoader(
train_dataset,
#num_workers=8,
num_workers=1,
shuffle=True, # 使用shuffle而不是sampler
batch_size=config.batch_size,
collate_fn=train_dataset.TextMelCollate
)
3.训练了5000步和10000步是一样的效果,没有正常的声音
4.4000步val
from emotivoice.
1.我按这个步骤中所说的数据进行训练的:https://github.com/netease-youdao/EmotiVoice/tree/main/data/DataBaker。 其中这两项没执行。 2.此外我改为本地模型训练,修改成以下代码: train_dataset = Dataset_PromptTTS_JETS(config.train_data_path, config, style_encoder) #data_sampler = DistributedSampler(train_dataset) #train_loader = torch.utils.data.DataLoader( # train_dataset, # num_workers=8, # shuffle=False, # batch_size=config.batch_size, # collate_fn=train_dataset.TextMelCollate, # sampler = data_sampler, #) # 不再使用 DistributedSampler train_loader = torch.utils.data.DataLoader( train_dataset, #num_workers=8, num_workers=1, shuffle=True, # 使用shuffle而不是sampler batch_size=config.batch_size, collate_fn=train_dataset.TextMelCollate )
For DataBaker, it should work fine. I have attached the results from 5000 steps for your reference.
DataBaker-g_00005000.zip
And I will attempt to replicate the issue based on the modifications you have made.
from emotivoice.
谢谢你的及时回复。我想我要重装下python环境试下,你用哪个版本的pytorch和cuda?
from emotivoice.
Some setups for your reference:
cuda 11.8 torch 2.1.1 python 3.10
cuda 11.7 torch 1.13 python 3.8
from emotivoice.
我安装了cuda 11.8 torch 2.1.1 python 3.10,重装环境也不行,我的是windows11,无法用分布式训练。用预训练模型都比训练的好,至少可以出来声音。:(
from emotivoice.
想请问一下你的硬件配置是?10000步大概训练了多久?我的训练了快10个小时了还没出结果。
from emotivoice.
训练出来,完全不可用。像哑巴一样,呀呀........,是哈原因差这么远
请问您解决了吗?我遇到了一样的问题,使用https://github.com/netease-youdao/EmotiVoice/tree/main/data/LJspeech提供的步骤,没有执行MFA,微调结果基本没有声音。
from emotivoice.
from emotivoice.
Related Issues (20)
- 运行网页生成语音没有问题,运行api命令的时候,报了一个No module named 'torch'
- CorpusError
- 如何在句子中插入停顿? HOT 1
- 用自己的数据制定音色
- 能够使用小样本英文进行微调吗? HOT 1
- 使用命令行推理报错,张量维度不一致 HOT 1
- 一次推理的句子最大长度是多少呢? HOT 1
- 请问推理时的sp0,sp1是什么作用,最大的sp是多少? HOT 1
- 推理时,生成的音频最后一个字或者最后一个音几乎听不到 HOT 1
- 请问一下,新增语音情感,StyleEncoder 模型需要重训练吗?怎么训练这个模型呢? HOT 2
- API接口如何处理多音字比如”还(hai2)不还(huan2)钱“ HOT 2
- 'sp' can not control pause as expected.
- Is dialect-languages-finetuning possible on EmotiVoice?
- Noised data for Finetune
- webdemo里面怎么标记多音字
- 请问这个参数gta = False指的是什么?有什么作用吗
- 求助!mfa执行第6步报错,老板等着要demo,求救求救 HOT 1
- 试了里面中文名字的语音,女:4519、6865、7143,男:7556、964这几个尚且能用 HOT 2
- 报错RuntimeError: CUDA error,
- 结合对话上下文给出符合人类理解的带情绪的回复
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from emotivoice.