Giter VIP home page Giter VIP logo

Comments (10)

tangflash avatar tangflash commented on June 2, 2024 2

from emotivoice.

syq163 avatar syq163 commented on June 2, 2024

Congratulations on your prompt actions regarding voice cloning. It would be helpful if you could provide more details, such as the data you are using, the number of training steps you have completed, and so on.

from emotivoice.

tangflash avatar tangflash commented on June 2, 2024

1.我按这个步骤中所说的数据进行训练的:https://github.com/netease-youdao/EmotiVoice/tree/main/data/DataBaker。
其中这两项没执行。
image
2.此外我改为本地模型训练,修改成以下代码:
train_dataset = Dataset_PromptTTS_JETS(config.train_data_path, config, style_encoder)
#data_sampler = DistributedSampler(train_dataset)
#train_loader = torch.utils.data.DataLoader(
# train_dataset,
# num_workers=8,
# shuffle=False,
# batch_size=config.batch_size,
# collate_fn=train_dataset.TextMelCollate,
# sampler = data_sampler,
#)
# 不再使用 DistributedSampler
train_loader = torch.utils.data.DataLoader(
train_dataset,
#num_workers=8,
num_workers=1,
shuffle=True, # 使用shuffle而不是sampler
batch_size=config.batch_size,
collate_fn=train_dataset.TextMelCollate
)

3.训练了5000步和10000步是一样的效果,没有正常的声音
4.4000步val
val_4000_melspec_0
val_4000_melspec_1

from emotivoice.

syq163 avatar syq163 commented on June 2, 2024

1.我按这个步骤中所说的数据进行训练的:https://github.com/netease-youdao/EmotiVoice/tree/main/data/DataBaker。 其中这两项没执行。 image 2.此外我改为本地模型训练,修改成以下代码: train_dataset = Dataset_PromptTTS_JETS(config.train_data_path, config, style_encoder) #data_sampler = DistributedSampler(train_dataset) #train_loader = torch.utils.data.DataLoader( # train_dataset, # num_workers=8, # shuffle=False, # batch_size=config.batch_size, # collate_fn=train_dataset.TextMelCollate, # sampler = data_sampler, #) # 不再使用 DistributedSampler train_loader = torch.utils.data.DataLoader( train_dataset, #num_workers=8, num_workers=1, shuffle=True, # 使用shuffle而不是sampler batch_size=config.batch_size, collate_fn=train_dataset.TextMelCollate )

3.训练了5000步和10000步是一样的效果,没有正常的声音 4.4000步val val_4000_melspec_0 val_4000_melspec_1

For DataBaker, it should work fine. I have attached the results from 5000 steps for your reference.
DataBaker-g_00005000.zip
And I will attempt to replicate the issue based on the modifications you have made.

from emotivoice.

tangflash avatar tangflash commented on June 2, 2024

谢谢你的及时回复。我想我要重装下python环境试下,你用哪个版本的pytorch和cuda?

from emotivoice.

syq163 avatar syq163 commented on June 2, 2024

Some setups for your reference:
cuda 11.8 torch 2.1.1 python 3.10
cuda 11.7 torch 1.13 python 3.8

from emotivoice.

tangflash avatar tangflash commented on June 2, 2024

我安装了cuda 11.8 torch 2.1.1 python 3.10,重装环境也不行,我的是windows11,无法用分布式训练。用预训练模型都比训练的好,至少可以出来声音。:(

from emotivoice.

ttPrivacy avatar ttPrivacy commented on June 2, 2024

想请问一下你的硬件配置是?10000步大概训练了多久?我的训练了快10个小时了还没出结果。

from emotivoice.

set-path avatar set-path commented on June 2, 2024

训练出来,完全不可用。像哑巴一样,呀呀........,是哈原因差这么远

请问您解决了吗?我遇到了一样的问题,使用https://github.com/netease-youdao/EmotiVoice/tree/main/data/LJspeech提供的步骤,没有执行MFA,微调结果基本没有声音。

from emotivoice.

tangflash avatar tangflash commented on June 2, 2024

from emotivoice.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.