openvpi / singingvocoders Goto Github PK
View Code? Open in Web Editor NEWA collection of neural vocoders suitable for singing voice synthesis tasks.
License: MIT License
A collection of neural vocoders suitable for singing voice synthesis tasks.
License: MIT License
如题。
我按照文档的描述在自己的数据集上微调NSF-HifiGAN声码器,查看tblog时,发现在声谱高频部分会有异常的横纹出现:
静音处显得更为明显:
说明:我使用ft_hifigan.yaml配置文件,除了数据集之外没有改变其他任何参数,使用的数据集大小~2小时。
对此有任何建议吗?thx~
Hi~,我使用了Releases中发布的两个版本的声码器,发现两者都不能很好的还原下面音色。
原始录音:
XYH(原始录音).zip
声码器重建波形:
XYH(2022.12).wav.zip
XYH(2024.02).wav.zip
原始录音中有明显的怒音特征,但是声码器重建的波形听起来则是嘶哑偏破音的状态。此外,即便是使用该音色的录音数据对声码器进行少量步数的针对性微调也没能完全解决这个问题(有所改善)。
我正在想办法看能否解决,对此您有任何想法吗?Thx~
我想请问博主,如果我想把其中的一个声码器用作so-vits-svc中浅扩散模型训练的声码器中,该如何操作,因为我试着导入了HIFIGAN总是失败,看能不能试试这个
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.