Comments (10)
@LEECHOONGHO I have published my model here https://huggingface.co/patriotyk/vocos-mel-hifigan-compat-44100khz
Sounds great, and there is metrics.
@Mahmoud-ghareeb My model has been trained on 800+ hours of audio. Vocoder doesn't require text transcripts so you can easily use audio books for training. You even don't need to cut it by silence because vocos anyway internally splits provided audios to smaller segments.
from vocos.
Do you have a standard tensorboard logs? It is interesting to compare.
from vocos.
@patriotyk Sorry, I've change the code to log on WandB server. I have no local logging files nor tensorboard logs.
from vocos.
What is your validation loss on the last checkpoint? It is encoded in to the checkpoint file name. I am training 44100 for an almost a week already and loss still goes down.
from vocos.
Training Loss, Generated Outputs.
I hope this will be a reference for model training.
TKS for your work,could your share 32k model training detail like:
your encodec model(i found pretrained models :24k and 48k,so i guess 32k resample to 24k or 48k for encodec pretrained model,then resample to 32k ??)
from vocos.
Training Loss, Generated Outputs.
I hope this will be a reference for model training.
https://api.wandb.ai/links/xi-speech-team/k0kdfwchTKS for your work,could your share 32k model training detail like: your encodec model(i found pretrained models :24k and 48k,so i guess 32k resample to 24k or 48k for encodec pretrained model,then resample to 32k ??)
I'm sry for your confuse.
I just trained Mel Vocoder not for encodec's decoder.
But I have plans to train Mel-Encodec?(Mel Spectrogram to RVQ Encoder, and Vocos Decoder for Various Speech data) in the future.
from vocos.
Do you have a standard tensorboard logs? It is interesting to compare.
What is your validation loss on the last checkpoint? It is encoded in to the checkpoint file name. I am training 44100 for an almost a week already and loss still goes down.
I estimated mel loss, and Generator loss with newly gained dataset. and each was 0.0942 and 2.82.
Because of the dataset's Size, estimating Eval loss with eval dataset have no difference with sampled train data.
how about your model output's quality? any artifacts?
from vocos.
Do you have a standard tensorboard logs? It is interesting to compare.
What is your validation loss on the last checkpoint? It is encoded in to the checkpoint file name. I am training 44100 for an almost a week already and loss still goes down.
I estimated mel loss, and Generator loss with newly gained dataset. and each was 0.0942 and 2.82. Because of the dataset's Size, estimating Eval loss with eval dataset have no difference with sampled train data.
how about your model output's quality? any artifacts?
I am still training(third week). It is very slow. I will update with my results when finish.
from vocos.
how much data do we need for training
from vocos.
Great work! @patriotyk, Thank you so much
from vocos.
Related Issues (20)
- Is Vocos suitable for singing?
- about the install problems HOT 1
- combine with superresolution HOT 2
- Training error, help needed!
- how to convert custom ckpt to bin? HOT 3
- Bark+Vocos.ipynb fails on saving mp3 files with error about FFmpeg backend
- error
- Export to ONNX HOT 14
- Compatibility with Matcha TTS HOT 7
- "error: No module named 'encodec'" while training a vocos
- MPS support HOT 2
- Why spectogram power is picket as 1?
- Bark + Vocos for longer text to speech ?
- Debug in vscode
- Training vocos on a single speaker dataset
- Feature maps from 1st layer of each discriminator not included
- About the VISQOL
- COLA == Training Instability?
- How to use customized trained models?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vocos.