Comments (3)
@sbkim052, no problem.
The basic idea is to train a speaker verification/classification model to learn an embedding space for speaker identity. Then, instead of conditioning the decoder on a fixed speaker id (like I did in this repo), you condition on the learned embeddings. At test time you can get the embedding for a new unseen speaker and condition the decoder to generate speech in that voice. For more info, you can take a look at this paper. They use a text-to-speech model instead of an autoencoder but the general idea is the same.
from zerospeech.
Hi @sbkim052,
Yeah, it should work with unseen speech as the input. All the examples here are converted from unseen speech.
If you want to convert to an unseen speaker, you'd have to retrain the model. You could also look into conditioning on x-vectors or other speaker embeddings if you want to do zero-shot conversion.
from zerospeech.
Hi @sbkim052,
Yeah, it should work with unseen speech as the input. All the examples here are converted from unseen speech.
If you want to convert to an unseen speaker, you'd have to retrain the model. You could also look into conditioning on x-vectors or other speaker embeddings if you want to do zero-shot conversion.
Thank you for answering:)
I have an additional question about your answer.
What do you mean by conditioning on x-vectors or other embeddings for zero-shot conversion?
Could you explain it in more detail on doing the zero-shot conversion?
from zerospeech.
Related Issues (17)
- pooch seems to be also a required dependency HOT 2
- keyerror when preprocess data HOT 6
- Can it run on cpu for real time voice conversion HOT 1
- How to perform Acoustic Unit Discovery?
- Vector dimension does not match other files HOT 14
- Time pre epoch
- About sampling rate 8 kHz
- Why is batch_size = 52 instead of 32 or 64?
- the speaker id
- How to train the model without using apex amp (automatic mixed precision)?
- Running on multiple GPUs
- Use as Universal Vocoder HOT 3
- How did you installing Apex? HOT 3
- I want to train this model in vctk dataset, but i don't know how to generate the json files provided by you? HOT 7
- ImportError when trying to compute ABX score HOT 4
- RuntimeWarning: invalid value encountered in log HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from zerospeech.