How much minutes of audio datasets to train for a single speaker using blizzard model?

Question

enk100 · Answer

try both:
checkpoint/(name of expName)/bestmodel.pth
checkpoint/(name of expNa

enk100 · Answer

What is the duration of this 140 files? i think that you should train it with mo

jaxlinksync · Answer

the output is very different from my orig.wav file.

enk100 · Answer

Did you use Blizzard 2011 dataset?

jaxlinksync · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

enk100 · Answer

sj_017.gen_0.wav - is blizzard
Are you sure you train it on your data?
Did you

jaxlinksync · Answer

Did you change the data path to your own dataset?

enk100 · Answer

yes, on <a href="https://github.com/facebookresearch/loop/blob/c866e8df9b7afdc58460bca

jaxlinksync · Answer

here's what I did <a class="user-mention notranslate" data-hovercard-type="user" data-

enk100 · Answer

Are you sure you didn't mix between your dataset & blizzard?
Can you look into

jaxlinksync · Answer

the data/blizzard only contains my datasets. I use the model/blizzard for training. Is

enk100 · Answer

You need to train the model from scratch.
Does the argument '--checkpoint' in trai

jaxlinksync · Answer

on the first stage of training the --checkpoint is empty. on the second stage of train

enk100 · Answer

check please the argument '--checkpoint' in train.py. if it contain some checkpoint of

jaxlinksync · Answer

I'm sorry I'm confused on this statement

if it contain

enk100 · Answer

for example, if you got argument in 'default' in train.py-
parser.add_arg

jaxlinksync · Answer

Ok I've done that. but what about the 2nd stage of training? do i need to execute it?<

enk100 · Answer

yes, you should execute it with the checkpoint argument
--checkpoint checkpoints/b

jaxlinksync · Answer

so it should give me the generated file with the same voice as my datasets right?

enk100 · Answer

yes, of course

jaxlinksync · Answer

Thank you so much for the clarification <a class="user-mention notranslate" data-hover

jaxlinksync · Answer

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

enk100 · Answer

1/ how many files do you have in your dataset for each speaker?
2/ are you sure th

jaxlinksync · Answer

how many files do you have in your dataset for each speaker?

jaxlinksync · Answer

The total duration is 23 mins. what do you mean by this (just note that you need

jaxlinksync · Answer

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

enk100 · Answer

Hi, you can choose -

Combine your data with vctk data and trai

jaxlinksync · Answer

You mean train it as multi speaker?

enk100 · Answer

yes. train it on vctk with the 22 speakers + your data

jaxlinksync · Answer

so i have to run extract_feats.py with the 22 speakers + my data right?

lvenoxi · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

enk100 · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

jaxlinksync · Answer

what about the norm.dat of the extracted data? do I have to add it also to the norm_in

enk100 · Answer

it only relevant when you are going to generate samples. so when you generate vctk, us

jaxlinksync · Answer

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

enk100 · Answer

<a href="https://github.com/facebookresearch/loop/blob/c866e8df9b7afdc58460bcae060a3bc

jaxlinksync · Answer

Thank you so much <a class="user-mention notranslate" data-hovercard-type="user" data-

enk100 · Answer

you welcome!

jaxlinksync · Answer

by the way <a class="user-mention notranslate" data-hovercard-type="user" data-hoverca

enk100 · Answer

print self.speakers
in

jaxlinksync · Answer

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

jaxlinksync · Answer

Thank you so much <a class="user-mention notranslate" data-hovercard-type="user" data-

jaxlinksync · Answer

after I generate the data this is what I get.

enk100 · Answer

Are you sure your speaker is 21? i guess it should be 22, as vctk has 22 speakers.

jaxlinksync · Answer

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

jaxlinksync · Answer

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

melaanya · Answer

Hi, <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url=

How much minutes of audio datasets to train for a single speaker using blizzard model? about loop HOT 47 CLOSED

Comments (47)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent