Comments (35)
@jackchinor You will need to uncomment this line: https://gist.github.com/kastnerkyle/cc0ac48d34860c5bb3f9112f4d9a0300#file-extract_feats-py-L1034
You will see that the missing matrices are also commented, so you will need to uncommented too. The only one that needs to remain commented is the code2speaker.
from loop.
solved by:
Install ncurses lib
sudo apt-get install libncurses5-dev libncursesw5-dev
Ubuntu 16
from loop.
@jaxlinksync I solved the problem by manually make and install the package tool.
from loop.
Hi again, Also I check .npz files and they don't contain several files that are in your Vctk data:
- audio_norminfo
- code2char
- text_features
- code2phone
- text_norminfo
Do you know how to build it?
from loop.
Uncomment the required arrays here
from loop.
@jdbermeol @jayavanth did you find an answer to the original question:
In order to combine both datasets, should I have to run the script for each speaker and them combine somehow the norms file, or should I put all data in one folder and process it?
from loop.
@karandwivedi42 No, I don't know the answer yet. Also, the script used to work but now I get this error:
Traceback (most recent call last):
File "/home/ubuntu/loop/preprocessing/latest_features/merlin/src/run_merlin.py", line 1175, in
main_function(cfg)
File "/home/ubuntu/loop/preprocessing/latest_features/merlin/src/run_merlin.py", line 693, in main_function
acoustic_worker.prepare_nn_data(in_file_list_dict, nn_cmp_file_list, cfg.in_dimension_dict, cfg.out_dimension_dict)
File "/home/ubuntu/loop/preprocessing/latest_features/merlin/src/frontend/acoustic_base.py", line 122, in prepare_nn_data
self.prepare_data(in_file_list_dict, out_file_list, in_dimension_dict, out_dimension_dict)
File "/home/ubuntu/loop/preprocessing/latest_features/merlin/src/frontend/acoustic_composition.py", line 126, in prepare_data
features, frame_number = io_funcs.load_binary_file_frame(in_file_name, in_feature_dim)
File "/home/ubuntu/loop/preprocessing/latest_features/merlin/src/io_funcs/binary_io.py", line 64, in load_binary_file_frame
fid_lab = open(file_name, 'rb')
IOError: [Errno 2] No such file or directory: '/home/ubuntu/loop/preprocessing/latest_features/merlin/egs/build_your_own_voice/s1/experiments/my_new_voice/acoustic_model/data/mgc/*.mgc'
+ echo 'All successfull!! Your demo voice is ready :)'
All successfull!! Your demo voice is ready :)
Feature extraction complete!
Traceback (most recent call last):
File "extract_features.py", line 1411, in
save_numpy_features()
File "extract_features.py", line 853, in save_numpy_features
shutil.copy2(audio_norm_source, audio_norm_dest)
File "/home/ubuntu/miniconda2/envs/loop/lib/python2.7/shutil.py", line 130, in copy2
copyfile(src, dst)
File "/home/ubuntu/miniconda2/envs/loop/lib/python2.7/shutil.py", line 82, in copyfile
with open(src, 'rb') as fsrc:
IOError: [Errno 2] No such file or directory: '/home/ubuntu/loop/preprocessing/latest_features/final_acoustic_data/norm_info_mgc_lf0_vuv_bap_63_MVN.dat'
from loop.
@adampolyak, @ytaigman. Hi, So I have been able to run the extract_features script for a speaker on the VCTK dataset. However, each run is going to create a norm_infor folder. So I go back to my original question. Is there a way to combine the output of each norm_info folder. Or should I create a big folder with all samples and run the script using that folder?
from loop.
-
norm_info_mgc_lf0_vuv_bap_63_MVN.dat is indeed the correct norm file.
-
Both ways are valid. The norm file contains mean and std of the dataset -
see the generation code. You can merge the statistics or just run the script on the merged folder.
from loop.
@jdbermeol I want to train my own data set, but I found it too complicate to do it. Should I firstly run the install_tts.py? and then run the extract_features script? when I run the install_tts.py , it occurred an error as follow:
Traceback (most recent call last):
File "install_tts.py", line 174, in
pe(untar_cmd)
File "install_tts.py", line 114, in pe
for line in execute(cmd, shell=shell):
File "install_tts.py", line 107, in execute
raise subprocess.CalledProcessError(return_code, cmd)
subprocess.CalledProcessError: Command '['tar', 'xzf', '/tmp/kastner/speech_synthesis/speech_tools-2.4-release.tar.gz']' returned non-zero exit status 2
I don't know how to fix it . Could you help me with it? really appreciate
from loop.
@jackchinor in the install_tts.py, you can find you need to download kk_all_deps.tar.gz or install some tools first
from loop.
@dengbingfeng I created the kk_all_deps.tar.gz, but run the install_tts.py file, it doesn't work, an error occured
from loop.
@jdbermeol finally ,I generate .npz files , but some of the .npy files are not contained, just the same with you.
1.audio_norminfo
2.code2char
3.text_features
4.code2phone
5.text_norminfo
Can you tell me how to build them?Really appreciate...
from loop.
@jdbermeol I see, thank you so much.
from loop.
I am trying to run extract_feats.py
on the complete VCTK dataset by following the advice from this comment above, to put all the wav files and text in one big directory and run the script on them.
Before I get to the part of save_dict
saving all features, I'm getting the following error.
Feature extraction complete!
Traceback (most recent call last):
File "extract_feats.py", line 1440 in <module>
save_numpy_features()
File "extract_feats.py", line 1020 in save_numpy_features
assert phonemes[0] == 'pau'
IndexError: tuple index out of range
My best guess is that the phonemes tuple is not being created properly. Any suggestions or am I making some common mistake?
from loop.
hi @ankitmishra262, great question, same happens to me, I could not solve it, I have to restrict my self to the subsample of speaker Facebook team use in the paper.
from loop.
I'm having a problem in running this https://gist.github.com/kastnerkyle/001a58a58d090658ee5350cb6129f857
to run extract_feats.py. Any ideas where I can download the kk_all_deps.tar.gz?
from loop.
@jackchinor how did you solve this?
from loop.
I created my own kk_all_deps.tar.gz but now encountered a new error.
Makefile:102: recipe for target 'ch_lab' failed
config/rules/targets.mak:55: recipe for target 'main' failed
Traceback (most recent call last):
File "install_tts.py", line 183, in <module>
pe(make_cmd)
File "install_tts.py", line 114, in pe
for line in execute(cmd, shell=shell):
File "install_tts.py", line 107, in execute
raise subprocess.CalledProcessError(return_code, cmd)
subprocess.CalledProcessError: Command '['make', '-j', '4']' returned non-zero exit status 2
from loop.
from loop.
Hi @ankitmishra262, what ubuntu are you using and the GPU?
mine is ubuntu 16, and cuda 8.0
from loop.
I'm running ubuntu 14.04 with cuda 8.0 @jaxlinksync
from loop.
Hi @jdbermeol,
how did you solve this? #11 (comment)
I now came up with the same error.
from loop.
Hi @ankitmishra262
Everything went smoothly until I run the extract_feats.py. I have this error
Traceback (most recent call last):
File "extract_feats.py", line 1406, in <module>
save_numpy_features()
File "extract_feats.py", line 853, in save_numpy_features
shutil.copy2(audio_norm_source, audio_norm_dest)
File "/usr/lib/python2.7/shutil.py", line 130, in copy2
copyfile(src, dst)
File "/usr/lib/python2.7/shutil.py", line 82, in copyfile
with open(src, 'rb') as fsrc:
IOError: [Errno 2] No such file or directory: '/home/jax/latest_features/final_acoustic_data/norm_info_mgc_lf0_vuv_bap_63_MVN.dat'
from loop.
@jaxlinksync , may I know how you fix the below make issue when running install_tts.py?
Makefile:102: recipe for target 'ch_lab' failed
config/rules/targets.mak:55: recipe for target 'main' failed
Traceback (most recent call last):
File "install_tts.py", line 183, in
pe(make_cmd)
File "install_tts.py", line 114, in pe
for line in execute(cmd, shell=shell):
File "install_tts.py", line 107, in execute
raise subprocess.CalledProcessError(return_code, cmd)
subprocess.CalledProcessError: Command '['make', '-j', '4']' returned non-zero exit status 2
when I try to execute 'make' in the dir directly , some error happened:
Making in directory ./siod ...
Making in directory ./rxp ...
Making in directory ./audio ...
Making in directory ./utils ...
Making in directory ./base_class ...
Making in directory base_class/string ...
Making in directory base_class/inst_tmpl ...
Making in directory ./ling_class ...
Making in directory ./speech_class ...
Making in directory ./sigpr ...
Making in directory sigpr/pda ...
Making in directory ./stats ...
Making in directory stats/wagon ...
Making in directory stats/kalman_filter ...
Making in directory ./grammar ...
Making in directory grammar/scfg ...
Making in directory grammar/wfst ...
Making in directory grammar/ngram ...
Making in directory ./intonation ...
Making in directory intonation/tilt ...
Making in directory ./lib ...
Making in directory ./main ...
gcc -O3 -Wall -o ch_lab ch_lab_main.o -L../lib -lestools -L../lib -lestbase -L../lib -leststring -lcurses -ldl -lncurses -lm -lstdc++
/usr/bin/ld: cannot find -lcurses
/usr/bin/ld: cannot find -lncurses
collect2: error: ld returned 1 exit status
Makefile:102: recipe for target 'ch_lab' failed
make[1]: *** [ch_lab] Error 1
config/rules/targets.mak:55: recipe for target 'main' failed
make: *** [main] Error 2
from loop.
@jdbermeol , @ankitmishra262 ,
Have you successfully extract the features from vctk data and train a speakable model?
I use extrct_feats.py to deal with the same data set got from download_data.sh , but the shape of feature['audio_feature'] is different with the ones downloaded.
when I used the feature files extracted locally to train a model , it could not speak, only output silences. Do you have the same problem?
from loop.
@hepower I was able to make it work, but it was not easy. Beside of doing manually all the setup of libraries. I had to use the same voice actors as they use in the download_data.sh, when I try to do it with the complete VCTK dataset it did not work.
from loop.
@jdbermeol are you training the speakable model with the feature files downloaded with download_data.sh or you create them yourself locally with extract_feats.py?
When I extract the same actors locally, I notice that the shape of the features are not same with the downloaded one.
About your issue. I suggest you do not deal with the complete VCTK dataset, you may change some actors to see if it works.
from loop.
I have the exact same issue as in #11 (comment)
somehow the norm_info does not get generated, but I can't find out why. I can't even find where it would be generated.
@jaxlinksync did you get it to work?
@jackchinor what do you mean by make the package tools manually. Which tools do you mean SPTK,WORLD or merlin?
(I am on mac btw)
from loop.
@jdbermeol how did you solve this problem: #11 (comment) ??
from loop.
@ghostcow I left this project a few months, and my memory is really bad, something important I remember is that Facebook team was not using all the speakers on the Dataset. When I restrict myself to the subsample of speaker Facebook team use in the paper I was able to fix some of the issues. Not sure if exactly that one, but it was helpfull.
from loop.
@jaxlinksync how did you fixed below issue?
Traceback (most recent call last):
File "extract_feats.py", line 1406, in <module>
save_numpy_features()
File "extract_feats.py", line 853, in save_numpy_features
shutil.copy2(audio_norm_source, audio_norm_dest)
File "/usr/lib/python2.7/shutil.py", line 130, in copy2
copyfile(src, dst)
File "/usr/lib/python2.7/shutil.py", line 82, in copyfile
with open(src, 'rb') as fsrc:
IOError: [Errno 2] No such file or directory: '/home/jax/latest_features/final_acoustic_data/norm_info_mgc_lf0_vuv_bap_63_MVN.dat'
from loop.
@jaxlinksync how did you solve the following error you metioned above, I also tried run make on terminal, but it didn't work
Makefile:102: recipe for target 'ch_lab' failed
config/rules/targets.mak:55: recipe for target 'main' failed
Traceback (most recent call last):
File "install_tts.py", line 183, in
pe(make_cmd)
File "install_tts.py", line 114, in pe
for line in execute(cmd, shell=shell):
File "install_tts.py", line 107, in execute
raise subprocess.CalledProcessError(return_code, cmd)
subprocess.CalledProcessError: Command '['make', '-j', '4']' returned non-zero exit status 2
from loop.
I had the subprocess.CalledProcessError: Command '['make', '-j', '4']' returned non-zero exit status 2
error and it seems that installing ncurses with sudo apt-get install libncurses5-dev libncursesw5-dev
(per above) solved it. Now in running install_tts.py
I get the following error instead:
Traceback (most recent call last):
File "install_tts.py", line 362, in <module>
pe(configure_cmd)
File "install_tts.py", line 116, in pe
for line in execute(cmd, shell=shell):
File "install_tts.py", line 107, in execute
raise subprocess.CalledProcessError(return_code, cmd)
subprocess.CalledProcessError: Command '['./configure', '--with-fest-search-path=/home/paperspace/voiceloop/test/speech_synthesis/festival/examples', '--with-sptk-search-path=/home/paperspace/voiceloop/test/speech_synthesis/SPTK-3.9/bin', '--with-hts-search-path=/home/paperspace/voiceloop/test/speech_synthesis/htk/HTKTools', '--with-hts-engine-search-path=/home/paperspace/voiceloop/test/speech_synthesis/hts_engine_API-1.10/bin']' returned non-zero exit status 1
from loop.
Command had to be with sudo
(sudo python2 install_tts.py
)
from loop.
Related Issues (20)
- Out of memory in validation step HOT 1
- Parameters for dataset in the wild HOT 12
- Main Readme wav files are missing and first instruction doesn't work HOT 1
- Using pre-trained model for new speaker?
- No matching distribution found for phonemizer (from -r scripts/requirements.txt (line 5)) HOT 1
- Issue running install_tts.py to preprocess data HOT 1
- Error running train.py HOT 1
- Error when 'make' HTK-3.4.1 and hts_core.
- Train VCTK dataset for all speakers
- bash scripts/download_tools.sh failed on Mac OS
- ERROR: Failed to find norm file. HOT 7
- ImportError: No module named torch
- How this repo compared to Merlin?
- Understanding feat tensor dimensions HOT 1
- Look like it fails on '!' character.
- Strange fail on "The quick brown fox jumps over the lazy dog."
- TBPTTIter.split_length() error HOT 1
- Batch
- hello world text
- Block on preprocessing
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from loop.