Comments (3)
Okay, I've made some headway:
CONFIG_PATH=configs/config_ljs_radtts.json
RADTTS_PATH=data/radtts/radtts++ljs-dap.pt
HG_PATH=data/hifigan_libritts100360_generator0p5.pt.zip
HG_CONFIG_PATH=data/hifigan_libritts/hifigan_22khz_config.json
TEXT_PATH=test.txt
python inference.py -c $CONFIG_PATH -r $RADTTS_PATH \
-v $HG_PATH -k $HG_CONFIG_PATH -t $TEXT_PATH -s ljs \
--speaker_attributes ljs --speaker_text ljs -o results/
But there's a mismatch:
File "/home/jaggz/venv/ttsrad/lib/python3.9/site-packages/torch/nn/modules/module.py", line 1671, in load_state_dict
raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format(
RuntimeError: Error(s) in loading state_dict for RADTTS:
size mismatch for context_lstm.weight_ih_l0: copying a param with shape torch.Size([2080, 1044]) from checkpoint, the shape in current model is torch.Size([2080, 1040]).
...
Full output is:
/home/jaggz/opt/src/tts/radtts/nvidia/common.py:391: UserWarning: torch.qr is deprecated in favor of torch.linalg.qr and will be removed in a future PyTorch release.
The boolean parameter 'some' has been replaced with a string parameter 'mode'.
Q, R = torch.qr(A, some)
should be replaced with
Q, R = torch.linalg.qr(A, 'reduced' if some else 'complete') (Triggered internally at ../aten/src/ATen/native/BatchLinearAlgebra.cpp:2349.)
W = torch.qr(torch.FloatTensor(c, c).normal_())[0]
/home/jaggz/venv/ttsrad/lib/python3.9/site-packages/torch/functional.py:1682: UserWarning: torch.lu is deprecated in favor of torch.linalg.lu_factor / torch.linalg.lu_factor_ex and will be removed in a future PyTorch release.
LU, pivots = torch.lu(A, compute_pivots)
should be replaced with
LU, pivots = torch.linalg.lu_factor(A, compute_pivots)
and
LU, pivots, info = torch.lu(A, compute_pivots, get_infos=True)
should be replaced with
LU, pivots, info = torch.linalg.lu_factor_ex(A, compute_pivots) (Triggered internally at ../aten/src/ATen/native/BatchLinearAlgebra.cpp:1915.)
return torch._lu_with_info(A, pivot=pivot, check_errors=(not get_infos))
Loading vocoder: data/hifigan_libritts100360_generator0p5.pt.zip
Applying spectral norm to text encoder LSTM
Applying spectral norm to context encoder LSTM
Traceback (most recent call last):
File "/home/jaggz/opt/src/tts/radtts/nvidia/inference.py", line 203, in <module>
infer(args.radtts_path, args.vocoder_path, args.config_vocoder,
File "/home/jaggz/opt/src/tts/radtts/nvidia/inference.py", line 97, in infer
radtts.load_state_dict(state_dict, strict=False)
File "/home/jaggz/venv/ttsrad/lib/python3.9/site-packages/torch/nn/modules/module.py", line 1671, in load_state_dict
raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format(
RuntimeError: Error(s) in loading state_dict for RADTTS:
size mismatch for context_lstm.weight_ih_l0: copying a param with shape torch.Size([2080, 1044]) from checkpoint, the shape in current model is torch.Size([2080, 1040]).
size mismatch for context_lstm.weight_ih_l0_reverse: copying a param with shape torch.Size([2080, 1044]) from checkpoint, the shape in current model is torch.Size([2080, 1040]).
from radtts.
Did you ever get this solved? I'm seeing the same thing.
from radtts.
Nevermind. This looks like a duplicate of #6. Solution here: #6 (comment)
from radtts.
Related Issues (20)
- Cannot train starting from pre-trained model b/c audio files not found HOT 1
- Required amount of data and iterations to train the model HOT 5
- Is it possible to do inference in real time?
- Trouble with inferencing without pitch and energy condition HOT 2
- How to slow down the speed of the response? HOT 3
- Here's a Colab notebook for using RADTTS [Documentation]
- Is it a mistake in README? HOT 1
- RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn HOT 1
- Output of voice conversion has source model's timbre, not destination models timbre HOT 2
- Certain texts in LJ speech unloadable HOT 1
- with open(config_path) as f: FileNotFoundError: [Errno 2] No such file or directory: HOT 1
- why mix phone and word embedding HOT 1
- train decatndur HOT 2
- Inference with bgap models
- Question about spectrogram normalization HOT 1
- Training for singing models
- recommend the steps of 1st-stage training.
- Train custom voice instead of the default ljs speaker.
- Straight through on unsupervised aligner
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from radtts.