A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。
and could U pls share the the structure of directory “datasets” ,it's differece between your script dataset_path = osp.join(datasets_path, dataset_dir) wavfile_path = osp.join(dataset_path, "wavs") melspec_path = osp.join(dataset_path, "mels")
and office data of BiaoBei PhoneLabeling ProsodyLabeling Wave
Traceback (most recent call last):
File "/home/gaol/codes/Voices/FCH-TTS/train-parallel.py", line 69, in
loggers=loggers
File "/home/gaol/codes/Voices/FCH-TTS/helpers/trainer.py", line 319, in fit
valid_losses = self._validate(valid_loader)
File "/home/gaol/codes/Voices/FCH-TTS/helpers/trainer.py", line 419, in _validate
loss.item(), l1_loss.item(), ssim_loss.item(), drn_loss.item()
AttributeError: 'float' object has no attribute 'item'