Hi, how can I instantiate an object of the SpeechT5 model in a Pytorch code file, and

How to load the pretrained models in pytorch about speecht5 HOT 5 CLOSED

microsoft commented on May 17, 2024

How to load the pretrained models in pytorch

from speecht5.

Comments (5)

Ajyy commented on May 17, 2024 2

Hi,

I have updated the steps to instantiate the model and load the checkpoint in here. Thanks.

from speecht5.

Ajyy commented on May 17, 2024 1

Hi, I'm glad that it helps you.

Yes, if you just want to load the model, you only need to put the dictionary under the paths. More concretely, you need to put the text dictionary under data and the pseudo-code dictionary under hubert_label_dir since they are needed to set up the task in here.

The pseudo-code dictionary can be created by the code here, where n_clusters is 500. The text dictionary can be downloaded in here.

You may need to follow the dataset code for preparing some dummy inputs and doing forward passes.

Thanks!

from speecht5.

ayushtues commented on May 17, 2024

Hi, thanks for the quick reply and for providing the instructions! I had a few more questions

In the updated code we need access to hubert_label_dir, and data here to create the task object which is used while defining the model architecture:

checkpoint['cfg']['task'].t5_task = 'pretrain'
checkpoint['cfg']['task'].hubert_label_dir = "/path/to/hubert_label"
checkpoint['cfg']['task'].data = "/path/to/tsv_file"

task = SpeechT5Task.setup_task(checkpoint['cfg']['task'])
model = T5TransformerModel.build_model(checkpoint['cfg']['model'], task)

Are there small dummy files which can be used here, or a way to define the model architecture without these files?

I just want to load the model using the SpeechT5 Base pretrained weights provided in the Readme (here) to inspect it, and maybe do some forward passes on dummy inputs, is it necessary to download the data for this (which is pretty huge)?

Thanks in advance!

from speecht5.

ayushtues commented on May 17, 2024

Thanks a lot, this helped me load the model!

The pseudo-code dictionary code is here for future reference for anyone, the link above was referring to the task code.

from speecht5.

Ajyy commented on May 17, 2024

Oh yes, sorry for the mistake. If you have further problems, please tell me.

from speecht5.

Recommend Projects

How to load the pretrained models in pytorch about speecht5 HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent