I was running the Pretrained Version in Pytorch, by downloading the files in googl

Hello. I would propose that this (<a class="issue-link js-issue-link" data-error-text=

Error in Pretrained Version about arae HOT 2 OPEN

jakezhaojb commented on August 26, 2024

Error in Pretrained Version

from arae.

Comments (2)

kellywzhang commented on August 26, 2024

Hello, so I replicated this issue when using the newest version of PyTorch. The issue seems to be just differences in the default dimensions (whether they're kept or squeezed). I didn't try it with the pre-trained model, but changing the generate method in model.py to the following fixed it for me:

    def generate(self, hidden, maxlen, sample=True, temp=1.0):
            """Generate through decoder; no backprop"""

            batch_size = hidden.size(0)

            if self.hidden_init:
                    # initialize decoder hidden state to encoder output
                    state = (hidden.unsqueeze(0), self.init_state(batch_size))
            else:
                    state = self.init_hidden(batch_size)

            # <sos>
            self.start_symbols.data.resize_(batch_size)
            self.start_symbols.data.fill_(1)

            embedding = self.embedding_decoder(self.start_symbols)
            inputs = torch.cat([embedding, hidden], 1).unsqueeze(1)

            # unroll
            all_indices = []
            for i in range(maxlen):
                    output, state = self.decoder(inputs, state)
                    overvocab = self.linear(output.squeeze(1))

            if not sample:
                    vals, indices = torch.max(overvocab, 1)
            else:
                    # sampling
                    probs = F.softmax(overvocab/temp)
                    indices = torch.multinomial(probs, 1).squeeze(1)

            all_indices.append(indices.unsqueeze(1))

            embedding = self.embedding_decoder(indices)
            inputs = torch.cat([embedding, hidden], 1).unsqueeze(1)

            max_indices = torch.cat(all_indices, 1)

            return max_indices`

I'm thinking to make an update to the code at some point, but not yet as I don't want to change something that would lead to problems when running with the old version of PyTorch.

Let me know if you run into other issues. Best!

from arae.

JulesGM commented on August 26, 2024

Hello. I would propose that this (#8 (comment)) be pushed to the main version, if you so desire, because it's been a while.
thanks for the help!

from arae.

Recommend Projects

Error in Pretrained Version about arae HOT 2 OPEN

Comments (2)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent