<div class="snippet-clipboard-content notranslate position-relative overflow-auto" data-snippet-clip

<a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="25

Runtime error about arae HOT 4 CLOSED

jakezhaojb commented on June 25, 2024

Runtime error

from arae.

Comments (4)

wj926 commented on June 25, 2024

I solved this problem by using --batch_size 300

from arae.

kellywzhang commented on June 25, 2024

Hello! So I've also run into the same issue and it seems to depend on the version of PyTorch that you're using. So the code at line 202 in models.py hidden = torch.div(hidden, norms.expand_as(hidden)) worked in the last major release version of PyTorch. For the newest release of PyTorch I've had to change it to hidden = torch.div(hidden, norms.unsqueeze(1).expand_as(hidden)).

Basically this section of the code finds the L2 norm for each of the hidden vectors / codes in the batch, and then divides the hidden vectors by the L2 norm to normalize them into unit vectors. The problem is just that of PyTorch syntax in changing the dimension of the norm to prepare it for the division.

Let me know if you have any more issues related to this.

from arae.

shaform commented on June 25, 2024

Thanks, this indeed solved the problem.
For reference, in v0.1.12 torch.norm always keep dims:

The output Tensor is of the same size as input except in the dimension dim where it is of size 1.
http://pytorch.org/docs/0.1.12/torch.html?highlight=norm#torch.norm

But in v0.2.0, the behavior was changed:

If keepdim is true, the output Tensor is of the same size as input except in the dimension dim where it is of size 1. Otherwise, dim is squeezed.
http://pytorch.org/docs/0.2.0/torch.html?highlight=norm#torch.norm

from arae.

JulesGM commented on June 25, 2024

#9 (comment) I think this should be uncommented by default now.. it's been almost a year

from arae.

Recommend Projects

Runtime error about arae HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent