The neuralmachinetranslation from azadyasar

wrong test file specified in README

The wrong test file specified in the README:

    python -m nmt evaluate  --test_dataset ../data/test.csv
        =>
    python -m nmt evaluate  --test_dataset ../data/eng-tur-test.csv

This is given OK in https://towardsdatascience.com/neural-machine-translation-inner-workings-seq2seq-and-transformers-229faff5895b.

How to generate the en_sp.model file

Could you please tell me how to generate the .model files? And What's the en_sp.vocab meaning for? Thanks.

How to vectorize decoder translation in transformer?

I was wondering, how does the decoder work during inference time? in particular, since the transformer takes in an entire batch (tensor) and thus outputs an entire tensor too it seemed to me that unlike an RNN, it can't take in it's own predictions without re-running the decoder every time it generates a token (e.g. it first takes in the start token then generates its first prediction then one takes its first prediction and the start token and continues). This processed didn't seem vectorized which worried me. Regardless, is this how it's done (e.g. in pytorch)?

I guess that's how it's done since your code does it:

NeuralMachineTranslation/src/nmt/translation/translator.py

Line 37 in 7fd9450

for i in range(max_len):

and so does the official pytorch tutorial code:

def evaluate(eval_model, data_source):
    eval_model.eval() # Turn on the evaluation mode
    total_loss = 0.
    src_mask = model.generate_square_subsequent_mask(bptt).to(device)
    with torch.no_grad():
        for i in range(0, data_source.size(0) - 1, bptt):
            data, targets = get_batch(data_source, i)
            if data.size(0) != bptt:
                src_mask = model.generate_square_subsequent_mask(data.size(0)).to(device)
            output = eval_model(data, src_mask)
            output_flat = output.view(-1, ntokens)
            total_loss += len(data) * criterion(output_flat, targets).item()
    return total_loss / (len(data_source) - 1)

https://pytorch.org/tutorials/beginner/transformer_tutorial.html

So in 2021 there is no way to vectorized this?

btw, thanks for sharing your awesome code and blog! :)

azadyasar / neuralmachinetranslation Goto Github PK

neuralmachinetranslation's People

Contributors

Stargazers

Watchers

Forkers

neuralmachinetranslation's Issues

wrong test file specified in README

How to generate the en_sp.model file

How to vectorize decoder translation in transformer?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent