The deeplearning_hw2 from nodiff-229

deeplearning_hw2's Introduction

We provide the start code under ./src folder for this homework. You need to modify model.py and main.py to construct and train your model. For model.py, we have provided the definition of RNN encoder and decoder. You need to add your RNN and Transformer model codes. For main.py, we have provided the dataloader to pre-process text data into tokens. You can use the data_loader.get_batch() to get a mini-batch of data. Before using the loader, you need to set the train and valid flags. Details are in data.py. You need to write model building code, the evaluation function and train function in main.py.

You also need to complete mha.py and you can check if you implement multi-head attention correctly by directly running:

python mha.py

Recommend Projects

nodiff-229 / deeplearning_hw2 Goto Github PK

deeplearning_hw2's Introduction

deeplearning_hw2's People

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent