Workflow:
-
Manually remove data-bin and checkpoints.
-
Preprocess by running the following:
$ bash preprocess.sh spa.txt
-
Manually assure that all files don't have empty lines.
-
Train by running:
$ bash train.sh -a <EncoderEmbedDim> -b <DecoderEmbedDim> -c <DecoderOutEmbedDim> -d <EncoderHiddenSize> -e <DecoderHiddenSize>
-
Generate predictions by running:
$ bash generate.sh <predictions_DATE.txt>
-
Run an error analysis by running:
$ python error_analysis.py <predictions_DATE.txt> <errors_DATE.txt>