This project implements the Tranformer network as created by Vaswani et al. (2017).
- Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin (2017). Attention Is All You Need
The Transformer network is then used for named-entity recognition (NER) and for question answering (QA).