PyTorch implementation of the Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning by Lu et. al.
Work in Progress.
- Implement normal LSTM-Attention
- Greedy + Sampling Decoder
- Layer Normalization
- [] Debug Sentinel-LSTM
- [] Implement Beam Search
- [] Tune hyperparameters