Yuanmin's Projects
Bilinear attention networks for visual question answering
Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]
关于2020年CS保研夏令营的汇总。欢迎大家分享夏令营信息,资瓷一下互联网精神吼不吼啊?
Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering
PyTorch implementation of Image captioning with Bottom-up, Top-down Attention
A library for Multilingual Unsupervised or Supervised word Embeddings
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Automatic image captioning model based on Caffe, using features from bottom-up attention.
Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".