Yaya Shi's Projects
3D ResNets for Action Recognition (CVPR 2018)
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
automatic video description generation with GPU training
Code for the ACL paper "No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling"
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Improving Convolutional Networks via Attention Transfer (ICLR 2017)
A curated list of image captioning and related area resources. :-)
Reading list for research topics in Masked Image Modeling
BERT score for text generation
Scene Graph Parsing as Dependency Parsing
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Caffe
An image-oriented evaluation tool for image captioning systems (EMNLP-IJCNLP 2019)
python codes for CIDEr - Consensus-based Image Caption Evaluation
Contrastive Language-Image Pretraining
two coco_caption evalution, can be used on python3
Data Release for VALUE Benchmark
Repo for "Can GCNs Go as Deep as CNNs?"
A paper list of object detection using deep learning.
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06
This repository contains implementations and illustrative code to accompany DeepMind publications
Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2018, with code, model and prediction results.
implementation of paper https://arxiv.org/abs/2210.04559
Code for Discriminability objective for training descriptive captions(CVPR 2018)