Light

zhjohnchan / awesome-image-captioning Goto Github PK

View Code? Open in Web Editor NEW

1.0K 1.0K 186.0 126 KB

A curated list of image captioning and related area resources. :-)

awesome-image-captioning's Introduction

Hi there. I'm Zhihong 👋

awesome-image-captioning's People

Contributors

Stargazers

Watchers

Forkers

stoensin iamal1 canadalynx nemonameless dreadlord1984 benjamesbabala jdc08161063 awesome-archive guanlongtianzi jordan-5i wanjinchang czhxiaohuihui yuhaoyesteve jason08 xinlongxiao wlybug song-heng quartz0714 cfh3c fireae bcui6611 amirunpri2018 mymuli lingyuliu liyazhou96 forence stc-cqupt happynewya kldcr pgsrv angleboy8 lifegwt terima-tang yinglv1106 phamvanlinh143 tong888 yujingmarkjiang akrusher pinglmlcv alibabadoufu runcode90 dong-jinkim y78h11b09 primekun zhangbinxy spartag117 kevin23916 wenhengqiu azuredsky stevenji tophk zwq2018 nightfury0126 zhangbei123 willamjie shibinzhou lumen2018 kdongyi liutengjun seaofocean autogyro rmfkxhxh sangminwoo acodec sszbuu taaccoo-beta junleecus bmei1314 michael-hsu qujundong share424 minglangqiao qasimyu cinkkkyo kanji95 hxh2h jiyali wanggangalanhe formzero karenyyy yongsongh gladcolor t-mac-curry dr-zhuang ibrahim85 wenjiaxu sue2415535899 nikolausn chagmgang hellomickey 2samgu2 cnrblm wonnerky haiboku233 forrest-gan veb-101 maodong2056 ankitshah009 mitjanikolaus mingshuangwu

awesome-image-captioning's Issues

Code link for 2019 paper

The 11th paper in 2019 actually has the GitHub code and Dataset link.
https://github.com/furkanbiten/GoodNews

some papers missing

Hi.....Thanks for this awesome page!
There are some papers in 2018 from ECCV which are not included. Perhaps you can add them.

Exploring Visual Relationship for Image Captioning
Boosted Attention: Leveraging Human Attention for Image Captioning
Stylized Image Captioning with Adaptive Learning and Attention

In Fact, the first one is the state-of-art.

Regards

Informative Image Captioning with External Sources of Information

I really need this paper's code. Could you please help me? Thank you very much!!

Chinese image caption， In the result, multiple words of the same type appear

Hello, I am using the COCO dataset,
A two-layer LSTM model, one layer for top-down attention, and one layer for language models.

Extracting words with jieba
I used all the words in the picture description that occurred more than 3 times as a dictionary file, and a total of 14,226 words.
words = [w for w in word_freq.keys () if word_freq [w]> 3]

After training the model, when using it, multiple words of the same type appear in the result, such as:

Note notebook laptop computer on bed
A little girl little girl girl standing together

How can I solve this problem?

Learning like a Child with wrong publisher

Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images is published in ICCV 2015, not CVPR2015

adding papers

Proposed updates for 2021 and 2022

Proposed updates for 2021 and 2022
You can refer to another paper collation work in the field of image registration 😄

live/webcam captioning?

Do you know any recent working projects that allow live/webcam captioning? I'm an artist and would like to use such a project to allow people to play with, and explore machine learning.

The link to the 2020 paper is useless

The link to the 2020 paper is useless

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.