Light

orangeyhchen / tipcb Goto Github PK

View Code? Open in Web Editor NEW

32.0 1.0 9.0 31 KB

Python 100.00%

tipcb's Introduction

TIPCB

Prerequisites

Pytorch 1.1
cuda 9.0
python 3.6
GPU Memory>=12G

Datasets

We evaluate our method on CUHK-PEDES. Please visit Here.

Usage

You need to generate tokens to "/data/BERT_encode/" by running "BERT_token_64.py" or downloading from Here.
If you want to train the network, you can run our code with the following commands:

python train.py --max-length 64 --batch-size 64 --num-epoches 80 --adam-lr 0.003 --gpus 0

You can download our trained model from Here and the trained log from Here.

Evaluate

Top-1	Top-5	Top-10
63.63	82.81	89.01

tipcb's People

Contributors

Stargazers

Watchers

Forkers

y-claw tragedyn liuliu408 liviust yqgao716 zhangweifeng1218 galaxfyos runtome zhguoqing

tipcb's Issues

Unable to get the results in ICFG-PEDES

When using TIPCB network to train ICFG-PEDES dataset, top-1=51.01%, which is three points lower than that reported in TIPCB. I was wondering if there was something wrong with the way I handled the raw dataset. So can you provide me your BERT_id_train_64_new.npz and BERT_id_test_64_new.npz files in ICFG-PEDES dataset and provide details of parameters? Thank you.
My email is [email protected]

Bug exists in evaluation

TIPCB/test_model.py

Lines 40 to 42 in 93aa7c5

 # we input the two times of images, so we need to select half of them 

 ac_top1_t2i, ac_top5_t2i, ac_top10_t2i, mAP = test_map(text_bank, labels_bank, images_bank[::2], labels_bank[::2]) 

 return ac_top1_t2i, ac_top5_t2i, ac_top10_t2i, mAP

Not all images in the test split are loaded two times. Your process is arbitrary.

In other words, there are 3074 unique images and 6156 unique sentences in the test split. It is obivous 3074 * 2 != 6156....

Your reported results may a little bit higher than the actual ones.

Please point out if I were wrong.

Cannot reach the reported performance

Thanks for your code!

I run the code using the default configurations on cuda=10.2 and pytorch=1.9.0, and only obtain the following results:
Rank@1:0.626545 Rank@5:0.825960 Rank@10:0.885491 mAP:0.534096.
Is it due to the cuda and pytorch version?

The detailed configurations are as follows:
CMPM: true
adam_alpha: 0.9
adam_beta: 0.999
adam_lr: 0.003
batch_size: 64
checkpoint_dir: ./log
dataset: CUHKPEDES
dir: ./datasets/
droprate: 0
embedding_type: BERT
epoches_decay: '50'
epsilon: 1.0e-08
feature_size: 2048
gpus: '1'
height: 384
log_dir: ./log
lr_decay_ratio: 0.1
lr_decay_type: MultiStepLR
max_length: 64
name: Origin
num_classes: 11003
num_epoches: 10000000
optimizer: adam
pretrained: true
resume: false
seed: 0
warm_epoch: 10
wd: 4.0e-05
width: 128

About BERT_token_64.py

no train.csv, val.csv, test.csv.
how did you split the dataset exactly?

Unable to get the results of the paper.

Thank you very much for your excellent work, it makes my thinking suddenly clear.
I encountered a problem when I repeat the experiment.
I used your parameters and experimental configuration that it in the paper.
python train.py --max-length 64 --batch-size 64 --num-epoches 80 --adam-lr 0.003 --gpus 0
But LOSS got NAN. I did not make other changes.
Since I didn't have a divided CSV file, I directly used the bert file you provided.
Please refer to the attached log.

train_log.txt

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

	# we input the two times of images, so we need to select half of them
	ac_top1_t2i, ac_top5_t2i, ac_top10_t2i, mAP = test_map(text_bank, labels_bank, images_bank[::2], labels_bank[::2])
	return ac_top1_t2i, ac_top5_t2i, ac_top10_t2i, mAP