Giter VIP home page Giter VIP logo

tipcb's Introduction

TIPCB

Prerequisites

  • Pytorch 1.1
  • cuda 9.0
  • python 3.6
  • GPU Memory>=12G

Datasets

We evaluate our method on CUHK-PEDES. Please visit Here.

Usage

  • You need to generate tokens to "/data/BERT_encode/" by running "BERT_token_64.py" or downloading from Here.
  • If you want to train the network, you can run our code with the following commands:

python train.py --max-length 64 --batch-size 64 --num-epoches 80 --adam-lr 0.003 --gpus 0

  • You can download our trained model from Here and the trained log from Here.

Evaluate

Top-1 Top-5 Top-10
63.63 82.81 89.01

tipcb's People

Contributors

orangeyhchen avatar

Stargazers

Hogan avatar lihuran1998 avatar WX1999 avatar Fei Shen avatar Sotaro YAMAGUCHI avatar  avatar Candy avatar  avatar Lin Min avatar  avatar Aotle avatar  avatar runcongma avatar silence avatar Liviu-Daniel avatar  avatar  avatar Weifeng Zhang avatar yangqu avatar M1ng avatar Zhicheng avatar  avatar  avatar orbits avatar Feng Chen avatar  avatar Brandon Han avatar Minghui Chen avatar  avatar  avatar Li Zheng avatar  avatar

Watchers

 avatar

tipcb's Issues

Unable to get the results in ICFG-PEDES

When using TIPCB network to train ICFG-PEDES dataset, top-1=51.01%, which is three points lower than that reported in TIPCB. I was wondering if there was something wrong with the way I handled the raw dataset. So can you provide me your BERT_id_train_64_new.npz and BERT_id_test_64_new.npz files in ICFG-PEDES dataset and provide details of parameters? Thank you.
My email is [email protected]

Bug exists in evaluation

TIPCB/test_model.py

Lines 40 to 42 in 93aa7c5

# we input the two times of images, so we need to select half of them
ac_top1_t2i, ac_top5_t2i, ac_top10_t2i, mAP = test_map(text_bank, labels_bank, images_bank[::2], labels_bank[::2])
return ac_top1_t2i, ac_top5_t2i, ac_top10_t2i, mAP

Not all images in the test split are loaded two times. Your process is arbitrary.

In other words, there are 3074 unique images and 6156 unique sentences in the test split. It is obivous 3074 * 2 != 6156....

Your reported results may a little bit higher than the actual ones.

Please point out if I were wrong.

Cannot reach the reported performance

Thanks for your code!

I run the code using the default configurations on cuda=10.2 and pytorch=1.9.0, and only obtain the following results:
Rank@1:0.626545 Rank@5:0.825960 Rank@10:0.885491 mAP:0.534096.
Is it due to the cuda and pytorch version?

The detailed configurations are as follows:
CMPM: true
adam_alpha: 0.9
adam_beta: 0.999
adam_lr: 0.003
batch_size: 64
checkpoint_dir: ./log
dataset: CUHKPEDES
dir: ./datasets/
droprate: 0
embedding_type: BERT
epoches_decay: '50'
epsilon: 1.0e-08
feature_size: 2048
gpus: '1'
height: 384
log_dir: ./log
lr_decay_ratio: 0.1
lr_decay_type: MultiStepLR
max_length: 64
name: Origin
num_classes: 11003
num_epoches: 10000000
optimizer: adam
pretrained: true
resume: false
seed: 0
warm_epoch: 10
wd: 4.0e-05
width: 128

Unable to get the results of the paper.

Thank you very much for your excellent work, it makes my thinking suddenly clear.
I encountered a problem when I repeat the experiment.
I used your parameters and experimental configuration that it in the paper.
python train.py --max-length 64 --batch-size 64 --num-epoches 80 --adam-lr 0.003 --gpus 0
But LOSS got NAN. I did not make other changes.
Since I didn't have a divided CSV file, I directly used the bert file you provided.
Please refer to the attached log.

train_log.txt

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.