Giter VIP home page Giter VIP logo

siamese-rpn-tensorflow's Introduction

Siamese-RPN-tensorflow

Code for reproducing the results in the following paper:

Environment

  • python=3.6
  • tensorflow=1.10
  • cuda=9.0

Downloading VOT2013 Data

Downloading YouTube-bb Data

Downloading ILSVRC 2015-VID Data

Performance

The red box is for tracing

Visualization for debug

bbox in detection

  • red -- the groundtruth

  • black -- bbox with highest score

  • other colors -- bbox with scores from second to tenth.

Training and Evaluation

If your data format is the same as VOT 2013, you can run the code directly. If not, you need to change the utils/image_reader.py or convert the data format to VOT format.

To train Siamese-RPN:

python train.py

If you want to see if the training is reasonable in the course of training, you can choose to turn on debug.Just change the init() in train.py

self.is_debug=True

This will result in a debug folder where you can see pictures of the training process, with groundtruth in red and box in top 10 scores in other colors.

To test Siamese-RPN:

To test series of images like VOT format

If you want to test a series of images captured from the video, you need to assign new values img_pathand img_label in config.py, which are the files of your image's path and label, respectively. Then execute the following commands

python test.py

This command will automatically synthesize videos from image sequences, and also synthesize videos from processed images, which are saved in. / data / vedio

To test a vedio

If you are testing a video, you need to put the video in./data/vedio. You can run the following command and select the object you want to track in the first frame according to the program prompt at the beginning.

python vedio_test.py test.mp4

The 'test.mp4' is the name of your vedio

Model

I will provide the well-trained model in the next few days

siamese-rpn-tensorflow's People

Contributors

makalo avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

siamese-rpn-tensorflow's Issues

nan loss

I use your implemented loss module to my SiamesRPN++. It was working fine after some first epochs, but the loss is getting to 'nan' after that. Do you know why does it happen?
Btw, Can you implementation work with batch size is higher than 1?

step=224,loss=0.21591535210609436,cls_loss=0.22106705605983734,reg_loss=0.04982298985123634,lr=0.0010000000474974513,time=0.13663458824157715 step=225,loss=0.23805415630340576,cls_loss=0.18039944767951965,reg_loss=0.018808992579579353,lr=0.0010000000474974513,time=0.14361572265625 step=226,loss=0.22827929258346558,cls_loss=0.22993800044059753,reg_loss=0.027202464640140533,lr=0.0010000000474974513,time=0.14561033248901367 step=227,loss=0.18403905630111694,cls_loss=0.20121710002422333,reg_loss=0.019027791917324066,lr=0.0010000000474974513,time=0.1216745376586914 step=228,loss=nan,cls_loss=nan,reg_loss=nan,lr=0.0010000000474974513,time=0.17054390907287598 step=229,loss=nan,cls_loss=nan,reg_loss=nan,lr=0.0010000000474974513,time=0.12932848930358887 step=230,loss=nan,cls_loss=nan,reg_loss=nan,lr=0.0010000000474974513,time=0.12520909309387207 step=231,loss=nan,cls_loss=nan,reg_loss=nan,lr=0.0010000000474974513,time=0.13763189315795898

You might make a mistake in image_reader_cuda.py, line 78, index_d might be out of range
index_d=[tf.cond(tf.greater(index_t-interval,node_min),lambda:index_t-interval,lambda:index_t+interval),tf.cond(tf.less(index_t+interval,node_max),lambda:index_t+interval,lambda:index_t-interval)]
For example, if length of an input array is 120, index_t = 60 (random between 0 and 120), interval = 98 (random between 30 and 100) => index_d = [158, -38] would be wrong

my bbox have some problem

I try to train this net by myself.I just used VOT and when I test ,I found the problem that all the boxes are the same size,so the accuracy is not well.It is caused by the error in code or something else?

关于网络训练

你好,请问youtube-BB全部视频下载下来有多大呢?

原论文中是读取了预训练模型的参数,微调后两层,但您的代码里好像是直接全部层的参数都自己训练了?效果怎么样呢?

The tracker is getting crazy

Hi,

I'm trying to run the demo, I select a bounding box, but instead of tracking the person, the bounding box just moves from one corner of the screen to another. Do I need to resize my video or do any other preprocessing steps?

Thanks,
Zvi

pre-trained model

Hello, I really appreciate your logical implementation of this complex network, I’ve learned a lot. But the training set is too big. I'm wondering if you could please share the pre-trained model, my email address is 17863135811@163. com, thanks a lot!

loss=nan

按照您所说的方式,采用vot数据进行训练,出现loss=nan的情况
step = 0,loss = nan,cls_loss = 0.694078922272,reg_loss = nan,lr = 0.0010000000475,time = 2.76396989822

Pre-trained model

Hi @makalo - Thanks for your implementation. Is a trained model already available? If so, how do I download it?

data aug

I don't find any data_augmentation in this code. ex:scale、 shift、motion blur; which are very important in training!
Am I miss some code?
Thank you fot this project!

pretrained model

Hello, thank you very much for your work. Could you please provide the pre-training model? Thank you!

预训练模型

你好,youtubb数据集太大了,你们能提供一下预训练的模型吗?嘿嘿

训练问题

训练过程权重变成nan,但是loss是下降的,没有异常,您遇到这种情况吗?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.