Giter VIP home page Giter VIP logo

understand_videobased_reid's Introduction

understand_videobased_reid's People

Contributors

asuradayuci avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

understand_videobased_reid's Issues

Does the AP computed is P ?

# compute average precision 计算平均精度 # reference: https://en.wikipedia.org/wiki/Evaluation_measures_(information_retrieval)#Average_precision num_rel = orig_cmc.sum() # 所有元素求和 tmp_cmc = orig_cmc.cumsum() # 累加和,不改变数据形状 tmp_cmc = [x / (i+1.) for i, x in enumerate(tmp_cmc)] # enumerate() 函数,返回数据下标和数据(i,x) # 计算top_i的cmc ..x / (i+1.) tmp_cmc = np.asarray(tmp_cmc) * orig_cmc # np.asarray(tmp_cmc),数据类型转换为数组, 只保留正确的匹配 , 错误的值为0 AP = tmp_cmc.sum() / num_rel # 平均精度 = 正确匹配的元素求和 除以原来所有元素的总和 all_AP.append(AP)

你好,有个问题请教一下。代码里的 AP的计算公式对应的是 P=TP/TP+NP吧,感觉怎么像是在计算精确度,不是用PR曲线的面积来求解的。能麻烦您讲解一下吗?

注释中的一点小疑问

在ResNet.py文件中,有如下注释:

wx20190304-144115

我阅读过代码后,觉得这个传入的x维度应该是[batch_size,seq_len,channels,height,width]
这样的话,下面这句代码才合理:
x = x.view(bt, x.size(2), x.size(3), x.size(4))
否则,没有理由将x resize成b
t,batch_size*channels,很莫名其妙

About the train

Hi, when i run the main_video_person_reid.py at the Epoch 50 steps run test. I got an error --------> AttributeError: 'tuple' object has no attribute 'view' at the code features = features.view(n, -1).
Can you tell me how to fix this error.
Thank you

sample_method dense mean all ?

elif self.sample == 'dense': """ Sample all frames in a video into a list of clips, each clip contains seq_len frames, batch_size needs to be set to 1. 将视频中的所有帧采样到一系列的clips,每个clip包含seq_len个帧,批次大小需要设置为1 This sampling strategy is used in test phase. 在测试阶段采用密集采样策略. """

你好,我想问一下 dense的取样方式 是不是类似于 把 整个tracklet 划分成 多个 seq_length长的clip ?

关于 Mars 数据集 在测试阶段的疑惑

你好,我想问一下,在main文件中设定50个epoch进行一次test, 是相当于使用了整个测试集的数据了吗? 在训练完成后,进行测试的时候,测试的数据集用还是用原来的query_idx 吗? 谢谢

About train the model

Hi, I learn from you project, and I can see the steps is:
1-----> I download the Mars data,
2-----> run the main_video_person_reid.py with pre_trained model----resnet-50-kinetic.pth
3-----> get the trained model to run test.py
of course change some path in the code.

Am I right?

Need assistance in understanding methodology to train video reid datasets.

Hi I am trying to understand different methodologies and architecture to train video reid datasets. My question is that can you please tell me what are the best hyperparameters combination, i.e. learning rate, margin for triplet loss, batch size train, batch size test, sequence length for video reid dataset training etc.

Also will you please tell me that in one of your questions asked in the TCLNet repository, you asked that while training for the mars dataset, your result was not similar to those in the research paper. The author said that try to use all frames while testing instead of 4 (the default value) will you please tell me what does it actually meant by this? Which value should be replaced instead of 4 in test_frames of argument parser?

I have also attached a screenshot for the reference
Screenshot 2021-11-04 194044

Some puzzled questions about details in realizing triplet loss

  # Compute pairwise distance, replace by the official when merged
        dist = torch.pow(inputs, 2).sum(dim=1, keepdim=True).expand(n, n)
        dist = dist + dist.t()
        dist.addmm_(1, -2, inputs, inputs.t())
        dist = dist.clamp(min=1e-12).sqrt()  # for numerical stability

你好,在损失函数的实现部分,TripletLoss(nn.Module)里面的上述代码我不太理解,能麻烦你解读一下吗? 非常非常感谢。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.