Giter VIP home page Giter VIP logo

ap3d's People

Contributors

guxinqian avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

ap3d's Issues

could you provide the following files for training?

    self.train_name_path = osp.join(self.root, 'info/train_name.txt')
    self.test_name_path = osp.join(self.root, 'info/test_name.txt')
    self.track_train_info_path = osp.join(self.root, 'info/tracks_train_info.mat')
    self.track_test_info_path = osp.join(self.root, 'info/tracks_test_info.mat')
    self.query_IDX_path = osp.join(self.root, 'info/query_IDX.mat')

could you provide the info folder?

Effect of inflate_* functions

I require a small clarification on my understanding of inflate_* functions.
If time_dim = 1, the inflate_* functions still behave like 2D functions.
i.e., inflate_conv will make the conv2d layer to conv3d layer, but still behave like conv2d layer, since time_dim=1.
Is that correct?

Or Is there any other advantage in inflate_* functions?

Thanks in advance!

关于目录

python train.py --root /home/guxinqian/data/ -d mars --arch ap3dres50 --gpu 0,1 --save_dir log-mars-ap3d
这个mar文件夹是用来存放什么的。

Details about the Deformable 3D Conv in Table 2

Excellent work! I noticed that you have compared different approaches for temporal information modeling in Table 2. I wonder how did you perform Deformable 3D Conv? Is it identical to our recently published D3Dnet? (https://github.com/XinyiYing/D3Dnet)

D3D is an effective approach for motion-aware spatio-temporal modeling and works well for video super-resolution. Did it fail in the Video-based Person ReID task?

Duke数据集

请问一下为什么duke数据集的结果 非常低?

Question about the implementation of contrastive attention

Thanks for your great work!
I am studying your code, and I find that in the implementation of contrastive attention, you use a detach() trick like:

x_att = self.x_mapping(x.unsqueeze(3).expand(-1, -1, -1, N-1, -1, -1).contiguous().view(b, c, (N-1)*t, h, w).detach())
n_att = self.n_mapping(neighbor_new.detach())
contrastive_att = self.contrastive_att_net(x_att * n_att)
neighbor_new = neighbor_new * contrastive_att

which indicates that you don't want the gradients to be broadcast. I want to know the exact reason of the detach() usage. And have you tried to train and test your network without detach(), how about the performance under such condition? Thank you very much!

如何理解这部分

x_norm_expand = x_norm.unsqueeze(3).expand(-1, -1, -1, N-1, -1, -1).permute(0, 2, 3, 4, 5, 1)

虽然能够理解利用拷贝进行相关矩阵的计算的操作,但是这里的扩维和expand操作是为什么呢,在之后的contrastive_att中也是,因为之前接触其它方向的时候没见过6维Tensor

Replicate only I3D results

Hi Xinqian,

To replicate only I3D results, do we need to change anything else except AP3D.APP3DC ---> AP3D.I3D in L168 of ResNet.py? If I only do this change, I get 3% mAP difference according to Table 1.

为什么我train中的pids恒为相等值

如题,下载了您的源代码和MARS数据集,进行源代码的运行时

for batch_idx, (vids, pids, _) in enumerate(trainloader):

中总是满足
if (pids-pids[0]).sum() == 0:
# can't compute triplet loss
continue
即pids总是相等的一组值。这是为什么

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.