Giter VIP home page Giter VIP logo

Comments (8)

yihongXU avatar yihongXU commented on July 29, 2024

Hi,
Thank you for your interest!
DHN data is easy to be generated. I was taking gt bboxes and detections provided by MOT15,16 and MOT17, and I calculated the distances (same as dist during training,https://gitlab.inria.fr/yixu/deepmot/-/blob/master/train_tracktor/src/tracktor/tracker.py line 392) between them -> a distance matrix is made. I augumented by thresholding them with different values.
As for ground-truth labels, we input these distances into Hungarian algorithm, and get the ground-truth binary assignment for each distance.

You can surely make your custom data, and define your own distance.

Hope that can help.

from deepmot.

 avatar commented on July 29, 2024

@yihongXU
Hi.
Thanks for your reply!
I am not sure what your words "augument by thresholding them with different values" mean.
Does it mean that zero out the value below the threshold and keep the value above the threshold?
e.g.
The distance matrix is [0.3, 0.5, 0.2], the stochastic threshold is [0.6, 0.4, 0.1], then the distance matrix is augmented to [0.0, 0.5, 0.2].
Is that right?

from deepmot.

yihongXU avatar yihongXU commented on July 29, 2024

Hi,

Suppose that your distance matrix values are [0,1], thresholding means that you threshold out those large distance that cannot be considered as a match. In your example, you threshold the distance matrix [0.3, 0.5, 0.2] by a value [0,1], let say 0.4, then it will become [0.3, inf, 0.2]. Inf referring to a big value ( like in https://github.com/cheind/py-motmetrics, you set it to np.nan) in order not to assign when you input it to Hungarian Algorithm.

from deepmot.

 avatar commented on July 29, 2024

Thanks! I got it.
The output shape of https://gitlab.inria.fr/yixu/deepmot/-/blob/master/train_tracktor/src/tracktor/tracker.py is [N, M], where N is the number of the objects and M is the number of the detections on this frame.
But the shape DHN_data/*/*.npy is all (10, 9, 15). What do the three dimensions “10, 9, 15” represent separately?
And the names of subdirectories are all in the format of “x_y”. What do x, y represent here?

from deepmot.

yihongXU avatar yihongXU commented on July 29, 2024

Hi,

10 is the batch size, I grouped the matrices having same size to a batch, just to accelerate a bit the training, otherwise you need to train it one by one since the input size varies.

x_y represents h,w of the matrix, as I said I regrouped the matrices by their sizes.

from deepmot.

 avatar commented on July 29, 2024

Thanks! I got it.
But I am still a little confused.
There are train_DHN/DHN_data/train/1_6/2DMOT2015_1_6_7_m.npy and rain_DHN/DHN_data/train/1_6/2DMOT2015_1_6_12_m.npy.
What does that number (7 and 12 here) represent?

from deepmot.

yihongXU avatar yihongXU commented on July 29, 2024

Hi,

7 and 12 are just indexes for numbering the matrices. The reason why they are not continuous is that I picked randomly some of the matrices to the validation set.

from deepmot.

972821054 avatar 972821054 commented on July 29, 2024

@magamig @yihongXU Hello, have you successfully made the data for DHN? I would appreciate if you could inform me about the process of making DHN!

from deepmot.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.