Giter VIP home page Giter VIP logo

Comments (9)

liangxuy avatar liangxuy commented on May 26, 2024

These two files are used in function prior_info of https://github.com/DirtyHarryLYL/HAKE-Action/blob/Image-level-HAKE-Action/scripts/obj_mask.py to boost the performance.
Take object.csv as an example, it is an matrix of size (9,658, 600). 9,658 is the size of testing set and 600 is the number of HOI categories. The value is non-zero only when the HOI index of the related object is detected in this image, and the value is the object detection score. For example, if the value of index <ride bicycle> is non-zero, then bicycle must be detected in this image.

from hake-action.

whqwill avatar whqwill commented on May 26, 2024

But these two files are used directly for testing? I am confused. So how to get these two files? You must use some code to generate these two files.

from hake-action.

whqwill avatar whqwill commented on May 26, 2024

BTW, what is the meaning of "related", how to get the related object? why are they related?

from hake-action.

whqwill avatar whqwill commented on May 26, 2024

然后就是 hp_list 以及 pvp_ankle2、pvp_knee2等等这些标注是做什么用的?

from hake-action.

liangxuy avatar liangxuy commented on May 26, 2024

But these two files are used directly for testing? I am confused. So how to get these two files? You must use some code to generate these two files.

BTW, what is the meaning of "related", how to get the related object? why are they related?

The object.csv is only based on object detection results. To make it clear, given an image, if the object bicycle is detected, then in the 600-d array, the values of bicycle-related HOIs (such as carry bicycle, ride bicycle, park bicycle, ...) are non-zero.
The verb.csv is based on the results of verb classification. We directly expand 117-d verb results to 600-d arrays to serve as a kind of prior knowledge.

from hake-action.

liangxuy avatar liangxuy commented on May 26, 2024

然后就是 hp_list 以及 pvp_ankle2、pvp_knee2等等这些标注是做什么用的?

Pvp_ankle2, pvp_knee2, ... are the labelings of the part-states of a person's 10 body parts. hp_list is a 10-d array corresponds to a person's 10 body parts. If the body part has no-interaction with the object, the vaule is 0, else the value is 1.

from hake-action.

liangxuy avatar liangxuy commented on May 26, 2024

Please see our paper PaStaNet: Toward Human Activity Knowledge Engine for better understanding.

from hake-action.

whqwill avatar whqwill commented on May 26, 2024

然后就是 hp_list 以及 pvp_ankle2、pvp_knee2等等这些标注是做什么用的?

Pvp_ankle2, pvp_knee2, ... are the labelings of the part-states of a person's 10 body parts. hp_list is a 10-d array corresponds to a person's 10 body parts. If the body part has no-interaction with the object, the vaule is 0, else the value is 1.

why dose ankle have 6 labels and hand have 23 labels? Does that mean hands can have 23 different states? Which is not mentioned in the paper. Also, I understand the mearning of hp_list now, but there are three persons and only one hp_list?

from hake-action.

liangxuy avatar liangxuy commented on May 26, 2024

This is the setting of our dataset, and we provide the full list of our part-state definitions under here. If we have multiple persons, then we use "OR" operation to fuse the three hp_lists.

from hake-action.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.