Giter VIP home page Giter VIP logo

Comments (4)

DoublePan-Oh avatar DoublePan-Oh commented on June 23, 2024

Can the ground-truth label be modified, for example, to 1?1 means that the predicted label matches the ground-truth, otherwise there is no match. I now want to modify it into such a requirement, do you think it is feasible?

from rtd-action.

sparkstj avatar sparkstj commented on June 23, 2024

We take the last label entry as background. Since it's a binary classification problem, the first label entry is action. So yes, the label 0 represents action and the label 1 represents background. It's just a matter of notations, so it would be feasible to swap the labels for action and background.

from rtd-action.

DoublePan-Oh avatar DoublePan-Oh commented on June 23, 2024

This classification task should be a binary classification. According to the detr, it should be num_class=2 when setting num_class, but your code setting is num_class=1. This is where I don't quite understand.

image

from rtd-action.

sparkstj avatar sparkstj commented on June 23, 2024

In the COCO dataset and many other datasets, labels are expected to start at 1, so DETR leaves class 0 untouched and reserves max_id + 1 for no-object class. That's why they decide num_classes = max_id + 1.

In our case, we index classes from 0 to create more compact class embedding. We assign action to class 0, and the background/no-object class to class 1. That is to say, our max_id is 0 and therefore num_classes = 1.

You could also check this discussion for detailed explanations.

from rtd-action.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.