Giter VIP home page Giter VIP logo

Comments (6)

yelantf avatar yelantf commented on May 27, 2024

Hi, we did not experiment on the RTX2070. On our GPU RTX 2080 Ti, the speed is about 10 fps. Condisering that RTX 2080 Ti is more powerful, the speed on your machine should be reasonable.

from alphaction.

semchan avatar semchan commented on May 27, 2024

Hi, we did not experiment on the RTX2070. On our GPU RTX 2080 Ti, the speed is about 10 fps. Condisering that RTX 2080 Ti is more powerful, the speed on your machine should be reasonable.

thanks very much for your reply. I have one more question for the speed. I noted that there is a parameter of "--detect-rate" , is it an action detection interval number? If set this parameter higher, the speed should be faster, am I correct? but I found I set this parameter higher, it seems no change for the speed.

from alphaction.

yelantf avatar yelantf commented on May 27, 2024

The detect-rate is the rate at which we update the action labels. For example, when detect-rate=4, it means that in one second, the action labels of each person are updated four times. In each update, the model will use a short clip of 64 frames as input and predict the new action labels. To process a 64-frame clip, our ResNet-101 takes about 0.2 seconds (on RTX 2080 TI), which means detect-rate>5 will takes more than one second to provide labels for a one-second clip. Thus, using higher detect-rate should slow down the speed.

However, we also use a tracking model to track each actors. The speed of tracking model is about 10 fps, which is a bottleneck when detect-rate is low.

from alphaction.

semchan avatar semchan commented on May 27, 2024

The detect-rate is the rate at which we update the action labels. For example, when detect-rate=4, it means that in one second, the action labels of each person are updated four times. In each update, the model will use a short clip of 64 frames as input and predict the new action labels. To process a 64-frame clip, our ResNet-101 takes about 0.2 seconds (on RTX 2080 TI), which means detect-rate>5 will takes more than one second to provide labels for a one-second clip. Thus, using higher detect-rate should slow down the speed.

However, we also use a tracking model to track each actors. The speed of tracking model is about 10 fps, which is a bottleneck when detect-rate is low.

Noted and thank you very much for your detailed explanation.If only this algorithm could be processed in real-time.

from alphaction.

yelantf avatar yelantf commented on May 27, 2024

Actually, if we use a low detect-rate with a realtime tracking method, this algorithm could be real-time. However, we did not find a robust realtime tracking method. Even the tracking model we use now is not very robust in the fast-motion scene. There is a tradeoff between speed and performance. Maybe in the future, a much stronger tracking method will make it possible.

from alphaction.

semchan avatar semchan commented on May 27, 2024

Actually, if we use a low detect-rate with a realtime tracking method, this algorithm could be real-time. However, we did not find a robust realtime tracking method. Even the tracking model we use now is not very robust in the fast-motion scene. There is a tradeoff between speed and performance. Maybe in the future, a much stronger tracking method will make it possible.

Noted with thanks. I will do some test based on your code to know more about your wonderful research

from alphaction.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.