Giter VIP home page Giter VIP logo

acm-net's Introduction

ACM-Net: Weakly-Supervised-Action-Localization

The official repository of our paper "ACM-Net: Action Context Modeling Network for Weakly-Supervised Temporal Action Localization" .

Motivation

Traditional methods mainly focus on foreground and background frames separation with only a single attention branch and class activation sequence. However, we argue that apart from the distinctive foreground and background frames there are plenty of semantically ambiguous action context frames. It does not make sense to group those context frames to the same background class since they are semantically related to a specific action category. Consequently, it is challenging to suppress action context frames with only a single class activation sequence. To address this issue, in this paper, we propose an action-context modeling network termed ACM-Net, which integrates a three-branch attention module to measure the likelihood of each temporal point being action instance, context, or non-action background, simultaneously. Then based on the obtained three-branch attention values, we construct three-branch class activation sequences to represent the action instances, contexts, and non-action backgrounds, individually.

Requirements

Required packages are listed in requirements.txt. You can install by running:

pip install -r requirements.txt

Dataset

We evaluate our ACM-Net on two popular benchmark datasets THUMOS-14 and ActivityNet-1.3. We provide extracted features for

Before running the code, please download the target dataset and unzip it under the data/ folder.

Running

You can train your own model by running:

# For the THUMOS-14 datasets.
python main_thu.py --batch_size 16

# For the ActivityNet-1.3 datasets.
python main_act.py --batch_size 64

You can configure your own hyper-parameters in config/model_config.py

To test your model, you can run following command:

# For the THUMOS-14 datasets.
python main_thu.py --test --checkpoint $checkpoint_path

# For the ActivityNet-1.3 datasets.
python main_act.py --test --checkpoint $checkpoint_path

Note that we apply the wandb client to log the experiments, if you don't want to use this tool, you can disable it in the command with --without_wandb like

python main_thu.py --without_wandb

Citation

If you find our code or our paper useful for your research, please cite our work:

@article{qu_2021_acmnet,
  title={ACM-Net: Action Context Modeling Network for Weakly-Supervised Temporal Action Localization},
  author={Sanqing Qu, Guang Chen, Zhijun Li, Lijun Zhang, Fan Lu, Alois Knoll},
  journal={arXiv preprint arXiv:2104.02967},
  year={2021}
}

Acknowledgement

We referenced the repos below for the code

Contact

If you have any question or comment, please contact the first author of this paper -- Sanqing Qu ([email protected])

acm-net's People

Contributors

sanqingqu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

acm-net's Issues

results reproduce on THUMOS-14

Hi there,

Thanks for your great work! I'm really interested and trying to follow your work.
However, during reproducing resutls on THUMOS-14, I only got 0.336 after 1000 epochs using GPU 3090.
I just clone the code from this repo and changed nothing. I downloaded data followed the link provided in this repo.
Do you have any clue on this?
Looking forward to yoru reply.

BW

Good work!

Good work! THUMOS-14 can be reproduced, but ActivityNet-1.3 can't? Any Suggestions?

Current test_mAP:0.2228, Current Best test_mAP:0.2342 Current Epoch:954/1000

训练THUMOS-14过程中出现train_act_inst_cls_loss和trian_loss为nan的情况

感谢分享优秀的工作,我没做任何参数修改的情况下在THUMOS-14上复现了论文中的精度。但是训练过程中出现了train_act_inst_cls_loss=nan和trian_loss=nan的情况,想请问一下这是什么情况?有什么解决的办法吗?

### ### ### ### ### ### ### ### ### ### ### ### ### ### ### ### ### ###

image

### ### ### ### ### ### ### ### ### ### ### ### ### ### ### ### ### ###

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.