Giter VIP home page Giter VIP logo

cfed-hanet's Introduction

Hierarchical Augmentation Networks(HANet)

Code and dataset for lrec-coling 2024 paper: "Continual Few-shot Event Detection via Hierarchical Augmentation Networks"

The paper is now available on (https://aclanthology.org/2024.lrec-main.342/)

Requirements

Dependencies can be installed by running the following code:

pip install -r requirements.txt

Quick Start for implementation

By the following codes, you can run the default setting of HANet on the CFED-MAVEN dataset:

bash MAVEN_all_fwUCL+TCL.sh

Detailed configurations can be seen in configs.py

Hyperparameters used in the experiments

Some hyperparameters used in the experiments are not fully stated in the MAVEN_all_fwUCL+TCL.sh. We show these parameters as follows:

Random Seed

We randomly evaluated each method with random seed ``1, 2, 3, 4, 42''

Dataset Permutation

The permutation used in the dataset can be found in data_incremental/{dataset}/perm{i}

Other parameters:

The --aug-repeat-times is 5 and the --joint-da-loss is 'none'.

Citation

Please cite our paper if you use HANet in your work:

@inproceedings{zhang-etal-2024-continual-shot,
    title = "Continual Few-shot Event Detection via Hierarchical Augmentation Networks",
    author = "Zhang, Chenlong  and
      Cao, Pengfei  and
      Chen, Yubo  and
      Liu, Kang  and
      Zhang, Zhiqiang  and
      Sun, Mengshu  and
      Zhao, Jun",
    editor = "Calzolari, Nicoletta  and
      Kan, Min-Yen  and
      Hoste, Veronique  and
      Lenci, Alessandro  and
      Sakti, Sakriani  and
      Xue, Nianwen",
    booktitle = "Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)",
    month = may,
    year = "2024",
    address = "Torino, Italia",
    publisher = "ELRA and ICCL",
    url = "https://aclanthology.org/2024.lrec-main.342",
    pages = "3868--3880",
    abstract = "Traditional continual event detection relies on abundant labeled data for training, which is often impractical to obtain in real-world applications. In this paper, we introduce continual few-shot event detection (CFED), a more commonly encountered scenario when a substantial number of labeled samples are not accessible. The CFED task is challenging as it involves memorizing previous event types and learning new event types with few-shot samples. To mitigate these challenges, we propose a memory-based framework: Hierarchical Augmentation Network (HANet). To memorize previous event types with limited memory, we incorporate prototypical augmentation into the memory set. For the issue of learning new event types in few-shot scenarios, we propose a contrastive augmentation module for token representations. Despite comparing with previous state-of-the-art methods, we also conduct comparisons with ChatGPT. Experiment results demonstrate that our method significantly outperforms all of these methods in multiple continual few-shot event detection tasks.",
}

cfed-hanet's People

Contributors

chenlong-clock avatar

Stargazers

Zhongyan Yi avatar Nguyễn Hoàng Anh avatar

Watchers

 avatar

cfed-hanet's Issues

About accuracy of the result

I apologize for any inconvenience. I have executed the code using the hyperparameters as specified in the paper and the accompanying code. However, the accuracy results I am obtaining consistently fall short of the benchmarks provided in the paper. For instance, the microF1 score during the second stage of the 2-way 5-shot evaluation on the ACE dataset is lower by 10-20 percentage points compared to the values reported in the paper. Could there be a reason for this discrepancy? Is it possible that some code or hyperparameters have not been updated or are not in sync with the latest findings?
I have uploaded the log file for the 'ACE/none/10class/5shot/shuffle_r1/perm0' experiment in the attachment. Could you please review it? I would appreciate your insights on any potential issues or discrepancies that might be affecting the accuracy of the results.
2024-06-27-22-17-20.log

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.