cfed-hanet's Introduction

Hierarchical Augmentation Networks(HANet)

Code and dataset for lrec-coling 2024 paper: "Continual Few-shot Event Detection via Hierarchical Augmentation Networks"

The paper is now available on (https://aclanthology.org/2024.lrec-main.342/)

Requirements

Dependencies can be installed by running the following code:

pip install -r requirements.txt

Quick Start for implementation

By the following codes, you can run the default setting of HANet on the CFED-MAVEN dataset:

bash MAVEN_all_fwUCL+TCL.sh

Detailed configurations can be seen in configs.py

Hyperparameters used in the experiments

Some hyperparameters used in the experiments are not fully stated in the MAVEN_all_fwUCL+TCL.sh. We show these parameters as follows:

Random Seed

We randomly evaluated each method with random seed ``1, 2, 3, 4, 42''

Dataset Permutation

The permutation used in the dataset can be found in data_incremental/{dataset}/perm{i}

Other parameters:

The --aug-repeat-times is 5 and the --joint-da-loss is 'none'.

Citation

Please cite our paper if you use HANet in your work:

@inproceedings{zhang-etal-2024-continual-shot,
    title = "Continual Few-shot Event Detection via Hierarchical Augmentation Networks",
    author = "Zhang, Chenlong  and
      Cao, Pengfei  and
      Chen, Yubo  and
      Liu, Kang  and
      Zhang, Zhiqiang  and
      Sun, Mengshu  and
      Zhao, Jun",
    editor = "Calzolari, Nicoletta  and
      Kan, Min-Yen  and
      Hoste, Veronique  and
      Lenci, Alessandro  and
      Sakti, Sakriani  and
      Xue, Nianwen",
    booktitle = "Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)",
    month = may,
    year = "2024",
    address = "Torino, Italia",
    publisher = "ELRA and ICCL",
    url = "https://aclanthology.org/2024.lrec-main.342",
    pages = "3868--3880",
    abstract = "Traditional continual event detection relies on abundant labeled data for training, which is often impractical to obtain in real-world applications. In this paper, we introduce continual few-shot event detection (CFED), a more commonly encountered scenario when a substantial number of labeled samples are not accessible. The CFED task is challenging as it involves memorizing previous event types and learning new event types with few-shot samples. To mitigate these challenges, we propose a memory-based framework: Hierarchical Augmentation Network (HANet). To memorize previous event types with limited memory, we incorporate prototypical augmentation into the memory set. For the issue of learning new event types in few-shot scenarios, we propose a contrastive augmentation module for token representations. Despite comparing with previous state-of-the-art methods, we also conduct comparisons with ChatGPT. Experiment results demonstrate that our method significantly outperforms all of these methods in multiple continual few-shot event detection tasks.",
}

cfed-hanet's People

Contributors

Stargazers

Watchers

cfed-hanet's Issues

About accuracy of the result

I apologize for any inconvenience. I have executed the code using the hyperparameters as specified in the paper and the accompanying code. However, the accuracy results I am obtaining consistently fall short of the benchmarks provided in the paper. For instance, the microF1 score during the second stage of the 2-way 5-shot evaluation on the ACE dataset is lower by 10-20 percentage points compared to the values reported in the paper. Could there be a reason for this discrepancy? Is it possible that some code or hyperparameters have not been updated or are not in sync with the latest findings?
I have uploaded the log file for the 'ACE/none/10class/5shot/shuffle_r1/perm0' experiment in the attachment. Could you please review it? I would appreciate your insights on any potential issues or discrepancies that might be affecting the accuracy of the results.
2024-06-27-22-17-20.log

chenlong-clock / cfed-hanet Goto Github PK

cfed-hanet's Introduction

Hierarchical Augmentation Networks(HANet)

Requirements

Quick Start for implementation

Hyperparameters used in the experiments

Random Seed

Dataset Permutation

Other parameters:

Citation

cfed-hanet's People

Contributors

Stargazers

Watchers

cfed-hanet's Issues

About accuracy of the result

context_feature stand for?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent