Giter VIP home page Giter VIP logo

hidialog's Introduction

HiDialog

Hierarchical Dialogue Understanding with Special Tokens and Turn-level Attention, ICLR 2023, Tiny Papers

https://arxiv.org/abs/2305.00262


This is the official code repository of our ICLR 2023 Tiny paper. In this paper, we proposed a simple but effective Hierarchical Dialogue Understanding model, HiDialog. we first insert multiple special tokens into a dialogue and propose the turn-level attention to learn turn embeddings hierarchically. Then, a heterogeneous graph module is leveraged to polish the learned embeddings.

Requirements

Our experiments are conducted with following core packages:

  • PyTorch 1.11.0
  • CUDA 11.6
  • dgl-cuda11.3 0.8.2
  • sklearn

Experiments

Main Results

Reproducibility

To reproduce our training process in main experiments on DialogRE,

  • download RoBERTa and unzip it to pre-trained_model/RoBERTa/.
  • download config.json, merges.txt and vocab.json from here, put them to pre-trained_model/RoBERTa/
  • download DialogRE
  • copy the *.json files into datasets/DialogRE
  • run bash dialogre.sh


To reproduce our training process in main experiments on MELD,

  • download RoBERTa and unzip it to pre-trained_model/RoBERTa/.
  • download config.json, merges.txt and vocab.json from here, put them to pre-trained_model/RoBERTa/
  • download MELD
  • copy the *.json files into datasets/MELD
  • run python MELD.py
  • run bash meld.sh

Credits

This project is expanded upon from a course project at NUS [Course Page]. The code repository is based on following projects:

  • ACL-20, "Dialogue-Based Relation Extraction" [github]
  • AAAI-21, "GDPNet: Refining Refining Latent Multi-View Graph for Relation Extraction" [github]
  • EMNLP-21, "Graph Based Network with Contextualized Representations of Turns in Dialogue" [github]

Thanks for their amazing work.

Reference

@Article{liu2023hierarchical,
  author  = {Xiao Liu and Jian Zhang and Heng Zhang and Fuzhao Xue and Yang You},
  title   = {Hierarchical Dialogue Understanding with Special Tokens and Turn-level Attention},
  journal = {arXiv preprint arXiv:2305.00262},
  year    = {2023},
}

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.