Giter VIP home page Giter VIP logo

spokencse's Introduction

Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding

image

Requirements

pip install -r requirements.txt

NOTE:

  • you may need additional setting for phonemizer if you want to do phoneme-related data preprocessing

Usage

  1. Data Preparation For SLURP, The preprocessed dataset is datasets/slurp/slurp_with_oracle_test.json. The preprocessed dataset without filtering and separating test sets is datasets/slurp/slurp.json The data preprocessing includes multiple operations including:
    • Derive ASR hypothesis
    • Generate phoneme sequences by phonemizer
    • Preprocess the dataset (1st version)
      • Scripts in prepare_data would you understand the process:
      • first run make_golden_dataset read only from data provided in SLURP repo
      • and then make_dataset would need transcriptions from different systems
    • Fine-tune roberta-base models on the 1st version dataset
    • Collect predictions and sub-sample the dataset with agreed pseudo label

For ATIS/TREC6 from PhonemeBERT, You can just clone their repo and unzip the dataset.

  1. Contrastive Pretraining
python contrastive_pretraining.py
  1. Fine-tuning
python finetune_on_slurp.py

or on the phonemebert datasets:

python finetune_on_phonemebert.py

Training and evaluation are both included in these two scripts. Adjust the arguments as you need.

Reference

Please cite the following paper:

@article{chang2022contrastive,
  title={Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding},
  author={Chang, Ya-Hsin and Chen, Yun-Nung},
  journal={arXiv preprint arXiv:2205.00693},
  year={2022}
}

spokencse's People

Contributors

zfrank7777 avatar yvchen avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.