parals_legacy's Introduction

ParaLS: Paraphraser-based Lexical Substitution

Lexical substitution (LS) aims at finding appropriate substitutes for a target word in a sentence. Recently, BERT-based LS methods have made remarkable progress, which generates substitute candidates for a target word directly based on its context. However, it overlooks the substitution's impact on the overall meaning of the sentence. In this paper, we try to generate the substitute candidates from a paraphraser. Considering the generated paraphrases from a paraphraser contain variations in word choice and preserve the sentence's meaning, we propose a simple decoding method that focuses on the variations of the target word during decoding, and leverage it to propose a new LS approach ParaLS. Experimental results show that ParaLS improves the F1 score from 18.4 to 28.7 on the up-to-date benchmark compared with the state-of-the-art BERT-based LS method.

Requirements and Installation

Our code is based on Fairseq version=10.2
PyTorch version = 1.8.0
Python version >= 3.8
For training new models, you'll also need an NVIDIA GPU and NCCL

Step 1: Downlaod the pretrained paraphraser modeling

You need to download the paraphraser from here, and put it into folder "checkpoints/⁨para⁩/transformer/⁩"

Step 2: Run our code

(1) run ParaLS for lexical substitute dataset SWORDS

input "sh run_Swords.sh"

(2) run ParaLS for lexical simplification dataset

input "sh run_LSPara_NNSeval.sh"

(3) run ParaLS for one example

input "python Paraphraser.py"

Citation

Please cite as:

Recommend Projects

qiang2100 / parals_legacy Goto Github PK

parals_legacy's Introduction

ParaLS: Paraphraser-based Lexical Substitution

Requirements and Installation

Step 1: Downlaod the pretrained paraphraser modeling

Step 2: Run our code

Citation

parals_legacy's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent