adacse's Introduction

AdaCSE

Training

For each existing model to be improved, the samples that the model has difficulty recognizing are first selected as training data from source data/chatgpt_pair.txt or source data/llama_pair.txt and saved in . /data.

Training scripts

We provide training scripts ./run.sh. Training scripts call ./train.py for training. The script runs with the following code,

bash run.sh

Evaluation

Before evaluation, please download the evaluation datasets by running,

cd SentEval/data/downstream/
bash download_dataset.sh

Our evaluation code for sentence embeddings is based on a modified version of SentEval. It evaluates sentence embeddings on semantic textual similarity (STS) tasks and downstream transfer tasks. For STS tasks, our evaluation takes the "all" setting and reports Spearman's correlation. You can evaluate any transformers-based pre-trained models using our evaluation code. For example,

python evaluation.py --model_name_or_path `Replace with your model or path` --pooler cls_before_pooler --task_set sts --mode test

Recommend Projects

wsa-dhu / adacse Goto Github PK

adacse's Introduction

AdaCSE

Training

Training scripts

Evaluation

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent