Giter VIP home page Giter VIP logo

metaeval's Introduction

MetaEval

MetaEval Collection of tasks for meta-learning and extreme multitask learning. We gather a large collection of classification tasks (single sentence or sentence pair) and align their format for convenient multi-task learning. We derive a standard train/validation/split for all datasets (when no test set is available, we use half of the validation set), and map all label/text keys.

A MetaEval dataset has a train/validation/test splits, a label key, and text keys that are either sentence or [sentence1,entence2]

pip install metaeval or pip install git+https://github.com/sileod/metaeval.git

Listing available english tasks

from metaeval import tasks_mapping, load_and_align
tasks_mapping.head(3)

returns

task_tuple text_fields label_fields split_keys num_labels
health_fact ['health_fact', 'default'] ['claim', 'main_text'] ['label'] ['test', 'train', 'validation'] 4
commonsense ['metaeval/ethics', 'commonsense'] ['text'] ['label'] ['test', 'train', 'validation'] 2
deontology ['metaeval/ethics', 'deontology'] ['text'] ['label'] ['test', 'train', 'validation'] 2

Loading dataset with unified format

dataset = load_and_align('health_fact')

returns a huggingface dataset with a unified format

DatasetDict({
    train: Dataset({
        features: ['sentence1', 'sentence2', 'label'],
        num_rows: 9832
    })
    test: Dataset({
        features: ['sentence1', 'sentence2', 'label'],
        num_rows: 1235
    })
    validation: Dataset({
        features: ['sentence1', 'sentence2', 'label'],
        num_rows: 1225
    })
})

All datasets can then be used interchangeably with standard code that can handle sentence or sentence1,sentence2 text keys.

Citation

@inproceedings{sileo2021analysis,
      title={Analysis and Prediction of NLP Models Via Task Embeddings}, 
      author={Damien Sileo and Marie-Francine Moens},
      booktitle = "Proceedings of the 12th Language Resources and Evaluation Conference",
      year={2022},
}

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.