Crowd-Kit: Computational Quality Control for Crowdsourcing

Crowd-Kit is a powerful Python library that implements commonly-used aggregation methods for crowdsourced annotation and offers the relevant metrics and datasets. We strive to implement functionality that simplifies working with crowdsourced data.

Currently, Crowd-Kit contains:

implementations of commonly-used aggregation methods for categorical, pairwise, textual, and segmentation responses;
metrics of uncertainty, consistency, and agreement with aggregate;
loaders for popular crowdsourced datasets.

Also, the learning subpackage contains PyTorch implementations of deep learning from crowds methods and advanced aggregation algorithms.

Installing

To install Crowd-Kit, run the following command: pip install crowd-kit. If you also want to use the learning subpackage, type pip install crowd-kit[learning].

If you are interested in contributing to Crowd-Kit, use Pipenv to install the library with its dependencies: pipenv install --dev. We use pytest for testing.

Getting Started

This example shows how to use Crowd-Kit for categorical aggregation using the classical Dawid-Skene algorithm.

First, let us do all the necessary imports.

from crowdkit.aggregation import DawidSkene
from crowdkit.datasets import load_dataset

import pandas as pd

Then, you need to read your annotations into Pandas DataFrame with columns task, worker, label. Alternatively, you can download an example dataset:

df = pd.read_csv('results.csv')  # should contain columns: task, worker, label
# df, ground_truth = load_dataset('relevance-2')  # or download an example dataset

Then, you can aggregate the workers' responses using the fit_predict method from the scikit-learn library:

aggregated_labels = DawidSkene(n_iter=100).fit_predict(df)

More usage examples

Implemented Aggregation Methods

Below is the list of currently implemented methods, including the already available (✅) and in progress (🟡).

Categorical Responses

Method	Status
Majority Vote	✅
One-coin Dawid-Skene	✅
Dawid-Skene	✅
Gold Majority Vote	✅
M-MSR	✅
Wawa	✅
Zero-Based Skill	✅
GLAD	✅
KOS	✅
MACE	✅
BCC	🟡

Multi-Label Responses

Method	Status
Binary Relevance	✅

Textual Responses

Method	Status
RASA	✅
HRRASA	✅
ROVER	✅

Image Segmentation

Method	Status
Segmentation MV	✅
Segmentation RASA	✅
Segmentation EM	✅

Pairwise Comparisons

Method	Status
Bradley-Terry	✅
Noisy Bradley-Terry	✅

Learning from Crowds

Method	Status
CrowdLayer	✅
CoNAL	✅

Citation

Ustalov D., Pavlichenko N., Tseitlin B. Learning from Crowds with Crowd-Kit. 2023. arXiv: 2109.08584 [cs.HC].

@misc{CrowdKit,
  author    = {Ustalov, Dmitry and Pavlichenko, Nikita and Tseitlin, Boris},
  title     = {{Learning from Crowds with Crowd-Kit}},
  year      = {2023},
  publisher = {arXiv},
  eprint    = {2109.08584},
  eprinttype = {arxiv},
  eprintclass = {cs.HC},
  url       = {https://arxiv.org/abs/2109.08584},
  language  = {english},
}

Questions and Bug Reports

To report a bug, post an issue on the Toloka/bugreport page.
To find answers to common questions or start a new discussion, join our English-speaking Slack community.

artinmajdi / crowd-kit Goto Github PK

crowd-kit's Introduction

Crowd-Kit: Computational Quality Control for Crowdsourcing

Installing

Getting Started

Implemented Aggregation Methods

Categorical Responses

Multi-Label Responses

Textual Responses

Image Segmentation

Pairwise Comparisons

Learning from Crowds

Citation

Questions and Bug Reports

License

crowd-kit's People

Contributors

Stargazers

Watchers

Recommend Projects

Recommend Topics

Recommend Org