Giter VIP home page Giter VIP logo

kmre's Introduction

Korean Movie Review Emotion (KMRE) Dataset

We develop a large-scale emotion-labeled dataset, called the Korean Movie Review Emotion (KMRE) dataset. The method for constructing this dataset is described in detail in our paper which is accepted by LREC 2020.

@inproceedings{lee2020korean,
  title={Korean-Specific Emotion Annotation Procedure Using N-Gram-Based Distant Supervision and Korean-Specific-Feature-Based Distant Supervision},
  author={Lee, Young-Jun and Lim, Chae-Gyun and Choi, Ho-Jin},
  booktitle={Proceedings of The 12th Language Resources and Evaluation Conference},
  pages={1603--1610},
  year={2020}
}

Overview

We construct the KMRE dataset annotated with six types of emotions by applying our presented procedure to the Naver Sentiment Movie Corpus (NSMC) in Korean. The detail of NSMC is described in here. Additionally, we constructed the KMRE dataset by exploiting n-gram-based distant supervision and Korean-specific-features-based distant supervision.

Statistics

The statistics of the KMRE dataset is as follows:

# of sentence Anger Disgust Fear Happiness Sadness Surprise
Training 119,995 29.88 9.84 8.42 20.36 23.93 7.57
Development 29,999 29.86 9.6 8.44 20.35 24.17 7.58
Testing 49,997 29.82 9.93 8.32 20.35 24.0 7.58

Table shows the number of sentence per dataset and emotion distributions (%) of KMRE dataset.

Features

  • The KMRE dataset has six types of emotions except the emotion neutral, because there were no neutral reviews in the NSMC

    • Anger, Disgust, Fear, Happiness, Sadness, Surprise
  • Each file was stored in the pickle module

import pickle as pc

with open('data/kmre_train', 'rb') as f:
    kmre_train_data = pc.load(f)

Contact

If you have any questions or comments, please send an email to the contact you want below.

kmre's People

Contributors

passing2961 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.