Giter VIP home page Giter VIP logo

cider's Introduction

CIDER: Commonsense Inference for Dialogue Explanation and Reasoning

This repository contains the dataset and the pytorch implementations of the models from the paper CIDER: Commonsense Inference for Dialogue Explanation and Reasoning. CIDER has been accepted to appear at SIGDIAL 2021. The preprint version can be found here.

Alt text

Commonsense inference to understand and explain human language is a fundamental research problem in natural language processing. Explaining human conversations poses a great challenge as it requires contextual understanding, planning, inference, and several aspects of reasoning including causal, temporal, and commonsense reasoning. In this work, we introduce CIDER -- a manually curated dataset that contains dyadic dialogue explanations in the form of implicit and explicit knowledge triplets inferred using contextual commonsense inference. Extracting such rich explanations from conversations can be conducive to improving several downstream applications. The annotated triplets are categorized by the type of commonsense knowledge present (e.g., causal, conditional, temporal). We set up three different tasks conditioned on the annotated dataset: Dialogue-level Natural Language Inference, Span Extraction, and Multi-choice Span Selection. Baseline results obtained with transformer-based models reveal that the tasks are difficult, paving the way for promising future research.

Dataset

The original annotated dataset can be found in the json files in the data folder.

Data Format

Each instance in the JSON file is a dictionary of the following items:

Key Value
id Id of the dialogue in DailyDialog, DREAM, or MuTual.
utterances Utterances of the dialogue spoken by speaker A or B.
triplets List of annotated triplets.
head Head span of the triplet.
relation Relation of the triplet.
tail Tail span of the triplet.

Example format in JSON:

{
        "id": "daily-dialogue-1063",
        "utterances": "A: Gordon , you're ever so late .    B: Yes , I am sorry . I missed the bus .    A: But there's a bus every ten minutes , and you are over 1 hour late .    B: Well , I missed several buses .    A: How on earth can you miss several buses ?    B: I , ah ... , I got have late .    A: Oh , come on , Gordon , it's the afternoon now . Why were you late really ?    B: Well , I ... I lost my wallet , and I ...    A: Have you got it now ?    B: Yes , I found it again .    A: When ?    B: This morning . I mean ...    A: This tardiness causes embarrassment every time . ",
        "triplets": [
            {
                "head": "missed the bus",
                "relation": "Causes",
                "tail": "late"
            },
            {
                "head": "lost my wallet",
                "relation": "Causes",
                "tail": "late"
            },
            {
                "head": "bus",
                "relation": "HappensIn",
                "tail": "every ten minutes"
            },
            {
                "head": "missed several buses",
                "relation": "Causes",
                "tail": "over 1 hour late"
            },
            {
                "head": "lost my wallet",
                "relation": "Before",
                "tail": "found it again"
            },
            {
                "head": "tardiness",
                "relation": "causes",
                "tail": "embarrassment"
            }
        ]
    }

Citation

CIDER: Commonsense Inference for Dialogue Explanation and Reasoning. Deepanway Ghosal and Pengfei Hong and Siqi Shen and Navonil Majumder and Rada Mihalcea and Soujanya Poria. SIGDIAL 2021.

cider's People

Contributors

deepanwayx avatar emrys-hong avatar soujanyaporia avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.