Giter VIP home page Giter VIP logo

sugar-conversational-dataset's Introduction

SUGAR: A Textual Dataset for Situated Proactive Response Selection

This repository contains data for the paper "Otani et al. 2023. A Textual Dataset for Situated Proactive Response Selection."

SUGAR (a dataset of SitUatated, Goal-Aware, and proactive Responses) contains 1,760 examples of single-turn English conversations in help-seeking scenarios. Each example includes a user request anchored by an implicit goal, a reference response, two extra response candidates, and 12 sentences of situational information. All response candidates are annotated with three-point appropriateness ratings to develop and evaluate response selection systems. The use of SUGAR is not limited to the development/evaluation of response selection systems. For instance, you can train situated response generation systems using SUGAR (See also (Otani et al., NLP4ConvAI 2023)).

Key Characteristics

introduction

  1. Situational information plays an important role in conversations as we illustrate in the figure. The example shows two response candidates to a user utterance "Can you open the window for me?" Although both candidates here sound helpful, their appropriateness varies depending on context (the "Situation" boxes). We represent situational information of six semantic categories (Location, Possession, Time, Date, Behavior, and Environment) in free-form English texts.

  2. While often being unspoken, underlying goals ("to cool off" and "to air the room" in the example) provide important semantic connections among context and utterances on many occasions particularly when language is indirect. We used goal information as a stimulus for soliciting naturalistic and proactive responses from human annotators in data collection.

Citation

Naoki Otani, Jun Araki, HyeongSik Kim, and Eduard Hovy. 2023. A Textual Dataset for Situated Proactive Response Selection. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3856โ€“3874, Toronto, Canada. Association for Computational Linguistics.

@inproceedings{otani-etal-2023-textual,
    title = "A Textual Dataset for Situated Proactive Response Selection",
    author = "Otani, Naoki  and
      Araki, Jun  and
      Kim, HyeongSik  and
      Hovy, Eduard",
    booktitle = "Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = jul,
    year = "2023",
    address = "Toronto, Canada",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.acl-long.214",
    doi = "10.18653/v1/2023.acl-long.214",
    pages = "3856--3874",
}

sugar-conversational-dataset's People

Contributors

notani avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.