Giter VIP home page Giter VIP logo

onestop-qa's Introduction

OneStopQA

OneStopQA is a multiple choice reading comprehension dataset annotated according to the STARC (Structured Annotations for Reading Comprehension) scheme. The reading materials are Guardian articles taken from the OneStopEnglish corpus. Each article comes in three difficlty levels, Elementary, Intermediate and Advanced. Each paragraph is annotated with three multiple choice reading comprehension questions. The reading comprehension questions can be answered based on any of the three paragraph levels.

STARC Annotation Structure

Answer Description Textual Span Annotation Tag
a Correct answer. Critical Span A
b Incorrect answer. A miscomprehension of the critical span. Critical Span A
c Incorrect answer. Refers to an additional span. Distractor Span D
d Incorrect answer. Has no textual support. - -

Example

Leading water scientists have issued one of the sternest warnings yet about global food supplies, saying that the world’s population may have to switch almost completely to a vegetarian diet by 2050 to avoid catastrophic shortages. <D>Humans derive about 20% of their protein from animal-based products now, but this may need to drop to just 5% to feed the extra two billion people expected to be alive by 2050,</D> according to research by some of the world’s leading water scientists. <A>“There will not be enough water available on current croplands to produce food for the expected nine-billion population in 2050 if we follow current trends and changes towards diets common in western nations,” the report by Malik Falkenmark and colleagues at the Stockholm International Water Institute (SIWI) said.</A>

Q: According to Malik Falkenmark’s report, what will happen if the world adopts the current diet trends of western nations?
a: There will not be sufficient water to grow enough food for everyone
b: By 2050, nine billion people will not have enough drinking water
c: By 2050, animal-based protein consumption will reduce from 20% to 5%
d: Obesity rates around the world will rise

Statistics

Aricles: 30
Paragraphs: 162
Questions: 486
Question-Paragraph Level pairs: 1,458

Citation

STARC: Structured Annotations for Reading Comprehension

@inproceedings{starc2020,  
      author    = {Berzak, Yevgeni and Malmaud, Jonathan and Levy, Roger},  
      title     = {STARC: Structured Annotations for Reading Comprehension},  
      booktitle = {ACL},  
      year      = {2020},  
      publisher = {Association for Computational Linguistics} 
      }

License

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

onestop-qa's People

Contributors

omershubi avatar berzak avatar saeub avatar

Stargazers

Samuel avatar Deborah N. Jakobi avatar David R. Reich avatar  avatar  avatar Sweta Agrawal avatar Riku Higashimura (Liku) avatar Asahi Ushio avatar Matthew Durward avatar  avatar zero alpha avatar Sian Gooding avatar Valentina Pyatkin avatar  avatar Fariz Ikhwantri avatar Simone Azeglio avatar qhd0081 avatar  avatar  avatar  avatar Xi Ye avatar  avatar 爱可可-爱生活 avatar Chenglei avatar

Watchers

Roger Levy avatar  avatar Chenglei avatar

onestop-qa's Issues

Missing characters in JSON

Several texts in the JSON file seem to be cut off by 1-2 characters in the beginning or end. For example, in article 14, "Love Hormone Helps Autistic Children Bond with Others", level Adv:

{"context": "nasal spray laced with ..."}
{"context": "utism is a developmental ..."}
{"context": "surprising finding, however, ..."}

In the TXT files, the text is complete.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.