Giter VIP home page Giter VIP logo

liveqa_medicaltask_trec2017's Introduction

--------------------------------------------------
TREC-2017 LiveQA: Medical Question Answering Task
--------------------------------------------------

The LiveQA'17 medical task focuses on consumer health question answering. We use consumer health questions received by the U.S. National Library of Medicine (NLM). 
We constructed medical question-answer pairs for training and testing, with additional annotations that can be used to develop question analysis and question answering systems.  

Please refer to our overview paper for more information about the constructed datasets and the LiveQA Track: 

	- Asma Ben Abacha, Eugene Agichtein, Yuval Pinter & Dina Demner-Fushman. Overview of the Medical Question Answering Task at TREC 2017 LiveQA. TREC, Gaithersburg, MD, 2017.  

A summary of the constructed medical datasets is below.   

If you use these datasets, please cite our paper: 

	@inproceedings{LiveMedQA2017,
	  author    = {Asma {Ben Abacha} and Eugene Agichtein and Yuval Pinter and Dina Demner{-}Fushman}, 
	  title     = {Overview of the Medical Question Answering Task at TREC 2017 LiveQA}, 
	  booktitle = {TREC 2017},
	  year      = {2017}
	} 

======================
Medical Training Data
====================== 

We provide 634 question-answer pairs for training: 

	1) TREC-2017-LiveQA-Medical-Train-1.xml => 388 question-answer pairs corresponding to 200 NLM questions. 
	Each question is divided into one or more subquestion(s). Each subquestion has one or more answer(s). 
	These question-answer pairs were constructed automatically and validated manually.

	2) TREC-2017-LiveQA-Medical-Train-2.xml => 246 question-answer pairs corresponding to 246 NLM questions.
	Answers were retrieved manually by librarians. 

The datasets are not exhaustive with regards to subquestions, i.e., some subquestions might not be annotated. We also provide additional annotations for both (i) the Focus and (ii) the Question Type used to define each subquestion. 23 question types were considered (e.g. Treatment, Cause, Diagnosis, Indication, Susceptibility, Dosage) related to four focus categories: Disease, Drug, Treatment and Exam. 

==================
Medical Test Data
==================

Test questions cover 26 question types associated with five focus categories. Each question includes one or more subquestion(s) and at least one focus and one question type. Reference answers were selected from trusted resources and validated by medical experts. We provide at least one reference answer for each test question, its URL and relevant comments. Question paraphrases were created by NIST assessors and used with the reference answers to judge the participants' answers.  

Additional annotations for each test question are provided, including: 

  1) List of Foci and their categories:   
	Example: <FOCUS fid="F1" fcategory="Problem">Beckwith-Wieddeman Syndrome</FOCUS>
  2) List of relevant keywords and their categories: 
	Example: <KEYWORD kid="K1" kcategory="Anatomy">ear</KEYWORD> 
  3) List of question types and associated focus/foci and keyword(s):
	Example: <TYPE tid="T1" hasFocus="F1,F2">INTERACTION</TYPE>  

---------------Simplified annotation of a test question-----------
<NLM-QUESTION qid=""> 
 <Original-Question qfile="">
	<SUBJECT></SUBJECT>
	<MESSAGE></MESSAGE>
 </Original-Question>
 <NIST-PARAPHRASE></NIST-PARAPHRASE>
 <ANNOTATIONS>
	<FOCUS fid="F1" fcategory=""></FOCUS>
	<TYPE tid="T1" hasFocus="F1" hasKeyword="K1"></TYPE>
	<KEYWORD kid="K1" kcategory=""></KEYWORD>
 </ANNOTATIONS>	 
 <ReferenceAnswers> 
	<ReferenceAnswer aid="">
		<ANSWER></ANSWER>
		<AnswerURL></AnswerURL>
		<COMMENT></COMMENT>
	</ReferenceAnswer>	
 </ReferenceAnswers>   
</NLM-QUESTION> 
---------------------------------------------------------------------

Contact Information
-------------------
- Asma Ben Abacha: [email protected]
- Dina Demner-Fushman: [email protected]

liveqa_medicaltask_trec2017's People

Contributors

abachaa avatar

Watchers

James Cloos avatar prince avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.