Giter VIP home page Giter VIP logo

existing-medical-qa-datasets's Introduction

Existing Medical QA Datasets

Multimodal Question Answering in the Medical Domain: A summary of Existing Datasets and Systems

*** Two Main Tasks: Medical Question Answering (QA) & Visual Question Answering (VQA) ***

-- This list is not exhaustive. You can email me links and references of relevant medical QA datasets and systems and I'll update the list asap. Also, several challenge-related datasets are not publicly available anymore. You can contact the organizers to have the data.

I) Medical QA Datasets:

  1. Corpus for Evidence Based Medicine Summarization (Mollá, 2010):https://sourceforge.net/projects/ebmsumcorpus
  2. CLEF QA4MRE Alzheimer’s task (Peñas et al, 2012).
  3. TREC LiveQA-Med (Ben Abacha et al, 2017): https://github.com/abachaa/LiveQA_MedicalTask_TREC2017
  4. MEDIQA @ ACL-BioNLP (Ben Abacha et al., 2019): https://github.com/abachaa/MEDIQA2019
  5. MedQuaD Collection (Ben Abacha and Demner-Fushman, 2019): https://github.com/abachaa/MedQuAD
  6. Medication QA Collection (Ben Abacha et al., 2019): https://github.com/abachaa/Medication_QA_MedInfo2019
  7. BioASK datasets (2012-2020): http://bioasq.org/participate/challenges

II) Medical VQA Datasets (Radiology):

  1. VQA-RAD (Lau et al. 2018): https://osf.io/89kps
  2. VQA-Med 2018 (Hasan et al. 2018): https://www.aicrowd.com/challenges/imageclef-2018-vqa-med
  3. VQA-Med 2019 (Ben Abacha et al. 2019): https://github.com/abachaa/VQA-Med-2019
  4. VQA-Med 2020 (Ben Abacha et al. 2020): https://www.aicrowd.com/challenges/imageclef-2020-vqa-med

III) Online QA Systems:

-- I searched and tested several systems (e.g. AskHERMES, MiPACQ, SimQ). This list includes only the systems that are still maintained.

  1. CHiQA (Consumer Health Question Answering System): chiqa.nlm.nih.gov
  2. Neural Covidex: covidex.ai

IV) Medical Datasets Relevant to QA:

  1. i2b2 shared tasks (2006-2016): www.i2b2.org/NLP
  2. n2c2 NLP clinical challenges (2018-2019): https://n2c2.dbmi.hms.harvard.edu https://dbmi.hms.harvard.edu/programs/national-nlp-clinical-challenges-n2c2
  3. TREC Medical Records Track (2012-2013).
  4. TREC Clinical Decision Support Track (2014-2016): http://www.trec-cds.org
  5. TREC Precision Medicine Track (2017-2019): http://www.trec-cds.org
  6. Consumer Health Question Summarization: https://github.com/abachaa/MeQSum
  7. CLEF eHealth (2013-2020): https://clefehealth.imag.fr
  8. COVID dataset (CORD-19): https://www.kaggle.com/allen-institute-for-ai/CORD-19-research-challenge

V) Medical Datasets Relevant to VQA:

  1. ImageCLEF Medical Automatic Image Annotation (2008-2009): https://www.imageclef.org/2008/medaat and https://www.imageclef.org/2009/medanno
  2. ImageCLEF Medical User-oriented Image Retrieval Task (2011): https://www.imageclef.org/2011/medicaluseroriented
  3. ImageCLEF Medical Retrieval Task (2008-2012): https://www.imageclef.org/2012/medical
  4. ImageCLEF AMIA: Medical task (2013): https://www.imageclef.org/2013/medical
  5. ImageCLEFmed: Medical classification (2015): https://www.imageclef.org/2015/medical
  6. ImageCLEF Medical Clustering (2015): https://www.imageclef.org/2015/clustering
  7. ImageCLEFmed (2016): https://www.imageclef.org/2016/medical
  8. ImageCLEFcaption (2017-2020): https://www.imageclef.org/2017/caption
  9. ImageCLEFmedical tasks (2019-2020): https://www.imageclef.org/2019/medical and https://www.imageclef.org/2020/medical

Last update on April 23, 2020.


Contact:

asma.benabacha at nih.gov

existing-medical-qa-datasets's People

Contributors

abachaa avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.