medical-data's People
Forkers
little1tow ml-lab kaiping wellwang clear-datacenter wanghaisheng zuijiawoniu benjamesbabala dachylong zhaochaocs algpower countlessmelons jamonglab-osi leezqcst ericshijian amoliu zhang11wu4 limingdeng younghai donghyunlee sunarker daiyl ieee820 madms kormilitzin lyk125 sigmaquan chop2 zhiyu-chen karthiknrao saurabh23 qureai pengzhao001 aridian1842 7ucky jjsong george-wu509 chenmingqiang heqing-psychology jdc08161063 fireae jaedukseo yancz1989 kevinmtian haroldss yejunbin coocoky tozammel aracthon euler1983 sandeep-krishnamurthy wen036 daobinhuang somebodyus rajat1994 bin913 lechenhao jiandanjinxin francis05 knishimura785 latim jiamery skrish13 jidiazhernandez montecarlo1 pi-null-mezon eizoflexscan ratulghosh htwmedia waldren oppa3109 ykwon0407 skyer9 saswat likebullet86 hariom-yadaw sfikas robi56 alexeyantonov directorscut82 hkfoxok kimkanna12 aojjang leehoy bchalamayya kitisak hbcbh1999 esskay0000 yj-yu ajaytalati guokr1991 caowencai poemlin tmhssk1 domenicosolazzo cuhkeehwang xxwudi508 burgersmoke liviust saurabhmathur96medical-data's Issues
image registration datasets
Do you know those Continuous Registration Challenge and Histology dataset ?
Link Dead - Lung Image Database Consortium (LIDC)
The link is dead for this data-set.
Another UCI dataset
Under UCI datasets, I notice that Breast Cancer and Lymphography datasets are included, but the third one in the group is omitted for some reason. Please add the Primary Tumor data set: https://archive.ics.uci.edu/ml/datasets/primary+tumor
Some more links
May be it will be useful:
data type of embedding file for Clinical Concept Embeddings Learned from Massive Sources of Medical Data
Hi, I downloaded the pre-trained embedding file. The file type says its a csv but actually its a binary, I used python dictionary to open it but I get an error.
I have also used gensim, KeyedVectors to load embedding but I get error
word_vectors = KeyedVectors.load_word2vec_format('__MACOSX/emb.csv', binary=True)
#changed name of the file to emb.csv
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xf9 in position 37: invalid start byte
So could tell me as to what tool is needed to open this file..?
EEG datasets?
Would love to see EEG data included in the mega-list.
Add melanoma database
Hi :)
Theres a database of classified skin lesions - "Isic Archive"
It can be accessed through here:
https://www.isic-archive.com/#!/topWithHeader/onlyHeaderTop/gallery
And i've also made a script to download it:
https://github.com/GalAvineri/ISIC-Archive-Downloader
Can you please add these to your collection? :)
Add Stanford EchoNet ultrasound database
Imaging addition: Dartmouth Lung Cancer Histology Dataset
Dataset: https://bmirds.github.io/LungCancer/
This dataset comprises 143 hematoxylin and eosin (H&E)-stained formalin-fixed paraffin-embedded (FFPE) whole-slide images of lung adenocarcinoma. The dataset is de-identified and released with permission from Dartmouth-Hitchcock Health (D-HH) Institutional Review Board (IRB). All whole-slide images are labeled according to the consensus opinion of three pathologists at Dartmouth-Hitchcock Medical Center (DHMC) for the predominant pattern of lung adenocarcinoma. For more information about this dataset, please refer to “Pathologist-level classification of histologic patterns on resected lung adenocarcinoma slides with deep neural networks”.
Cell counting dataset
Are there any cell counting dataset available?
Some Spelling, Grammatical Errors, and Invalid Dataset Link
Have to fix the following mistakes in README.md
- Spelling
- Grammatical
- Repeated words
Have to update the following dataset link:
- DRIVE
- LIDC-IDRI
- Belarus TB dataset
- DDSM
- SCR Database
- VISCERAL
- Coding4Cancer
- Parkinson Speech Dataset with Multiple Types of Sound Recordings Data Set
- Parkinson's Disease Classification Data Set
Have to update the following paper link:
- Learning Low-Dimensional Representations of Medical Concept
Including the following dataset link:
- SCMR Consensus, Sunnybrook Cardiac Data
DocGraph
CMS has released additional info since the FIOA request by Fred. You can find the other years in a FAQ on the CMS site.
https://questions.cms.gov/faq.php?faqId=7977
If you accept pull requests, I may have some additional suggested edits/additions.
Add dataset for LOINC embeddings?
Hi,
I shared a set of Word2Vec embeddings of LOINC codes via GitHub. I trained them from lab orders at my organization (City of Hope National Medical Center). I wonder if you could include it in your list. The markdown with description and related links is below.
Thank you!
Lorenzo
Evaluation of Embeddings of Laboratory Test Codes for Patients at a Cancer Center
200 dimensional Word2Vec embeddings of 1098 laboratory test codes (LOINCs) trained from 8,280,238 lab orders for 79,081 patients at City of Hope National Medical Center
Paper: https://arxiv.org/abs/1907.09600
Data: https://github.com/elleros/DSHealth2019_loinc_embeddings
visceral.eu is now Best New Online UK Casinos
Add ECG data
There may be some wheat in the chaff https://datasetsearch.research.google.com/search?query=ECG&docid=sU%2FLa8M%2FP9nUeWMAAAAAAA%3D%3D
Data
TREC-MED
Dear Drew,
in 2011 there was TREC-MED https://trec.nist.gov/pubs/trec21/papers/MED12OVERVIEW.pdf perhaps you might want to add that to your list too :-)
spine dataset
NLP Addition: Clinical NLP Challenges
Add Clinical NLP Challenges Datasets from Filannino and Uzuner's Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks. While some are already included (e.g. TREC), others could be added (e.g. CLEF/eHealth). May requires a followup checklist to track since there are a number indicated in Table 1.
DREAM challenges
It could be worth mentionning the dream Challenges on this page, either as a single entry for all of them, or an entry per challenge.
In particular:
The Digital Mammography DREAM Challenge.
June 29, 2016- Feb. 20, 2017 (open)
With generous support from the Laura and John Arnold Foundation this $1.2 million Challenge, one of two large prize Coding4Cancer Challenges, seeks to improve the accuracy of breast cancer detection and reduce the current rate of patient callbacks.
Please include the link of...?
If possible, please include the link to the followings:
- Emergency Tele-Orthopedics X-ray Digital Library.
- IMT Segmentation
- Needle EMG MUAP Time Domain Features
- Interactive Tool of Clinical Concept Embeddings Learned from Massive Sources of Medical Data
- DocGraph
- PICO elements (data + website)
- EMOTASS
- Autism Sub-Challenge (Paper + Link)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.