Dataset,Papers Used,PHI,Public,Category,Link (if public),Domain
self-collected,55,,,,,
MIMIC,18,1,1,EHR,,EHR
simulated data,9,,,,,
ADNI,4,1,1,Primarily Imaging,,
SEER Cancer data,2,1,1,,,
e-ICU,2,1,1,EHR,https://eicu-crd.mit.edu/about/eicu/,
i2b22010 challenge,2,0,1,,,
LIDC,2,0,1,,,
UCI EEG,2,0,1,Waveform Data,,
Digital Database for Screening of Mammography (DDSM),2,0,1,Primarily Imaging,,
Meta-analysis Global Group in Chronic heart failure database,1,1,0,,,
United Network for Organ Sharing (UNOS) database,1,1,0,,,
MUSIC,1,1,0,,http://musicurology.com/data-management/,
Clinical Antipsychotic Trials of Intervention Effectiveness (CATIE),1,1,0,,,
American College of Surgeons’ NSQIP,1,1,0,,https://www.facs.org/quality-programs/about/cqi/internetresources/databases,
Precision Medicine Vocabulary (PMV),1,0,0,Primarily Notes,,
Cerner HealthFacts database,1,1,0,,,
GoViral,1,1,0,,,
FluWatch,1,1,0,,,
Hong Kong,1,1,0,,,
Hutterite,1,1,0,,,
MERLIN-TIMI,1,1,0,,,
Oakden Ryder Radiological Hip Fracture Dataset,1,1,0,, https://arxiv.org/pdf/1711.06504.pdf,
Clue app,1,1,0,Wearables/App Data,,
Flatiron Health database,1,1,0,,,
Project CLEAR,1,0,0,,,
Healthy Aging Picture Description(HAPD),1,0,0,,,
Healthy Aging Fluency & Paragraph tasks(HAFP),1,0,0,,,
Famous People,1,0,0,,,
The Photo Tourism dataset,1,0,1,Non-Medical,,Non-Medical
WeiboNER,1,0,1,Non-Medical,,Non-Medical
SighanNER,1,0,1,Non-Medical,,Non-Medical
TwitterNER,1,0,1,Non-Medical,,Non-Medical
CoNLL 2003 English NER,1,0,1,Non-Medical,,Non-Medical
UK Biobank,1,1,1,,,
2017 Physionet challenge,1,0,1,,,
acute myeloid leukemia (AML),1,0,1,,,
bone marrow mononuclear cells (BMMC),1,0,1,,,
UK Cystic Fibrosis Trust,1,1,1,,,
MIT-BIH Arrhythmia Dataset,1,0,1,,,
PhysioNet Challenge 2015 Dataset,1,0,1,,,
HyperGEN,1,1,1,,https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000379.v1.p1,
French national health insurance database (SNIIRAM),1,1,1,,,
NCBI-diseas,1,0,1,,,
BC2GM,1,0,1,,,
JNLPBA,1,0,1,,,
BioCreative V Chemical Disease Relation Extraction (BC5CDR) task,1,0,1,,,
Philadelphia Neurodevelopmental Cohort (PNC) study,1,1,1,,https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000607.v3.p2,
PRAEGNANT study network,1,1,1,,http://www.praegnant.org/,
2015 TREC CDS track dataset,1,0,1,,,
CLPsych shared task data,1,0,1,,,
Synthetic Derivative (SD),1,1,1,,https://www.vumc.org/dbmi/synthetic-derivative,
NIH Human Microbiome Project,1,0,1,,https://www.hmpdacc.org/HMASM/,
Heart Sound & Murmur Library,1,0,1,,,
Classifying Heart Sounds Challenge,1,0,1,,,
Physionet Challenge 2016 dataset,1,0,1,,,
"incident reports from Imperial College Healthcare NHS Trust, London",1,0,1,Other (Medical),https://report.nrls.nhs.uk/nrlsreporting/,
DAIC-WOZ,1,1,1,,,
ISBI-ISIC 2017 melanoma classification challenge,1,1,1,,,
TARGET-NBL,1,1,1,,https://portal.gdc.cancer.gov/projects/TARGET-NBL,
METABRIC microarray,1,0,1,,,
TCGA microarray,1,0,1,,,
TCGA RNA-seq,1,0,1,,,
Z-Alizadeh Sani,1,1,1,,,
Breast Cancer,1,1,1,,,
SPECTF Heart,1,1,1,,,
Arrhythmia,1,1,1,,,
Heart Disease,1,1,1,,,
LFPW,1,0,1,Non-Medical,,Non-Medical
Helen,1,0,1,Non-Medical,,Non-Medical
CK+,1,0,1,Non-Medical,,Non-Medical
iBUG,1,0,1,Non-Medical,,Non-Medical
AFW,1,0,1,Non-Medical,,Non-Medical
UNBC-McMaster Shoulder Pain Expression Archive,1,0,1,,,
OASIS Brains Datasets,1,1,1,,,
UCI data sets - Bikeshare,1,0,1,Non-Medical,,Non-Medical
UCI data sets (Magic),1,0,1,Non-Medical,,Non-Medical
a Loan risk scoring data set from an online lending company,1,1,1,Non-Medical,,Non-Medical
the 2018 FICO Explainable ML Challenge’s credit data set,1,1,1,Non-Medical,,Non-Medical
pneumonia data set,1,0,1,,,
GWAS summary association statistics,1,0,1,,ftp://ftp.ebi.ac.uk/pub/databases/gwas/summary_statistics/,
Dermnet SkinDisease Atlas,1,0,1,,http://www.dermnet.com/,
NLST,1,0,1,,https://wiki.cancerimagingarchive.net/display/NLST/National+Lung+Screening+Trial,
MNIST,1,0,1,Non-Medical,,Non-Medical
Camelyon 2016 challenge dataset,1,0,1,,,
the PhysioNet polysomnography dataset,1,0,1,,,
CHB-MIT Scalp EEG dataset,1,0,1,,,
PTB Diagnostic ECG Database,1,0,1,,https://physionet.org/physiobank/database/ptbdb/,
Framingham Heart Study (FHS),1,1,1,,https://www.framinghamheartstudy.org/fhs-for-researchers/research-application-overview/,
MEDLINE,1,0,1,,,
Predicting Language Outcome RecoveryAfter Stroke (PLORAS) database,1,1,1,,https://www.ucl.ac.uk/ploras/,
Dementia Bank,1,0,1,,,
BindingDB,1,0,1,,,
PanCancer Analysis of Whole Genomes dataset (PCAWG),1,0,1,,,
PsychENCODE dataset,1,0,1,,,
"AustralianImaging, Biomarker and Lifestyle Flagship Study of Aging (AIBL)",1,1,1,,,
K562 Cell Image Dataset,1,0,1,Primarily Imaging,,
MIMIC-CXR,1,1,1,Primarily Imaging,,
-,1,,,,,