Dataset,Papers Used,PHI,Public,Category,Link (if public),Domain self-collected,55,,,,, MIMIC,18,1,1,EHR,,EHR simulated data,9,,,,, ADNI,4,1,1,Primarily Imaging,, SEER Cancer data,2,1,1,,, e-ICU,2,1,1,EHR,https://eicu-crd.mit.edu/about/eicu/, i2b22010 challenge,2,0,1,,, LIDC,2,0,1,,, UCI EEG,2,0,1,Waveform Data,, Digital Database for Screening of Mammography (DDSM),2,0,1,Primarily Imaging,, Meta-analysis Global Group in Chronic heart failure database,1,1,0,,, United Network for Organ Sharing (UNOS) database,1,1,0,,, MUSIC,1,1,0,,http://musicurology.com/data-management/, Clinical Antipsychotic Trials of Intervention Effectiveness (CATIE),1,1,0,,, American College of Surgeons’ NSQIP,1,1,0,,https://www.facs.org/quality-programs/about/cqi/internetresources/databases, Precision Medicine Vocabulary (PMV),1,0,0,Primarily Notes,, Cerner HealthFacts database,1,1,0,,, GoViral,1,1,0,,, FluWatch,1,1,0,,, Hong Kong,1,1,0,,, Hutterite,1,1,0,,, MERLIN-TIMI,1,1,0,,, Oakden Ryder Radiological Hip Fracture Dataset,1,1,0,, https://arxiv.org/pdf/1711.06504.pdf, Clue app,1,1,0,Wearables/App Data,, Flatiron Health database,1,1,0,,, Project CLEAR,1,0,0,,, Healthy Aging Picture Description(HAPD),1,0,0,,, Healthy Aging Fluency & Paragraph tasks(HAFP),1,0,0,,, Famous People,1,0,0,,, The Photo Tourism dataset,1,0,1,Non-Medical,,Non-Medical WeiboNER,1,0,1,Non-Medical,,Non-Medical SighanNER,1,0,1,Non-Medical,,Non-Medical TwitterNER,1,0,1,Non-Medical,,Non-Medical CoNLL 2003 English NER,1,0,1,Non-Medical,,Non-Medical UK Biobank,1,1,1,,, 2017 Physionet challenge,1,0,1,,, acute myeloid leukemia (AML),1,0,1,,, bone marrow mononuclear cells (BMMC),1,0,1,,, UK Cystic Fibrosis Trust,1,1,1,,, MIT-BIH Arrhythmia Dataset,1,0,1,,, PhysioNet Challenge 2015 Dataset,1,0,1,,, HyperGEN,1,1,1,,https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000379.v1.p1, French national health insurance database (SNIIRAM),1,1,1,,, NCBI-diseas,1,0,1,,, BC2GM,1,0,1,,, JNLPBA,1,0,1,,, BioCreative V Chemical Disease Relation Extraction (BC5CDR) task,1,0,1,,, Philadelphia Neurodevelopmental Cohort (PNC) study,1,1,1,,https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000607.v3.p2, PRAEGNANT study network,1,1,1,,http://www.praegnant.org/, 2015 TREC CDS track dataset,1,0,1,,, CLPsych shared task data,1,0,1,,, Synthetic Derivative (SD),1,1,1,,https://www.vumc.org/dbmi/synthetic-derivative, NIH Human Microbiome Project,1,0,1,,https://www.hmpdacc.org/HMASM/, Heart Sound & Murmur Library,1,0,1,,, Classifying Heart Sounds Challenge,1,0,1,,, Physionet Challenge 2016 dataset,1,0,1,,, "incident reports from Imperial College Healthcare NHS Trust, London",1,0,1,Other (Medical),https://report.nrls.nhs.uk/nrlsreporting/, DAIC-WOZ,1,1,1,,, ISBI-ISIC 2017 melanoma classification challenge,1,1,1,,, TARGET-NBL,1,1,1,,https://portal.gdc.cancer.gov/projects/TARGET-NBL, METABRIC microarray,1,0,1,,, TCGA microarray,1,0,1,,, TCGA RNA-seq,1,0,1,,, Z-Alizadeh Sani,1,1,1,,, Breast Cancer,1,1,1,,, SPECTF Heart,1,1,1,,, Arrhythmia,1,1,1,,, Heart Disease,1,1,1,,, LFPW,1,0,1,Non-Medical,,Non-Medical Helen,1,0,1,Non-Medical,,Non-Medical CK+,1,0,1,Non-Medical,,Non-Medical iBUG,1,0,1,Non-Medical,,Non-Medical AFW,1,0,1,Non-Medical,,Non-Medical UNBC-McMaster Shoulder Pain Expression Archive,1,0,1,,, OASIS Brains Datasets,1,1,1,,, UCI data sets - Bikeshare,1,0,1,Non-Medical,,Non-Medical UCI data sets (Magic),1,0,1,Non-Medical,,Non-Medical a Loan risk scoring data set from an online lending company,1,1,1,Non-Medical,,Non-Medical the 2018 FICO Explainable ML Challenge’s credit data set,1,1,1,Non-Medical,,Non-Medical pneumonia data set,1,0,1,,, GWAS summary association statistics,1,0,1,,ftp://ftp.ebi.ac.uk/pub/databases/gwas/summary_statistics/, Dermnet SkinDisease Atlas,1,0,1,,http://www.dermnet.com/, NLST,1,0,1,,https://wiki.cancerimagingarchive.net/display/NLST/National+Lung+Screening+Trial, MNIST,1,0,1,Non-Medical,,Non-Medical Camelyon 2016 challenge dataset,1,0,1,,, the PhysioNet polysomnography dataset,1,0,1,,, CHB-MIT Scalp EEG dataset,1,0,1,,, PTB Diagnostic ECG Database,1,0,1,,https://physionet.org/physiobank/database/ptbdb/, Framingham Heart Study (FHS),1,1,1,,https://www.framinghamheartstudy.org/fhs-for-researchers/research-application-overview/, MEDLINE,1,0,1,,, Predicting Language Outcome RecoveryAfter Stroke (PLORAS) database,1,1,1,,https://www.ucl.ac.uk/ploras/, Dementia Bank,1,0,1,,, BindingDB,1,0,1,,, PanCancer Analysis of Whole Genomes dataset (PCAWG),1,0,1,,, PsychENCODE dataset,1,0,1,,, "AustralianImaging, Biomarker and Lifestyle Flagship Study of Aging (AIBL)",1,1,1,,, K562 Cell Image Dataset,1,0,1,Primarily Imaging,, MIMIC-CXR,1,1,1,Primarily Imaging,, -,1,,,,,