Published July 18, 2020 | Version 1.0
Dataset Open

Transfer learning enables prediction of CYP2D6 haplotype function

  • 1. Stanford University

Description

This data here were used to train the models described in the manuscript "Transfer learning enables prediction of CYP2D6 haplotype function".  The deep learning model described predicts metabolic function of CYP2D6 star alleles.  It uses two pretraining steps, first with simulated data, then with sequence data collected from liver microsomes, and finally using sequence data for CYP2D6 star alleles.

 

simulated_cyp2d6_diplotypes.tar.gz  - This file contains sequence data and labels for simulated CYP2D6 data used in the first training step

dalton_2019_cyp2d6_microsomes.txt  - This file contains summary statistic data for liver microsome data used in the second pretraining step (originally from https://doi.org/10.1111/cts.12695)

star_samples.vcf - This file contains sequence data for CYP2D6 star alleles derived from PharmVar (https://www.pharmvar.org/gene/CYP2D6) used in the final training step.

Files

dalton_2019_cyp2d6_microsomes.txt

Files (22.9 MB)

Name Size Download all
md5:295e26942b6cf248ede82783d89536f3
7.1 kB Preview Download
md5:da140347af4a1f3c06020f6d4c5445e9
22.4 MB Download
md5:84a12bdb2c2872f3a395877915685c38
468.2 kB Download

Additional details

Related works

Cites
Journal article: 10.1111/cts.12695 (DOI)