Published November 4, 2024 | Version v2
Dataset Open

Personalized genomes for DL models supporting data

Description

Archive of models and data associated with our manuscript "Training deep learning models on personalized genomic sequences improves variant effect prediction".

Code for training and benchmarking LCL models is available at https://github.com/Danko-Lab/clipnet_ablation, whereas code for training and benchmarking K562 models is available at https://github.com/Danko-Lab/clipnet_k562/.

Model files & metadata:

  • n{i}_run{j}.tar
    • CLIPNET LCL models trained on i individuals
  • subsample_individuals_ids.tar
    • text files containing lists of the individuals used to train the above models.
  • reference_models.tar
    • CLIPNET LCL model trained on data from 67 PRO-cap libraries, but using hg38 sequences instead of personal genomes.
  • clipnet_k562_reference.tar
    • hg38-trained model described above transfer learned to K562.

Benchmark data:

Files

Files (23.8 GB)

Name Size Download all
md5:44b3c2ec5001c9a45fbfae3d275007ea
33.6 MB Download
md5:37d4d3196e9ae51cd45404740591400c
870.8 MB Download
md5:9210d8a9451f84a65ed1b1d7800d4814
265.3 MB Download
md5:7bf83aaac96a69659bda9ace79aca69a
871.3 MB Download
md5:db8ea43b384ece94b3e1ad573cd87348
871.3 MB Download
md5:17bd1e7079faa06de18ccccc1db44e62
871.3 MB Download
md5:61355cf90cbe68f18b6c3321b42d2ceb
871.3 MB Download
md5:5cdd6797a1596ef891a31fc2011d9bca
871.3 MB Download
md5:ce04560197362884c0a3c04dd9d55172
871.4 MB Download
md5:bf3815042f26ded00735f02dda5bdd6b
871.4 MB Download
md5:a378d2e2ea33a5bb16360356363ff34f
871.4 MB Download
md5:1925709f0276b64b9a7385f78a9e8f3a
871.4 MB Download
md5:f89455ad616c2747ac386c2ca365a484
871.4 MB Download
md5:0f82912630886207e5ddacda3b1f2a44
871.4 MB Download
md5:57972d01471c26116b3f7c7125093176
871.4 MB Download
md5:591cc1f4afbf21f05d58ca6ed942d4fd
871.4 MB Download
md5:4c82eaf73fec079175277674de0d39b8
871.4 MB Download
md5:b06b21468331d6f72a13f70e70a9fb01
871.4 MB Download
md5:5a8db54bd2699fa03c5857047658d8d6
871.4 MB Download
md5:43b23c27ed9bba86072194d7020788ee
871.4 MB Download
md5:e649a96bab1220c327b99931652e1ae1
871.4 MB Download
md5:34b49a2ae15d0421be018810cefe762a
871.3 MB Download
md5:5aca643542609bbe8831aca7b36dc051
871.4 MB Download
md5:27321c6882252e1001aca7b1e63c4f17
871.3 MB Download
md5:28402f685f87a078e7b30fd7371350c3
871.3 MB Download
md5:aaf2acdc2d15581fb62af50ccf839f2f
871.3 MB Download
md5:1059c06c83c0eb24a2ba9924fad57e26
871.3 MB Download
md5:534d19b1aec40b00b4026f18d82f3013
871.3 MB Download
md5:174c004b8aed31379640354c8d04ef5d
3.4 MB Download
md5:0d5c155b35b68766c55a388e8c42c635
871.4 MB Download
md5:306407866f0e508af5219339c5109ab7
30.7 kB Download