Published December 19, 2023 | Version v3
Dataset Open

Repeat expansions associated with human disease are present in diverse organisms

Description

These data are associated with the Repeat expansions associated with human disease are present in diverse organisms pub from Arcadia Science. The code associated with production of these data is available on the GitHub repos linked in the above publication. Version 3 is the correct version of this record.

koala_repeat_length_outliers.xlsx and full_koala_results.txt are tables containing results of repeat expansion counting from koala population sequencing data. 

full_results_20230802_foldseekandblast.csv is a table of the hits with sequence or structural similarity to human repeat expansion proteins. 

foldseek_aa_fasta.zip contains amino acid fasta files for foldseek hits of dRE proteins.

All other tables are described in the NCBI_taxid_to_lineage_and_barchart_tree_plotting.ipynb and profile_repeats.ipynb jupyter notebooks here.

 

Files

aacountingreults.zip

Files (269.5 MB)

Name Size Download all
md5:c5264fd1c30343d36abbc0cf018177a9
169.7 MB Preview Download
md5:d6808fa9432b0a35db0fdcfa484bb501
21.4 kB Preview Download
md5:7f5fd9f1080c5458c89dac80981457e0
69.0 kB Preview Download
md5:33120eeb8cbac5e03ef04c2ce8321a47
995 Bytes Preview Download
md5:6304b82bcf902eff43ffe3b149128abe
1.6 kB Preview Download
md5:c9df0e2dc3ef730ee94567f53b32a6bb
1.7 kB Preview Download
md5:ffc491c82f3dff3a453ce61116adc379
10.5 MB Preview Download
md5:bfe6256b748f75dbdb69a8a4520967d0
61.9 kB Preview Download
md5:30077bd63aab91b58ba0c9ba0c69a2a5
44.8 MB Preview Download
md5:a94f98c756d55991ebee09e9833cfa3d
9.4 kB Download
md5:f966f58b75c4ec42fa10b1eb8db60bc6
655 Bytes Preview Download
md5:e437d46cbe98272baa30f7172fd17407
1.5 kB Preview Download
md5:0062b98a7b7753318a360431ea6cd759
7.4 kB Preview Download
md5:8ec1ccedc5cdd4cb20fb6b87d7940a09
28.3 kB Preview Download
md5:1c3bac2487338a5b969847b443f084ab
44.3 MB Preview Download