Repeat expansions associated with human disease are present in diverse organisms
Description
These data are associated with the Repeat expansions associated with human disease are present in diverse organisms pub from Arcadia Science. The code associated with production of these data is available on the GitHub repos linked in the above publication. Version 3 is the correct version of this record.
koala_repeat_length_outliers.xlsx and full_koala_results.txt are tables containing results of repeat expansion counting from koala population sequencing data.
full_results_20230802_foldseekandblast.csv is a table of the hits with sequence or structural similarity to human repeat expansion proteins.
foldseek_aa_fasta.zip contains amino acid fasta files for foldseek hits of dRE proteins.
All other tables are described in the NCBI_taxid_to_lineage_and_barchart_tree_plotting.ipynb and profile_repeats.ipynb jupyter notebooks here.
Files
aacountingreults.zip
Files
(269.5 MB)
Name | Size | Download all |
---|---|---|
md5:c5264fd1c30343d36abbc0cf018177a9
|
169.7 MB | Preview Download |
md5:d6808fa9432b0a35db0fdcfa484bb501
|
21.4 kB | Preview Download |
md5:7f5fd9f1080c5458c89dac80981457e0
|
69.0 kB | Preview Download |
md5:33120eeb8cbac5e03ef04c2ce8321a47
|
995 Bytes | Preview Download |
md5:6304b82bcf902eff43ffe3b149128abe
|
1.6 kB | Preview Download |
md5:c9df0e2dc3ef730ee94567f53b32a6bb
|
1.7 kB | Preview Download |
md5:ffc491c82f3dff3a453ce61116adc379
|
10.5 MB | Preview Download |
md5:bfe6256b748f75dbdb69a8a4520967d0
|
61.9 kB | Preview Download |
md5:30077bd63aab91b58ba0c9ba0c69a2a5
|
44.8 MB | Preview Download |
md5:a94f98c756d55991ebee09e9833cfa3d
|
9.4 kB | Download |
md5:f966f58b75c4ec42fa10b1eb8db60bc6
|
655 Bytes | Preview Download |
md5:e437d46cbe98272baa30f7172fd17407
|
1.5 kB | Preview Download |
md5:0062b98a7b7753318a360431ea6cd759
|
7.4 kB | Preview Download |
md5:8ec1ccedc5cdd4cb20fb6b87d7940a09
|
28.3 kB | Preview Download |
md5:1c3bac2487338a5b969847b443f084ab
|
44.3 MB | Preview Download |