There is a newer version of the record available.

Published November 3, 2022 | Version v1
Journal article Open

Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions

Description

Accompanying files to the manuscript  "Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions" by Dembech et al.
The files enclosed here contain the full results of the co-transition (cotr) pipeline and the input files required to reproduce the results.
The code required to reproduce the results as well as python and R notebook to reproduce analyses and figures in the manuscript are available to the github repository: https://github.com/lab83bio/Cotransitions

Main files

=================
FILES DESCRIPTION 
=================

Files                                               Description
--------------------------------------------------  ------------------------------------------------------------
Eukaryota.raxml.NL.transitions.annotated            orthogroup pairs and scores (tab-separated)
Eukaryota.raxml.LL.transitions.annotated            orthogroup pairs and scores (tab-separated)
Eukaryota.raxml.RL.transitions.annotated            orthogroup pairs and scores (tab-separated)
intersection.Eukaryota.raxml.RL.clusters            mcl orthogroup clusters tab-separated (one cluster per line)
intersection.Eukaryota.raxml.RL.clusters.annotated  mcl clusters in human-readable format

=============================================================
COLUMNS DESCRIPTION (Eukaryota.raxml.*.transitions.annotated)
=============================================================

  #  Name         Description
---  -----------  -------------------------
  1  Orthogroup1  (Orthogroup1)
  2  Orthogroup2  (Orthogroup2)
  3  org          (number of organisms)
  4  t1           (Orthogroup1 transitions)
  5  t2           (Orthogroup1 transitions)
  6  c            (concordant)
  7  d            (discordant)
  8  k            (concordant - discordant)
  9  k_score      (cotr score)
 10  p            (p-value)
 11  p.adj        (p-value adjusted)

Additional files

=================
FILES DESCRIPTION
=================

Files                                                                      Description
-------------------------------------------------------------------------  -----------------------------------------------------
Eukaryota.phylip.tree                                                      ncbi species tree in phylip format
Eukaryota.phylip.data                                                      presence/absence data in phylip format
Eukaryota.RData                                                            presence/absence tables in R data format

RAxML_log.Eukaryota
RAxML_result.Eukaryota
RAxML_bestTree.Eukaryota                                                   RaxML fully resolved tree (phylip format)
RAxML_info.Eukaryota

Eukaryota.raxml.RL.nexus                                                   right-ladderized raxml tree in nexus format
Eukaryota.raxml.LL.nexus                                                   left-ladderized raxml tree in nexus format
Eukaryota.raxml.NL.nexus                                                   left-ladderized raxml tree in nexus format

Eukaryota.raxml.RL.csv                                                     presence/absence table (tab-separated) with gene acc.
Eukaryota.raxml.LL.csv                                                     presence/absence table (tab-separated) with gene acc.
Eukaryota.raxml.NL.csv                                                     presence/absence table (tab-separated) with gene acc.

Eukaryota.raxml.RL.csv.num_m                                               gene numbers table (tab-separated)
Eukaryota.raxml.LL.csv.num_m                                               gene numbers table (tab-separated)
Eukaryota.raxml.NL.csv.num_m                                               gene numbers table (tab-separated)

Eukaryota.raxml.RL.csv.num                                                 binary presence/absence table (tab-separated)
Eukaryota.raxml.LL.csv.num                                                 binary presence/absence table (tab-separated)
Eukaryota.raxml.NL.csv.num                                                 binary presence/absence table (tab-separated)

Eukaryota.raxml.RL.transitions.annotated.odb.pickle                        orthogroup info from orthodb API
Eukaryota.raxml.RL.transitions.annotated.unigo                             orthogroups mapped in Uniprot GO
Eukaryota.raxml.RL.ogs.annotated.unigo.experimental.slim                   unigo file slimmed by owltools
Eukaryota.raxml.RL.ogs.annotated.unigo.experimental.slim.boxcox.P.ranking  slim file with normalized pvalues
Eukaryota.raxml.RL.ogs.annotated.unigo.experimental.slim.boxcox.F.ranking  slim file with normalized pvalues
Eukaryota.raxml.RL.ogs.annotated.unigo.experimental.slim.boxcox.C.ranking  slim file with normalized pvalues

=============================================================
VARIABLES DESCRIPTION (Eukaryota.RData)
=============================================================

  #  Name     Description
---  -------  ---------------------------------------------------------------------------
  1  Gst      Table with orthogroup genes in genomes
  2  tr       NCBI tree
  3  Ngroups  Genome table with GenomeID, number of orthogroups (N), TaxID, species names
  4  cl_d     data.table with taxonomic NCBI classification

Files

Eukaryota.raxml.LL.csv

Files (4.7 GB)

Name Size Download all
md5:c99a0aa99cda6bae2211c6ecc7e8cc15
77.0 MB Download
md5:e43d5236e32ef8b456d203e02711e63c
44.3 kB Download
md5:348270e0bd050abb350090b21ee98a39
314.5 MB Preview Download
md5:d6802b68178a9be03d75e6a8d147f1e5
154.8 MB Download
md5:5703129428568e8fb845bc0198c569a4
154.9 MB Download
md5:9bdd21d2cce72ac3deb9ee41d22ceb69
165.0 kB Download
md5:0e23767d35b62d6a9d6152fef272c3de
435.9 MB Download
md5:9987b0b623e45766c2f1f2ebd74d1e68
314.5 MB Preview Download
md5:c3a13fd950f3711836dc30b6b6620040
154.8 MB Download
md5:3dd3181a3bed83124e21df1af17a0d63
154.9 MB Download
md5:b9253a47cf63b8a0640d881608b29267
165.0 kB Download
md5:d190b7ce4271d9d844e8a63316d41d9b
355.4 MB Download
md5:b5c80cfb8afbdce7bcf293761c035f2d
314.5 MB Preview Download
md5:08d9c070ef69adccd4bd8848097426b2
154.8 MB Download
md5:927c18d3672db2300b27a4ff19036a4d
154.9 MB Download
md5:93c9fd08748e93a23415ff6baaf69add
165.0 kB Download
md5:90aae72f1f093168dc431e83b5f2e0be
564.6 MB Download
md5:eb5dff906c7af9836e3dddf38d293fca
3.1 MB Download
md5:e9c63f015d1ff001386f1b4a378ab0fa
2.0 MB Download
md5:aebd27136b4273f6122f66b231949dea
3.0 MB Download
md5:1025de650d87c5c50617d7cefdbc56ff
321.3 MB Download
md5:ed015eec1628ee5e0c0da4c715a1f2fa
52.0 MB Download
md5:50203367c80d9e210fd7fbd36e34ebcc
917.4 MB Download
md5:5b29cca82300212c8b4c855a69006ac7
76.1 MB Download
md5:a0c1b6e59981a4f3e12df98f5e61bf4e
106.9 kB Download
md5:661fac8b057cc33a84eafd0dc7b7079e
410.8 kB Download
md5:5a4126c3da942f6db23e51c54b53c98b
1.0 MB Download
md5:0a11ec6a103f8985a274b1454b92644a
1.7 MB Download
md5:128c00e9ac76562da5802779c121496f
71.4 kB Download
md5:fc47a2640353245133c2867197b324d2
2.2 kB Download
md5:2b9a61fe2c91bda4c9bfeaea0bc95dcc
71.4 kB Download