Published January 20, 2023 | Version V1.4
Dataset Open

Supplementary data of the paper 'Adaptive trends of sequence compositional complexity over pandemic time in the SARS CoV 2 coronavirus'

  • 1. Department of Genetics, Faculty of Sciences, University of Granada, 18071, Granada, Spain
  • 2. Department of Applied Physics II and Institute Carlos I for Theoretical and Computational Physics, University of Málaga, 29071, Málaga, Spain
  • 3. Dipartimento di Scienze della Terra, dell'Ambiente e delle Risorse, Università di Napoli Federico II, 80126, Napoli, Italy
  • 4. 7Centro de Investigaciones sobre Desertificación, Consejo Superior de Investigaciones Científicas (CSIC), University of València and Generalitat Valenciana, 46113, Valencia, Spain
  • 5. Institute of Integrative Systems Biology (I2Sysbio), University of València and Consejo Superior de Investigaciones Científicas (CSIC), 46980, Valencia, Spain

Description

Supplement of the paper
"Adaptive trends of sequence compositional complexity over pandemic time in the SARS-CoV-2 coronavirus”
During the spread of the COVID-19 pandemic, the SARS-CoV-2 coronavirus underwent mutation and recombination events that altered its genome compositional structure, thus providing an unprecedented opportunity to check an evolutionary process in real time. The mutation rate is known to be lower than expected for neutral evolution, suggesting natural selection and convergent evolution. We begin by summarizing the compositional heterogeneity of each viral genome by computing its Sequence Compositional Complexity (SCC). To analyze the full range of SCC diversity, we select random samples of high quality coronavirus genomes covering the full span of the pandemic. We then search for evolutionary trends that could inform us on the adaptive process of the virus to its human host by computing the phylogenetic ridge regression of SCC against time (i.e., the collection date of each viral isolate). In early samples, we find no statistical support for any trend in SCC values, although the viral genome appears to evolve faster than Brownian Motion (BM) expectation. However, in samples taken after the emergence of high fitness variants, and despite the brief time span elapsed, a driven decreasing trend for SCC and an increasing one for its absolute evolutionary rate are detected, pointing to a role for selection in the evolution of SCC in the coronavirus. We conclude that the higher fitness of variant genomes may have leads to adaptive trends of SCC over pandemic time in the coronavirus.

Supplementary files

File

Description

SupplementaryTables S1-S19.zip

Excel supplementary tables: The strain name, the collection date, and the SCC values for each analyzed genome.

nextstrain_ncov_open_global_timetree.nwk ML phylodynamic tree for the Nextstrain sample

SupplementaryTable S20.pdf

A complete list acknowledging the authors, originating and submitting laboratories of the genetic sequences we used for the analysis of the Nextstrain sample.

Nextstrain_sample_fasta_3059.zip Nextstrain sample (sequences in Fasta format)

PhylogeneticTimetrees_NewickFormat.zip

Phylogenetic timetrees (Newick format).

 

Notes

This project was funded by grants from the Spanish Minister of Science, Innovation and Universities (former Spanish Minister of Economy and Competitiveness) to J.L.O. (Project AGL2017-88702-C2-2-R) and A.M. (Project PID2019-105969GB-I00), a grant from Generalitat Valenciana to A.M. (Project Prometeo/2018/A/133) and co-financed by the European Regional Development Fund (ERDF). The most time-demanding computations were done on the servers of the Laboratory of Bioinformatics, Dept. of Genetics & Institute of Biotechnology, Center of Biomedical Research, 18100, Granada, Spain.

Files

NewickFormatTrees.zip

Files (4.4 MB)

Name Size Download all
md5:f0e712d6215a25ec9fbb6549492d115b
677.3 kB Preview Download
md5:3495c087b9297620e39c1c60776cffc8
178.0 kB Download
md5:0fb827ad5355a07f03543e37eb164c5a
1.3 MB Preview Download
md5:d43280f496bcdbebde586d24f28d84d2
2.2 kB Preview Download
md5:a9d588d7b8c39c28201c5fb3ff271641
679.4 kB Preview Download
md5:30f5dfed39d7604897469c2398478a6a
1.5 MB Preview Download