Retrospective BLAST Analysis Detects RaTG13/Ra4991 in the Earliest SARS-CoV-2 Genome Submitted to NCBI GenBank (December 28, 2019)
Description
On December 28, 2019, Dr. Lili Ren submitted a near-complete genome sequence of a novel coronavirus (BetaCoV/Wuhan/IPBCAMS-WH-01/2019) to NCBI GenBank - the earliest known SARS-CoV-2 sequence sent outside China. At the time, GenBank already contained an embargoed partial RdRp sequence of RaTG13/Ra4991 (accession MH615898.1), a strain that would later be recognized as the closest known relative to SARS-CoV-2, deposited in July 2018.
This retrospective BLAST analysis demonstrates that a standard nucleotide search of Dr. Ren’s December 28, 2019, sequence would have produced a strong match of 97.79% identity to the embargoed 2018 fragment. A subsequent keyword search for “Ra4991” would have rapidly linked the emerging virus to the Mojiang mine and the 2012 miner outbreak. These findings strongly suggest that a clear early warning signal would have been detectable in late December 2019 using routine bioinformatics tools and data already available in the NIH GenBank system.
Files
Retrospective BLAST Analysis Detects RaTG13:Ra4991 in the Earliest SARS-CoV-2 Genome Submitted to NCBI GenBank (December 28, 2019).pdf
Files
(379.8 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:004bf3466025f83db732bf1d509a420b
|
379.8 kB | Preview Download |
Additional details
Related works
- Is cited by
- Report: https://energycommerce.house.gov/posts/e-and-c-investigation-uncovers-earliest-known-sars-co-v-2-sequence-released-outside-of-china (URL)
- Is supplemented by
- Dataset: https://raw.githubusercontent.com/jbloom/SARS2_28Dec2019_Genbank_submission/main/results/SARS-CoV-2_28Dec2019_submission.fa (URL)
References
- House Committee on Energy and Commerce (2024). E&C Investigation Uncovers Earliest Known SARS-CoV-2 Sequence Released Outside of China. https://energycommerce.house.gov/posts/e-and-c-investigation-uncovers-earliest-known-sars-co-v-2-sequence-released-outside-of-china
- Ge, X. Y., Li, J. L., Yang, X. L., et al. (2013). Coexistence of multiple coronaviruses in several bat colonies in an abandoned mineshaft. Virologica Sinica, 28(4), 252–259. https://doi.org/10.1007/s12250-013-3405-7
- Rahalkar, M. C., & Bahulikar, R. A. (2020). Lethal Pneumonia Cases in Mojiang Miners (2012) and the Mineshaft Could Provide Important Clues to the Origin of SARS-CoV-2. Frontiers in Public Health, 8, 581569. https://doi.org/10.3389/fpubh.2020.00581
- NCBI GenBank (2018). MH615898.1: Bat SARS-like coronavirus strain RaTG13_Yunnan RNA-dependent RNA polymerase (RdRp) gene, partial cds. https://www.ncbi.nlm.nih.gov/nuccore/MH615898.1
- National Center for Biotechnology Information (NCBI). (2026). Nucleotide BLAST. https://blast.ncbi.nlm.nih.gov/Blast.cgi
- Zhou, P., Yang, X.-L., Wang, X.-G., et al. (2020). A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature, 579(7798), 270–273. https://doi.org/10.1038/s41586-020-2012-7