Published March 23, 2025 | Version v1
Preprint Open

The GenBank Sequence for Bat Virus BtSy2 Cannot Be Assembled from the Purported Bat Fecal Sample SRA File

Authors/Creators

Description

The apparent emergence of SARS-like coronavirus BtSY2 has garnered significant global attention due to its close genetic relationship with SARSCoV-2 and its purported ability to bind human ACE2 receptors. This virus was  identified through meta-transcriptomic sequencing of bat rectal swabs in Yunnan, China, and presented as a high-risk candidate for zoonotic spillover. Subsequent publications and media coverage have amplified its significance, raising calls for pandemic preparedness.

Preliminary reanalysis of the publicly available sequencing data from the bat specimen reportedly harboring SARS-like coronavirus BtSY2 reveals a striking absence of supporting raw reads for large regions of the published genome. Most notably, the Spike protein coding region and other key genomic segments show little to no coverage in the SRA dataset, despite claims of a complete 29.8 kb genome assembly.

Although PCR and Sanger sequencing were used to fill in gaps, the extent of missing data suggests that substantial portions of the genome may not have originated from the specimen cited. While alternative explanations such as low viral load or metagenomic artifacts are possible, the pervasive gaps raise serious concerns about the integrity of the assembly.

These findings, though preliminary, are data-driven and robust enough to warrant further scrutiny. I strongly encourage the original authors or independent investigators to re-examine the BtSY2 genome and its origins using transparent, reproducible methods.

Files

BtSY2 genome sequence cannot be recreated from the raw reads.pdf

Files (250.8 kB)