Journal article Open Access

CONTAMINATION OR VACCINE RESEARCH? RNA Sequencing data of early COVID-19 patient samples show abnormal presence of vectorized H7N9 hemagglutinin segment

Quay, Steven, C.; Rahalkar, Monali, C.; Jones, Adrian; Bahulikar, Rahul, A.

Abstract

 

A re-analysis of the meta-transcriptome data (SRR11092059-63) generated at the Wuhan Institute of Virology (WIV) from bronchial-alveolar lavage fluid (BALF) specimens of five early SARS-CoV-2 patients (WIV02,04,05,06,07-2)2 was done. The data of these five patients had been obtained by the WIV on two different NGS machines: MiSeq and MGISEQ-2000RS (HiSeq 3000 equivalent). The MGISEQ-2000RS gave 10X more data than the MiSeq machine (5.6-12 Gb) and therefore, it was possible to see more detail. Surprisingly, all the five samples analysed by MGISEQ-2000RS machine showed the presence of a sequence H7N9 ‘Hemagglutinin A (HA) segment 4’ gene in a relatively high proportion, and in one case six- times the abundance of the SARS-CoV-2 sequences. The presence of non-SARS-CoV-2, including these influenza A genes, has been reported earlier, and this data was also used in our current study for comparison and analysis. The surprising finding was the HA segment 4 gene cloned in an expression vector, pVAX1, confirming previously identified vector sequences3,4. A WIV publication documented that DNA vaccines containing H7N9 HA genes were being developed and tested in mice in WIV at the same time as the outbreak (2019-2020). In addition, all five samples showed a relatively high proportion of Spodoptera frugiperda rhabdovirus (13-83% of SARS-CoV-2 reads). Additionally, the samples also showed the presence of other low-abundance, high homology (LAHH) sequences, mostly of viral origin and not expected to be associated with human BALF specimens. These LAHH sequences could be contaminants, and we identified these viruses as part of previously published research at the WIV, providing a genomic record of prior work. The ability to identify previously performed research in the meta-transcriptome raw data reads from a laboratory provides a new forensic tool. The presence of cloned H7N9 HA gene segment in the transcriptome data of the early five patients processed in the WIV should be treated as an important forensic clue and warrants a full investigation. The most important question considering the plausible hypothesis that the SARS-CoV-2 could have escaped due to a lab accident would be: what does the co-occurrence of vectorized H7N9 sequences with SARS-CoV2 sequences in the early COVID-19 patients suggests?

 

 

 

 

Files (1.5 MB)
Name Size
Final-H7N9-Paper-07.02.2021-3.pdf
md5:bb66bbf1196a7c69eac36ab1d71f2549
521.5 kB Download
Supp_Table_1_SARS-CoV-2_Early_Patients_fastv.xlsx
md5:3cc2691ff9bbebc041fc66f7f959c816
61.5 kB Download
Supplementary Analyses.pdf
md5:017fbd8690cb10197f5b514f65bcf345
896.3 kB Download
3,145
2,055
views
downloads
All versions This version
Views 3,1451,902
Downloads 2,0551,414
Data volume 1.1 GB739.4 MB
Unique views 2,6351,704
Unique downloads 1,7481,215

Share

Cite as