Polyidus — identifying chimeric sequencing reads with a tale of Python programming language
Creators
- 1. Department of Medical Biophysics, University of Toronto
- 2. Princess Margaret Cancer Centre
Description
The free Polyidus software identifies the exact genomic regions for integration of a known virus.
We developed Polyidus to identify viral integration sites with chimeric sequencing reads from any paired-end sequencing data. First, Polyidus aligns reads to a viral genome. It allows for partial mapping using local alignment, and removes any sequencing fragment where neither read maps to the virus. Second, Polyidus aligns the selected reads to the host genome, permitting partial mapping. Third, Polyidus identifies chimeric reads: those reads mapped partially to the host genome and partially to the virus genome. Fourth, for each chimeric read, Polyidus reports the start and strand of integration in both the host and viral genomes. Polyidus also reports the number of chimeric reads supporting each integration site.
In the previous version, we called our software Bellerophon. After noticing that the name was previously used for another software, we changed the name to Polyidus.
Update on version 1.1.0:
The host BAM file for paired-end analysis was expected to be sorted by reads, but was in fact sorted by position. The bug is fixed in v1.1.0
Files
polyidus_v1.1.0.zip
Files
(129.0 MB)
Name | Size | Download all |
---|---|---|
md5:28c0c8b17132535989b0d5fe54648f36
|
129.0 MB | Preview Download |