Published May 4, 2020 | Version v1.1.0
Software Open

Polyidus — identifying chimeric sequencing reads with a tale of Python programming language

  • 1. Department of Medical Biophysics, University of Toronto
  • 2. Princess Margaret Cancer Centre

Description

The free Polyidus software identifies the exact genomic regions for integration of a known virus.

We developed Polyidus to identify viral integration sites with chimeric sequencing reads from any paired-end sequencing data. First, Polyidus aligns reads to a viral genome. It allows for partial mapping using local alignment, and removes any sequencing fragment where neither read maps to the virus. Second, Polyidus aligns the selected reads to the host genome, permitting partial mapping. Third, Polyidus identifies chimeric reads: those reads mapped partially to the host genome and partially to the virus genome. Fourth, for each chimeric read, Polyidus reports the start and strand of integration in both the host and viral genomes. Polyidus also reports the number of chimeric reads supporting each integration site.

In the previous version, we called our software Bellerophon. After noticing that the name was previously used for another software, we changed the name to Polyidus.

Update on version 1.1.0:

The host BAM file for paired-end analysis was expected to be sorted by reads, but was in fact sorted by position. The bug is fixed in v1.1.0

Files

polyidus_v1.1.0.zip

Files (129.0 MB)

Name Size Download all
md5:28c0c8b17132535989b0d5fe54648f36
129.0 MB Preview Download