pggb: the PanGenome Graph Builder
Authors/Creators
- 1. University of Tennessee Health Science Center (UTHSC)
- 2. Human Technopole
- 3. QBiC
Description
PGGB builds pangenome variation graphs from a set of input sequences.
A pangenome variation graph can be understood as a kind of multiple sequence alignment. It lets us understand any kind of sequence variation between a collection of genomes. It shows us similarity where genomes walk through the same parts of the graph, and differences where they do not.
PGGB generates this kind of graph using an all-to-all alignment of input sequences (wfmash), graph induction (seqwish), and progressive normalization (smoothxg, gfaffix). After construction, PGGB generates diagnostic visualizations of the graph (odgi). A variant call report (in VCF) representing both small and large variants may be generated based on any reference genome included in the graph (vg). PGGB writes its output in GFAv1 format, which can be used as input by numerous "genome graph" and pangenome tools, such as the VG and ODGI toolkits.
PGGB has been tested at scale in the Human Pangenome Reference Consortium (HPRC) as a method to build a graph from the draft human pangenome.
Files
pangenome/pggb-v0.4.1.zip
Files
(5.5 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:fdfed744dc6b611be3c303f3b8fb341c
|
5.5 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/pangenome/pggb/tree/v0.4.1 (URL)