Published March 25, 2022 | Version v1
Software Open

precisionFDA Truth Challenge V2: Calling variants from short- and long-reads in difficult-to-map regions

  • 1. NIST

Description

The precisionFDA Truth Challenge V2 aimed to assess the state-of-the-art of variant calling in challenging genomic regions. Starting with FASTQs, 20 challenge participants applied their variant calling pipelines and submitted 64 variant callsets for one or more sequencing technologies (Illumina, PacBio HiFi, and Oxford Nanopore Technologies). Submissions were evaluated following best practices for benchmarking small variants with updated Genome In A Bottle benchmark sets and genome stratifications. Challenge submissions included numerous innovative methods, with graph-based and machine-learning methods scoring best for short-read and long-read datasets, respectively. With machine-learning approaches combining multiple sequencing technologies performed particularly well. Recent developments in sequencing and variant calling have enabled benchmarking variants in challenging genomic regions, paving the way for the identification of previously unknown clinically relevant variants. The pFDA challenge results can be found at https://precision.fda.gov/challenges/10, the challenge data are available on the precisionFDA platform and  doi:10.18434/mds2-2336.

Notes

Code repository for analysis of challenge results and manuscript figure generation.

Files

Files (2.6 GB)

Name Size Download all
md5:80b03728bd81e60f9397c10df31d7e73
2.6 GB Download

Additional details

Related works

Is supplement to
Dataset: 10.18434/mds2-2336 (DOI)