Reduce the sample size.

In this notebook, we reduce the sample size to make the analysis more approachable using average infrastructure.

TSV derived from the VCF dataset

Tb-profiler dataset

Create main monolabel dataset and segregate it into

Read the output of step-001 and derive the monolabel dataset