Mono resistance EDA

In this POC, we gloss over the resistance towards distinct drugs and focus only on the mono-resistance.

EDA for Tb-profiler results

Create data frame for train dataset

Create data frame for test dataset

Analyze the train and test datasets

Resistance and Sensitive genomes

Lineage distribution

Find the relationship between drtype and main_lin variables

Stacked Column Chart: visual form of the two-way table

EDA for genomic pre-processing results

EDA on the final binarized dataset