Published November 19, 2019
| Version v1
Journal article
Open
Discriminant analysis in Whole genome sequencing and phylogenetic characterization of a novel bat-associated picornavirus-like virus with an unusual genome organization.
Description
This archive contains necessary files to perform a discriminant analysis in order to classify the Kandabadicivirus sequence among many hosts: arthropods, birds, mammals, mollusks, and plants.
First, decompress the fasta sequences (sequences.zip) in a directory named sequences. Then:
for i in sequences/*.fasta; do
echo $i;
python3 scripts/fasta2count.py $i ${i%.fasta}.csv;
done
You now have csv files along with corresponding fasta files (csv are provided in this archive).
You can now perform discriminantAnalysis.R in R software to obtain the discriminant analysis and make a graphical output named "discrimin.pdf".