Published November 19, 2019 | Version v1
Journal article Open

Discriminant analysis in Whole genome sequencing and phylogenetic characterization of a novel bat-associated picornavirus-like virus with an unusual genome organization.

Authors/Creators

  • 1. Institut Pasteur

Description

This archive contains necessary files to perform a discriminant analysis in order to classify the Kandabadicivirus sequence among many hosts: arthropods, birds, mammals, mollusks, and plants.

First, decompress the fasta sequences (sequences.zip) in a directory named sequences. Then:

for i in sequences/*.fasta; do
   echo $i;
   python3 scripts/fasta2count.py $i ${i%.fasta}.csv;
done

You now have csv files along with corresponding fasta files (csv are provided in this archive).

You can now perform discriminantAnalysis.R in R software to obtain the discriminant analysis and make a graphical output named "discrimin.pdf".

Files

counts.zip

Files (1.4 MB)

Name Size Download all
md5:191b7bc65242e14e36d68440c92e9701
85.1 kB Preview Download
md5:a07dcee8c94ce9b99383b099c13a1428
1.6 kB Download
md5:8334c09f12c6ee3744946bb12217ed4d
1.2 kB Download
md5:9b19818a951753da7492c0dc9c35a293
1.3 MB Preview Download