Published April 28, 2019 | Version v1
Dataset Open

H7 hemagglutinin lineage identification

Authors/Creators

  • 1. University of Westminster

Description

This is the nucleotide sequence data and the R files for the k-means clustering of the H7 hemagglutinin segment of Influenza A. 

  1. H7_HA_IRDB_2019_1_14.fasta is the initial data downloaded from the Influenza Research Database
  2. H7_HA_IRDB_2019_1_14_aligned_muscle.fasta is the sequence data aligned with Muscle
  3. H7_HA_IRDB_2019_1_14_aligned_muscle_cleaned.fas is the aligned sequence data after problem sequence have been removed.
  4. RNA_distances_HA_kmeans_analysis.R is the R code for carrying out the k-means clustering and diagnostics.
  5. kmeans_clustering.pdf is the R-markdown generated document describing the R-code for k-means clustering.
  6. clustered_HA_kmeans_RNA.csv are the results of the clustering
  7. usearch_clustering.csv is a summary of the USEARCH results.
  8. Clade_Alpha.tree - alpha clade phylogenetic tree
  9. Clade_Beta.tree - beta clade phylogenetic tree
  10. Clade A-D trees are the phylogenetic trees for clades A to D.
  11. H7_HA_IRDB_2019_1_14_aligned_muscle_cleaned_fasttree_coloured.tree is the overview tree of all the sequences.

 

Files

clustered_HA_kmeans_RNA.csv

Files (33.2 MB)

Name Size Download all
md5:cd04c447228f43b089ff4b67e6f8a66e
343.8 kB Download
md5:2f56a01538205f943a27fa400c12b542
215.2 kB Download
md5:00d492226a96bd12f7d864473747bf16
8.1 kB Download
md5:48bc1a52c3e63079f39dd3d3e4810b91
9.5 kB Download
md5:8cb338c8eaa31739cf37b96a35b3d43e
209.0 kB Download
md5:cf21828ff8a05cf569df751d7cdf9065
235.5 kB Download
md5:8ac8ab358bf8b4269c954b7889cd330b
238.2 kB Preview Download
md5:3f588b9580b1ce94c53bf23d7044a485
4.7 MB Download
md5:f25e3ba6066096abde062513f164ec4b
5.3 MB Download
md5:1fe4dd052f3e05a53a870ab4eee15b4d
5.0 MB Download
md5:45c20a81a4c6ad75150291b0d1cf5994
617.5 kB Download
md5:d780c7cb40ec5dbc8f0c40519bdd6727
16.3 MB Preview Download
md5:b59ea8d7a4cb952a6fe87d44612f98af
1.9 kB Download
md5:7d5d14e5fffceeff508a9b6ecb6cac30
359 Bytes Preview Download