Preprint Open Access

Mutational dynamics of influenza A viruses: a principal component analysis of hemagglutinin sequences of subtype H1

Sanejouand, Yves-Henri

DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="" xmlns="" xsi:schemaLocation="">
  <identifier identifierType="DOI">10.5281/zenodo.2566385</identifier>
      <creatorName>Sanejouand, Yves-Henri</creatorName>
    <title>Mutational dynamics of influenza A viruses: a principal component analysis of hemagglutinin sequences of subtype H1</title>
    <subject>principal component analysis</subject>
    <subject>multiple sequence aligment</subject>
    <date dateType="Issued">2018-01-09</date>
  <resourceType resourceTypeGeneral="Text">Preprint</resourceType>
    <alternateIdentifier alternateIdentifierType="url"></alternateIdentifier>
    <relatedIdentifier relatedIdentifierType="arXiv" relationType="IsNewVersionOf">arXiv:1710.01594</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.5281/zenodo.2566384</relatedIdentifier>
    <rights rightsURI="">Creative Commons Attribution 4.0 International</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
    <description descriptionType="Abstract">&lt;p&gt;A principal component analysis of a multiple sequence alignement of hemagglutinin sequences of subtype H1 has been performed, the sequences being encoded using the amino-acid property that maximizes the weight of the major component. In the case of this alignment, it happens to be a well-known hydrophobicity scale. Interestingly, sequences coming from human have large positive amplitudes along the major component before 2009, and large negative ones afterwards. This strongly suggests that the 2009 pandemic was associated to a major change in the hydrophobicity pattern of hemagglutinin.&lt;br&gt;
The present analysis also highlights the high variability of viral sequences coming from swine. At a more general level, the method proposed herein allows to describe a sequence coming from an alignment with a set of numbers, the original point being that the choice of the corresponding property is driven by the data.&lt;/p&gt;

All versions This version
Views 129130
Downloads 2828
Data volume 19.2 MB19.2 MB
Unique views 126127
Unique downloads 2626


Cite as