Journal article Open Access

BIGNASim: a NoSQL database structure and analysis portal for nucleic acids simulation data

Adam Hospital; Pau Andrio; Cesare Cugnasco; Laia Codo; Yolanda Becerra; Pablo D. Dans; Federica Battistini; Jordi Torres; Ramón Goñi; Modesto Orozco; Josep Ll. Gelpí

DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="" xmlns="" xsi:schemaLocation="">
  <identifier identifierType="URL"></identifier>
      <creatorName>Adam Hospital</creatorName>
      <creatorName>Pau Andrio</creatorName>
      <creatorName>Cesare Cugnasco</creatorName>
      <creatorName>Laia Codo</creatorName>
      <creatorName>Yolanda Becerra</creatorName>
      <creatorName>Pablo D. Dans</creatorName>
      <creatorName>Federica Battistini</creatorName>
      <creatorName>Jordi Torres</creatorName>
      <creatorName>Ramón Goñi</creatorName>
      <creatorName>Modesto Orozco</creatorName>
      <creatorName>Josep Ll. Gelpí</creatorName>
    <title>BIGNASim: a NoSQL database structure and analysis portal for nucleic acids simulation data</title>
    <date dateType="Issued">2015-11-26</date>
  <resourceType resourceTypeGeneral="JournalArticle"/>
    <alternateIdentifier alternateIdentifierType="url"></alternateIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsSupplementedBy"></relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="References"></relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsIdenticalTo">10.1093/nar/gkv1301</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf"></relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf"></relatedIdentifier>
    <rights rightsURI="">Creative Commons Attribution Non Commercial 4.0 International</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
    <description descriptionType="Abstract">&lt;p&gt;Molecular dynamics simulation (MD) is, just behind genomics, the bioinformatics tool that generates the largest amounts of data, and that is using the largest amount of CPU time in supercomputing centres. MD trajectories are obtained after months of calculations, analysed &lt;em&gt;in situ&lt;/em&gt;, and in practice forgotten. Several projects to generate stable trajectory databases have been developed for proteins, but no equivalence exists in the nucleic acids world. We present here a novel database system to store MD trajectories and analyses of nucleic acids. The initial data set available consists mainly of the benchmark of the new molecular dynamics force-field, parmBSC1. It contains 156 simulations, with over 120 μs of total simulation time. A deposition protocol is available to accept the submission of new trajectory data. The database is based on the combination of two NoSQL engines, Cassandra for storing trajectories and MongoDB to store analysis results and simulation metadata. The analyses available include backbone geometries, helical analysis, NMR observables and a variety of mechanical analyses. Individual trajectories and combined meta-trajectories can be downloaded from the portal.&lt;/p&gt;

&lt;p&gt;The system is accessible through;br&gt;
Supplementary Material is also available on-line at &lt;/p&gt;

      <funderName>European Commission</funderName>
      <funderIdentifier funderIdentifierType="Crossref Funder ID">10.13039/501100000780</funderIdentifier>
      <awardNumber awardURI="info:eu-repo/grantAgreement/EC/H2020/675728/">675728</awardNumber>
      <awardTitle>Centre of Excellence for Biomolecular Research</awardTitle>
Views 51
Downloads 52
Data volume 98.6 MB
Unique views 48
Unique downloads 51


Cite as