Published November 7, 2024 | Version 0.2
Dataset Open

GRPM Dataset

  • 1. Department of Biology, University of Napoli "Federico II", Complesso Universitario Monte Sant'Angelo, Via Cinthia, 80126 Napoli, Italy
  • 2. Institute of Biomolecular Chemistry (ICB), National Research Council (CNR), Via Campi Flegrei 34, 80078 Pozzuoli, Italy

Description

This repository hosts datasets generated using the GRPM system, a modular resource specifically designed to integrate and analyze genetic polymorphism data related to specific biomedical subjects in literature.

GitHub: GRPM_system

medrxiv DOI: 10.1101/2023.08.04.23293659 


Contents:

1. `grpm_dataset.zip`: This archive includes the GRPM Dataset, which represents a consolidated resource of genetic polymorphism information relevant to various biomedical traits. The dataset is constructed by harmonizing data from LitVar and PubMed and is further enhanced through the inclusion of Medical Subject Headings (MeSH) terms. The GRPM Dataset serves as a critical resource for researchers by providing a foundational dataset to examine the genetic factors influencing these traits.

2. `grpm_surveys.zip`: The GRPM Surveys archive contains the results of targeted queries submitted to the GRPM Dataset using predefined MeSH terms. These queries cover ten topics relevant to nutrigenetics and nutrition-related traits.  

3. `nutrigentic_dataset.zip`: This dataset is a collection of genetic polymorphisms associated with nutrigenetics. It has been enriched with data from Genome-Wide Association Studies (GWAS) to offer a comprehensive overview of genetic variations pertinent to personalized nutrition. This dataset is invaluable for researchers and nutritionists aiming to investigate the genetic basis of nutrition interventions.

The repository is supplemented with crucial files necessary for executing the software and reproducing the entire data integration and query workflow effectively:

4. `human_genes.csv` : Ensembl Human Genes dataset
5. `gwas_data.zip` : GWAS Catalog data and semantic allignmet with MeSH
6. `ref-mesh.zip` : MeSH terms associated with nutrition-related traits.
7. `nbib_data.zip`: Accessory data related to all the publications collected during the Medline data retrieval process. 

Files

grpm_dataset.zip

Files (1.5 GB)

Name Size Download all
md5:db80eb945725d71a1d564bfd065914fc
42.1 MB Preview Download
md5:c9261b7ce6b1798ef4807a6f8382c37f
1.3 GB Preview Download
md5:d3623a951ad481f08a8db9757f2d972f
126.3 MB Preview Download
md5:46ea41d187bbe14c117c5e2692d03676
15.3 MB Preview Download
md5:56b4b5b71213663d58f6d03d3c4c19f7
4.8 MB Preview Download
md5:69620d14e1a9b387ade939cc4ef965a6
16.6 MB Preview Download
md5:c32a1df8c3e4f58744458ee44223dced
4.0 MB Preview Download

Additional details

Related works

Is referenced by
Preprint: 10.1101/2023.08.04.23293659 (DOI)
Is supplement to
Software: https://github.com/johndef64/GRPM_system (URL)
Conference proceeding: 10.1007/978-3-031-71382-8_2 (DOI)