PGxMine

Jake Lever

doi:10.5281/zenodo.6617348

Published June 6, 2022 | Version v10

Dataset Open

PGxMine

Jake Lever¹

1. University of Glasgow

This describes the output files for the PGxMine project. The code for this viewer is available in the PGxMine Github repo if you want to run it independently. Each file is a tab-delimited file with a header, no comments and no quoting.

You likely want pgxmine_collated.tsv if you just want the list of chemical/variant assocations. If you want the supporting sentences, look at pgxmine_sentences.tsv. You can use the matching_id column to connect the two files. If you want to dig further and are okay with a higher false positive rate, look at pgxmine_unfiltered.tsv.

pgxmine_collated.tsv: This contains the chemical/variant associations with citation counts supporting them. It contains the normalized chemical, variant and where appropriate gene names with identifiers for PharmGKB, dbSNP and Entrez.

pgxmine_sentences.tsv: This contains the supporting sentences for the chemical/variant associations in the collated file. Each row is a single supporting sentence for one association. This file contains information on the source publication (e.g. journal, publication date, etc), the actual sentence and the chemical/variant association extracted.

pgxmine_unfiltered.tsv: This is the combined raw output of the createKB.py script across all of PubMed, Pubmed Central Open Access and PubMed Central Author Manuscript Collection. It contains every predicted relation with a prediction score above 0.5. So this may contain many false positives. Each row contain information on the publication (e.g. journal, publication date, etc) along with the sentence and the specific chemical/variant association.

Files

Files (324.3 MB)

Name	Size	Download all
pgxmine_collated.tsv md5:9df9a14c53994f42dd0d4cd207692790	2.2 MB	Download
pgxmine_sentences.tsv md5:338ec3ba7a81bde93e373a40909e5c5e	110.5 MB	Download
pgxmine_unfiltered.tsv md5:2f1fe8044c73b918b13686cd6b1e0ef0	211.6 MB	Download

Citations

Oops! Something went wrong while fetching results.

	All versions	This version
Views	1,773	318
Downloads	292	89
Data volume	21.0 GB	11.2 GB

PGxMine

Creators

Description

Files

Files (324.3 MB)