Dataset Open Access

Drug Indications Extracted from FAERS

Stupp, Gregory S; Su, Andrew I


Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:creator>Stupp, Gregory S</dc:creator>
  <dc:creator>Su, Andrew I</dc:creator>
  <dc:date>2018-09-28</dc:date>
  <dc:description>This dataset contains drug indications extracted from the FDA Adverse Event Reporting System (FAERS).

Source code here: https://github.com/stuppie/faers

Method Outline


	Data files are extracted from zip files, parsed from csvs, and imported into a MySQL database (see parser.py).
	Duplicate records are then de-duplicated by taking only the most recent version for each case ID (see dedupe.py).
	Indications are normalized by matching to UMLS terms by string matching. Cross-references to Human Phenotype Ontology are pulled from UMLS and xrefs to Monarch Disease Ontology (MONDO) are pulled from MONDO using the UMLS xrefs. (See normalize_indications.py)
	Drugs names are normalized first by applying a few simple string cleaning operations (strip, fix slashes and periods). Then they are attempted to be matched to rxnorm by exact string matching. Those that don't match are run against rxnorm's approximate matching service, and are accepted if the score is higher than 67/100. The matched rxnorm CUIs are then mapped to the their Ingredient level rxnorm ID. (See normalize_drugs.py)
	Indications are then retrieved for each drug ingredient and filtered to require a minimum of 20 individual occurances. (See get_indications.py)
</dc:description>
  <dc:identifier>https://zenodo.org/record/1436000</dc:identifier>
  <dc:identifier>10.5281/zenodo.1436000</dc:identifier>
  <dc:identifier>oai:zenodo.org:1436000</dc:identifier>
  <dc:language>eng</dc:language>
  <dc:relation>info:eu-repo/grantAgreement/NIH/NATIONAL+INSTITUTE+OF+GENERAL+MEDICAL+SCIENCES/2R01GM089820-06/</dc:relation>
  <dc:relation>doi:10.5281/zenodo.1435999</dc:relation>
  <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
  <dc:rights>https://creativecommons.org/publicdomain/zero/1.0/legalcode</dc:rights>
  <dc:subject>faers, drug indications, fda, drugs</dc:subject>
  <dc:title>Drug Indications Extracted from FAERS</dc:title>
  <dc:type>info:eu-repo/semantics/other</dc:type>
  <dc:type>dataset</dc:type>
</oai_dc:dc>
949
63
views
downloads
All versions This version
Views 949949
Downloads 6363
Data volume 15.3 MB15.3 MB
Unique views 886886
Unique downloads 4444

Share

Cite as