Dataset Open Access
Stupp, Gregory S;
Su, Andrew I
<?xml version='1.0' encoding='utf-8'?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:adms="http://www.w3.org/ns/adms#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dct="http://purl.org/dc/terms/" xmlns:dctype="http://purl.org/dc/dcmitype/" xmlns:dcat="http://www.w3.org/ns/dcat#" xmlns:duv="http://www.w3.org/ns/duv#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:frapo="http://purl.org/cerif/frapo/" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:gsp="http://www.opengis.net/ont/geosparql#" xmlns:locn="http://www.w3.org/ns/locn#" xmlns:org="http://www.w3.org/ns/org#" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:prov="http://www.w3.org/ns/prov#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:schema="http://schema.org/" xmlns:skos="http://www.w3.org/2004/02/skos/core#" xmlns:vcard="http://www.w3.org/2006/vcard/ns#" xmlns:wdrs="http://www.w3.org/2007/05/powder-s#"> <rdf:Description rdf:about="https://doi.org/10.5281/zenodo.1436000"> <rdf:type rdf:resource="http://www.w3.org/ns/dcat#Dataset"/> <dct:type rdf:resource="http://purl.org/dc/dcmitype/Dataset"/> <dct:identifier rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURI">https://doi.org/10.5281/zenodo.1436000</dct:identifier> <foaf:page rdf:resource="https://doi.org/10.5281/zenodo.1436000"/> <dct:creator> <rdf:Description rdf:about="http://orcid.org/0000-0002-0644-7212"> <rdf:type rdf:resource="http://xmlns.com/foaf/0.1/Agent"/> <dct:identifier rdf:datatype="http://www.w3.org/2001/XMLSchema#string">0000-0002-0644-7212</dct:identifier> <foaf:name>Stupp, Gregory S</foaf:name> <foaf:givenName>Gregory S</foaf:givenName> <foaf:familyName>Stupp</foaf:familyName> <org:memberOf> <foaf:Organization> <foaf:name>The Scripps Research Institute</foaf:name> </foaf:Organization> </org:memberOf> </rdf:Description> </dct:creator> <dct:creator> <rdf:Description rdf:about="http://orcid.org/0000-0002-9859-4104"> <rdf:type rdf:resource="http://xmlns.com/foaf/0.1/Agent"/> <dct:identifier rdf:datatype="http://www.w3.org/2001/XMLSchema#string">0000-0002-9859-4104</dct:identifier> <foaf:name>Su, Andrew I</foaf:name> <foaf:givenName>Andrew I</foaf:givenName> <foaf:familyName>Su</foaf:familyName> <org:memberOf> <foaf:Organization> <foaf:name>The Scripps Research Institute</foaf:name> </foaf:Organization> </org:memberOf> </rdf:Description> </dct:creator> <dct:title>Drug Indications Extracted from FAERS</dct:title> <dct:publisher> <foaf:Agent> <foaf:name>Zenodo</foaf:name> </foaf:Agent> </dct:publisher> <dct:issued rdf:datatype="http://www.w3.org/2001/XMLSchema#gYear">2018</dct:issued> <dcat:keyword>faers, drug indications, fda, drugs</dcat:keyword> <frapo:isFundedBy rdf:resource="info:eu-repo/grantAgreement/NIH/NATIONAL_INSTITUTE_OF_GENERAL_MEDICAL_SCIENCES/2R01GM089820-06/"/> <schema:funder> <foaf:Organization> <dct:identifier rdf:datatype="http://www.w3.org/2001/XMLSchema#string">10.13039/100000002</dct:identifier> <foaf:name>National Institutes of Health</foaf:name> </foaf:Organization> </schema:funder> <dct:issued rdf:datatype="http://www.w3.org/2001/XMLSchema#date">2018-09-28</dct:issued> <dct:language rdf:resource="http://publications.europa.eu/resource/authority/language/ENG"/> <owl:sameAs rdf:resource="https://zenodo.org/record/1436000"/> <adms:identifier> <adms:Identifier> <skos:notation rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURI">https://zenodo.org/record/1436000</skos:notation> <adms:schemeAgency>url</adms:schemeAgency> </adms:Identifier> </adms:identifier> <dct:isVersionOf rdf:resource="https://doi.org/10.5281/zenodo.1435999"/> <owl:versionInfo>2018q2</owl:versionInfo> <dct:description><p>This dataset contains drug indications extracted from the&nbsp;FDA Adverse Event Reporting System (<a href="https://www.fda.gov/drugs/guidancecomplianceregulatoryinformation/surveillance/adversedrugeffects/">FAERS</a>).</p> <p>Source code here:&nbsp;<a href="https://github.com/stuppie/faers">https://github.com/stuppie/faers</a></p> <p><strong>Method Outline</strong></p> <ul> <li>Data files are extracted from zip files, parsed from csvs, and imported into a MySQL database (see&nbsp;parser.py).</li> <li>Duplicate records are then de-duplicated by taking only the most recent version for each case ID (see&nbsp;dedupe.py).</li> <li>Indications are normalized by matching to UMLS terms by string matching. Cross-references to Human Phenotype Ontology are pulled from UMLS and xrefs to Monarch Disease Ontology (MONDO) are pulled from MONDO using the UMLS xrefs. (See&nbsp;normalize_indications.py)</li> <li>Drugs names are normalized first by applying a few simple string cleaning operations (strip, fix slashes and periods). Then they are attempted to be matched to rxnorm by exact string matching. Those that don&#39;t match are run against rxnorm&#39;s <a href="https://rxnav.nlm.nih.gov/RxNormAPIs.html#uLink=RxNorm_REST_getApproximateMatch">approximate matching service</a>, and are accepted if the score is higher than&nbsp;67/100. The matched rxnorm CUIs are then mapped to the their Ingredient level rxnorm ID. (See&nbsp;normalize_drugs.py)</li> <li>Indications are then retrieved for each drug ingredient and filtered to require a minimum of 20 individual occurances. (See&nbsp;get_indications.py)</li> </ul></dct:description> <dct:accessRights rdf:resource="http://publications.europa.eu/resource/authority/access-right/PUBLIC"/> <dct:accessRights> <dct:RightsStatement rdf:about="info:eu-repo/semantics/openAccess"> <rdfs:label>Open Access</rdfs:label> </dct:RightsStatement> </dct:accessRights> <dcat:distribution> <dcat:Distribution> <dct:license rdf:resource="https://creativecommons.org/publicdomain/zero/1.0/legalcode"/> <dcat:accessURL rdf:resource="https://doi.org/10.5281/zenodo.1436000"/> </dcat:Distribution> </dcat:distribution> <dcat:distribution> <dcat:Distribution> <dcat:accessURL rdf:resource="https://doi.org/10.5281/zenodo.1436000"/> <dcat:byteSize>242342</dcat:byteSize> <dcat:downloadURL rdf:resource="https://zenodo.org/record/1436000/files/faers_indications.csv"/> <dcat:mediaType>text/csv</dcat:mediaType> </dcat:Distribution> </dcat:distribution> </rdf:Description> <foaf:Project rdf:about="info:eu-repo/grantAgreement/NIH/NATIONAL_INSTITUTE_OF_GENERAL_MEDICAL_SCIENCES/2R01GM089820-06/"> <dct:identifier rdf:datatype="http://www.w3.org/2001/XMLSchema#string">2R01GM089820-06</dct:identifier> <dct:title>Gene Wiki: expanding the ecosystem of community-intelligence resources</dct:title> <frapo:isAwardedBy> <foaf:Organization> <dct:identifier rdf:datatype="http://www.w3.org/2001/XMLSchema#string">10.13039/100000002</dct:identifier> <foaf:name>National Institutes of Health</foaf:name> </foaf:Organization> </frapo:isAwardedBy> </foaf:Project> </rdf:RDF>
All versions | This version | |
---|---|---|
Views | 2,621 | 2,621 |
Downloads | 489 | 489 |
Data volume | 118.5 MB | 118.5 MB |
Unique views | 2,466 | 2,466 |
Unique downloads | 462 | 462 |