Published October 18, 2018 | Version 1.0.0
Dataset Open

PathoPhenoDB: a database of pathogen-phenotype associations

  • 1. King Abdullah University of Science and Technology
  • 2. University of Cambridge


PathoPhenoDB is a database containing pathogen-to-phenotype associations mined from the scientific literature. PathoPhenoDB relies on manual curation of pathogen-disease relations, and on ontology-based text mining to associate phenotypes with infectious disease. Using Semantic Web technologies, PathoPhenoDB also links to knowledge about drug resistance mechanisms and drugs used in the treatment of infectious diseases. The information in PathoPhenoDB can provide background knowledge about known pathogens, diseases, and phenotypes, and the drugs to which they respond; it therefore provides a tool for research on infectious diseases and drug mechanisms. PathoPhenoDB is accessible at, and the data is freely available through a public SPARQL endpoint.


Files (41.7 MB)

Name Size Download all
41.7 MB Download