Published October 22, 2022 | Version v2
Dataset Open

PHEE: A Dataset for Pharmacovigilance Event Extraction from Text

  • 1. University of Warwick
  • 2. Northeastern University
  • 3. Astra Zeneca
  • 4. King's College London

Description

The PHEE dataset contains over 5,000 finely annotated pharmacovigilance events from public medical case reports. Two types of events, the adverse events and the potential therapeutic events, are annotated. For each event, we annotate the event trigger and hierarchical arguments.  The main arguments (coarse-grained spans) include subject, treatment and effect. Further fine-grained sub-arguments - age, gender, race, number of patients (labelled as population) and preexisting conditions (labelled as subject.disorder) for the subject argument and drug (and their combinations), dosage, frequency, route, time-elapsed, duration, target disorder (labelled as treatment.disorder) for the treatment argument - are then annotated upon main arguments.

We provide two formats of data: visualisation-friendly brat-format data and structured json data for the convenience of use.

 

Files

data.zip

Files (8.5 MB)

Name Size Download all
md5:f9310f0ca475cb9eaef3789abc32bbcc
8.5 MB Preview Download

Additional details

Related works

Is published in
Conference paper: arXiv:2210.12560 (arXiv)