PHEE: A Dataset for Pharmacovigilance Event Extraction from Text
Authors/Creators
- 1. University of Warwick
- 2. Northeastern University
- 3. Astra Zeneca
- 4. King's College London
Description
The PHEE dataset contains over 5,000 finely annotated pharmacovigilance events from public medical case reports. Two types of events, the adverse events and the potential therapeutic events, are annotated. For each event, we annotate the event trigger and hierarchical arguments. The main arguments (coarse-grained spans) include subject, treatment and effect. Further fine-grained sub-arguments - age, gender, race, number of patients (labelled as population) and preexisting conditions (labelled as subject.disorder) for the subject argument and drug (and their combinations), dosage, frequency, route, time-elapsed, duration, target disorder (labelled as treatment.disorder) for the treatment argument - are then annotated upon main arguments.
We provide two formats of data: visualisation-friendly brat-format data and structured json data for the convenience of use.
Files
Additional details
Related works
- Is published in
- Conference paper: arXiv:2210.12560 (arXiv)