Published July 8, 2024
| Version V1.0
Publication
Open
LSD600: the first corpus of biomedical abstracts annotated with lifestyle–disease relations
Creators
- 1. Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Denmark
- 2. TurkuNLP Group, Department of Computing, University of Turku, Finland
Description
- In this study, we introduce LSD600, the first corpus specifically focused on LSF-disease relations. LSD600 consists of 600 abstracts annotated with LSF-disease relations, encompassing 1897 relations covering eight different relation types.
- The annotated LSF entities in these relations are pre-annotate using a comprehensive LSF classification (Nourani et al., 2024) and cover a wide spectrum of lifestyle factors belonging to nine categories.
- We have used LSD600 to train a transformer-based model on the multi-label LSF-disease RE task.
Files
RE_Annotation_Guidelines.pdf
Files
(1.3 GB)
Name | Size | Download all |
---|---|---|
md5:beb9c3b06cae00b1ded81b99530e0823
|
1.3 GB | Download |
md5:b5194313b0ff6443148f1f63cdd143ac
|
604.2 kB | Download |
md5:7177bbbfb4ef2c937c3297e37a97fcba
|
208.7 kB | Preview Download |
Additional details
Dates
- Created
-
2024