There is a newer version of the record available.

Published July 8, 2024 | Version V1.0
Publication Open

LSD600: the first corpus of biomedical abstracts annotated with lifestyle–disease relations

  • 1. Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Denmark
  • 2. TurkuNLP Group, Department of Computing, University of Turku, Finland

Description

  • In this study, we introduce LSD600, the first corpus specifically focused on LSF-disease relations. LSD600 consists of 600 abstracts annotated with LSF-disease relations, encompassing 1897 relations covering eight different relation types.

 

  • The annotated LSF entities in these relations are pre-annotate using a comprehensive LSF classification (Nourani et al., 2024) and cover  a wide spectrum of lifestyle factors belonging to nine categories.

 

  • We have used LSD600 to train a transformer-based model on the multi-label LSF-disease RE task.

 

Files

RE_Annotation_Guidelines.pdf

Files (1.3 GB)

Name Size Download all
md5:beb9c3b06cae00b1ded81b99530e0823
1.3 GB Download
md5:b5194313b0ff6443148f1f63cdd143ac
604.2 kB Download
md5:7177bbbfb4ef2c937c3297e37a97fcba
208.7 kB Preview Download

Additional details

Dates

Created
2024