Published November 12, 2023 | Version v1
Conference proceeding Open

The overview of the BioRED (Biomedical Relation Extraction Dataset) track at BioCreative VIII

  • 1. National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), MD, 20894, Bethesda, USA
  • 2. School of Computer Science and Technology, Dalian University of Technology, 116024, Dalian, China

Description

Abstract

The BioRED track at BioCreative VIII calls for a community effort to identify, semantically categorize, and highlight the novelty factor of the relationships between biomedical entities in unstructured text. Relation extraction is crucial for many Natural Language Processing (NLP) applications, from drug discovery to custom medical solutions. While previous community challenges focused on identifying relationships of a single type (i.e., protein-protein interactions), categorized relationships into different semantic categories but did not require entity normalization, or worked at the sentence level, real-world applications necessitate that entities are linked to specific knowledge base records, relationships encountered in any given document generally occur between different entity types, and perusal of the whole document provides valuable additional detail. In addition, journal publications often distinguish between novel findings and background information or prior knowledge.

 

This article is part of the Proceedings of the BioCreative VIII Challenge and Workshop: Curation and Evaluation in the era of Generative Models.

Files

BioCreativeVIII-BioRED-Track-Overview.pdf

Files (450.2 kB)

Name Size Download all
md5:eb5f8a4dd003323c72ebf99e78557fa8
450.2 kB Preview Download

Additional details

Related works

Is published in
Conference proceeding: 10.5281/zenodo.10103190 (DOI)