Published April 4, 2024 | Version v1
Dataset Open

A BIDIRECTIONAL ENCODER-DECODER MODEL WITH ATTENTIONMECHANISM FOR NESTED NAMED ENTITY RECOGNITION

  • 1. Ecole Doctorale Polytechnique, Institut National Polytechnique (INP-HB), Yamoussoukro, Cote dIvoire.
  • 2. Laboratoire de Recherche en Informatique et Telecommunication (LARIT).

Description

Named entity recognition is a fundamental task for several natural language processing applications. It consists in identifying mentions of named entities in a text, then classifying them according to predefined entity types. Most labeling methods for this task use a label to recognize flat named entities because they belong to a single entity type. Therefore, they cannot recognize named entities that belong to multiple entity types.In this work, we concatenated all the labels of a word of a named entity into a joint in order to recognize flat or nested named entities. Then, we proposed a bidirectional encoder-decoder model with attention mechanism that uses this joint label to fine-tune a pre-trained language model for named entity recognition.We experimented our method on GENIA (a nested named entity dataset) and on two flat named entity datasets: CoNLL-2003 and i2b2 2010. Using the BioBERT model, our method achieved an F1 score of 78.85% on the GENIA dataset, 93.22% and 87.51% on CoNLL-2003 and i2b2 2010 respectively. These results show that our method can effectively recognize flat named entities as well as nested named entities.

 

Files

7.pdf

Files (636.5 kB)

Name Size Download all
md5:90938cc33c381b3abda9c538cbd76b44
636.5 kB Preview Download