Published March 7, 2022 | Version v1
Conference paper Open

Elastic Embedded Background Linking for News Articles with Keywords, Entities and Events

  • 1. University of La Rochelle

Description

In this paper, we present a collection of five flexible background linking models created for the News Track in TREC 2021 that generate ranked lists of articles to provide contextual information. The collection is based on the use of sentence embeddings indexes, created with Sentence BERT and Open Distro for ElasticSearch. For each model, we explore additional tools, from keywords extraction using YAKE, to entity and event detection, while passing through a linear combination. The associated code is available online as open-source software.

Files

TREC_News_2021.pdf

Files (341.9 kB)

Name Size Download all
md5:1de7b096e8662eb7ac28508442b19bbd
341.9 kB Preview Download

Additional details

Funding

European Commission
NewsEye - NewsEye: A Digital Investigator for Historical Newspapers 770299