00000nam##2200000uu#4500 6369944 doi 10.5281/zenodo.6369944 oai:zenodo.org:6369944 user-newseye user-embeddia user-eu Emanuela Boros Boshko Koloski Lidia Pivovarova EMBEDDIA at SemEval-2022 Task 8: Investigating Sentence, Image, and Knowledge Graph Representations for Multilingual News Article Similarity Elaine Zosa info:eu-repo/semantics/openAccess Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode cc-by-4.0 spdx <p>In this paper, we present the participation of the EMBEDDIA team to the SemEval 2022 Task 8 (Multilingual News Article Similarity). We cover several techniques and propose different methods for finding the multilingual news article similarity by exploring the dataset in its entirety. We take advantage of the textual content of the articles, the provided metadata (e.g., titles, keywords, topics), the translated articles, the images (those that were available), and knowledge graph-based representations for entities and relations present in the articles. We, then, compute the semantic similarity between the different features and predict through regression the similarity scores. Our findings show that, while our researched methods obtained promising results, exploiting the semantic textual similarity with sentence representations is unbeatable. Finally, in the official SemEval 2022 Task 8, we ranked fifth in the overall team ranking cross-lingual results, and second in the English-only results.</p> Zenodo 2022-03-19 user-newseye user-embeddia user-eu info:eu-repo/semantics/conferencePaper 770299 NewsEye: A Digital Investigator for Historical Newspapers 825153 Cross-Lingual Embeddings for Less-Represented Languages in European News Media 20220325124246.0 2347635 md5:152af77ead089f38cf624d3830cafb3d https://zenodo.org/records/6369944/files/SemEval_2022___28_February_2022___5_pages___EMBEDDIA_at_SemEval_2022_Task_8__Investigating_Sentence__Image__and_Knowledge_Graph_Representations.pdf open 10.5281/zenodo.6369943 isVersionOf doi