Using deep learning models for learning semantic text similarity of Arabic questions

Mahmoud Hammad; Mohammed Al-Smadi; Qanita Bani Baker; Sa'ad A. Al-Zboon

doi:10.11591/ijece.v11i4.pp3519-3528

Published August 1, 2021 | Version v1

Journal article Open

Using deep learning models for learning semantic text similarity of Arabic questions

1. ordan University of Science and Technology

Question-answering platforms serve millions of users seeking knowledge and solutions for their daily life problems. However, many knowledge seekers are facing the challenge to find the right answer among similar answered questions and writer’s responding to asked questions feel like they need to repeat answers many times for similar questions. This research aims at tackling the problem of learning the semantic text similarity among different asked questions by using deep learning. Three models are implemented to address the aforementioned problem: i) a supervised-machine learning model using XGBoost trained with pre-defined features, ii) an adapted Siamese-based deep learning recurrent architecture trained with pre-defined features, and iii) a pre-trained deep bidirectional transformer based on BERT model. Proposed models were evaluated using a reference Arabic dataset from the mawdoo3.com company. Evaluation results show that the BERT-based model outperforms the other two models with an F1=92.99%, whereas the Siamese-based model comes in the second place with F1=89.048%, and finally, the XGBoost as a baseline model achieved the lowest result of F1=86.086%.

Files

83 1570662461 24008 EM 13jan21 9dec20 22jul20 N.pdf

Files (792.4 kB)

Name	Size	Download all
83 1570662461 24008 EM 13jan21 9dec20 22jul20 N.pdf md5:1263905420576fe9631f6e04af59ce52	792.4 kB	Preview Download

	All versions	This version
Views	16	16
Downloads	38	38
Data volume	31.7 MB	31.7 MB

Using deep learning models for learning semantic text similarity of Arabic questions

Creators

Description

Files

83 1570662461 24008 EM 13jan21 9dec20 22jul20 N.pdf

Files (792.4 kB)