Published June 1, 2021 | Version v1
Journal article Open

Geographical queries reformulation using a parallel association rules generator to build spatial taxonomies

  • 1. Mohammed V University in Rabat
  • 2. Ecole Marocaine des Sciences de l'Ingénieur de Rabat

Description

Geographical queries need a special process of reformulation by information retrieval systems (IRS) due to their specificities and hierarchical structure. This fact is ignored by most of web search engines. In this paper, we propose an automatic approach for building a spatial taxonomy, that models’ the notion of adjacency that will be used in the reformulation of the spatial part of a geographical query. This approach exploits the documents that are in top of the retrieved list when submitting a spatial entity, which is composed of a spatial relation and a noun of a city. Then, a transactional database is constructed, considering each document extracted as a transaction that contains the nouns of the cities sharing the country of the submitted query’s city. The algorithm frequent pattern growth (FP-growth) is applied to this database in his parallel version (parallel FP-growth: PFP) in order to generate association rules, that will form the country’s taxonomy in a Big Data context. Experiments has been conducted on Spark and their results show that query reformulation using the taxonomy constructed based on our proposed approach improves the precision and the effectiveness of the IRS.

Files

80 1570670524 23988 ES 12oct 15aug N.pdf

Files (563.1 kB)

Name Size Download all
md5:ef966c9a9d16283078939e143743908f
563.1 kB Preview Download