Published October 22, 2021 | Version v1
Preprint Open

On Generalizing Permutation-Based Representations for Approximate Search

  • 1. ISTI-CNR

Description

In the domain of approximate metric search, the Permutation-based Indexing (PBI) approaches have been proved to be particularly suitable for dealing with large data collections. 
These methods employ a permutation-based representation of the data, which can be efficiently indexed using data structures such as inverted files. In the literature, the definition of the permutation of a metric object was derived by reordering the distances of the object to a set of pivots. In this paper, we aim at generalizing this definition 
in order to enlarge the class of permutations that can be used by PBI approaches. As a practical outcome, we defined a new type of permutation that is calculated using distances from pairs of pivots. The proposed technique permits us to produce longer permutations than traditional ones for the same number of object-pivot distance calculations. The advantage is that the use of inverted files built on permutation prefixes leads to greater efficiency in the search phase when longer permutations are used. 

 

Notes

The final publication is available at Springer via https://doi.org/10.1007/978-3-030-89657-7_6 Cite this as: Vadicamo, L., Gennaro, C. and Amato, G., 2021, September. On Generalizing Permutation-Based Representations for Approximate Search. In International Conference on Similarity Search and Applications (pp. 66-80). Springer, Cham. https://doi.org/10.1007/978-3-030-89657-7_6

Files

SISAP2021_Vadicamo_OnGeneralizingPerm.pdf

Files (910.0 kB)

Name Size Download all
md5:00187b1149d78f2f833b325d11ace95e
910.0 kB Preview Download

Additional details

Related works

Is published in
Conference paper: 10.1007/978-3-030-89657-7_6 (DOI)

Funding

AI4Media – A European Excellence Centre for Media, Society and Democracy 951911
European Commission
AI4EU – A European AI On Demand Platform and Ecosystem 825619
European Commission