Published June 1, 2020 | Version v1
Journal article Open

Combined cosine-linear regression model similarity with application to handwritten word spotting

  • 1. Sidi Mohamed Ben Abdellah University

Description

The similarity or the distance measure have been used widely to calculate the similarity or dissimilarity between vector sequences, where the document images similarity is known as the domain that dealing with image information and both similarity/distance has been an important role for matching and pattern recognition. There are several types of similarity measure, we cover in this paper the survey of various distance measures used in the images matching and we explain the limitations associated with the existing distances. Then, we introduce the concept of the floating distance which describes the variation of the threshold’s selection for each word in decision making process, based on a combination of Linear Regression and cosine distance. Experiments are carried out on a handwritten Arabic image documents of Gallica library. These experiments show that the proposed floating distance outperforms the traditional distance in word spotting system.

Files

16 25Nov 9Nov 5Apr 19263 fa.pdf

Files (643.9 kB)

Name Size Download all
md5:ca02297b157984d8bde00d87d506c8cd
643.9 kB Preview Download