Published July 31, 2025 | Version v1
Journal article Open

Features extraction for image identification using computer vision

  • 1. Department of Computing and Information System, Kenyatta University, Kenya.
  • 2. Department of Mathematics, Institute for Basic Science, Technology and Innovation, Pan-African University, Kenya.
  • 3. Department of Software Engineering, College of Software, Nankai University, China.

Description

This study examines various feature extraction techniques in computer vision, the primary focus of which is on Vision Transformers (ViTs) and other approaches such as Generative Adversarial Networks (GANs), deep feature models, traditional approaches (SIFT, SURF, ORB), and non-contrastive and contrastive feature models. Emphasizing ViTs, the report summarizes their architecture, including patch embedding, positional encoding, and multi-head self-attention mechanisms with which they overperform conventional convolutional neural networks (CNNs). Experimental results determine the merits and limitations of both methods and their utilitarian applications in advancing computer vision.

Files

WJARR-2025-2647.pdf

Files (742.6 kB)

Name Size Download all
md5:75e2a1aadd5a72fd6dcb82464d237eeb
742.6 kB Preview Download

Additional details