Published July 31, 2025
| Version v1
Journal article
Open
Features extraction for image identification using computer vision
Authors/Creators
- 1. Department of Computing and Information System, Kenyatta University, Kenya.
- 2. Department of Mathematics, Institute for Basic Science, Technology and Innovation, Pan-African University, Kenya.
- 3. Department of Software Engineering, College of Software, Nankai University, China.
Description
This study examines various feature extraction techniques in computer vision, the primary focus of which is on Vision Transformers (ViTs) and other approaches such as Generative Adversarial Networks (GANs), deep feature models, traditional approaches (SIFT, SURF, ORB), and non-contrastive and contrastive feature models. Emphasizing ViTs, the report summarizes their architecture, including patch embedding, positional encoding, and multi-head self-attention mechanisms with which they overperform conventional convolutional neural networks (CNNs). Experimental results determine the merits and limitations of both methods and their utilitarian applications in advancing computer vision.
Files
WJARR-2025-2647.pdf
Files
(742.6 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:75e2a1aadd5a72fd6dcb82464d237eeb
|
742.6 kB | Preview Download |