IMPROVE GOOGLENET ARCHITECTURE TO ENHANCE IMAGE CLASSIFICATION ACCURACY
Description
GoogLeNet (Inception v1) introduced a computationally efficient deep convolutional neural network architecture that achieved state-of-the-art performance in large-scale image classification, notably in the ImageNet Large Scale Visual Recognition Challenge. Its Inception module enabled multi-scale feature extraction while significantly reducing the number of parameters compared to earlier networks. However, advances in deep learning-such as residual learning, attention mechanisms, improved normalization, and optimized training strategies-have surpassed the original design in both accuracy and robustness. This paper proposes architectural enhancements to GoogLeNet aimed at improving image classification accuracy without sacrificing computational efficiency. The proposed improvements include integrating residual connections into Inception modules, incorporating channel and spatial attention mechanisms, refining convolution factorization, applying batch normalization systematically, and adopting modern optimization strategies. Through conceptual analysis and practical examples across datasets such as ImageNet and CIFAR-10, the enhanced architecture demonstrates improved convergence stability, stronger feature representation, and higher classification accuracy. The findings indicate that modernizing GoogLeNet with contemporary deep learning innovations significantly enhances its competitiveness for current computer vision applications.
Files
815-818.pdf
Files
(185.3 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:a3e40d4f75ef6c3d84354cf8c6004f6d
|
185.3 kB | Preview Download |