Analysis of the Fuzziness of Image Caption Generation Models due to Data Augmentation Techniques

Kota Akshith Reddy; Satish C. J; Polsani Jahnavi; Chintapalli Teja Naveen; Gangapatnam Sai Ananya

doi:10.35940/ijrte.C6439.0910321

Published September 30, 2021 | Version v1

Journal article Open

Analysis of the Fuzziness of Image Caption Generation Models due to Data Augmentation Techniques

1. Department of Computer Science, Vellore Institute of Technology, Vellore, Tamil Nadu, India
2. Department of Computer Science, Anna University, Tamil Nadu, India.
3. Department of Computer Science, Vellore Institute of Technology, Vellore, Tamil Nadu, India.
4. Department of Computer Science, Narayana Engineering College, Tamil Nadu, India.

Contributors

Sponsor:

Blue Eyes Intelligence Engineering & Sciences Publication (BEIESP)¹

1. Publisher

Automatic Image Caption Generation is one of the core problems in the field of Deep Learning. Data Augmentation is a technique which helps in increasing the amount of data at hand and this is done by augmenting the training data using various techniques like flipping, rotating, Zooming, Brightening, etc. In this work, we create an Image Captioning model and check its robustness on all the major types of Image Augmentation techniques. The results show the fuzziness of the model while working with the same image but a different augmentation technique and because of this, a different caption is produced every time a different data augmentation technique is employed. We also show the change in the performance of the model after applying these augmentation techniques. Flickr8k dataset is used for this study along with BLEU score as the evaluation metric for the image captioning model.

Files

C64390910321.pdf

Files (618.0 kB)

Name	Size	Download all
C64390910321.pdf md5:bfcfcbf2e69ea3834abd15268b68742d	618.0 kB	Preview Download

Additional details

Is cited by: Journal article: 2277-3878 (ISSN)

ISSN: 2277-3878
Retrieval Number: 100.1/ijrte.C64390910321

	All versions	This version
Views	119	112
Downloads	114	114
Data volume	72.9 MB	72.9 MB

Contributors

Sponsor:

C64390910321.pdf

Files (618.0 kB)

Related works

Subjects

Analysis of the Fuzziness of Image Caption Generation Models due to Data Augmentation Techniques

Authors/Creators

Contributors

Sponsor:

Description

Files

C64390910321.pdf

Files (618.0 kB)

Additional details

Related works

Subjects