Image Description using Encoder and Decoder LSTM Methods: Some Issues

Nirmala,; Gopalkrishna Joshi; P S Hiremath

doi:10.35940/ijitee.K7729.0991120

Published September 30, 2020 | Version v1

Journal article Open

Image Description using Encoder and Decoder LSTM Methods: Some Issues

1. Department of Computer Science & Engineering, Nitte Meenakshi Institute of Technology, Bangalore, Karnataka, India.
2. Dean Director, Centre for Engineering Education Research B. V. Bhoomaraddi College of Engg. & Technology, Hubli, Karnataka, India
3. Professor, Department of Computer Science, BVB College of Engineering & Technology, Hubli Karnataka, India

Contributors

Sponsor:

Blue Eyes Intelligence Engineering and Sciences Publication(BEIESP)¹

1. Publisher

Description of images has an important role in image mining. The description of images provides an insight into the location, its surroundings and other information related to it. Different procedures of describing the images exist in literature. However, a well trained description of images is still a tedious task to achieve. Several researchers have come up with solutions to this problem using various techniques. Herein, the concept of LSTM is used in generating a trained description of images. The said process is achieved through encoders and decoders. Encoders use techniques of maxpooling and convolution, while the decoders use the concept of recurrent neural networks. The combined architecture of encoders and decoders result in trained classifiers, which enable reliable description of images. The working has been implemented by considering a sample image. It has been found that slight variations with regard to accuracy, naturalness, missing concepts, deficiency of sufficient semantics and incomplete description of image still exist. Hence, it can be inferred that, with reasonable amount of enhancement in the technique and using the techniques of natural language processing, more accuracy in image descriptions could be achieved.

Files

K77290991120.pdf

Files (642.7 kB)

Name	Size	Download all
K77290991120.pdf md5:6a5695f151c317bfd53d7d70a6e88038	642.7 kB	Preview Download

Additional details

Is cited by: Journal article: 2278-3075 (ISSN)

ISSN: 2278-3075
Retrieval Number: 100.1/ijitee.K77290991120

	All versions	This version
Views	50	50
Downloads	61	61
Data volume	39.8 MB	39.8 MB

Image Description using Encoder and Decoder LSTM Methods: Some Issues

Contributors

Sponsor:

Files

K77290991120.pdf

Files (642.7 kB)

Additional details

Related works

Subjects

Image Description using Encoder and Decoder LSTM Methods: Some Issues

Creators

Contributors

Sponsor:

Description

Files

K77290991120.pdf

Files (642.7 kB)

Additional details

Related works

Subjects