Bidirectional Image-to-Text and Text-to-Image Conversion using Deep Learning Models

Riya Saji; Mr. Rony Tom

doi:10.5281/zenodo.11202862

Published April 16, 2024 | Version v1

Conference paper Open

Bidirectional Image-to-Text and Text-to-Image Conversion using Deep Learning Models

Abstract— In this paper, we introduce a system that facilitates bidirectional conversion between images and text using pre-trained deep learning models. Our approach incorporates a Vision Encoder-Decoder model for image captioning and utilizes the Stable Diffusion method for generating images from textual prompts. The implementation is integrated into a user-friendly UI application developed using Streamlit, enabling smooth transitions between images and text. Users have the ability to upload images for automatic captioning or input textual prompts to generate corresponding images, allowing for intuitive exploration of the interplay between visual and textual data.

Files

Bidirectional Image-to-Text and Text-to-Image Conversion using Deep Learning Models.pdf

Files (555.9 kB)

Name	Size	Download all
Bidirectional Image-to-Text and Text-to-Image Conversion using Deep Learning Models.pdf md5:d7987dc66f74cd7da1f4d6bb10a213fe	555.9 kB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	35	35
Downloads	33	33
Data volume	18.3 MB	18.3 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Amal Jyothi College of Engineering Kanjirappally, Kottayam

Conference

National Conference on Emerging Computer Applications (NCECA -2024) , Kanjirappally, Kerala, 16-04-2024

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: May 16, 2024
Modified: July 5, 2024

Bidirectional Image-to-Text and Text-to-Image Conversion using Deep Learning Models

Authors/Creators

Description

Files

Bidirectional Image-to-Text and Text-to-Image Conversion using Deep Learning Models.pdf

Files (555.9 kB)