Planned intervention: On Wednesday April 3rd 05:30 UTC Zenodo will be unavailable for up to 2-10 minutes to perform a storage cluster upgrade.
Published June 30, 2020 | Version v1
Journal article Open

Classifying Books by Genre Based on Cover

  • 1. Department of Computer Science and Engineering, Bangalore Institute of Technology, Bengaluru, India.
  • 1. Publisher

Description

A book cover can convey a lot about the content of the book. Despite the adage to not evaluate something based on outward appearances, we apply machine learning to see if we can, in fact, judge a book by its cover, or more specifically by its cover art and text. The classification was done considering three different aspects - cover image only, cover text only and both image and text in a multimodal approach. Image classification was done using transfer learning with Inception-v3. For text detection from the cover image, images were first converted to greyscale and different thresholds were applied to detect maximum text. This text was then vectorized and used to train a Multinomial Naïve Bayes model. We also trained custom CNNs for image and text modalities. For multimodal classification, we examine late fusion model, where the modalities are combined at decision level, and early fusion model, where the modalities are combined at the feature level. Our results show that the late fusion model performs best in our setting. We also observe that text is more informative with respect to genre prediction and that significant efforts need to be devoted to solve this image-based classification task to a satisfactory level. This research can be used to aid product design process by revealing underlying information. It could also be used in recommender systems and to help in promotion and sales processes for automatic genre suggestion.

Files

E9561069520.pdf

Files (623.4 kB)

Name Size Download all
md5:32a1a2f261bd6593eea1a03ffd6f9095
623.4 kB Preview Download

Additional details

Related works

Is cited by
Journal article: 2249-8958 (ISSN)

Subjects

ISSN
2249-8958
Retrieval Number
E9561069520/2020©BEIESP