Multimodal 3D Object Retrieval

Pegia, Maria-Eirini; Jónsson, Björn Þór; Moumtzidou, Anastasia; Diplaris, Sotiris; Gialampoukidis, Ilias; Vrochidis, Stefanos; Kompatsiaris, Ioannis

doi:10.5281/zenodo.10226589

Published January 29, 2024 | Version v1

Preprint Open

Multimodal 3D Object Retrieval

1. Information Technologies Institute - Centre for Research and Technology Hellas
2. Reykjavík University

Three-dimensional (3D) retrieval of objects and models plays a crucial role in many application areas, such as industrial design, medical imaging, gaming and virtual and augmented reality. Such 3D retrieval involves storing and retrieving different representations of single objects, such as images, meshes or point clouds. Early approaches considered only one such representation modality, but recently the CMCL method has been proposed, which considers multimodal representations. Multimodal retrieval, meanwhile, has recently seen significant interest in the image retrieval domain. In this paper, we therefore explore the application of state-of-the-art multimodal image representations to 3D retrieval, in comparison to existing 3D approaches. In a detailed study over two benchmark 3D datasets, we show that the MuseHash approach from the image domain outperforms other approaches, improving recall over the CMCL approach by about 11% for unimodal retrieval and 9% for multimodal retrieval.

Files

mmm2024_paperID_371_zenodo_version.pdf

Files (828.3 kB)

Name	Size	Download all
mmm2024_paperID_371_zenodo_version.pdf md5:b5479c6de3e29ae942d08c86e046c89b	828.3 kB	Preview Download

Additional details

European Commission
XRECO - XR mEdia eCOsystem 101070250

Accepted: 2023-11-29

Citations

Oops! Something went wrong while fetching results.

196

Views

134

Downloads

Show more details

	All versions	This version
Views	196	196
Downloads	134	134
Data volume	136.7 MB	136.7 MB

More info on how stats are collected....

DOI

Resource type

Preprint

Publisher

Zenodo

Conference

30th International Conference on Multimedia Modeling (MMM) , Amsterdam, The Netherlands, 29 January - 2 February 2024 (Session XR-MACCI: eXtended Reality and Multimedia - Advancing Content Creation and Interaction)

Languages

English

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: December 5, 2023
Modified: July 10, 2024

Multimodal 3D Object Retrieval

Files

mmm2024_paperID_371_zenodo_version.pdf

Files (828.3 kB)

Additional details

Funding

Dates

Multimodal 3D Object Retrieval

Creators

Description

Files

mmm2024_paperID_371_zenodo_version.pdf

Files (828.3 kB)

Additional details

Funding

Dates