Multimodality in Media Retrieval
Creators
Description
The quest for retrieving relevant media for a given query is well-studied and has various applications. Modern publicly available media collections provide diverse modalities of the same objects, which can enhance search. Our research delves into enhancing media retrieval by effectively representing and querying multimodal data. In the retrieval methods' ranking procedure, we examine efficiency through techniques like approximate nearest neighbor (ANN) indexing and high-performance computing (HPC). Our method, MuseHash, is proposed for single media object retrieval and is applied to images and 3D objects, outperforming existing methods on diverse datasets. Moreover, it significantly reduces execution times with ANN and HPC. Future plans include considering multimodality in the video retrieval domain.
Files
ICMR2024_PhDSymposium_Paper.pdf
Files
(918.8 kB)
Name | Size | Download all |
---|---|---|
md5:c37bc91d403d5e85e9742522768354e7
|
918.8 kB | Preview Download |