Published January 29, 2024 | Version v1
Conference paper Open

Mining Landmark Images for Scene Reconstruction from Weakly Annotated Video Collections

  • 1. Joanneum Research Forschungsgesellschaft mbH

Description

Many XR productions require reconstructions of landmarks such as buildings or public spaces. Shooting content on demand is often not feasible, thus tapping into audiovisual archives for images and videos as input for reconstruction is a promising way. However, if annotated at all, videos in (broadcast) archives are annotated on item level, so that it is not known which frames contain the landmark of interest. We propose an approach to mine frames containing relevant content in order to train a fine-grained classifier that can then be applied to unlabeled data. To ensure the reproducibility of our results, we construct a weakly labelled video landmark dataset (WAVL) based on Google Landmarks v2. We show that our approach outperforms a state-of-the-art landmark recognition method in this weakly labeled input data setting on two large datasets.

Files

MMM2024_Weakly_supervised_classification.pdf

Files (2.8 MB)

Name Size Download all
md5:7436690dfea33c37abb1292ce445a0ae
2.8 MB Preview Download

Additional details

Funding

European Commission
XRECO - XR mEdia eCOsystem 101070250