Mining Landmark Images for Scene Reconstruction from Weakly Annotated Video Collections
Authors/Creators
- 1. Joanneum Research Forschungsgesellschaft mbH
Description
Many XR productions require reconstructions of landmarks such as buildings or public spaces. Shooting content on demand is often not feasible, thus tapping into audiovisual archives for images and videos as input for reconstruction is a promising way. However, if annotated at all, videos in (broadcast) archives are annotated on item level, so that it is not known which frames contain the landmark of interest. We propose an approach to mine frames containing relevant content in order to train a fine-grained classifier that can then be applied to unlabeled data. To ensure the reproducibility of our results, we construct a weakly labelled video landmark dataset (WAVL) based on Google Landmarks v2. We show that our approach outperforms a state-of-the-art landmark recognition method in this weakly labeled input data setting on two large datasets.
Files
MMM2024_Weakly_supervised_classification.pdf
Files
(2.8 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:7436690dfea33c37abb1292ce445a0ae
|
2.8 MB | Preview Download |
Additional details
Software
- Repository URL
- https://github.com/XRecoEU/WAVL-Dataset