Conference paper Open Access
Gkountakos, Konstantinos; Galanopoulos, Damianos; Mpakratsas, Marios; Touska, Despoina; Moumtzidou, Anastasia; Ioannidis, Konstantinos; Gialampoukidis, Ilias; Vrochidis, Stefanos; Mezaris, Vasileios; Kompatsiaris, Ioannis
This paper provides an overview of the runs submitted to TRECVID 2020 by ITI-CERTH. ITI-CERTH participated in the Ad-hoc Video Search (AVS), Disaster Scene Description and Indexing (DSDI) and Activities in Extended Video (ActEV) tasks. Our AVS task participation is based on an attention-based cross-modal deep network method for retrieving video shots relevant to ad-hoc textual queries. The DSDI task is performed by implementing a multi-label image classification model, trained on all humanly annotated images and estimating the final classes on averaging the predictions on the keyframes of the video shots. For the ActEV task, we deploy an object detection algorithm and then convert the individual detected objects to activities by following an object tracking technique in order to detect human and vehicle-related activities.