Cross-modal networks and dual softmax operation for MediaEval NewsImages 2022
Description
Matching images to articles is challenging and can be considered a special version of the cross-media retrieval problem. This working note paper presents our solution for the MediaEval NewsImages benchmarking task. We investigated the performance of two cross-modal networks, a pre-trained network and a trainable one, the latter originally developed for text-video retrieval tasks and adapted to the NewsImages task. Moreover, we utilize a method for revising the similarities produced by either one of the cross-modal networks, i.e., a dual softmax operation, to improve our solutions’ performance. We report the official results for our submitted runs and additional experiments we conducted to evaluate our runs internally.
Files
mediaeval2022.pdf
Files
(634.6 kB)
Name | Size | Download all |
---|---|---|
md5:d9c4b92ab2effbb2561dea9e59646860
|
634.6 kB | Preview Download |