Localized Medical VQA
Description
This dataset contains 3 sub-datasets with questions about regions for the Medical Visual Question Answering (VQA) task. Traditionally, questions are asked about the entire image. In these datasets, we ask questions about randomly generated regions in an image for fundus images as well as cataract surgery frames and surgeries performed with the DaVinci robot.
All three datasets were created using publicly available datasets.
For more information and code, visit our GitHub page.
If you use this dataset, please cite our work:
@inproceedings{tascon2023localized,
title={Localized Questions in Medical Visual Question Answering},
author={Tascon-Morales, Sergio and M{\'a}rquez-Neila, Pablo and Sznitman, Raphael},
booktitle={International Conference on Medical Image Computing and Computer-Assisted Intervention},
pages={361--370},
year={2023},
organization={Springer}
}
Files
LocalizedVQA.zip
Files
(2.0 GB)
Name | Size | Download all |
---|---|---|
md5:9e70c0fce3e7abefe4fe9c22c7130ba2
|
2.0 GB | Preview Download |