A survey on bias in visual datasets

doi:10.1016/j.cviu.2022.103552

Published September 5, 2022 | Version v1

Journal article Open

A survey on bias in visual datasets

1. CERTH-ITI
2. FUB

Computer Vision (CV) has achieved remarkable results, outperforming humans in several tasks. Nonetheless, it may result in significant discrimination if not handled properly as CV systems highly depend on training datasets and can learn and amplify biases that such datasets may carry. Thus, the problem of understanding and discovering bias in visual datasets is of utmost importance; yet, it has not been studied in a systematic way to date. Hence, this work aims to: (i) describe the different kinds of bias that may manifest in visual datasets; (ii) review the literature on methods for bias discovery and quantification in visual datasets; (iii) discuss existing attempts to collect visual datasets in a bias-aware manner. A key conclusion of our study is that the problem of bias discovery and quantification in visual datasets is still open, and there is room for improvement in terms of both methods and the range of biases that can be addressed. Moreover, there is no such thing as a bias-free dataset, so scientists and practitioners must become aware of the biases in their datasets and make them explicit. To this end, we propose a checklist to spot different types of bias during visual dataset collection.

Files

visual_bias_arxiv.pdf

Files (1.1 MB)

Name	Size	Download all
visual_bias_arxiv.pdf md5:f3ce6498ced601475755f629abe43fe2	1.1 MB	Preview Download

Additional details

AI4Media – A European Excellence Centre for Media, Society and Democracy 951911: European Commission
NoBIAS – Artificial Intelligence without Bias 860630: European Commission

	All versions	This version
Views	144	143
Downloads	297	293
Data volume	343.9 MB	339.4 MB

A survey on bias in visual datasets

Creators

Description

Files

visual_bias_arxiv.pdf

Files (1.1 MB)

Additional details

Funding