Published May 29, 2023 | Version v1
Dataset Open

Experimental data analyzed in: Signal detection models as contextual bandits

  • 1. Carleton University

Description

Signal detection theory (SDT) has been widely applied to identify the optimal discriminative decisions of receivers under uncertainty. However, the approach assumes that decision-makers immediately adopt the appropriate acceptance threshold, even though the optimal response must often be learned. Here we recast the classical normal-normal (and power-law) signal detection model as a contextual multi-armed bandit (CMAB). Thus, rather than starting with complete information, decision-makers must infer how the magnitude of a continuous cue is related to the probability that a signaller is desirable, while simultaneously seeking to exploit the information they acquire. We explain how various CMAB heuristics resolve the trade-off between better estimating the underlying relationship and exploiting it. Next, we determined how naïve human volunteers resolve signal detection problems with a continuous cue. As anticipated, a model of choice (accept/reject) that assumed volunteers immediately adopted the SDT-predicted acceptance threshold did not predict volunteer behaviour well. The Softmax rule for solving CMABs, with choices based on a logistic function of the expected payoffs, best explained the decisions of our volunteers but a simple midpoint algorithm also predicted decisions well under some conditions. CMABs offer principled parametric solutions to solving many classical SDT problems when decision-makers start with incomplete information.

Notes

We used Stan (https://mc-stan.org/) to fit and compare multi-level models of human choices. Stan was accessed in R via RStan and the models were coded using the ulam function in the rethinking package. All posterior distributions were estimated using Markov Chain Monte Carlo (MCMC) sampling for 4000 iterations in four separate chains. To facilitate model fitting, the RGB values (C) of all signallers were rescaled by dividing C by 255, ensuring a value of perceived appearance (x) between 0 and 1. We provide a full listing of R code as part of this submission.

Funding provided by: Natural Sciences and Engineering Research Council of Canada
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100000038
Award Number:

Files

README.md

Files (272.2 kB)

Name Size Download all
md5:f21e0e085332670f34557dd68fd387e5
226.4 kB Download
md5:478f7fbb88f84ed48d6d4ffc5b8a1b8d
40.4 kB Download
md5:53153c5d11ae7d856b732e88c3670209
5.4 kB Preview Download

Additional details

Related works

Is derived from
10.5281/zenodo.7974937 (DOI)