manuscript.Rmd 32.1 KB
Newer Older
1
---
linushof's avatar
linushof committed
2
3
title: 'Sampling Strategies in Decisions from Experience'
author: "Linus Hof, Thorsten Pachur, Veronika Zilker"
4
5
6
7
8
9
bibliography: sampling-strategies-in-dfe.bib
output:
  html_document:
    code_folding: hide
    toc: yes
    toc_float: yes
linushof's avatar
linushof committed
10
    number_sections: no
linushof's avatar
linushof committed
11
12
13
  pdf_document:
    toc: yes
csl: apa.csl
linushof's avatar
linushof committed
14
15
16
editor_options: 
  markdown: 
    wrap: sentence
17
18
---

19
20
```{r}
# load packages
linushof's avatar
linushof committed
21
22
pacman::p_load(repro,
               tidyverse,
linushof's avatar
linushof committed
23
24
               knitr, 
               viridis)
25
26
```

27
# Author Note
28

29
This document was created from the commit with the hash `r repro::current_hash()`. 
30

31
32
- Add information on how to reproduce the project.
- Add contact.
linushof's avatar
linushof committed
33

34
# Abstract
linushof's avatar
linushof committed
35

36
37
A probability theoretic definition of sampling and a rough stochastic model of the random process underlying decisions from experience are proposed.
It is demonstrated how the stochastic model can be used a) to explicate assumptions about the sampling and decision strategies that agents may apply and b) to derive predictions about the resulting decision behavior in terms of function forms and parameter values.
38
Synthetic choice data is simulated and modeled in cumulative prospect theory to test these predictions. 
39

linushof's avatar
linushof committed
40
# Introduction
41

42
43
...

44
## Random Processes in Sequential Sampling 
linushof's avatar
linushof committed
45

46
47
48
In research on the decision theory, a standard paradigm is the choice between at least two (monetary) prospects.
Let a prospect be a probability space $(\Omega, \mathscr{F}, P)$.
$\Omega$ is the sample space 
linushof's avatar
linushof committed
49

50
$$\begin{equation}
51
\omega_i = \{\omega_1, ..., \omega_n\} \in \Omega 
52
\end{equation}$$ 
linushof's avatar
linushof committed
53

54
55
containing a finite set of possible outcomes, gains and/or losses respectively. 
$\mathscr{F}$ is a set of subsets of $\Omega$, i.e., the event space
linushof's avatar
linushof committed
56

57
$$\begin{equation}
58
59
A_i = \{A_1, ..., A_n\} \in \mathscr{F} = \mathscr{P}(\Omega) 
\; .
60
\end{equation}$$
linushof's avatar
linushof committed
61

62
$\mathscr{P}(\Omega)$ denotes the power set of $\Omega$. 
linushof's avatar
linushof committed
63

64
$P$ is a probability mass function  
65
66

$$\begin{equation}
67
P: \mathscr{F} \mapsto [0,1] 
68
69
\end{equation}$$

70
that assigns each $\omega_i \in \Omega$ a probability of $0 \leq p_i \leq 1$ with $P(\Omega) = 1$ [cf. @kolmogorovFoundationsTheoryProbability1950, pp. 2-3].
71

72
In such a choice paradigm, agents are asked to evaluate the prospects and build a preference for either one of them. 
73
It is common to make a rather crude distinction between two variants of this evaluation process [cf. @hertwigDescriptionExperienceGap2009]. 
74
75
For decisions from description (DfD), agents are provided a full symbolic description of the triples $(\Omega, \mathscr{F}, P)_j$, where j denotes a prospect.
For decisions from experience [DfE; e.g., @hertwigDecisionsExperienceEffect2004], the probability triples are not described but must be explored by the means of sampling. 
76
To provide a formal definition of sampling in risky or uncertain choice, we make use of the mathematical concept of a random variable. 
77
78
79
Thus, if for each

$$\begin{equation}
80
81
\omega_{i} \in \Omega: p(\omega_{i}) \neq 1 
\; ,
82
83
84
\end{equation}$$

we refer to the respective prospect as *"risky"*, where risky describes the fact that if agents would choose the prospect and any of the outcomes $\omega_{i}$ must occur, none of these outcomes will occur with certainty but according to the probability measure $P$. 
85
It is acceptable to speak of the occurrence of $\omega_{i}$ as the realization of a random variable iff the following conditions A and B are met: 
86

87
A) The random variable $X$ is defined as the function 
88
89

$$\begin{equation}
90
91
X: (\Omega, \mathscr{F})  \mapsto (\Omega', \mathscr{F'}) 
\; , 
92
93
\end{equation}$$

94
where the image $\Omega'$ is the set of possible values $X$ can take and $\mathscr{F'}$ is a set of subsets of $\Omega'$.
95
I.e., $X$ maps any event $A_i \in \mathscr{F}$ to a subset $A'_i \in \mathscr{F'}$: 
96
97

$$\begin{equation}
98
A'_i \in  \mathscr{F'} \Rightarrow X^{-1}A'_i \in \mathscr{F}
99
100
\end{equation}$$

101
[@kolmogorovFoundationsTheoryProbability1950, p. 21].
102

103
B) The image $X: \Omega \mapsto \Omega'$ must be such that $\omega_i \in \Omega = x_i \in \Omega'$. 
104

105
106
Given conditions A and B, we denote any realization of a random variable defined on the triple $(\Omega, \mathscr{F}, P)$ as a *"single sample"* of the respective prospect and any systematic approach to generate a sequence of single samples from multiple prospects as a sampling strategy [see also @hillsInformationSearchDecisions2010]. 
Because for a sufficiently large number of single samples *n* from a given prospect, i.e., $\lim_{n \to \infty}$, the relative frequencies of $\omega_{i}$ approximate their probabilities in $p_i \in P$ [@bernoulliOpusPosthumumAccedit1713], sampling in principle allows to explore a prospect's probability space. 
107

108
## A Stochastical Sampling Model for DfE
109

110
111
112
Consider a choice between $1,\, ...,\,  j,\,...,\, n$ prospects, where $j \leq n \geq 2$.
To construct a rough stochastic sampling model (hereafter SSM) of the random process underlying DfE, it is assumed that agents base their decisions on the information provided by the prospects, which is in principle fully described by their probability triples. 
Thus, a decision variable 
113
114

$$\begin{equation}
115
D := f((\Omega, \mathscr{F}, P)_j)
116
117
\end{equation}$$

118
119
is defined. 
Since in DfE no symbolic descriptions of the triples are provided, the model is restricted to the case where decisions are based on sequences of single samples generated from the triples:
120

121
$$\begin{equation}
122
123
D := f((X: (\Omega, \mathscr{F}) \mapsto (\Omega', \mathscr{F'}))_j) = f(X_1, ..., X_j, ..., X_n) 
\; ,
124
125
\end{equation}$$

126
where $\Omega_j = \Omega'_j$. 
127

128
129
130
131
132
Note that the decision variable $D$ is defined as a function $f$ of the random variables associated with the prospects' probability spaces, where $f$ can operate on any quantitative measure, or moment, related to these random variables.
Since decision models differ in the form of $f$ and the measures the latter utilizes [@heOntologyDecisionModels2020, for an ontology of decision models], we take the stance that these choices should be informed by psychological or other theory and empirical protocols. 
For what do these choices mean? 
They reflect the assumptions about the kind of information agents process and the way they do, not to mention the question of whether they are capable of doing so.   
In the following section, it is demonstrated how such assumptions about the processing strategies that agents may apply in DfE can be captured by the SSM.   
133

134
## Integrating sampling and decision strategies into the SSM
135

136
137
138
139
Hills and Hertwig [-@hillsInformationSearchDecisions2010] discussed a potential link between the sampling and decision strategies of agents in DfE, i.e., a systematic relation between the pattern according to which sequences of single samples are generated and the mechanism of integrating and evaluating these sample sequences to arrive at a decision. 
Specifically, the authors suppose that frequent switching between prospects in the sampling phase translates to a round-wise decision strategy, for which the evaluation process is separated into multiple rounds of ordinal comparisons between single samples (or small chunks thereof), such that the unit of the final evaluation are round wins rather than raw outcomes.   
In contrast, infrequent switching is supposed to translate to a decision strategy, for which only a single ordinal comparison of the summaries across all samples of the respective prospects is conducted [@hillsInformationSearchDecisions2010, see Figure 1].
The authors assume that these distinct sampling and decision strategies lead to characteristic patterns in decision behavior and may serve as an additional explanation for the many empirical protocols which indicate that DfE differ from DfD [@wulffMetaanalyticReviewTwo2018, for a meta-analytic review; but see @foxDecisionsExperienceSampling2006]. 
140

141
In the following, choices between two prospects are considered to integrate the assumptions about the sampling and decision strategies from above into the SSM. 
142

143
144
145
Let $X$ and $Y$ be random variables related to the prospects with the probability spaces $(\Omega, \mathscr{F}, P)_X$ and $(\Omega, \mathscr{F}, P)_Y$.
By definition, the decision variable $D$ should quantify the accumulated evidence for one prospect over the other, which Hills and Hertwig [-@hillsInformationSearchDecisions2010] describe in units of won comparisons.
Hence, $f$ should map the possible outcomes of a comparison of quantitative measures related to $X$ and $Y$, hereafter the sampling space $S = \mathbb{R}$, to a measure space $S' = \{0,1\}$, indicating the possible outcomes of a single comparison:
146

147
148
149
150
151
152
$$\begin{equation}
D:= f: S \mapsto S' 
\; .
\end{equation}$$

Since Hills and Hertwig [-@hillsInformationSearchDecisions2010] assume that comparisons of prospects are based on sample means, $S$ is the set
153
154

$$\begin{equation}
155
156
157
158
159
160
161
162
163
164
S = 
  \left\{
    \frac{\frac{1}{N_X} \sum\limits_{i=1}^{N_X} x_i}
    {\frac{1}{N_Y} \sum\limits_{j=1}^{N_Y} y_j} 
  \right\}^{\mathbb{N}}  
  = 
  \left\{
    \frac{\overline{X}} {\overline{Y}}
  \right\}^{\mathbb{N}} 
  \; ,
165
166
\end{equation}$$

167
168
where $\mathbb{N}$ is the number of comparisons, $x_i$ and $y_j$ are the realizations of the respective random variables, i.e., the single samples, and $N_X$ and $N_Y$ are the numbers of single samples within a comparison.  
To indicate that the comparison of prospects on the ordinal scale is of primary interest, we define 
169

170
171
172
$$\begin{equation}
\mathscr{D} = \left\{\frac{\overline{X}}{\overline{Y}} > 0, \frac{\overline{X}}{\overline{Y}} \leq 0 \right\} 
\end{equation}$$
173

174
as a set of subsets of $S$ and the decision variable as the measure 
175

176
$$\begin{equation}
177
D:= f: (S, \mathscr{D}) \mapsto S'
178
\end{equation}$$
linushof's avatar
linushof committed
179

180
with the mapping
181

182
$$\begin{equation}
183
184
185
186
187
188
189
190
191
192
D:= 
  \left(
    \frac{\overline{X}} {\overline{Y}}
  \right) 
  \in S : 
  f
  \left(
    \frac{\overline{X}} {\overline{Y}}
  \right) 
  =
193
  \begin{cases}
194
195
196
197
    1 & if & \frac{\overline{X}}{\overline{Y}} > 0 \in \mathscr{D} \\
    0 & if & \frac{\overline{X}}{\overline{Y}} \leq 0 \in \mathscr{D}
  \end{cases} 
  \; .
198
\end{equation}$$
199
200


linushof's avatar
linushof committed
201
202
# Method

linushof's avatar
linushof committed
203
## Test set
204

linushof's avatar
linushof committed
205
206
207
208
209
210
211
212
213
Under each condition, i.e., strategy-parameter combinations, all gambles are played by 100 synthetic agents.
We test a set of gambles, in which one of the prospects contains a safe outcome and the other two risky outcomes (*safe-risky gambles*).
Therefore, 60 gambles from an initial set of 10,000 are sampled.
Both outcomes and probabilities are drawn from uniform distributions, ranging from 0 to 20 for outcomes and from .01 to .99 for probabilities of the lower risky outcomes $p_{low}$.
The probabilities of the higher risky outcomes are $1-p_{low}$, respectively.
To omit dominant prospects, safe outcomes fall between both risky outcomes.
The table below contains the test set of 60 gambles.
Sampling of gambles was stratified, randomly drawing an equal number of 20 gambles with no, an attractive, and an unattractive rare outcome.
Risky outcomes are considered *"rare"* if their probability is $p < .2$ and *"attractive"* (*"unattractive"*) if they are higher (lower) than the safe outcome.
214

linushof's avatar
linushof committed
215
216
217
```{r message=FALSE}
gambles <- read_csv("data/gambles/sr_subset.csv")
gambles %>% kable()
218
219
```

linushof's avatar
linushof committed
220
## Model Parameters
221

linushof's avatar
linushof committed
222
**Switching probability** $s$ is the probability with which agents draw the following single sample from the prospect they did not get their most recent single sample from.
linushof's avatar
linushof committed
223
$s$ is varied between .1 to 1 in increments of .1.
224

linushof's avatar
linushof committed
225
The **boundary type** is either the minimum value any prospect's sample statistic must reach (absolute) or the minimum value for the difference of these statistics (relative).
linushof's avatar
linushof committed
226
Sample statistics are sums over outcomes (comprehensive strategy) and sums over wins (piecewise strategy), respectively.
227

linushof's avatar
linushof committed
228
229
For comprehensive integration, the **boundary value** $a$ is varied between 15 to 75 in increments of 15.
For piecewise integration $a$ is varied between 1 to 5 in increments of 1.
230

linushof's avatar
linushof committed
231
```{r message=FALSE}
232
233
234
235
236
237
238
239
# read choice data 
cols <- list(.default = col_double(),
             strategy = col_factor(),
             boundary = col_factor(),
             gamble = col_factor(),
             rare = col_factor(),
             agent = col_factor(),
             choice = col_factor())
linushof's avatar
linushof committed
240
choices <- read_csv("data/choices/choices.csv", col_types = cols)
241
242
```

linushof's avatar
linushof committed
243
In sum, 2 (strategies) x 60 (gambles) x 100 (agents) x 100 (parameter combinations) = `r nrow(choices)` choices are simulated.
linushof's avatar
linushof committed
244

linushof's avatar
linushof committed
245
# Results
246

linushof's avatar
linushof committed
247
248
Because we are not interested in deviations from normative choice due to sampling artifacts (e.g., ceiling effects produced by low boundaries), we remove trials in which only one prospect was attended.
In addition, we use relative frequencies of sampled outcomes rather than 'a priori' probabilities to compare actual against normative choice behavior.
249
250

```{r}
linushof's avatar
linushof committed
251
252
253
# remove choices where prospects were not attended
choices <- choices %>%
  filter(!(is.na(a_ev_exp) | is.na(b_ev_exp)))
254
255
```

linushof's avatar
linushof committed
256
257
258
259
260
```{r eval = FALSE}
# remove choices where not all outcomes were sampled
choices <- choices %>% 
  filter(!(is.na(a_ev_exp) | is.na(b_ev_exp) | a_p1_exp == 0 | a_p2_exp == 0))
```
linushof's avatar
linushof committed
261

linushof's avatar
linushof committed
262
Removing the respective trials, we are left with `r nrow(choices)` choices.
linushof's avatar
linushof committed
263

linushof's avatar
linushof committed
264
## Sample Size
linushof's avatar
linushof committed
265

linushof's avatar
linushof committed
266
267
268
269
270
271
```{r message=FALSE}
samples <- choices %>% 
  group_by(strategy, s, boundary, a) %>% 
  summarise(n_med = median(n_sample))
samples_piecewise <- samples %>% filter(strategy == "piecewise")
samples_comprehensive <- samples %>% filter(strategy == "comprehensive")
272
273
```

linushof's avatar
linushof committed
274
The median sample sizes generated by different parameter combinations ranged from `r min(samples_piecewise$n_med)` to `r max(samples_piecewise$n_med)` for piecewise integration and `r min(samples_comprehensive$n_med)` to `r max(samples_comprehensive$n_med)` for comprehensive integration.
275

linushof's avatar
linushof committed
276
### Boundary type and boundary value (a)
277

linushof's avatar
linushof committed
278
As evidence is accumulated sequentially, relative boundaries and large boundary values naturally lead to larger sample sizes, irrespective of the integration strategy.
linushof's avatar
linushof committed
279

linushof's avatar
linushof committed
280
281
```{r message=FALSE}
group_med <- samples_piecewise %>%
linushof's avatar
linushof committed
282
  group_by(boundary, a) %>% 
linushof's avatar
linushof committed
283
  summarise(group_med = median(n_med)) # to get the median across all s values
linushof's avatar
linushof committed
284

linushof's avatar
linushof committed
285
286
samples_piecewise %>%
  ggplot(aes(a, n_med, color = a)) + 
linushof's avatar
linushof committed
287
  geom_jitter(alpha = .5, size = 2) +
linushof's avatar
linushof committed
288
289
290
  geom_point(data = group_med, aes(y = group_med), size = 3) +
  facet_wrap(~boundary) + 
  scale_color_viridis() + 
291
  labs(title = "Piecewise Integration",
linushof's avatar
linushof committed
292
       x ="a", 
linushof's avatar
linushof committed
293
       y="Sample Size", 
linushof's avatar
linushof committed
294
       col="a") + 
linushof's avatar
linushof committed
295
  theme_minimal()
linushof's avatar
linushof committed
296
```
linushof's avatar
linushof committed
297

linushof's avatar
linushof committed
298
299
```{r message=FALSE}
group_med <- samples_comprehensive %>%
linushof's avatar
linushof committed
300
  group_by(boundary, a) %>% 
linushof's avatar
linushof committed
301
  summarise(group_med = median(n_med)) 
linushof's avatar
linushof committed
302

linushof's avatar
linushof committed
303
304
samples_comprehensive %>%
  ggplot(aes(a, n_med, color = a)) + 
linushof's avatar
linushof committed
305
  geom_jitter(alpha = .5, size = 2) +
linushof's avatar
linushof committed
306
307
308
  geom_point(data = group_med, aes(y = group_med), size = 3) +
  facet_wrap(~boundary) + 
  scale_color_viridis() + 
309
  labs(title = "Comprehensive Integration",
linushof's avatar
linushof committed
310
       x ="a", 
linushof's avatar
linushof committed
311
       y="Sample Size", 
linushof's avatar
linushof committed
312
       col="a") + 
linushof's avatar
linushof committed
313
  theme_minimal()
314
315
```

linushof's avatar
linushof committed
316
### Switching probability (s)
317

linushof's avatar
linushof committed
318
319
320
For piecewise integration, there is an inverse relationship between switching probability and sample size.
I.e., the lower s, the less frequent prospects are compared and thus, boundaries are only approached with larger sample sizes.
This effect is particularly pronounced for low probabilities such that the increase in sample size accelerates as switching probability decreases.
linushof's avatar
linushof committed
321

linushof's avatar
linushof committed
322
323
```{r message=FALSE}
group_med <- samples_piecewise %>%
linushof's avatar
linushof committed
324
  group_by(boundary, s) %>% 
linushof's avatar
linushof committed
325
  summarise(group_med = median(n_med)) # to get the median across all a values
linushof's avatar
linushof committed
326

linushof's avatar
linushof committed
327
328
329
330
331
332
samples_piecewise %>%
  ggplot(aes(s, n_med, color = s)) + 
  geom_jitter(alpha = .5, size = 2) +
  geom_point(data = group_med, aes(y = group_med), size = 3) +
  facet_wrap(~boundary) + 
  scale_color_viridis() + 
333
  labs(title = "Piecewise Integration",
linushof's avatar
linushof committed
334
       x ="s", 
linushof's avatar
linushof committed
335
       y="Sample Size", 
linushof's avatar
linushof committed
336
       col="s") + 
linushof's avatar
linushof committed
337
338
339
  theme_minimal()
```

linushof's avatar
linushof committed
340
341
342
For comprehensive integration, boundary types differ in the effects of switching probability.
For absolute boundaries, switching probability has no apparent effect on sample size as the distance of a given prospect to its absolute boundary is not changed by switching to (and sampling from) the other prospect.
For relative boundaries, however, samples sizes increase with switching probability.
linushof's avatar
linushof committed
343

linushof's avatar
linushof committed
344
345
```{r message=FALSE}
group_med <- samples_comprehensive %>%
linushof's avatar
linushof committed
346
  group_by(boundary, s) %>% 
linushof's avatar
linushof committed
347
  summarise(group_med = median(n_med)) # to get the median across all a values
linushof's avatar
linushof committed
348

linushof's avatar
linushof committed
349
350
351
352
353
354
samples_comprehensive %>%
  ggplot(aes(s, n_med, color = s)) + 
  geom_jitter(alpha = .5, size = 2) +
  geom_point(data = group_med, aes(y = group_med), size = 3) +
  facet_wrap(~boundary) + 
  scale_color_viridis() + 
355
  labs(title = "Comprehensive Integration",
linushof's avatar
linushof committed
356
357
358
       x ="s",
       y = "Sample Size", 
       col="s") + 
linushof's avatar
linushof committed
359
360
361
  theme_minimal()
```

linushof's avatar
linushof committed
362
## Choice Behavior
linushof's avatar
linushof committed
363

linushof's avatar
linushof committed
364
Below, in extension to Hills and Hertwig [-@hillsInformationSearchDecisions2010], the interplay of integration strategies, gamble features, and model parameters in their effects on choice behavior in general and their contribution to underweighting of rare events in particular is investigated.
linushof's avatar
linushof committed
365
366
367
368
369
370
371
372
373
374
375
376
We apply two definitions of underweighting of rare events: Considering false response rates, we define underweighting such that the rarity of an attractive (unattractive) outcome leads to choose the safe (risky) prospect although the risky (safe) prospect has a higher expected value.

```{r message=FALSE}
fr_rates <- choices %>% 
  mutate(ev_ratio_exp = round(a_ev_exp/b_ev_exp, 2), 
         norm = case_when(ev_ratio_exp > 1 ~ "A", ev_ratio_exp < 1 ~ "B")) %>% 
  filter(!is.na(norm)) %>% # exclude trials with normative indifferent options
  group_by(strategy, s, boundary, a, rare, norm, choice) %>% # group correct and incorrect responses
  summarise(n = n()) %>% # absolute numbers 
  mutate(rate = round(n/sum(n), 2), # response rates 
         type = case_when(norm == "A" & choice == "B" ~ "false safe", norm == "B" & choice == "A" ~ "false risky")) %>% 
  filter(!is.na(type)) # remove correct responses
linushof's avatar
linushof committed
377
378
```

linushof's avatar
linushof committed
379
Considering the parameters of Prelec's [-@prelecProbabilityWeightingFunction1998] implementation of the weighting function [CPT; cf. @tverskyAdvancesProspectTheory1992], underweighting is reflected by decisions weights estimated to be smaller than the corresponding objective probabilities.
linushof's avatar
linushof committed
380

linushof's avatar
linushof committed
381
### False Response Rates
linushof's avatar
linushof committed
382

linushof's avatar
linushof committed
383
384
385
```{r message=FALSE}
fr_rates_piecewise <- fr_rates %>% filter(strategy == "piecewise")
fr_rates_comprehensive <- fr_rates %>% filter(strategy == "comprehensive")
linushof's avatar
linushof committed
386
```
387

linushof's avatar
linushof committed
388
The false response rates generated by different parameter combinations ranged from `r min(fr_rates_piecewise$rate)` to `r max(fr_rates_piecewise$rate)` for piecewise integration and from `r min(fr_rates_comprehensive$rate)` to `r max(fr_rates_comprehensive$rate)` for comprehensive integration.
linushof's avatar
linushof committed
389
However, false response rates vary considerably as a function of rare events, indicating that their presence and attractiveness are large determinants of false response rates.
linushof's avatar
linushof committed
390

linushof's avatar
linushof committed
391
392
393
394
395
396
```{r message=FALSE}
fr_rates %>% 
  group_by(strategy, boundary, rare) %>% 
  summarise(min = min(rate),
            max = max(rate)) %>% 
  kable()
linushof's avatar
linushof committed
397
398
```

linushof's avatar
linushof committed
399
The heatmaps below show the false response rates for all strategy-parameter combinations.
linushof's avatar
linushof committed
400
401
Consistent with our - somewhat rough - definition of underweighting, the rate of false risky responses is generally higher, if the unattractive outcome of the risky prospect is rare (top panel).
Conversely, if the attractive outcome of the risky prospect is rare, the rate of false safe responses is generally higher (bottom panel).
linushof's avatar
linushof committed
402
As indicated by the larger range of false response rates, the effects of rare events are considerably larger for piecewise integration.
403

linushof's avatar
linushof committed
404
405
406
407
408
409
410
411
412
413
414
415
416
417
```{r message=FALSE}
fr_rates %>% 
  filter(strategy == "piecewise", boundary == "absolute") %>% 
  ggplot(aes(a, s, fill = rate)) + 
  facet_grid(type ~ fct_relevel(rare, "attractive", "none", "unattractive"), switch = "y") +
  geom_tile(colour="white", size=0.25) + 
  scale_x_continuous(expand=c(0,0), breaks = seq(1, 5, 1)) +
  scale_y_continuous(expand=c(0,0), breaks = seq(.1, 1, .1)) +
  scale_fill_viridis() + 
  labs(title = "Piecewise Integration | Absolute Boundary",
       x = "a", 
       y= "s", 
       fill = "% False Responses") + 
  theme_minimal() 
418
419
```

linushof's avatar
linushof committed
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
```{r message=FALSE}
fr_rates %>% 
  filter(strategy == "piecewise", boundary == "relative") %>% 
  ggplot(aes(a, s, fill = rate)) + 
  facet_grid(type ~ fct_relevel(rare, "attractive", "none", "unattractive"), switch = "y") +
  geom_tile(colour="white", size=0.25) + 
  scale_x_continuous(expand=c(0,0), breaks = seq(1, 5, 1)) +
  scale_y_continuous(expand=c(0,0), breaks = seq(.1, 1, .1)) +
  scale_fill_viridis() + 
  labs(title = "Piecewise Integration | Relative Boundary",
       x = "a", 
       y= "s", 
       fill = "% False Responses") + 
  theme_minimal() 
```
linushof's avatar
linushof committed
435

linushof's avatar
linushof committed
436
437
```{r message=FALSE}
fr_rates %>% 
linushof's avatar
linushof committed
438
  filter(strategy == "comprehensive", boundary == "absolute") %>% 
linushof's avatar
linushof committed
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
  ggplot(aes(a, s, fill = rate)) + 
  facet_grid(type ~ fct_relevel(rare, "attractive", "none", "unattractive"), switch = "y") +
  geom_tile(colour="white", size=0.25) + 
  scale_x_continuous(expand=c(0,0), breaks = seq(15, 75, 15)) +
  scale_y_continuous(expand=c(0,0), breaks = seq(.1, 1, .1)) +
  scale_fill_viridis() + 
  labs(title = "Comprehensive Integration | Absolute Boundary",
       x = "a", 
       y= "s", 
       fill = "% False Responses") + 
  theme_minimal() 
```

```{r message=FALSE}
fr_rates %>% 
linushof's avatar
linushof committed
454
  filter(strategy == "comprehensive", boundary == "relative") %>% 
linushof's avatar
linushof committed
455
456
457
458
459
460
461
462
463
464
465
  ggplot(aes(a, s, fill = rate)) + 
  facet_grid(type ~ fct_relevel(rare, "attractive", "none", "unattractive"), switch = "y") +
  geom_tile(colour="white", size=0.25) + 
  scale_x_continuous(expand=c(0,0), breaks = seq(15, 75, 15)) +
  scale_y_continuous(expand=c(0,0), breaks = seq(.1, 1, .1)) +
  scale_fill_viridis() + 
  labs(title = "Comprehensive Integration | Relative Boundary",
       x = "a", 
       y= "s", 
       fill = "% False Responses") + 
  theme_minimal() 
466
467
```

linushof's avatar
linushof committed
468
#### Switching Probability (s) and Boundary Value (a)
linushof's avatar
linushof committed
469

linushof's avatar
linushof committed
470
As for both piecewise and comprehensive integration the differences between boundary types are rather minor and of magnitude than of qualitative pattern, the remaining analyses of false response rates are summarized across absolute and relative boundaries.
linushof's avatar
linushof committed
471

linushof's avatar
linushof committed
472
Below, the $s$ and $a$ parameter are considered as additional sources of variation in the false response pattern above and beyond the interplay of integration strategies and the rarity and attractiveness of outcomes.
linushof's avatar
linushof committed
473

linushof's avatar
linushof committed
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
```{r message=FALSE}
fr_rates %>% 
  filter(strategy == "piecewise") %>% 
  ggplot(aes(s, rate, color = a)) + 
  facet_grid(type ~ fct_relevel(rare, "attractive", "none", "unattractive"), switch = "y") +
  geom_jitter(size = 2) + 
  scale_x_continuous(breaks = seq(0, 1, .1)) +
  scale_y_continuous(breaks = seq(0, 1, .1)) +
  scale_color_viridis() + 
  labs(title = "Piecewise Integration",
       x = "s", 
       y= "% False Responses", 
       color = "a") + 
  theme_minimal() 
```
489

linushof's avatar
linushof committed
490
491
```{r message=FALSE}
fr_rates %>% 
linushof's avatar
linushof committed
492
  filter(strategy == "comprehensive") %>% 
linushof's avatar
linushof committed
493
494
495
496
497
498
  ggplot(aes(s, rate, color = a)) + 
  facet_grid(type ~ fct_relevel(rare, "attractive", "none", "unattractive"), switch = "y") +
  geom_jitter(size = 2) + 
  scale_x_continuous(breaks = seq(0, 1, .1)) +
  scale_y_continuous(breaks = seq(0, 1, .1)) +
  scale_color_viridis() + 
499
  labs(title = "Comprehensive Integration",
linushof's avatar
linushof committed
500
501
502
503
       x = "s", 
       y= "% False Responses", 
       color = "a") + 
  theme_minimal() 
504
505
```

linushof's avatar
linushof committed
506
For piecewise integration, switching probability is naturally related to the size of the samples on which the round-wise comparisons of prospects are based on, with low values of $s$ indicating large samples and vice versa.
linushof's avatar
linushof committed
507
Accordingly, switching probability is positively related to false response rates.
linushof's avatar
linushof committed
508
509
I.e., the larger the switching probability, the smaller the round-wise sample size and the probability of experiencing a rare event within a given round.
Because round-wise comparisons are independent of each other and binomial distributions within a given round are skewed for small samples and outcome probabilities [@kolmogorovFoundationsTheoryProbability1950], increasing boundary values do not reverse but rather amplify this relation.
510

linushof's avatar
linushof committed
511
512
513
For comprehensive integration, switching probability is negatively related to false response rates, i.e., an increase in $s$ is associated with decreasing false response rates.
This relation, however, may be the result of an artificial interaction between the $s$ and $a$ parameter.
Precisely, in the current algorithmic implementation of sampling with a comprehensive integration mechanism, decreasing switching probabilities cause comparisons of prospects based on increasingly unequal sample sizes immediately after switching prospects.
linushof's avatar
linushof committed
514
Consequentially, reaching (low) boundaries is rather a function of switching probability and associated sample sizes than of actual evidence for a given prospect over the other.
515

linushof's avatar
linushof committed
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
### Cumulative Prospect Theory

In the following, we examine the possible relations between the parameters of the *choice-generating* sampling models and the *choice-describing* cumulative prospect theory.

For each distinct strategy-parameter combination, we ran 20 chains of 40,000 iterations each, after a warm-up period of 1000 samples.
To reduce potential autocorrelation during the sampling process, we only kept every 20th sample (thinning).

```{r}
# read CPT data
cols <- list(.default = col_double(),
             strategy = col_factor(),
             boundary = col_factor(),
             parameter = col_factor())
estimates <- read_csv("data/estimates/estimates_cpt_pooled.csv", col_types = cols)
```

#### Convergence

```{r}
gel_92 <- max(estimates$Rhat) # get largest scale reduction factor (Gelman & Rubin, 1992) 
```

The potential scale reduction factor $\hat{R}$ was $n \leq$ `r round(gel_92, 3)` for all estimates, indicating good convergence.

540
#### Piecewise Integration
linushof's avatar
linushof committed
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574

```{r}
# generate subset of all strategy-parameter combinations (rows) and their parameters (columns)
curves_cpt <- estimates %>% 
  select(strategy, s, boundary, a, parameter, mean) %>% 
  pivot_wider(names_from = parameter, values_from = mean)
```

##### Weighting function w(p)

We start by plotting the weighting curves for all parameter combinations under piecewise integration.

```{r}

cpt_curves_piecewise <- curves_cpt %>% 
  filter(strategy == "piecewise") %>% 
  expand_grid(p = seq(0, 1, .1)) %>% # add vector of objective probabilities
  mutate(w = round(exp(-delta*(-log(p))^gamma), 2)) # compute decision weights (cf. Prelec, 1998)

# all strategy-parameter combinations 

cpt_curves_piecewise %>% 
  ggplot(aes(p, w)) + 
  geom_path(size = .5) +
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
  labs(title = "Piecewise Integration: Weighting functions",
       x = "p", 
       y= "w(p)") + 
  theme_minimal() 
```

```{r}
cpt_curves_piecewise %>% 
  ggplot(aes(p, w)) + 
575
  geom_path() +
linushof's avatar
linushof committed
576
577
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
  facet_wrap(~a) + 
578
579
580
581
582
  labs(title = "Piecewise Integration: Weighting functions",
       x = "p",
       y= "w(p)",
       color = "Switching Probability") + 
  scale_color_viridis() +
linushof's avatar
linushof committed
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
  theme_minimal() 
```

```{r}
cpt_curves_piecewise %>% 
  ggplot(aes(p, w, color = s)) + 
  geom_path() +
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
  labs(title = "Piecewise Integration: Weighting functions",
       x = "p", 
       y= "w(p)", 
       color = "Switching Probability") + 
  scale_color_viridis() +
  theme_minimal() 
```

```{r}
cpt_curves_piecewise %>% 
  ggplot(aes(p, w, color = s)) + 
  geom_path() +
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
  facet_wrap(~a) + 
  labs(title = "Piecewise Integration: Weighting functions",
       x = "p",
       y= "w(p)",
       color = "Switching Probability") + 
  scale_color_viridis() +
  theme_minimal() 
```

##### Value function v(x)

```{r}

cpt_curves_piecewise <- curves_cpt %>% 
  filter(strategy == "piecewise") %>% 
  expand_grid(x = seq(0, 20, 2)) %>% # add vector of objective outcomes
  mutate(v = round(x^alpha, 2)) # compute decision weights (cf. Prelec, 1998)

# all strategy-parameter combinations 

cpt_curves_piecewise %>% 
  ggplot(aes(x, v)) + 
  geom_path(size = .5) +
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
  labs(title = "Piecewise Integration: Value functions",
       x = "p", 
       y= "w(p)") + 
  theme_minimal() 
```

```{r}
cpt_curves_piecewise %>% 
  ggplot(aes(x, v, color = s)) + 
  geom_path(size = .5) +
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
  labs(title = "Piecewise Integration: Value functions",
       x = "p", 
       y= "w(p)") + 
  scale_color_viridis() + 
  theme_minimal() 
```

```{r}
cpt_curves_piecewise %>% 
  ggplot(aes(x, v, color = s)) + 
  geom_path(size = .5) +
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
  facet_wrap(~a) + 
  labs(title = "Piecewise Integration: Value functions",
       x = "p", 
       y= "w(p)") + 
  scale_color_viridis() + 
  theme_minimal() 
```

659
#### Comprehensive Integration
linushof's avatar
linushof committed
660
661
662
663

##### Weighting function w(p)

We start by plotting the weighting curves for all parameter combinations under piecewise integration.
linushof's avatar
linushof committed
664

linushof's avatar
linushof committed
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
```{r}

cpt_curves_comprehensive <- curves_cpt %>% 
  filter(strategy == "comprehensive") %>% 
  expand_grid(p = seq(0, 1, .1)) %>% # add vector of objective probabilities
  mutate(w = round(exp(-delta*(-log(p))^gamma), 2)) # compute decision weights (cf. Prelec, 1998)

# all strategy-parameter combinations 

cpt_curves_comprehensive %>% 
  ggplot(aes(p, w)) + 
  geom_path(size = .5) +
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
  labs(title = "Comprehensive Integration: Weighting functions",
       x = "p", 
       y= "w(p)") + 
  theme_minimal() 
```

```{r}
cpt_curves_comprehensive %>% 
  ggplot(aes(p, w)) + 
  geom_path(size = .5) +
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
  labs(title = "Comprehensive Integration: Weighting functions",
       x = "p", 
691
692
       y= "w(p)") + 
  facet_wrap(~a) + 
linushof's avatar
linushof committed
693
694
695
696
697
  theme_minimal() 
```

```{r}
cpt_curves_comprehensive %>% 
698
699
  ggplot(aes(p, w, color = s)) + 
  geom_path() +
linushof's avatar
linushof committed
700
701
702
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
  labs(title = "Comprehensive Integration: Weighting functions",
       x = "p", 
703
704
705
       y= "w(p)", 
       color = "Switching Probability") + 
  scale_color_viridis() +
linushof's avatar
linushof committed
706
707
708
709
710
711
712
713
  theme_minimal() 
```

```{r}
cpt_curves_comprehensive %>% 
  ggplot(aes(p, w, color = s)) + 
  geom_path() +
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
714
  facet_wrap(~a) + 
linushof's avatar
linushof committed
715
  labs(title = "Comprehensive Integration: Weighting functions",
716
717
       x = "p",
       y= "w(p)",
linushof's avatar
linushof committed
718
719
720
721
722
723
724
       color = "Switching Probability") + 
  scale_color_viridis() +
  theme_minimal() 
```

```{r}
cpt_curves_comprehensive %>% 
725
  filter(s >= .7) %>% 
linushof's avatar
linushof committed
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
  ggplot(aes(p, w, color = s)) + 
  geom_path() +
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
  facet_wrap(~a) + 
  labs(title = "Comprehensive Integration: Weighting functions",
       x = "p",
       y= "w(p)",
       color = "Switching Probability") + 
  scale_color_viridis() +
  theme_minimal() 
```

##### Value function v(x)

```{r}

cpt_curves_comprehensive <- curves_cpt %>% 
  filter(strategy == "comprehensive") %>% 
  expand_grid(x = seq(0, 20, 2)) %>% # add vector of objective outcomes
  mutate(v = round(x^alpha, 2)) # compute decision weights (cf. Prelec, 1998)


# all strategy-parameter combinations 

cpt_curves_comprehensive %>% 
  ggplot(aes(x, v)) + 
  geom_path(size = .5) +
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
  labs(title = "Comprehensive Integration: Value functions",
       x = "p", 
       y= "w(p)") + 
  theme_minimal() 
```

```{r}
cpt_curves_comprehensive %>% 
  ggplot(aes(x, v)) + 
  geom_path(size = .5) +
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
  facet_wrap(~a) + 
  labs(title = "Comprehensive Integration: Value functions",
       x = "p", 
       y= "w(p)") + 
  theme_minimal() 
```

```{r}
cpt_curves_comprehensive %>% 
  ggplot(aes(x, v, color = s)) + 
  geom_path(size = .5) +
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
  labs(title = "Comprehensive Integration: Value functions",
       x = "p", 
       y= "w(p)") + 
  scale_color_viridis() + 
  theme_minimal() 
```

```{r}
cpt_curves_comprehensive %>% 
  ggplot(aes(x, v, color = s)) + 
  geom_path(size = .5) +
  geom_abline(intercept = 0, slope = 1, color = "red", size = 1) +
  facet_wrap(~a) + 
  labs(title = "Comprehensive Integration: Value functions",
       x = "p", 
       y= "w(p)") + 
  scale_color_viridis() + 
  theme_minimal() 
```
linushof's avatar
linushof committed
796

797
798
799
800
# Discussion 

# Conclusion

linushof's avatar
linushof committed
801
# References