Experimentally Revealed Stochastic Preferences for Multicomponent Choice Options

Realistic, everyday rewards contain multiple components. An apple has taste and size. However, we choose in single dimensions, simply preferring some apples to others. How can such single-dimensional preference relationships refer to multicomponent choice options? Here, we measured how stochastic choices revealed preferences for 2-component milkshakes. The preferences were intuitively graphed as indifference curves that represented the orderly integration of the 2 components as trade-off: parts of 1 component were given up for obtaining 1 additional unit of the other component without a change in preference. The well-ordered, nonoverlapping curves satisfied leave-one-out tests, followed predictions by machine learning decoders and correlated with single-dimensional Becker-DeGroot-Marschak (BDM) auction-like bids for the 2-component rewards. This accuracy suggests a decision process that integrates multiple reward components into single-dimensional estimates in a systematic fashion. In interspecies comparisons, human performance matched that of highly experienced laboratory monkeys, as measured by accuracy of the critical trade-off between bundle components. These data describe the nature of choices of multicomponent choice options and attest to the validity of the rigorous economic concepts and their convenient graphic schemes for explaining choices of human and nonhuman primates. The results encourage formal behavioral and neural investigations of normal, irrational, and pathological economic choices.

scalar; their firing rate either increases, remains unchanged, or decreases (thus, constituting a distribution), even when encoding multidimensional variables (Pastor-Bernier, Stasiak, & Schultz, 2019). Hence the question: how can single-dimensional preferences, utility, and neural signals concern multicomponent choice options? Or, put more formally, how can scalar measures reflect vectorial bundles? And can we test the issue empirically, using well worked-out, rigorous theoretical concepts that should reduce possible confounds?

Concepts and Hypotheses
The issue of vectorial-to-scalar transformation of economic goods during economic decisions can be addressed by studying revealed preferences. The key notion posits that preferences cannot be observed directly but are revealed by measurable choice. Revealed preferences contrast with stated preferences; both are unobservable, but only revealed preferences refer to immediate, actual choices. Revealed preferences may be inherent and fixed, sitting there, and waiting to be revealed by the choice. This is a basic assumption of formal Revealed Preference Theory (Fisher, 1892;Samuelson, 1937Samuelson, , 1938. However, preferences vary with many factors such as context, frame, and number of options, which might indicate that preferences are flexible and constructed on the fly, at the moment of choice. These distinctions are important and a matter of ongoing debate (Dhar & Novemsky, 2008;Kivetz, Netzer, & Schrift, 2008;Payne, Bettman, & Schkade, 1999;Simonson, 2008;Warren, McGraw, & Van Boven, 2011). However, the possibility to empirically infer preferences from observable choice is invaluable for investigating choices of multicomponent options. Therefore, we like to restrict the use of the term "revealed" to the inference of preferences from observable choice irrespective of their assumed origin, while nevertheless benefiting from the rigorous concepts and graphics of Revealed Preference Theory.
We used the following notions of Revealed Preference Theory and stochastic choice theories (Luce, 1959;McFadden, 2005;McFadden & Richter, 1990;Stott, 2006) for humans (for the design of our parallel study on monkeys, see Method; Pastor-Bernier, Plott, & Schultz, 2017): 1. The option set contains two simultaneously presented bundles (binary choice). Each bundle contains two components (A and B). The options are mutually exclusive (choose one bundle or its alternative but not both) and collectively exhaustive (they contain all available bundles).

Each bundle can be plotted at the intersection of an
x-coordinate (component A) and an y-coordinate (component B) on a two-dimensional graph ( Figure 1A).
3. Stochastic preference is revealed by the probability of choosing one bundle over its alternative, which depends on all components of both bundles.
4. Every bundle has a utility for the decision maker that depends only on the amounts of both of its components. A bundle is chosen with higher probability than any other bundle in the same option set if and only if its utility is higher than in any other bundle in that option set. In other words, the preference relationship between two bundles is monotone if the higher utility of one bundle implies that it is preferred to its alternative (Mas-Colell et al., 1995).
5. Two bundles are chosen with equal probability if and only if their utility is the same. Equal choice probability of p ϭ .5 for each bundle reveals equal stochastic preference and is graphically represented by a twodimensional indifference point (IP). At the IP, some amount of one component is given up to gain one unit of the other component without change in preference (marginal rate of substitution, MRS; Figure 1A). In the apple example, some size is given up for more sweetness (the sweeter apple was smaller). The IP is estimated from an S-shaped psychophysical function fitted to the choice probabilities while varying one component of one bundle and keeping all other components constant ( Figure 1B, 1C and 1D). The trade-off at the IP is conceptually important, as it demonstrates same preference despite oppositely varying bundle composition.
6. Multiple IPs align on a single, continuous indifference curve (IC). The ICs are graphically characterized by two parameters: (a) the slope, which reflects the relative utility (currency) of the two bundle components; it could be asymmetric between x-axis and y-axis; (b) the curvature, which captures any slope change between IC center and IC periphery. The ICs are typically convex (viewed from the origin): larger reward amounts are associated with decreasing value increment (reflecting diminishing marginal utility of concave utility function) and, therefore, require larger trade-in amounts for getting a smaller amount of the other component.
7. Bundles with larger amounts of one or both components are plotted farther away from the origin and are assumed to be revealed preferred to bundles with smaller amounts (the "value function" for the component is strictly monotonically positive; "more is always better"). Hence, any bundle above an IC (farther away from origin) would be revealed preferred to any bundle on that IC, and any bundle on an IC would be revealed preferred to any bundle below that IC (closer to origin).
8. The preference relationship between two bundles may hold even when one component of the preferred bundle has a smaller amount than the alternative bundle (partial physical nondominance, requiring overcompensation by the other bundle component). This aspect, together with the equal-preference trade-off, crucially reflects the integration of the physical value of both bundle components into single-dimensional preference relationships and utility.
With such "lexicographic" preferences, the decision maker would forego potential gains from the neglected component and, thus, violate the principle of utility maximization. In such cases, ICs would be parallel to one of the axes: any bundle on such a line would be equally preferred, and more preferred bundles would lie on one or more parallel lines farther out on the graph. Such scenarios are not far-fetched, as decision-makers may have different discrimination thresholds for different bundle components, such as seen with reward amounts and probabilities (Tversky, 1969; Just-Noticeable-Difference, JND; Rieskamp, Busemeyer, & Mellers, 2006).

The Current Study
Our experiment revealed stochastic preferences in humans and compared human performance with that of monkeys. Monkeys, as our evolutionary close neighbors, have superb and understandable cognitive and behavioral abilities that allow generalization across primate species and high-resolution neuronal investigations. Interspecies comparisons are also methodologically useful, as monkeys can participate longer in experiments and provide much larger numbers of trials for more sophisticated tests, including preference between bundles not used for IC fitting, transitivity across partially physically nondominating bundles on different ICs, and change of option set size, all in the same individual.
The perspective of serving as template for future human and animal studies determined the experimental design and imposed constraints concerning event timing, trial repetition and immediately consumable payouts comparable to tangible liquid and food rewards for animals (Kagel et al., 1975). We followed the notions above and used the same design and data analysis as in rhesus monkeys (Pastor-Bernier et al., 2017). Participants repeatedly chose between two composite visual stimuli presented on a computer monitor. Each stimulus predicted a bundle of the same two milkshakes that varied in amount of fat and sugar concentration and were delivered directly to the participant's mouth (component A, component B; Figure 1B). The milkshakes were fully known to the participants and constituted rewards, as evidenced by the participants' voluntary consumption. We kept effort cost constant and equal for both choice options, but we did not test budget constraints to stay compatible with neurophysiological primate studies in which budgets would add confounds for interpreting neuronal responses and, thus, increase experimental complexity. We estimated ICs from psychophysically assessed IPs and confirmed the resulting two-dimensional preference map with mechanismindependent Becker-DeGroot-Marschak (BDM) auction-like bidding that truthfully reveals the bidder's value of desired objects in humans and monkeys (Al-Mohammad & Schultz, 2019;Becker, DeGroot, & Marschak, 1964;Plassmann, O'Doherty, & Rangel, 2007). The choices conformed with the ICs of economic theory, satisfied leaveout statistics, corresponded to decoder predictions, and correlated with BDM bids. The trade-off accuracy of humans compared well with that of laboratory monkeys.

Human Participants
A total of 24 human participants (11 men, 13 women; mean age 25.4 years, range 19 -36 years) completed a binary choice task for measuring revealed preferences and performed a Becker-DeGroot-Marschak (BDM) control task. The sample size of 24 participants was chosen with future neuroimaging tests on these participants in mind; an earlier study in our laboratory suggested that this sample size was adequate for avoiding false negatives (Zangemeister, Grabenhorst, & Schultz, 2016). The participants were fully informed about the upcoming consumption of milkshakes, the general nature of the binary choice task, the requirement to perform this task and a BDM bidding task in a separate session in the human imaging scanner (whose neural results will be published elsewhere), the remuneration (20 U.K. £, paid into their bank account a few days after the second session), and the duration of each of the two experimental sessions. Because of this information, the participants were able to construct the "meal" of milkshakes in advance. None of the participants had diabetes or lactose intolerance, nor did they require specific diets, to avoid medical and cultural interference. All participants had a known appetite for milkshakes and provided written informed consent based on a detailed information sheet. We used similar design, data analysis and validation procedures as in our parallel study on rhesus monkeys, where they are described and discussed in more detail (Pastor-Bernier et al., 2017). The Local Research Ethics Committee of the Cambridgeshire Health Authority approved the study.

Stimuli, Rewards, and Delivery Apparatus
The human participants viewed quantitative, colored visual stimuli on a computer monitor, which represented the two milkshakes and their amounts in each of the two bundles ( Figure 1B and 1C). Each bundle stimulus consisted of two vertically aligned rectangles. The color of each rectangle indicated the bundle component. The vertical position of a bar in each rectangle indicated the physical amount of each component (higher was more).
After extensive piloting with various liquids and liquidized foods, we found milkshakes with a controlled mix of sugar and fat to give the most reliable behavioral performance. As the milkshakes were delivered individually (see below), drinks containing either only sugar or only fat were deemed to be too unnatural. Thus, in our bundles, component A (top, blue) consisted of a low-sugar high-fat milkshake (25% whole milk, 75% double cream, no sugar), and component B (bottom, red) consisted of a high-sugar low-fat milkshake (10% sugar in skimmed milk). The Psychtoolbox in Matlab (Version R2015b) running on a Dell Windows computer served for stimulus display and recording of behavioral choices.
Participants were tested one at a time and in the absence of other persons in the room; they were seated on a standard, padded office chair at a standard-height desk in a closed, small, window-less experimental room (ca. 3 ϫ 4 m) with artificial light; the room was located in a quiet laboratory area, without specific ambient noise. Besides the desk and chair, the room was empty except for one standard-height table with a large storage carton box underneath that was unrelated to the experiment. The two milkshakes were delivered directly into the participant's mouth via a custom-made mouthpiece with two single-use pipette tips onto which the participant bit; the mouthpiece was connected to two silicone tubes approved for delivery of food stuffs; the two tubes were, respectively, attached to two 50-ml syringes driven by two piston pumps (NE-500, New Era Pump Systems Inc.; www.syringepump.com). As used before (Zangemeister et al., 2016), each pump delivered a programmable quantity of one milkshake with milliliter precision (VWR International Ltd.) and was controlled by the computer using a National Instruments card (NI-USB-6009) via the Matlab Data Acquisition Toolbox.

Binary Choice Task
Each trial started with an initial fixation cross in the center of a computer monitor in front of the participant. After a pseudorandomly varying interval (mean 0.5 s, flat hazard rate from Poisson distribution), the two bundle stimuli appeared simultaneously at pseudorandomly alternating fixed left and right positions on the monitor; each bundle stimulus indicated the same two milkshakes with independently set amounts ( Figure 1B). The participant chose one of the two bundle stimuli by pressing a single button once (left or right computer keyboard arrow for corresponding left or right bundle choice), upon which a green rectangle appeared for 200 ms around the chosen bundle to confirm the choice. There was no time limit on the button press, and most RTs (from appearance of bundle stimuli to button press) were between 0.5 and 4.0 s (medians of 1.98 to 2.58 s depending on distance-to-choice indifference). At 4.0 s after trial start or at 0.5 s after button press, whatever occurred later, all stimuli extinguished, and the participant received either no payout (80% of trials), or a payout (20% of trials); thus, every fifth chosen bundle was paid out on average, using a Poisson distribution. No-payout trials ended here, and a new trial started after an intertrial interval (ITI) of 0.5 s. Thus, given that button press was allowed to occur later than 4.0 s after trial start, the median duration of unrewarded trials was 5.0 s, and total cycle time (trial ϩ ITI) was 5.5 s.
By contrast, in payout trials, both milkshakes of the chosen bundle were delivered; component A immediately after the choice, and component B at a constant interval of 0.5 s after onset of delivery of component A. The constant delay between the two components, rather than simultaneous delivery or pseudorandomly alternating sequential liquid delivery, served to clearly demarcate the two distinct components and maintain their discriminability. Thus, the utility for component B reflected the subjective value of the milkshake itself and a temporal discount because of longer delay. Delivery of each milkshake by the syringe pump system took between 0.5 and 5.0 s depending on amount (up to 8.0 ml); thus, the participant could swallow milkshake A while waiting for milkshake B or mix the milkshakes in the mouth before full swallowing. The trial ended 5.0 s after component B delivery onset, which started 0.5 s after component A delivery onset. Adding these 5.5 s for milkshake delivery to the median trial duration of 5.0 s before payout, payout trials lasted about 10.5 s, and total cycle time (trial ϩ ITI of 0.5 s) was 11.0 s.
The full assessment of 3 ICs with 4 IPs required a total of 504 trails (see below for details). With 20% rewarded trials lasting 11.0 s and 80% unrewarded trials lasting 5.5 s, the total duration of an experiment for each participant was 55 min (0.2 ϫ 504 ϫ 11 s ϩ 0.8 ϫ 504 ϫ 5.5 s ϭ 1,109 s ϩ 2,218 s ϭ 3,327 s ϭ 55 min).
Although participants were instructed to not eat or drink up to 4 hr before testing, satiety was a concern because of the high sugar and fat content of the milkshakes. We addressed the issue by the 20% payout schedule, by limiting each payout to maximally 10.0 ml, and by delivering not more than 200 ml of total liquid to each participant on each session. We did not find evidence for satiety with a specific data analysis.

Psychophysical Assessment of Binary Choice
We used the same standard psychophysical staircase procedure (Green & Swets, 1966) as in our parallel study on rhesus monkeys (Pastor-Bernier et al., 2017) to estimate IPs at which each of two bundles were chosen with equal probability (p ϭ .5 each option), revealing equal preference for each option. We tested seven bundles for obtaining each IP and, to avoid hysteresis, alternated their sequence randomly. We obtained five IPs for estimating each of three ICs. The procedure required repeated testing, which was also necessary for the subsequent neuroimaging experiment with the same participants (to be reported elsewhere).
We advanced in several steps. First, we realized that participants had different sensitivity ranges for milkshake amounts. To assess the appropriate ranges of good discrimination of both milkshake components, we had each participant chose between singlecomponent bundles containing only milkshake B with the amount of 4 ml (the starting amount for the intermediate IC to be estimated in the full procedure) and amounts varying between 0 and 8 ml of only milkshake A. Once these individual ranges had been determined, the Matlab function interp1q was used to set test amounts at five even incremental steps for component A (x-axis) within the 371 PREFERENCES FOR MULTICOMPONENT CHOICE OPTIONS specific sensitive range of each participant. The used test amounts for both component for each of the three IC levels are listed in online Supplemental Materials Tables S1A-C and S2. This initial test was also used to discard data from participants with differences in preferences between the two milkshakes larger than 1:3, which would have resulted in technical difficulties for milkshake delivery and control of satiety.
Once the participant-specific test amounts for component A had been determined, we performed the full psychophysical procedure. We set, in the Reference Bundle, component A to 0.0 ml and component B randomly to either 2.0, 5.0, or 8.0 ml as starting points for the future three ICs (thus, estimating the 3 ICs in random order). Then we set the alternative Variable Bundle as follows: we set the participant-specific test amount of component A one step higher (Step 2 in online Supplemental Materials Table S1A-C), and we randomly selected (without replacement) one of the seven amounts of component B (online Supplemental Materials Table S2); we repeated the selection until all seven amounts had been tested once ( Figure 1D). The seven amounts were even divisions of the vertical space (y-axis) between 0 and 2 ml (lowest IC), 0 and 5 ml (intermediate IC), and 0 and 8 ml (highest IC). We performed this procedure six times to estimate each IP using sigmoid fitting (see Equations 1 and 2 below); thus, requiring 42 choices per IP ( Figure 1D). At each IP, the amount of component B was usually lower in the Variable Bundle compared with the Reference Bundle. In this way, we implemented the marginal rate of substitution (MRS) at each IP that indicated how much of component B the participant was willing to give up to gain one unit of component A, always relative to the constant Reference Bundle.
We obtained three further IPs in choices between the constant Reference Bundle and the Variable Bundle whose amounts of component A increased stepwise (Steps 3, 4, and 5 in online Supplemental Materials Tables S1A-C); thus, advancing from top left to bottom right on a two-dimensional x-y indifference map. As the distance in physical space of component B between the Reference Bundle and the Variable Bundle increased, the confidence intervals (CI) increased also rightward for IPs. This effect was consistent across all participants and ICs (however, differences in slope and curvature between participants reflected genuine properties of IC and did not result from the method). We are aware that the unidirectional progression of testing may lead to somewhat different IP estimates than testing in the opposite direction or in random sequence (Knetsch, 1989). However, in this initial study, we were primarily interested in the systematic assessment of consistent IPs rather than exploring potential pitfalls.
We obtained three ICs with the three starting amounts of component B in the Reference Bundle (2.0, 5.0, or 8.0 ml). Each IC required four psychophysically estimated IPs (from five test bundles); thus, the three ICs required a total of 12 IP estimations. Each of the 12 IP estimations involved seven test amounts and was done six times; thus, requiring a total of 12 ϫ 7 ϫ 6 ϭ 504 choices for each participant.

Becker-DeGroot-Marschak (BDM) Bidding Task
The preferences represented by the ICs should reflect the utility of each IP bundle on a given IC (see Notions 4 and 5 above). To confirm the correspondence between revealed preference and inferred utility with an independent estimation method, we assessed the utility of each IP bundle with a BDM mechanism that is akin to a second price auction and estimates the true subjective value of the participant for the bundle (Becker et al., 1964). The BDM estimates for bundles on IPs should increase with higher ICs but vary only insignificantly along the same IC; thus, following the two-dimensional pattern of preferences represented by the ICs.
We ran the BDM task in the same 24 participants as a separate task a few days after the binary choice task during neuroimaging in pseudorandomly selected 50% of trials immediately after a binary choice trial. During neuroimaging, the participant was lying in an functional magnetic resonance imaging (fMRI) scanner in the supine position (instead of sitting on a chair at a desk) and viewed the stimuli on a computer monitor above the head (instead of the monitor in front). The BDM value bids were then compared with the three levels of revealed preference defined by the IPs estimated in the binary choice task (we compared with revealed preference levels rather than fitted ICs to avoid potential inaccuracies from curve fitting). The data obtained from the binary choice task in the scanner confirmed the currently reported data but will not be further used in the current study; the neuroimaging results will be reported separately.
In the BDM, the participant received a fresh monetary endowment (20 U.K. pence) on each trial. The participant bid for a bundle against a pseudorandomly set computer bid that was retrieved from a normal distribution with replacement; a normal compared with a uniform distribution slightly focusses behavior into the bidding range of our untrained participants without unduly affecting the cost of misbehavior (Lusk, Alexander, & Rousu, 2007), in analogy to the beta distribution used on monkeys (Al-Mohammad & Schultz, 2019). If the participant's bid was higher than or equal to the computer bid then the participant received both component rewards of the bundle and paid an amount equal to the computer bid. If the participant's bid was lower than the computer bid, the participant lost the auction, paid nothing and did not receive any bundle reward. The participant was informed about a win or a loss immediately after placing the bid; when winning the bid, the participant received the bundle rewards in the same sequence and frequency (every fifth trial on average) as in the choice task assessing revealed preferences. Although we did not assess in an objective manner whether the participants understood the BDM, the similarity in payout schedule with the binary choice task and the correlation in performance between the two tasks suggested a sufficient amount of comprehension for making BDM a valid mechanism.
We showed each participant single bundles that were randomly selected (without replacement) from the set of 15 IP bundles on the three revealed preference levels (the same 15 IPs as had been used to fit the three ICs). A given bundle was set to the participant's psychophysically estimated IP. We presented each of the 15 bundles 12 times, resulting in 180 trials in total, and considered the mean of these bids as the BDM-estimated utility. The participant indicated the bid by moving a cursor horizontally on the computer monitor with left and right keyboard arrows. The BDM bid was registered from the cursor position at 5.0 s after onset of presentation of the horizontal bidding scale. 372 PASTOR-BERNIER ET AL.

Statistical Analysis
This analysis allowed us to construct the graphic ICs that represent revealed preferences for two-component choice options. Specifically, we used a probit function to estimate each IP bundle from the psychophysical procedure described above, and we used a hyperbolic regression to obtain each of the three ICs from the respective sets of one start bundle (on the y-axis) and four IP estimations. A separate random effects analysis confirmed that the measured choices were explained by bundle components rather than other regressors.
To estimate the IPs numerically, we obtained a sigmoid fit to the empirically assessed choice frequencies via a general linear regression. To do so, we used the Matlab function glmfit (Matlab R2015b; Mathworks) on a binomial distribution with a probit link function, which is the inverse of the normal cumulative distribution function (G). Specifically, the generalized linear regression y ϭ ␤ 0 ϩ ␤ 1 B var ϩ ε can be rewritten after applying the link function as: with y as number of times the subject chose the Variable Bundle in the current block from a series of six repetitions, ␤ 0 as offset constant, ␤ 1 as regression slope coefficient, B var as reward amount (milliliter) of component B in the Variable Bundle, and ε as residual error. We chose the probit model because it assumes that random errors have a multivariate normal distribution, which makes it attractive as the normal distribution provides a good approximation to many other distributions. The model does not rely on the assumption of error independence and is used frequently by econometricians (Razzaghi, 2013). Further, preliminary data analysis had revealed a slightly better fit for the probit model compared with the logit model (deviance of 0.4623, as twice log-likelihood difference between probit model and maximumparameter model, compared with deviance for logit model of 0.5907; 3,200 trials, five participants). Therefore, we estimated the IPs from the sigmoid fit provided by the probit model, using the following equation: with ␤ 0 and ␤ 1 as coefficients of the general linear regression (Equation 1). We obtained single ICs, separately for each individual participant, from a set of individual IPs by weighted least-mean-square, nonlinear regression (as opposed to the probit regression for estimating each IP). We applied a weight to account for withinparticipant choice variability; the weight was the inverse of the standard deviation of the titrated amount of the B-component at the corresponding IP (the IP having been estimated by the probit regression). We estimated the best-fitting ␤ coefficients from least-mean-square fitting to obtain the equal-preference IC (that is a utility level) and wrote the basic hyperbolic equation in our notation as: with A and B as amounts of components A and B (ml) referring to the x-axis and y-axis, respectively, ␤ 1 and ␤ 2 as slope coefficients of the regressors B and A, and ␤ 3 as curvature coefficient. The overall slope of the IC itself (global MRS) is calculated as y/x; as components B and A extend on the y-axis and x-axis, respectively, and as ␤ is inversely related to the impact of physical amount of the respective component, the IC slope is (1/␤ 1 )/(1/␤ 2 ) or ␤ 2 /␤ 1 . As IC is a constant, we merged the other constants offset (␤ 0 ) and error (ε) into a common final constant k. To draw the ICs, we computed the amount of component B as a function of component A from the derived equation: To graphically display a fitted IC, we plotted the preset amount of component A on the x-axis, and the computed fitted amount of component B (Equation 3a) on the y-axis. The error on the hyperbolic curve was measured as 95% confidence interval (CI). The higher the error around an IP the less weight was given to this point when the IC was calculated. This model resulted in good fits in earlier work (Pastor-Bernier et al., 2017). In this way, the IPs of five equally revealed preferred but differently composed bundles aligned as a single fitted IC. The three ICs representing increasing revealed preference levels (low, medium, and high) were located increasingly farther away from the origin ( Figure 1F). The indifference map of 3 ϫ 5 IPs was unique for each participant (online Supplemental Materials Figure S1).
The IC shape was derived from hyperbolic fit and was quantified by two coefficients: slope and curvature. The IC slope coefficient, derived from the ratio of regression slope coefficients (␤ 2 /␤ 1 ), reflected the currency relationship between the components and described the participant's preference for component A relative to component B. For example, an IC slope of Ϫ60°i ndicated that component A was valued twice as much as the same milliliter amount of component B. The curvature coefficient (␤ 3 ) quantified the constancy in the trade-off between bundle components. A linear curve (curvature coefficient ϭ 0) indicated a constant rate of exchange for the bundle components, suggesting that the components were perfect substitutes. A more convex IC (curvature coefficient Ͼ0) indicated a varying rate of exchange, suggesting that the participant was giving up lesser amounts of component B to obtain one unit of component A when advancing on the IC from top left to bottom right. For a more intuitive measure, we quantified the curvature by measuring the largest perpendicular distance between the IC and the line between the x-axis and y-axis intercepts: with d as maximal perpendicular distance (milliliter; whereas ␤ 3 was a best-fitted, estimated parameter and, thus, less conservative), B IC as amount of component B on the IC (ml), and B linearIC as amount of component B at the line connecting the x-and y-intercepts (constant amount of component A, x-axis; milliliter). This simplified curvature measure reflected the change in trade-off between the two components across the tested range of reward amounts, in milliliter of component B. We used logistic regression on trial-by-trial choices to confirm that the measured choices were explained by bundle components rather than other factors. In a random-effects analysis, we fitted a logistic regression to the data from each individual participant and then averaged the obtained ␤ coefficients and p values across all participants. We used the following regression:  (milliliter), RT as RT (millisecond; interval between appearance of the two bundle stimuli and participant's computer button press), VarPos as left or right position of the Variable Bundle on the computer monitor relative to the Reference Bundle (0 or 1), and PChoice as choice in the previous trial (0 or 1). Each ␤ coefficient was standardized by multiplication with standard deviation of the respective independent variable and division by standard deviation of y (dependent variable). A subsequent one-sample t test against 0 served to assess the significance of the beta (␤) coefficients in the population of the 24 participants. Satiety may have occurred and could have affected the preferences for the two bundle components in an uncontrolled manner, even though the bundle rewards were only paid out on every fifth trial on average and were limited to a total of 200 ml. A prime suspected effect might have been differential devaluation between the two bundle components that would result in changed currency relationship between the two components. We assessed such change between the two components by searching for gradual change in instantaneous choice probability above and below the IPs over six repeated test steps of 2 ϫ 12 trials each (4 IPs on each of 3 ICs; total of 144 choices). We calculated the instantaneous choice probability at each test step, separately above and below the IP, as: y ϭ ⌺ (n ϭ 1 to12)(CV ⁄ 12) with y as instantaneous probability (p ϭ 0.0 to 1.0), CV as choice of Variable Bundle (0 or 1). To analyze the BDM bidding data, we first assessed the basic question whether the monetary bids increased for higher valued bundles (between revealed preference levels) but were similar for equally valued bundles (that constituted IPs; along the same preference levels), using two-way analysis of variance (ANOVA) between and along preference levels, respectively, and confirmation by Spearman rank correlation analysis between preference levels. In addition, we performed a random-effects analysis with a general linear regression with a normal (Gaussian) link function on separate data from each participant and averaged the obtained ␤ coefficients and their p values across participants. We used the following regression: with y as monetary bid, PrefLev as revealed preference level (low, medium, and high), AmBundle as summed milliliter amount of components A and B in the currency of component A (A ϩ (k -(␤ 2 /␤ 1 )A ϩ ␤ 3 A) as in Equation 3a), TrialN as trial number, PrevBid as BDM bid in previous trial (U.K. pence), and Consum as accumulated drinks consumption for component A and component B up to this point in the experiment (milliliter). We included PrefLev and AmBundle as separate regressors to account for their distinction because of partial physical nondominance of bundles on different preference levels, combined with currency differences between the two components. Each ␤ coefficient was standardized by multiplication with standard deviation of the respective independent variable and division by standard deviation of y (dependent variable).
A subsequent one-sample t test against 0 assessed the significance of the ␤ coefficients in all 24 participants. Finally, we compared hyperbolically fitted BDM isolines directly with hyperbolically fitted revealed preference ICs (rather than with revealed preference levels just described), separately for each individual participant (note that the BDM data were acquired in the human imaging scanner and the IC data used in this analysis were acquired in a prior session with the binary choice task outside the scanner). This procedure required to present BDM bids on the same scale as revealed preference ICs. To this end, we fitted isolines of same BDM-bids in analogy to fitting same-preference ICs. We fitted a hyperbolic function to the measured mean BDM bids in analogy to Equation 3: with ␤ 1 and ␤ 2 as regression slopes, and ␤ 3 as curvature coefficients, and A and B as amounts of components A and B (milliliter), respectively. Coefficients ␤ 1 and ␤ 2 were standardized by multiplication with standard deviation of components B and A, respectively (independent variables), and division by standard deviation of BDMBid (dependent variable). We obtained separate ␤ coefficients from all participants and averaged them and their p values across participants. A subsequent one-sample t test used the individual ␤ coefficients from all 24 participants to test overall significance against 0.
To compare BDM bids with ICs, we graphically displayed BDM isolines along which all mean BDM bids were equal. As a BDM isoline is a constant, we merged the constants offset (␤ 0 ) and error (ε) into a common final constant k. To draw the BDM isolines, we computed the amount of component B as a function of component A from the derived equation: To display a three-dimensional map, we graphed colored BDM isoline zones on the z-axis as a function of the amounts of components A (x-axis) and B (y-axis). For a two-dimensional map of BDM isolines, we plotted the preset amount of component A on the x-axis and the amount of component B computed from the isolines (Equation 8a) on the y-axis. For comparison, we plotted the revealed-preference ICs on the same two-dimensional map using the same scale. We also compared numerically, separately for each participant, CIs and slope and curvature coefficients between hyperbolically fitted BDMBids (Equation 8a) and hyperbolically fitted revealed preference ICs (Equation 3a), using the paired Wilcoxon's test.

Comparison With Monkeys
To compare the results of this human study across a closely related species, we reanalyzed existing data from a previous experiment on rhesus monkeys; all methods of this study have been described (Pastor-Bernier et al., 2017). That study tested in an analogous manner the same notions of revealed stochastic preferences stated above. Each monkey chose with a hand touching a horizontally mounted monitor between two visually presented 374 PASTOR-BERNIER ET AL. bundles composed of the same two liquids (fruit juices or water) in varying amounts. We used analogous psychophysical procedures and statistics to estimate IPs and ICs. For the current comparisons, we assessed the accuracy of integration of the two option components by the monkeys with two measures: (a) the 95% CIs of the psychophysical fits to the choice probabilities used for estimating each IP, which indicated how well the animals had estimated the IPs; (b) the CIs of the hyperbolic fits of ICs to all equally preferred IPs, which indicated the accuracy of the trade-off between the two bundle components that how well the animals had estimated the I characterizes the value integration from both bundle components.

Human Revealed Preferences Follow the Graphic Scheme of IC
We obtained individual ICs from each set of five equally revealed preferred IP bundles by hyperbolic fitting (Equations 3 and 3a; Figure 1E and 1F). Such an IC defined the trade-off between the two components of an equally preferred bundle: it indicated how much of component B a participant gave up for obtaining one more unit of component A without a change in utility. Thus, the IC characterized the orderly integration of both bundle components into a single-dimensional estimate. The continuous ICs were asymmetric between the x-axis and y-axis, indicating different subjective weighting of the two milkshakes; the convex IC curvature suggests that lower amounts of both milkshakes together were as much preferred as higher amounts of singular milkshakes (possibly reflecting gradually flattening, concave utility functions, and/or complementarity between high-sugar and high-fat components). Although the ICs varied in slope and curvature between participants, the ICs of bundles with larger reward amounts were located farther away from the origin (for two example participants, see Figure 2A and 2B; for all participants, see online Supplemental Materials Figure S1; the subjective, individual nature of ICs does not allow to assume a common preference and utility scale, which precludes averaging ICs across participants). The three ICs were well ordered and failed to overlap. Only the 95% CI overlapped partly in four of the 24 participants (17%).
Closer inspection of Figures 1F, 2A and 2B, and online Supplemental Materials S1 shows a typical feature of ICs. During the psychophysical assessment of individual IPs, the Variable Bundle was preferred to the Reference Bundle (choice p Ͼ .5) even when component B of the preferred bundle had lower amounts compared with the Variable Bundle at the y-axis anchor ( Figure 1F, arrows), indicating partial physical nondominance (Notions 7 and 8; involving overcompensation by higher amount of component A). As with the trade-off, such preferences for bundles with one physically lower component indicate the integration of the values of both bundle components into single-dimensional utility.
Taken together, the maps of systematic and continuous ICs reflect well the integration of both bundle components into singledimensional utility and preference relationships. As an alternative to these well integrated ICs, lexicographic preferences would not show such integration but follow only a single bundle component, manifested as lines parallel to one of the graph's axes. Taken together, the true, multicomponent-integrating ICs corresponded well to and, thus, seemed to validate empirically, the intuitive graphic schemes of Revealed Preference Theory. The following tests will address the validations numerically.

IC Coefficients
The shape of ICs reflects the trade-off between the components and can be quantified by slope and curvature coefficients of hyperbolic fits (Equation 3a). The global IC slope, between y-axis and x-axis intercepts, was measured as ratio of the two regression coefficients ␤ 2 /␤ 1 in Equation 3; it indicated how much the participant was globally willing to give up to obtain one unit of the other component; the measure reflected the relative utility (currency; global MRS) of the two bundle components. The IC slopes steeper than Ϫ45°indicated that a participant gave up a higher physical amount of component B (high-sugar, y-axis) for a smaller amount of component A (high-fat, x-axis); thus, indicating higher subjective value of fatty than sugary milkshake per unit of ml. The IC slope was Ϫ71°Ϯ 6.5°(mean Ϯ SEM; range Ϫ45°to Ϫ76°; N ϭ 24 participants; Figure 2C). The higher valuation of high-fat component A over high-sugar component B amounted to a factor of 3:1 in 18 of the 24 participants (75%). The predominantly asymmetric trade-off between the two milkshake components documents that each component contributed to bundle preference in its own distinct way.
The IC curvature showed substantial convexity in 18 of the 24 participants (75%), as indicated by ␤ 3 coefficients from the hyperbolic fit (Equation 3) that were significantly larger than 0.0 (8.89; mean Ϯ 5.9 SEM; p Ͻ .05, t test; Figure 2D). For graphically assessing IC curvature, we measured the distance between the IC center and a straight line connecting equally revealed preferred bundles at the x-and y-intercepts, in units of milliliter on the y-axis ( Figure 2E). This distance ranged from 0.09 ml (quasilinear IC) to 3.76 ml (most convex IC; mean of 1.28 ml Ϯ 0.19 SEM; Figure 2F). The distribution of the IC distance was overall similar to that of the ␤ 3 curvature coefficient from Equation 3. The two highest histogram bars in Figure 2F show data from six participants with rather similar, considerably convex IC, and from six other participants with rather similar but quasi-linear IC. Thus, the coefficients confirmed numerically the well-ordered nature of the ICs representing revealed preferences.

Control for Other Choice Variables
To test whether the choices reflected the components of the bundles rather than other, unrelated factors, we performed a logistic regression analysis separately for each of the 24 participants, using the following regressors: amount of each bundle component, RT, Reference Bundle position on participant's monitor, and previous trial choice (Equation 5). Median RTs were 2.10 and 1.98 s with Ͼ1 ml milkshake difference above or below IP, respectively, and 2.58 s with Ͻ1 ml difference from IP.  Figure 2G). The ␤ coefficients for these three variables differed significantly from 0 (p ϭ .012, p ϭ .00088, and p ϭ .00028, respectively; one-sample t test), confirming the validity of the ␤s. Thus, the Variable Bundle was preferred with increasing amounts of either of its components and with lower amounts of the Reference Bundle. The result suggests that both bundle components, rather than a single component alone, were important for the observed preferences. The remaining variables, including RT, position of Reference Bundle on the monitor and previous trial choice, failed to significantly account for current choice of the Variable Bundle (p ϭ .754 Ϯ .003 to p ϭ .988 Ϯ .290). Thus, the revealed preference relationships concerned the bundles with their two components rather than other task factors.
To assess potential consumption effects derived from the flow of the milkshake during the experiment, we searched for signs of satiety. We followed choice probability across the total test duration in each of the 24 participants. We selected two bundles that were situated above and below the IP, respectively. These two bundles contained high-sugar low-fat (above IP) and low-sugar high-fat (below IP) milkshakes. We plotted choice probability over six repeated test steps. Choice probabilities fluctuated without conspicuous upward or downward trend (see Figure 3) and varied only insignificantly (above IP: F(5, 41) ϭ 0.28, p Ͼ .05; below IP: F(5, 41) ϭ 1.53, p Ͼ .05; one-way repeated measures ANOVA). Even at the final, sixth step, choice probability differed only insignificantly from any other step. Thus, the revealed references did not seem to be importantly confounded by satiety for neither sugar nor fat within the amounts and concentrations used in our experiment.

Internal Validation of IPs and ICs
We assessed the contribution of individual IPs to the hyperbolically fitted ICs with three tests. (a) Using a leave-one-out analysis, we compared ICs fitted to all five IPs with ICs fitted with one IP left out and found good correspondence in all of four tests (online Supplemental Materials Figure S2). (b) Using a previously devel-376 oped single-dimensional linear support vector machine (SVM) algorithm (Grabenhorst, Hernadi, & Schultz, 2016;Tsutsui, Grabenhorst, Kobayashi, & Schultz, 2016), and (c) a two-dimensional linear discriminant analysis (LDA), we assessed the accuracy of reversely assigning individual IPs to their original preference level. Both decoders reported between-IC accuracies largely in the 70 -100% range, and the LDA showed only random distinction along-ICs (online Supplemental Materials Figure S3 and Table S3 left); as a control, shuffled data did not discriminate between preference levels (SVM accuracies of 44.7-54.6%; online Supplemental Materials Table S4 left). These procedures confirmed that the hyperbolically fitted ICs captured the IPs consistently and provided valid representations of the revealed preferences for twocomponent choice options. For details, see online supplementary material.

Mechanism-Independent Validation
The ordered representation of revealed preferences can be further validated by comparison with utility inferred from a different estimation mechanism. We used a monetary BDM auction-like bidding task that estimates participants' utility on a trial-by-trial basis. The property of truthful revelation (incentive compatibility) makes BDM an indispensable tool for experimental economics and explains its increasing popularity in neuroscientific studies of human decision making (Medic et al., 2014;Plassmann et al., 2007). We ran this task independently from the binary choice task in an fMRI scanner (fMRI data to be reported separately) and obtained BDM bids in U.K. pence for each of the 15 IP bundles ( Figure 4A). Although BDM bids are assessed as single shots, we repeated each Trial 12 times to approach the nature of stochastic choice for IP estimation. We compared the 12-trial averaged bids to preference levels rather than to ICs to avoid fitting inaccuracies.
A two-way ANOVA showed that BDM bids varied between the three revealed preference levels in all 24 participants (p ϭ 5.37 ϫ 10 Ϫ40 to p ϭ 5.38 ϫ 10 Ϫ116 ) but rarely between the five equally revealed preferred bundles (IPs) on each level (p Ͻ .05; except for three participants; Figure 4B; red, green, blue; online Supplemental Materials Table S5). This variation pattern was confirmed with separate one-way ANOVAS between the three preference levels (p ϭ .038534 to p ϭ 6.86 ϫ 10 Ϫ16 ) and the five IPs along the same preference levels (p ϭ .055 to p ϭ .951; except for three participants p Ͻ .05). A Spearman rank analysis confirmed and refined the result between preference levels; it showed a positive monotonic correlation between bids for bundles on the same preference level (means from all five bundles) and the three levels ( ϭ 0.60 Ϯ 0.05; M Ϯ SEM; N ϭ 24 participants; p Ͻ .01). Thus, the ANOVAS and Spearman correlation showed higher BDM bids between the three increasing revealed preference levels but mostly similar bids for bundles on same preference levels (with the ANOVA being sensitive to bundle amounts, as shown by increasing bids between levels). This BDM result confirmed the pattern of revealed preference seen with binary choice: higher value for bundles on higher preference levels, and similar value at trade-off between bundle components.
Using SVM and LDA decoders, we assessed the accuracy of reversely distinguishing original preference levels from individual BDM bids. Both decoders reported discrimination accuracies largely in the 50 -70% and the 88 -100% range, respectively, between preference levels (online Supplemental Materials Table  S3 right), but the LDA showed only random distinction between bundles on same preference levels (43-51%; online Supplemental Materials Figure S4); shuffled data did not discriminate between preference levels (SVM accuracies of 45.8 -54.7%; online Supplemental Materials Table S4 right). Thus, decoding accuracy confirmed well the two-dimensional scheme of revealed preference represented by the ICs. For details, see online supplementary material.
A stronger mechanism-independent validation of the IC scheme may be achieved by direct graphic comparison between BDM bids and hyperbolically fitted ICs. To this end, we estimated isolines that were fit to BDM bids using Equations 8 and 8a and compared them with ICs that had been hyperbolically fitted to IPs (Equations 3 and 3a). The BDM isolines represented BDM bids from 12 averages, and the revealed preference ICs derived from fits to IPs estimated from 42 stochastic choices. The BDM isolines increased for IPs on increasing preference levels (farther away from the origin), but were similar for IPs on the same revealed preference level ( Figure 5A). The BDM isolines matched the revealed preference ICs within their 95% CIs in every one of the 24 participants ( Figure 5B). Statistical comparisons showed significantly higher CIs of BDM isolines compared with revealed preference ICs ( Figure 5C; p Ͻ .0884 ϫ 10 Ϫ8 , Wilcoxon's paired test; N ϭ 24 participants). Despite their larger variability, both BDM isoline slope and curvature coefficients, derived from the respective ␤ 2 / ␤ 1 -ratio and ␤ 3 regression coefficient in Equation 8, failed to differ significantly from the respective slope and curvature coefficients of revealed preference ICs (Equation 3; Figure 5D and 5E; both p Ͼ .05, Wilcoxon's test on BDM vs. IC coefficients paired from each participant; N ϭ 24). Thus, despite larger variability, the BDM bids matched well the revealed preference ICs when assessed in a comparable way. Taken together, the mechanism-independent validation with BDM bids provided good confirmation of the graphic representation of revealed preferences by the ICs and, thus, strengthens the validity of these conceptual schemes.

Performance Comparison With Nonhuman Primates
The systematic IPs reflecting the trade-off between the two bundle components resulted in well-ordered ICs of our human participants that reflected systematic integration of utilities of all option components. The question arises to what extent these basic measures of multicomponent integration in humans compared with nonhuman primates whose lack of verbal interaction makes their behavior primarily dependent on experienced reinforcement. Using the same economic concepts and experimental design as in our human participants, we had tested two rhesus monkeys during several months and estimated Ͼ600 IPs that conformed to three to six convex or linear, negatively sloped ICs with each of five different types of two-component bundles containing blackcurrant juice, grape juice, water, and strawberry juice ( Figure 6A-E; Pastor-Bernier et al., 2017). The ICs were consistent, as shown by out-of-sample prediction, transitivity, and independence of number of choice options; thus, reflecting the animals' extended laboratory experience and allowing comparison with human performance.
We used two measures for the decision process underlying revealed preferences for multicomponent choice options. First, to assess the accuracy of the estimated IPs, we computed the 95% CIs of the psychophysical fits to the choice probabilities when assessing individual IPs (which were used for constructing ICs). The CIs for IPs in humans were in a similar range but slightly smaller in comparison with all but one of the five bundle types in monkeys ( Figure 6: compare F with G-K); the higher human accuracy was ). Second, we tested the accuracy of the fit of ICs to the IPs that were derived from the trade-off between the two bundle components; this trade-off critically reflected the integration of the two bundle components into single-dimensional measures. To this end, we computed the CIs of the hyperbolic IC fits to the estimated IPs. A narrow CI would indicate a good match of ICs to IPs and, thus, a good trade-off and utility integration between the two bundle components. The human participants showed slightly smaller CIs and, thus, higher accuracy, than the monkeys (p ϭ .0018; Figure 6M). Thus, despite differences in number of participants (N ϭ 24 for humans, N ϭ 2 for monkeys), task experience (single days for humans, several months for laboratory monkeys) and reward (milkshakes for humans, fruit juices and water for monkeys), the CI comparisons for IPs and ICs suggested comparable, although slightly better, choice accuracy in humans compared with the monkeys. Besides these encouraging results, the species comparison suggests robust correspondence of empirical ICs to the schematic graphs; thus, validating the theoretical concepts.

Discussion
This study used formalisms of Revealed Preference Theory to empirically investigate decisions for multicomponent rewards. We measured stochastic choices of small, tangible, and imme-diately consumable milkshake rewards that were delivered in repeated trials in a controlled, nonverbal laboratory setting. The estimated ICs were systematic and corresponded well to graphic economic schemes that are used for interpreting human choices (Kreps, 1990;Laidler & Estrin, 1989;Mas-Colell et al., 1995;Perloff, 2009;Varian, 1992). The similarity in performance accuracy with monkeys suggested maintained preference mechanisms across evolutionarily gaps. Further comparisons between the two species, in particular concerning the nature of preferences indicated by IC shape, should be done with caution, as IC shape was also influenced by bundle composition that differed between the species. The experimental feasibility opens the possibility for concept-driven, empirical neuroimaging studies of rational and irrational behavior in normal and brain-damaged individuals.
Our experimental controls and comparisons attested to the validity of these measurements; the ICs were asymmetric and nonlinear, failed to overlap, complied with decoder predictions and correlated with independent BDM bids. The integration of multiple bundle components was particularly evident in the continuous, graded trade-off along ICs. Preference was unchanged when reduction in one bundle component was compensated by increase in another bundle component, and bundles with one smaller component could even be preferred to bundles with a larger component when the other component compensated (partial physical non- The BDM bids were made for bundles that had been placed on psychophysically estimated revealed preference indifference points (black dots). The BDM isolines were similar along same levels and increased between revealed preference levels. (B) Match between hyperbolically fitted isolines of mean BDM bids (black; Equations 8 and 8a) and hyperbolically fitted revealed preference ICs (blue, green, red; Ϯ 95% confidence intervals, CI, shaded) from a typical participant. Thus, the BDM isolines fell within the respective CIs of the revealed preference ICs. (C) CIs of fitted BDM isolines (red) and revealed preference ICs (blue) from all 24 participants (averaged along each isoline/IC). The larger 95% CIs of BDM isolines (BDMiso) suggest more variability compared with revealed preference ICs. (D, E) Comparison of slope and curvature coefficient estimates, respectively, between BDM isolines (red) and revealed preference ICs (blue; same data as shown in Figure 2C and 2D) from all 24 participants. See the online article for the color version of this figure.

379
PREFERENCES FOR MULTICOMPONENT CHOICE OPTIONS dominance; Figure 1F). Such choices suggested proportionate contributions of individual bundle components.
The representation of vectorial, multicomponent choice options by single-dimensional neural signals is an open issue in experimental neuroeconomics. Our behavioral tests provide a formal, concept-driven foundation for investigating such signals in humans. The immediately consumed, small payouts facilitate comparisons with animal studies and control for temporal discounting. The tangible payouts after small sets of trials do not rely on language and should reduce mental ruminations about future rewards and assure reliable cooperation by the participants; thus, reducing interfering and confounding brain activity. Future neuroeconomic work on underlying decision mechanisms should par-ticularly benefit from the empirical estimation of whole maps of well-ordered ICs derived from multiple IPs that conform to predictive mathematical functions; thus, avoiding to test preferences for a few bundles with limited general validity. For example, different neurons in the orbitofrontal cortex of monkeys combine both bundle components into a common scalar neural signal or code each component separately (requiring later integration for contribution to the decision; Pastor-Bernier et al., 2019). The systematic ICs may also help to investigate neural underpinnings of specific theories, such as the switching of attentional processes between components conceptualized by multialternative decision field theory (Roe, Busemeyer, & Townsend, 2001). Finally, our detailed ICs would be helpful for investigating neural mechanisms  380 underlying inconsistent decision making when choice options are added (independence of irrelevant alternatives, IIA), beyond the decoy effects already being addressed (Chung et al., 2017;Gluth et al., 2017). Our multitrial approach corresponds to the standard psychophysical elicitation of choice functions (Green & Swets, 1966;Sutton & Barto, 1998), allows comparison with animal studies (Kagel et al., 1975;Pastor-Bernier et al., 2017), and was tailored to the statistical requirements of neural studies. In humans and monkeys, these methods deliver systematically varying, graded choice probabilities rather than single, all-or-none selection. The visible trial-by-trial variations are assumed to reflect underlying random processes that make the choice process stochastic, as captured by well worked-out stochastic choice theories that facilitate data interpretation (Luce, 1959;McFadden, 2005;McFadden & Richter, 1990;Stott, 2006). We appreciate that our multitrial schedule is at odds with the frequently used, standard, single-shot, deterministic assessment of ICs in experimental economics (Mac-Crimmon & Toda, 1969;Thurstone, 1931; but see Mosteller & Nogee, 1951), and the obtained consistent and robust ICs seem to validate the approach.
Economic choice often involves substantial but imaginary sums of money or consumer items, or random, singular payouts (Rieskamp et al., 2006;Simonson, 1989;Tversky & Simonson, 1993). By contrast, our payout schedule was tailored to requirements of neural studies in humans and animals and allowed immediate reward consumption over many trials (while controlling for satiety). These behavioral choices resembled small daily activities, such as consuming snacks and drinks, and were met by the good motivation of our participants. In this way, we obtained well-ordered ICs without requiring hypothetical items or large sums of imagined money.
Previous investigations of multicomponent economic choice have revealed several inconsistencies, including reference biases, difference between willingness-to-pay and willingness-to-sell, and violation of independence of irrelevant alternatives such as similarity, compromise, asymmetric dominance, and attraction effects (Bateman, Munro, Rhodes, Starmer, & Sugden, 1997;Knetsch & Sinden, 1984;Rieskamp et al., 2006;Simonson, 1989;Tversky, 1972;Tversky & Simonson, 1993). These phenomena may be because of revealed preferences being constructed at the time of choice rather being fixed (Dhar & Novemsky, 2008;Kivetz et al., 2008;Payne et al., 1999;Simonson, 2008;Warren et al., 2011) or reflect the adaptive nature of biological processes (Li et al., 2018;Soltani et al., 2012). We aimed to avoid interference from adaptive processes by designing stable and highly reproducible test conditions in a well-controlled laboratory environment, nonreliance on verbal report, single, uninterrupted test sessions, singular changes of bundle components, constant direction of testing (from top left to bottom right on the revealed preference map), and preventing satiety by limiting total milkshake intake to 200 ml. These measures may explain why our IPs remained stable over successive testing steps. We used the exact same conditions for eliciting BDM bids, which may have facilitated their correspondence to revealed preference ICs. With these testing conditions, we avoided known compromising factors that might hinder identification of basic factors underlying choice irregularities.
Our experiment used a limited range of closely related, basically substitutable reward types; thus, the validity of our results is restricted to that range and might not necessarily generalize to bundles with more varied kinds of rewards. Apes, monkeys, dogs, and pigeons may display seemingly irrational preference patterns for bundles of food items with very different values. According to the "selective value effect," an animals may assign value primarily or exclusively to a preferred reward item and partly or completely forego less desired items, rather than revealing more graded preferences (Pattison & Zentall, 2014;Silberberg et al., 1998). One possible explanation might be that consumption of a less preferred reward item would delay acquisition of the next preferred reward item (Beran et al., 2009), but many other reasons might exist (Zentall, 2019). However, none of these tests assessed ICs, and the results are difficult to compare with ours. With more formal testing, some of these anomalies may have been comparable with lexicographic preferences that concern only a single component and become evident by ICs that run parallel to one of the axes; our study failed to find lexicographic preferences. Further, our study only observed preference ratios up to 3:1, which suggested reasonable acceptance of both bundle components rather than generating selective value effects (see the IC slopes in Figures 2A and  2B and online Supplemental Materials S1). Our parallel study on monkeys tested a wider range of liquid rewards and found similar two-dimensional ICs and no evidence for major violation, refuting possible reasons in species difference. Taken together, choices of multicomponent bundles may be more prone to irregularities than choices of single-component options, but our experiment was too limited to address this tissue in an exhaustive manner.
The coefficients of hyperbolic fits to the ICs characterize numerically the representation of revealed preferences. The slope coefficient indicates the relative weighting of the two bundle components. For example, the amount of equally revealed preferred single milkshakes (graphed at the respective x-axis and y-axis intercepts of the two-dimensional map) was lower for highfat (component A) than high-sugar (component B) milkshakes in our participants, which was represented by a IC slope steeper than Ϫ45 degrees; thus, participants would have revealed preferred high-fat over high-sugar milkshakes if they came in same amounts. Such asymmetric IC slopes are also seen with various bundles in monkeys (Pastor-Bernier et al., 2017). Key reasons for the visible asymmetry on the graphic axes may be the physical amount scale and the absence of a simple common physical scale for the fat and sugar content of the milkshakes. Further, the slope for identical bundle compositions varied between our participants, which demonstrates an additional subjective component in revealed preferences. Despite these scaling and subjectivity issues, our estimated ICs had well-ordered slopes and failed to overlap.
The convex IC curvature in most participants indicated that disproportionately smaller amounts of combined fat-sugar milkshakes (at IC center) were equally revealed preferred as larger amounts of milkshakes containing primarily fat or sugar (closer to the axes). The BDM isolines confirmed the convex curvature for probably the same reasons. A previous study on food snacks also found higher BDM bids for fat-sugar combinations than for fat or sugar alone, despite similar calories (DiFeliceantonio et al., 2018). However, not all ICs need to be convex; bundles with one unattractive component showed concave ICs in primates (Pastor-Bernier et al., 2017). The IC convexity may be ascribed to concave utility of each bundle component (Perloff, 2009); more of the same component (closer to IC x-or y-intercepts) has lower marginal gain 381 PREFERENCES FOR MULTICOMPONENT CHOICE OPTIONS and, therefore, the decision-maker is willing to trade in more of an abundant, undervalued component for one unit of a less abundant, more valued component. Thus, the defining slope and curvature coefficients suggest that the ICs reliably represented well-ordered preferences for the milkshake bundles.
The estimating mechanism of BDM bids differed substantially from that underlying revealed preference ICs. Bidding occurred along a single scale and involved the physical movement of a cursor rather than hitting choice buttons between two simultaneous options. BDM bidding is incentive compatible, such that erroneous or deceiving bids lead to suboptimal outcome, as conceptualized by the expected cost of misbehavior (Lusk et al., 2007); bidders should state exactly their subjective value to avoid paying an exaggerated price (with too high bids) or foregoing the desired object (with too low bids). With these properties, BDM bidding constituted a well-controlled, authoritative test for eliciting true subjective values and provided a useful validation mechanism for preferences revealed by binary choice. Indeed, the obtained SVMand LDA-consistent BDM bids followed the preference scheme of ICs, namely higher bids for revealed preferred bundles and similar bids with choice indifference despite varying bundle composition. Most strikingly, hyperbolically fitted BDM isolines closely resembled the hyperbolically fitted revealed preference ICs in graphics and numerics (see Figure 5). The only notable difference was higher BDM bid variability. Our data align well with, and extend, the previously noted correspondence between binary choices and BDM bids for single-component options and bundle choices on paper or via verbal communication (Roosen, Marette, & Blanchemanche, 2010). This correspondence is particular interesting in light of conflicting and unclear accounts of economic choice and utility, such as the unresolved distinction between inherent and constructed revealed preferences (Payne et al., 1999;Simonson, 2008;Warren et al., 2011) and the fundamental question whether utility, supposedly underlying both revealed preferences and BDM, may not be a required inferred variable, and choices may simply involve heuristics (Piantadosi & Hayden, 2015;Vlaev, Chater, Stewart, & Brown, 2011). Whatever the answer might be, the BDM bids validated the empirical assessments of the ICs and confirmed their representation of revealed preferences.
The revealed preferences in humans indicated a similar level of integration of the two-component choice options as in rhesus monkeys (Pastor-Bernier et al., 2017) and, in a more general way, also observed in rodents (Kagel, Battalio, & Green, 1995;Kagel et al., 1975). In two monkeys, the estimated IPs conformed to three to six convex or linear, negatively sloped ICs with five types of two-liquid bundles. Preferences for other bundles were characterized by concave or positively sloped ICs reflecting satiety or disfavored juices. The validity of preference representation by the consistent and orderly ICs in monkeys was shown by out-ofsample prediction from homothetic polynomials, various transitivity tests, axiomatic compliance with independence of number of choice options, and classifier prediction. In contrast to the unexperienced humans, the monkeys had several months of training in well-controlled, stable laboratory conditions. The reward difference (milkshakes in humans vs. juices in monkeys) and the difference in IC estimation (unidirectional IP progression vs. random alternation) failed to prevent the general similarity of ICs between the two species, despite known hysteresis issues (Knetsch, 1989). Only quantitative differences were observed, with higher precision in humans, and it remains to be seen which of the different factors might account for these differences: cognition, task experience, and/or other factors. Thus, the robustness of performance across species is reassuring for the validity of our experimental procedures and demonstrates the basic and precise nature of this economic decision process with revealed stochastic preferences for multicomponent choice options.