An experiment on information presentation and investor mutual fund selection

Abstract Our experiments evaluate the role of information presentation in reducing violations of the Law of One Price in individual investor selection of index mutual funds. The results indicate that most individuals fail to minimize fees. However, individuals allocate nearly 27% (43%) more of their investment dollars to the lowest fee index mutual fund when receiving fee information in the form of a table compared to a graph presentation (in ten-year rather than one-year form). Overall, a simple change from table to graph fee presentation results in a statistically and economically significant reduction in the fees paid by investors.


Introduction
Index mutual funds provide an experimental environment well suited to the analysis of individual investor decisions. While allocation amongst actively managed mutual funds requires the investor to consider both expected (but unknown) future returns and fees, for index funds, only a single factor must ultimately be evaluated, that is, fees. This is because index funds seek to track the returns of the underlying index. As such, in the case of Standard and Poor's (S&P) 500 Index mutual funds, for example, future before-fee returns will be identical for all funds and equal to the return of the S&P 500. Given the commodity-like nature of S&P 500 index funds, the rational course of action for investors is to seek out the lowest fee fund.
Despite this fact, there continues to be a wide range of fees charged by S&P 500 Index mutual funds. For example, Vanguard's 500 Index Fund (Admiral Shares) offers an expense ratio of 0.04% and the Rydex S&P 500 Fund Class A offers an expense ratio of 1.65% as of June 2022. Experimental evidence confirms that investors do not minimize fees when selecting amongst S&P 500 Index funds, a clear violation of the Law of One Price (Choi et al., 2010). Iannotta and Navone (2012) note that price dispersion for homogenous products (including index funds) is often considered an indirect measure of market inefficiency.
Various explanations for this behavior include search costs, marketing, return chasing, diversification bias, and financial literacy (Choi et al., 2010;Mauck & Salzsieder, 2017). While there is emerging literature on investor debiasing interventions, relatively less is known about efforts to reduce individual investor misallocations in general, to our knowledge, no effort has been undertaken specifically in the context of index mutual funds.
The primary focus of this paper is to determine whether, and to what extent, violations of the Law of One Price in S&P 500 Index Funds can be reduced through changing information presented to investors. The experimental design is structured to eliminate nearly all behavioral explanations for investors' failure to minimize fees in this context. Investors are asked to allocate $10,000 amongst four generically named S&P 500 Index Mutual Funds. They can allocate all of their money to a single investment or spread out the investment so long as the sum is $10,000. The only information provided pertains to fees. As such, marketing (generic fund names instead of brand names), return chasing (no historical returns reported), and search costs (single construct of fees) are not likely to play a role, and it may be that simplifying the information set yields better investor decisions.
In addition to the removal of potentially distracting or confusing disclosures (i.e., no prospectus or historical returns provided), we examine if disclosure information affecting fees can influence investor selection. Specifically, each of our two experiments evaluates whether tables or graphs are associated with better investor decisions. Similarly, we examine the role of presenting fees in percentage or dollar form. Finally, while our first experiment focuses only on fees from one year, the second experiment seeks to make the longer-term implications of fees more salient by providing total fees over ten years.
Our experiment is motivated by the emerging literature on the role of data visualization and information presentation in reducing behavioral biases in real-world decision-making. Most closely related to our work, Newall and Parker (2019) seek to debias investors' irrational preference for return chasing in mutual fund selection. Their results indicate that a statement of warning about using past performance yields the best results, and they also find support for converting annual fees to ten-year equivalents. However, we note that their context (actively managed mutual funds) and intervention approach (statements versus visual presentation) differ significantly from our study.
Outside of the mutual fund literature, a variety of data visualization and investment presentation methods have been linked to improved decision-making. Rosdini et al. (2020) examine whether information presentation, visual illusions, and mood are related to biased interpretation of financial statements. They find that data presented in table format provides better accuracy than information presented in graph form. Isler et al. (2022) examine default and education nudges in financial literacy training. They find that while educative nudges are promising, better techniques are needed. Anic and Wallmeier (2020) find that a probability histogram revealing loss probabilities in structured finance products influences the perceived attractiveness of the investment. Harcourt-Cooke et al. (2022) provide experimental evidence indicating that comics can improve the usability of disclosure, leading to wider adoption of a product. Scholl and Fontes (2022) note that investor financial knowledge and regulatory disclosure are closely interrelated. In order to better understand individual financial literacy they develop a measure of mutual fund knowledge. The results of their survey indicate that only 14% of participants were able to correctly compute fees associated with mutual funds and they conclude that roughly 80% of the participants lack the understanding to make informed mutual fund decisions. They provide an example using exactly the context of our study, S&P 500 index funds. They note that an investor allocating $100,000 could see a nearly $80,000 difference in fees paid over 20 years as a result of mistakenly selecting a higher fee fund. This evidence is consistent with the literature indicating that individual investors are relatively naïve (Huang et al., 2022) in mutual fund allocation decisions relative to professional investors.
While our study concentrates on index funds, the literature provides evidence on the dispersion of fees for actively managed funds. Iannotta and Navone (2012) find that fee dispersion is driven in part by search costs and Mansor et al. (2015) find that fees have a severe negative impact on fund performance. Further work has focused on the role of uncertainty in decision making (Ashraf et al., 2022;Ma et al., 2022) as well as ethical concerns (Al Rahahleh & Bhatti, 2022;Al Rahahleh et al., 2019;Guo et al., 2022).
The results of our experiments yield consistent and clear conclusions. The results indicate that even when given a very simple allocation decision stripped of much of the real-world complexity faced by individuals, investors consistently fail to minimize fees. Regardless of the treatment used in the two experiments, no more than 17% (29%) of participants in the first (second) experiment make the optimal allocation decision of placing 100% of funds into the lowest fee fund. This is consistent with Mauck and Salzsieder (2017) who find that no more than 22% of participants minimize fees in a similar experiment which potentially includes historical returns in addition to fee information. We are unable to eliminate the diversification bias identified in their paper because such a bias is inherently possible given the option to diversify investment dollars. Second, using tables results in statistically and economically significantly less biased allocation than graphs. Individuals allocate nearly 27% (15%) more of their investments to the lowest fee index mutual fund when receiving fee information in the form of a table compared to a graph in the first (second) experiment. This is consistent with Rosdini et al. (2020) who find evidence of the superiority of tables compared to graphs in terms of accurately interpreting financial statements. Third, the presentation format of fees in dollars or percentages is not linked to significantly different allocation decisions. This is consistent with Choi et al. (2010) &Love (2015) who find that reframing fees as a dollar cost is either marginally effective or even counterproductive when the dollar cost is low (i.e., less than $15). Finally, although we are unable to statistically test differences between the two experiments, our results suggest that presenting fees over ten years leads to greater allocation to the lowest fee fund. In particular, the total dollar amount allocated to the lowest fee fund in experiment two is 43% higher than in experiment one. This is consistent with Newall and Parker (2019) who find evidence of improved decision-making when fees are converted from annual to ten-year equivalents.
This study builds on previous work that examines how individual investors make decisions based on whether a table or graph is displayed (Cardoso et al., 2016;DeSanctis, 1984;Harvey & Bolger, 1996). The debate among which format is superior varies significantly between empirical studies from various fields of study from the social sciences to managerial decision making (Eberhard, 2021). An important distinction of this study compared to previous work is that the focus is on subjects who are not known to have any formal investment background. Studying how subjects with no known investment background make decisions related to investment choices can provide insight into how to best display financial product selections to naive investors.
The results of our experiment add to an emerging literature on debiasing individual investor decision-making. While the results point to simple to implement promising improvements, we note that even in the best-case scenarios in our experiments, the vast majority of participants fail to minimize fees and a relatively large amount of money remains allocated to the highest fee fund. Thus, debiasing investors to eliminate the violation of the Law of One Price in S&P 500 Index Mutual Funds is not achieved in our experimental designs.
More work is required to better understand the underlying causes of the misallocation as well as the evaluation of potential mitigation efforts.

Experiments
This study examines two different experimental designs that were both focused on whether and why individual investors are willing to pay higher fees to purchase identical assets (index funds). In both experiments, subjects are asked to allocate $10,000 into four index funds. The four funds have different fees associated with them. Both experiments consist of a 2 (table vs. graph) x 2 (dollars vs. percentage) between-subjects full factorial design. The dependent variable is the amount of money allocated to the lowest fund fee. The range of fees was selected based on actual prevailing fees on S&P 500 Index Funds offered as of the time of the experiment, although the names of the funds are not provided to participants to avoid the potential for marketing to impact the decision. The two factors for both experiments were presentation format (table vs. graph) and unit of measurement (dollars vs. percentage). The selection of factors is based on prior research indicating that, at least in the context of interpreting financial statement information accurately, tables outperform graphs (Rosdini et al., 2020) and that dollars versus percentage for fees could be an important distinction (Choi et al., 2010;Newall & Love, 2015).
The difference between the two experiments is the way the fee information is presented. Experiment 1 displays the fees in either dollars or percentages using simple tables and bar charts (Panel A of Appendix 1A) based on one-year of fees. Experiment 2 displays the fees associated with the funds in terms of an ending balance (Panel B of Appendix 1A) based on ten-year fees. Experiment 2 is motivated by the Newall and Parker (2019) finding of improved decision-making when fees are converted from annual to ten-year equivalents.

Subjects
We recruited a sample of 448 crowdsourcing subjects from Amazon Mechanical Turk using CloudResearch (Chandler et al., 2019;Palan & Schitter, 2018) for both experiments. Appendix 2A discusses how the sample size was determined. The average time for the researchers to read the survey instrument was 2 minutes and 44 seconds. Any subject that completed the survey instrument in less than 2 minutes and 44 seconds was excluded. We included an attention-check question that asked, "Which of the following is NOT a shape?" The options were "circle," "square," "blue," and "triangle." Subjects that did not select "blue" as the answer were removed from the analysis based on best practices for using MTurk subjects (Aguinis et al., 2021).

Experimental materials
The materials place the subjects in the position of an individual investor who has $10,000, which must be invested in their personal portfolio. Subjects are informed that they will be presented some information about four index funds and then will be asked to make a judgment about how they would allocate $10,000 into those four index funds. The first manipulated variable, the presentation format (table vs. graph), was designed to test the effect of varying levels of visual formats on investor allocations. The second manipulated variable, unit of measurement ($ vs. %), was designed to test the effect of varying levels of these two measurement frames on investor allocations. 1

Subjects
Experiment 1 consisted of 379 subjects. A total of 69 participants were excluded from Experiment 1 in the final analysis due to either completing the survey instrument too quickly or not passing an attention check. Eighteen subjects did not respond to the manipulation check, and 36% (128 of the total 360 manipulation check responses) correctly identified their presentation format when the manipulation check was performed.
The mean age of subjects in Experiment 1 was 42. A total of 62% of the subjects were female and 38% of the subjects were male. Out of these, 30% of the subjects had completed high school and 37% had an associate degree or higher.

Analysis of minimizing fees
According to past research on index funds, "In instances where fees differ, the only economically rational behavior is to make a 100% allocation to the lowest fee fund" (Mauck & Salzsieder, 2017, p. 49). Table 1 shows the proportion of subjects that minimized fees for all treatments.
In Table 2, Panel A, we report the mean amount allocated to each fund by treatment. On average, subjects allocated more money to the lowest fee fund (fund 2) versus the other funds. On average, for all treatments except the graph/dollar level, subjects allocated more money to the lowest fee fund (fund 2) versus the other funds.
Even if subjects did not make an optimal decision and allocate 100% of funds to the lowest fee fund, we expect most allocations to be skewed towards the lowest fee fund (Fund 2). A many-to-one multiple comparison analysis of treatment means is presented in Table 2, Panel B. This analysis performed two-sided and one-sided t-tests. The one-sided tests allow for testing if the mean amount allocated to the lowest fee fund is significantly greater than the mean amount allocated to all other funds across the different treatments. Since the goal of this multiple testing is to identify treatments with a significantly "better" outcome (that is, on average, more money allocated to the lowest fee fund), a one-sided test, in addition to the two-sided test, is appropriate (Rafter et al., 2002).
It is common practice to adjust the p-values in the presence of multiple comparison tests to reduce the Type I error rate (Benjamini & Hochberg, 1995;Lee & Lee, 2018). While the Bonferroni correction is used extensively in many disciplines, it is not always the most appropriate p-value adjustment (Blakesley et al., 2009;Carmer & Swanson, 1973;Castaiieda et al., 1993;Félix & Menezes, 2018). Whether or not to use the Bonferroni correction depends on the circumstances of the research study (Armstrong, 2014). Some studies have suggested that the Bonferroni method should be used when the number of multiple tests is small, that is, 5 tests or less (Bender & Lange, 2001). The main disadvantage of using the Bonferroni adjustment is that it reduces the power of the statistical tests (Armstrong, 2014;Nakagawa, 2004;Olejnik et al., 1997). Since this study will perform more than 5 multiple tests and keeping the power of the statistical tests is important, this study will employ the Holm p-value correction (commonly referred to as the Bonferroni-Holm procedure) (Holm, 1979). The Holm adjustment has been shown to be as effective as the Bonferroni correction and increases statistical power (Abdi, 2010;Aickin & Gensler, 1996;Eichstaedt et al., 2013).
The multiple comparison tests displayed in Table 2 Panel B indicate that, on average, the amount allocated to the lowest fee fund is significantly different from all the other funds for treatments using a table, except for the table/dollar level. However, the table/dollar level is quite close to being significant at the 0.05 level of significance. The multiple comparison tests also indicate that the mean amount allocated to the lowest fee fund is significantly greater than the average amount allocated to all other funds for all treatments that contained a table, regardless of the unit of measurement. Only one of the treatments with a graph (graph, %) resulted in the mean amount allocated to the lowest fee fund being significantly greater than the mean amount allocated to fund 1.
These results suggest that regardless of the unit of measurement, displaying the fees in a table form leads to more money being allocated to the lowest fee fund rather than presenting the fees in a simple bar chart based on the one-sided tests. *significant at the 0.05 level **this test was not performed since the mean amount allocated to fund 3 for this treatment was more than the mean amount allocated to the lowest fee fund (fund 2). Table 3 shows how the subjects allocated the $10,000 among the funds for each treatment.

Analysis of the optimal allocation
Overall, only 13% (48 of 379) allocated the entire $10,000 to the lowest fee fund (minimized fees). Overwhelmingly, across all treatments, subjects diversified their money (bold row in Table 3) by selecting multiple funds. Our results are consistent with Scholl and Fontes (2022) who find that only 14% of participants in their survey were able to correctly calculate fees. Subjects were specifically told in the "Key Points to Remember" and "Frequently Asked Questions" introductory sections of both experiments that mutual funds have fees associated with them that lower investment returns (Appendix 3A). This result illustrates the desire for subjects to diversify even when it is not financially beneficial. Indeed, in an untabulated analysis regarding a qualitative question asking participants why they made the allocation that they did, the most common category of answers was related to a desire to diversify.
To measure the extent to which the treatments affected the subjects' propensity to engage in a naïve diversification strategy, Table 4 compares the concentration of funds by treatment, using a concentration measure based on the Euclidean distance from the perfectly even distribution (Beshears et al., 2011). This measure ranges from 0 ($2500 allocated equally across four funds) to ffi ffi ffi ffi ffi ffi ffi 3 = 4 q = 0.87 ($10,000 allocated entirely to a single fund) (Beshears et al., 2011). The mean concentration for all subjects was 0.45, with a standard deviation of 0.30. The average concentration values for all treatments are roughly halfway between 0 and 0.87. Therefore, the treatments applied to the subjects do not seem to lead the subjects to deviate from the naïve diversification strategy of equal allocations among the four funds.   Table 5 shows how subjects rated the importance of various factors for their investment choices on a fivepoint Likert scale in the form of post-experimental questions (PEQs). The PEQs solicited responses about the subject's self-identification of confidence in their allocation and the influence to diversify and minimize fees in their selection among their investment choices. There were five possible responses, from "not confident at all" to "extremely confident" for the confidence question, and "not at all influential" to "extremely influential" for the influence to minimize fees and diversify. Integer values 1 through 5 were assigned to each possible response, with higher integers corresponding to greater importance. The choice of a five-point scale for the confidence, desire to diversify, and minimize fees PEQs was based upon the Financial Management Behavioral Scale. All questions on this seminal survey related to financial behaviors use a five-point scale (Dew & Xiao, 2011). Responses with five to seven points have been shown to have higher reliability measures associated with them versus two to three points (Boateng et al., 2018). The majority of subjects ranked the middle category as "somewhat important/influential" as their response to the three PEQs. Sixty-seven percent of subjects reported they were somewhat confident, slightly confident, or not confident at all in their allocation decision. This result supports that accuracy and confidence are associated (Tekin & Roediger, 2017). For Experiment 1, most subjects were not confident in their allocation decision and most subjects were not accurate in their allocation decision. That is, most subjects did not select to allocate the entire $10,000 to the fund with the minimum fee (see Table 3). For the questions with the influential Likert scale, 75% reported that a desire to diversify was either somewhat influential, slightly influential, or not influential at all. This self-reported result is a contradiction of the decisions the subjects made in Experiment 1. Table 3 displays how most subjects did diversify the $10,000. Lastly, 63% of subjects reported that minimizing fees was either somewhat influential, slightly influential, or not at all influential. Table 3 supports this selfreporting statistic due to most subjects not selecting the fund with the minimum fee. The PEQs were also analyzed based upon which subjects minimized fees and those who diversified. The one-way ANOVA (Analysis of Variance) test was performed to compare subjects' fee minimization and the PEQs. Applying parametric statistical tests to Likert-scale data is more robust than applying non-parametric tests, even when normality assumptions are grossly violated (Norman, 2010). Recommendations from the literature suggest that when a Likert scale has five or more categories, the scale can be considered continuous (Harpe, 2015). Table 6 displays the descriptive statistics for the PEQs and the results of the one-way ANOVA test. A one-way ANOVA revealed there was a statistically significant difference in the mean responses to the PEQ related to the desire to diversify between the subjects who did and did not minimize fees (i.e., diversified). This result indicates that subjects who realized there was no benefit to diversifying over multiple funds self-reported that diversifying was not as important to them (mean of 1.7 vs. 2.4).

Analysis of post experimental questions
It is interesting to note that the subjects that diversified did not rank, on average, that diversification was very important to them, with a mean of 2.4. However, it is somewhat counterintuitive to see that the mean influence of minimizing fees between both groups is not statistically significant. This result indicates that even among those that did minimize fees, the influence of the desire to minimize fees was no different from those who diversified. These results indicate that more work is needed to better understand why subjects are not minimizing fees when it is in their best interest to do so. One potential explanation is the Scholl and Fontes (2022) finding that roughly 80% of individual investors lack the knowledge to invest in mutual funds.

Subjects
Experiment 2 consisted of 384 subjects. A total of 64 participants were excluded from Experiment 2 in the final analysis due to either completing the survey instrument too quickly or not passing an attention check. A total of 33 subjects did not respond to the manipulation check, and 41% (143 of the total 351 manipulation check responses) correctly identified their presentation format when the manipulation check was performed.
The mean age of subjects in Experiment 2 was 42. A total of 62% of the subjects were female and 38% of the subjects were male. A total of 29% of the subjects had completed high school and 42% had an associate degree or higher.

Analysis of minimizing fees
As in Table 1 for Experiment 1, Table 7 shows the proportion of subjects that minimized fees for all treatments in Experiment 2. In Table 8 Panel A, we report the mean amount allocated to each fund by treatment. On average, subjects allocated more money to the lowest fee fund (fund 2) versus the other funds. On average, for all treatments, subjects allocated more money to the lowest fee fund (fund 2) than to the other funds.
The multiple comparison tests displayed in Table 8 Panel B are all significant for both the onesided and two-sided tests. The raw p-values from the tests were adjusted similarly to Experiment 1. All the raw p-values from the one-sided and two-sided tests were<0.0001. A p-value of 0.0001 was used when the adjustment was made, which resulted in an adjusted p-value of 0.0004 for all tests. Since all the adjusted p-values are same, an analysis of the magnitude of the t-statistics is useful to determine where the largest difference (standardized difference) between the mean amount allocated to the lowest fee fund (fund 2) and the other funds exists. The larger the t-statistic indicates that more money, on average, was allocated to fund two versus the other funds. The two largest t-statistics are associated with the treatments that consisted of a table with either unit of measurement, with the dollar unit of measurement being slightly higher. The majority of the larger t-statistics are associated with the treatments that contained tables. Again, these results suggest that displaying the fees in a table form leads subjects to allocate more, on average, to the lowest fee fund when compared to other funds across the different treatment levels. Table 9 shows how the subjects allocated the $10,000 among the funds for each treatment. Overall, only 23% (88 of 379) allocated the entire $10,000 to the lowest fee fund (minimized fees). Overwhelmingly, again with Experiment 2, across all treatments, subjects diversified their money (bold row in Table 9). This result from Experiment 2 illustrates the desire for subjects to diversify even when it is not financially beneficial. Table 10 compares the concentration of funds by treatment, using a concentration measure explained for Experiment 1. The mean concentration for all subjects was 0.46, with a standard deviation of 0.31. The lowest concentration value corresponds to the graph/percentage treatment. This treatment seems to have the least effect on moving subjects away from naïve diversification behavior. This effect can also be observed in Table 9. The graph/percentage treatment has the highest percentage of subjects that diversified their funds. Similarly, as in Experiment 1, the concentration measures for the other treatments are roughly halfway between 0 and 0.87. Therefore, the treatments applied to the subjects do not seem to lead the subjects to deviate from the naïve diversification strategy of equal allocations among the four funds. Table 11 shows how subjects rated the importance of various factors for their investment choices in a similar manner as described in Experiment 1. As in Experiment 1, the majority of subjects ranked the middle category "somewhat important/influential" as their response to all three PEQs. The self-reported results of the PEQs for Experiment 2 is similar to Experiment 1. Sixty-three percent of subjects reported they were somewhat confident, slightly confident, or not confident at all in their allocation decision. For Experiment 2, most subjects were not confident in their allocation decision and most subjects did not select to allocate the entire $10,000 to the fund with the minimum fee (see Table 9). For the questions with the influential Likert scale, 74% reported that a desire to diversify was either somewhat influential, slightly influential, or not influential at all. This self-reported result is a contradiction of the decisions the subjects made in Experiment 2. Table 9 displays how most subjects did diversify the $10,000. Fifty percent of subjects reported that minimizing fees was either somewhat influential, slightly influential, or not at all influential. This statistic is less than the self-reported statistic from Experiment 1. The lower self-reported minimizing fees percentage in Experiment 2 may be supported by the fact that more subjects did allocate all $10,000 to the lowest fee fund compared to Experiment 1. Table 12 displays the descriptive statistics for the PEQs and the results of the one-way ANOVA test for Experiment 2. A one-way ANOVA revealed there were statistically significant differences in the PEQs related to the confidence of the subject's allocation and their desire to diversify between the subjects who did and did not minimize fees (i.e., diversified).

Analysis of post experimental questions
Interestingly, a significant difference was found with regard to confidence for Experiment 2. Panel of Appendix 2A shows how the graphs and tables were presented in Experiment 2. The graphs and tables used in Experiment 2 are more "complex" than those used in Experiment 1. By "more complex," we mean more rows and columns in the tables and a stacke bar chart was used instead of a simple bar chart. In both groups (those that did and did not diversify), the confidence in the allocation decision is higher for Experiment 2 versus Experiment 1.  Similarly, as found in Experiment 1, it is somewhat counterintuitive to see that the mean influence of minimizing fees between both groups is not statistically significant. This result again indicates that even among those that did minimize fees, the influence of the desire to minimize fees was no different from those who did not minimize fees.

Results and discussion
As indicated by both the wide range of fees on S&P 500 Index Mutual Funds in the marketplace and experimental evidence, investors fail to minimize fees when selecting such funds. This behavior violates the Law of One Price, which indicates that commodity-like goods such as S&P 500 Index Mutual Funds should see convergence in fees. While this deviation from optimal behavior  has been well-studied, there has been less focus on the interventions that may improve investor decision-making.
In this study, we extend a growing literature on the potential for information presentation to improve general decision-making. To our knowledge, no such effort has been made in the context of S&P 500 Index Mutual Funds. This product provides a context ideal for experimentation because, contrary to actively managed mutual funds and other similar investment products, all index funds will have identical future returns. As such, the optimal decision is knowable a priori, and investors should seek to minimize fees.

Conclusions
Our experiments yield some promising conclusions. The use of tables leads to better investment allocations than the use of graphs when displaying fee information. Further, providing fees based on their tenyear costs rather than one-year costs improves investor allocations. Yet, the results also indicate a need for considerable additional research given that even in the best-case scenarios of our experiments, the vast majority of participants fail to minimize fees and often allocate substantial amounts of money to the highest fee fund. Our results do not address possible causes of the continued failure to minimize fees, but a likely driver is the aforementioned lack of mutual fund understanding amongst individual investors (Scholl & Fontes, 2022). Finally, our experiments provided much less information than investors would be faced with in the real world. Despite the relatively simple disclosure and decision to be made, the average result is strongly suboptimal. Thus, it will be important to better understand why investors misallocate when selecting index mutual funds to better provide disclosures that lead to wealth-enhancing decisions for individual investors.

Citation information
Cite this article as: An experiment on information presentation and investor mutual fund selection, Billie Anderson, Nathan Mauck & Leigh Salzsieder, Cogent Economics & Finance (2023), 11: 2190214. Note 1. A manipulation check was structured as a multiplechoice question as follows: In the scenario that you just read and completed, how was the fee information presented to you? The subjects were provided a link that showed them the four possible treatments (panels A and B of Appendix 1). The instructions stated to close the window with the treatments and navigate back to the survey instrument and select the presentation format the subject was presented. The analyses in this study are based on responses from all subjects for both experiments, regardless of whether they passed the manipulation check or not (Aronow et al., 2019).

Disclosure statement
No potential conflict of interest was reported by the authors.
sample size computations in sufficient detail to enable a knowledgeable reader with the original data to be able to verify the computations (Kang, 2021). It has been reported in the accounting discipline that more researchers need to conduct sample size studies for experiments (Kim et al., 2018). Another compelling reason to provide the details of a statistical design like sample size calculations is directly related to the reproducibility crisis that is occurring in many scientific disciplines (Al Rahahleh et al., 2019).
a power of 80% was chosen. A power value of 80% is a standard choice for statistical tests (Ellis, 2010).
This study uses a level of significance (α) of 0.05. The scientific community has a long history of using 0.05 as a standard for scientific inference (Kennedy-Shaffer, 2019;Manderscheid, 1965).

Effect size
The effect size is the difference in the mean response values among the different treatments that the researcher wishes to detect (O'brien & Castelloe, 2004). The effect size is determined by the hypothesized treatment means and standard deviations the researcher proposes. The hypothesized treatment means represent the desired possible scientific meaningful difference a researcher is willing to detect (High, 2007). These hypothesized means can be from past studies, the researcher's own empirical experience, or what the researcher hypothesizes will be the results of the designed experiment. In this study, the researchers are hypothesizing the desired hypothetical results. Table A2.2 displays the hypothesized means the authors expect subjects will allocate to the lowest fund fee for each treatment combination. Figure A2.1 provides the SAS code used to determine the sample size for Experiment 1, along with the output from PROC GLMPOWER. The SAS output indicates that for this study to achieve a power of 80%, a sample size of 448 subjects is required.