Multinomial models with linear inequality constraints: Overview and improvements of computational methods for Bayesian inference

doi:10.1016/j.jmp.2019.03.004

Journal of Mathematical Psychology

Volume 91, August 2019, Pages 70-87

https://doi.org/10.1016/j.jmp.2019.03.004 Get rights and content

Highlights

•
Psychological theory often leads to inequality-constrained multinomial models.
•
Constraints are defined by inequalities or the convex hull of a set of vertices.
•
We develop a Gibbs sampler for Bayesian estimation using either representation.
•
We offer improved methods for model testing using the encompassing Bayes factor.
•
The R package multinomineq implements the proposed methods.

Abstract

Many psychological theories can be operationalized as linear inequality constraints on the parameters of multinomial distributions (e.g., discrete choice analysis). These constraints can be described in two equivalent ways: Either as the solution set to a system of linear inequalities or as the convex hull of a set of extremal points (vertices). For both representations, we describe a general Gibbs sampler for drawing posterior samples in order to carry out Bayesian analyses. We also summarize alternative sampling methods for estimating Bayes factors for these model representations using the encompassing Bayes factor method. We introduce the R package multinomineq , which provides an easily-accessible interface to a computationally efficient implementation of these techniques.

Introduction

Multinomial random variables form the backbone of discrete and categorical data analysis within psychology and the behavioral sciences. The key to any viable data analysis is the successful translation of an abstract theoretical hypothesis into a concrete, statistical model. As a simple example, consider the hypothesis that overconsumption of drugs (i.e., taking more tablets than prescribed) decreases with the number of daily doses (Paes, Bakker, & Soe-Agnie, 1997). To assess the validity of this prediction, one could test the statistical hypothesis that overconsumption is identical across all dosage regimes. If this hypothesis is rejected, one could carry out subsequent analyses to determine if the rates differ across the dosage conditions in a pairwise fashion. Yet, testing the “straw-man” model of all dosage conditions resulting in identical rates of overconsumption is not necessarily a faithful translation of the original hypothesis, rather, it is a means to an end, serving only as a pretext to carrying out tests on multiple pairs of dosage conditions.

To make this example more concrete, suppose we have three dosage regimes of the drug (i.e., once, twice, and three times daily) in a between-subjects design (Paes et al., 1997). We model the number of participants showing overconsumption in each condition as a binomial random variable and define the parameters $θ_{1}$ , $θ_{2}$ , and $θ_{3}$ as the corresponding probabilities that an individual takes more tablets than prescribed. While we could test whether the three $θ_{i}$ parameters are equal across all conditions (i.e., $θ_{1} = θ_{2} = θ_{3}$ ), this does not directly follow from our original hypothesis which only specified a monotonic relationship between overconsumption and dosage regimen. Testing the hypothesis of interest requires specifying an ordering relationship imposed on the overconsumption rates for each of the three dosage conditions: $θ_{1} \geq θ_{2} \geq θ_{3} .$ Paired with the binomial likelihood function, these order constraints represent a more faithful statistical analysis of the hypothesis being tested (see also Hoijtink, 2011 for a full discussion). Testing order constraints such as these, and linear inequality constraints more generally, requires a bit more effort than simpler tests of equality, but, as we show, can be carried out efficiently and are more interpretable.

A key difficulty in analyzing inequality-constrained models and theories is that it can quickly become difficult to characterize the resulting restricted parameter space (e.g., Davis-Stober, 2012, Fishburn, 1992). Our drug dosage example is quite simple—indeed, for Eq. (1), there are only two non-redundant pairwise order constraints, namely, $θ_{1} \geq θ_{2}$ and $θ_{2} \geq θ_{3}$ . When combined with the inequality constraints that the probability of overconsumption must be between zero and one for all conditions (i.e., $0 \leq θ_{i} \leq 1$ ), this completely characterizes the ordering relationships of interest. However, not all interesting hypotheses are so simple in structure. As we illustrate in Section 5.3, the random preference model of Regenwetter and Davis-Stober (2012) is far more complex with 75,834 non-redundant linear inequalities.

In general, bounded, linearly restricted parameter spaces can be defined in two different, yet equivalent, ways (Brøndsted, 2012). First, the restricted parameter space can be defined as the solution space to a system of a finite number of linear inequalities and equalities — similar to our drug dosage example. Alternatively, the same restricted parameter space can be defined as the convex hull of a set of extremal points (vertices). Let $θ = (θ_{1}, θ_{2}, θ_{3})$ . For our simple dosage example, the set of all extremal points is the set of all vectors, $θ$ , where each entry is equal to 0 or 1 and satisfy the above inequalities, which yields the set: $(0, 0, 0)$ , $(1, 0, 0)$ , $(1, 1, 0)$ , and $(1, 1, 1)$ . Section 1.1 shows that it is often relatively easy to derive these vertices by enumerating all patterns that are predicted by a psychological theory even though it may be difficult to specify the corresponding system of inequality constraints (Regenwetter & Robinson, 2017).

Irrespective of how inequality constraints are formally specified, their statistical analysis has been a long-standing issue in mathematical psychology (Iverson & Falmagne, 1985) and statistics in general (Barlow et al., 1972, Robertson et al., 1988, Silvapulle and Sen, 2004). In classical statistics, basic results regarding the asymptotic distribution of the likelihood ratio test are valid when testing equality constraints, but are not when testing inequality constraints (Davis-Stober, 2009, Silvapulle and Sen, 2004). As a remedy, methods for inequality-constrained models have recently been developed in the Bayesian framework (Hoijtink et al., 2008, Karabatsos, 2005, Klugkist et al., 2005, Myung et al., 2005, Sedransk et al., 1985) or based on minimum description length (Heck et al., 2015, Klauer and Kellen, 2015, Rissanen, 1978). Multinomial models with inequality constraints have also been applied to the Bayesian analysis of contingency tables (e.g., Agresti and Hitchcock, 2005, Klugkist et al., 2010, Laudy and Hoijtink, 2007, Lindley, 1964). However, general-purpose software packages for Bayesian statistics such as JAGS (Plummer, 2003) or Stan (Stan Development Team, 2018) are often not suited for the analysis of models with complex inequality constraints. This is due to the fact that the boundary of the constrained parameter space is specified as a, typically complex, function of multiple parameters. As a result, the parameters are highly inter-dependent and often cannot be defined independently (for a counterexample with simple constraints, see Heck & Wagenmakers, 2016).

This article considers computational methods of carrying out Bayesian analyses on multinomial models with linear inequality constraints on the parameters. However, we go further than analyzing simple “toy” models such as the dosage example above and consider models defined by arbitrarily complex linear constraints on multinomial parameters. Analyzing this class of model is known to be computationally challenging, especially for highly complex linear constraints as those defined by random preference models (Smeulders, Davis-Stober, Regenwetter, & Spieksma, 2018) and the axioms of additive conjoint measurement (Karabatsos, 2018). In the following, Section 1.1 highlights the relevance of inequality-constrained multinomial models for testing psychological theories. In Section 2, we introduce the notation, likelihood, and prior for multinomial models and the two types of representations for inequality constraints. Section 3 extends existing computational methods for binomial models with specific order constraints (e.g., Karabatsos, 2005, Myung et al., 2005) to multinomial models with arbitrary sets of linear inequalities. More precisely, we develop a general Gibbs sampler for parameter estimation and offer improved computational methods for estimating the encompassing Bayes factor for carrying out Bayesian model selection. Section 4 develops these methods for models that are specified by a set of predicted patterns using the vertex representation. This is useful, as defining a restricted model may be straightforward for one type of representation but not the other, while switching between representations can be computationally infeasible (Avis, Bremner, & Seidel, 1997). In Section 5, we offer the R package multinomineq (Heck & Davis-Stober, 2019) and show how to apply inequality-constrained multinomial models in practice using concrete examples. Finally, Section 6 discusses the analysis of nested data, the choice of priors, and possible directions for future research.

Inequality constraints on multinomial parameters can arise in a number of ways. Similar to our drug consumption example, they can arise “organically” by directly instantiating the hypothesis of interest. For this example, the inequalities are implied by the natural hypothesis that the response categories should be ordered by dosage regimen. In this way, inequality constraints can provide a direct evaluation of the hypothesis of interest, in contrast to other, heuristic methods such as testing the equality of all three dosage condition parameters and then carrying out additional, post hoc analyses to determine directional differences. In later sections, we will consider other examples of linear inequality constraints that arise naturally from theoretic hypotheses that are more complex than simple order restrictions (Hilbig & Moshagen, 2014).

While not immediately obvious, linear inequality constraints can also arise when evaluating theories/models/axioms in which multiple predictions are made. Such theories are quite common, especially in the field of judgment and decision making. For example, consider the well-known transitivity of preference axiom (Regenwetter, Dana, & Davis-Stober, 2011). Depending upon an individual’s tastes, there are many ways for a decision maker to have transitive preferences over a set of choice alternatives. Evaluating multiple predictions of a theory simultaneously within a multinomial framework opens up additional ways to operationalize this theory of interest. As an example, we consider methods of stochastic specification for deterministic theories, although we note that the application of such methods (e.g., mixture methods) extends beyond the decision making domain (Davis-Stober, Morey, Gretton, & Heathcote, 2016).

Many psychological theories predict deterministic choice patterns across different contexts (e.g., different types of stimuli, items, conditions, measurement occasions, or pre-existing groups). For instance, a theory might provide a specific response pattern such as “participants prefer Option A over B in each of five choice scenarios” (Bröder & Schiffer, 2003). Often, however, theories predict more than one response pattern. As illustrated in Section 5.2 for the description-experience gap in the domain of risky gambles (Hertwig, Barron, Weber, & Erev, 2004), the hypothesis that participants assign more weight to small probabilities results in multiple predicted patterns. The complete set of predicted patterns can be obtained in different ways (Regenwetter & Robinson, 2017), for instance, by (a) translating a verbal theory into predicted patterns, (b) deriving algebraic implications of axioms or formal theories, and (c) brute force enumeration of all of the predictions made by the deterministic theory, typically under a set of theory-specific assumptions (e.g., theory parameter values).³ Irrespective of how the theoretical predictions are derived, observed choice frequencies are inherently noisy and exhibit a certain amount of variance both within and across persons or contexts. Hence, the question arises of how to define a stochastic model for empirical frequencies based on a set of deterministic predicted patterns (Carbone and Hey, 2000, Heck et al., 2017, Regenwetter and Davis-Stober, 2012, Regenwetter and Davis-Stober, 2018).

In multinomial models, each predicted choice pattern can be represented by a vector of probabilities of either one (an option is deterministically chosen) or zero (an option is not chosen; Bröder & Schiffer, 2003). Fig. 1 illustrates this for two independent binomial probabilities $θ = (θ_{1}, θ_{2})$ of preferring Option A over B in a control and an experimental condition, respectively. The three black points in Fig. 1A show three predicted patterns of a hypothetical theory that are represented by the vectors $v^{(1)} = (0, 1)$ , $v^{(2)} = (1, 1)$ , and $v^{(3)} = (1, 0)$ . For instance, the pattern $v^{(3)} = (1, 0)$ represents the prediction that Option A is chosen in the control condition (since $θ_{1} = 1$ ) whereas Option B is chosen in the experimental condition (since $θ_{2} = 0$ ).

To derive a stochastic model based on a set of predictions $v^{(s)}$ , it is important to consider why a psychological theory makes multiple predictions in the first place (Regenwetter & Robinson, 2017). A theory might assume that one of the predicted patterns consistently describes the “true” data-generating mechanism across all measurement occasions. According to this interpretation, theory-inconsistent responses merely emerge from unsystematic errors in responding (e.g., due to inattention) whereas latent preferences are stable. In our example, this assumption results in a stochastic model with two independent error probabilities for the two conditions. These error probabilities serve as free parameters and are usually constrained to be below a predefined, fixed threshold such as 20%. In Fig. 1B, this independent-error model is illustrated geometrically by square boxes around the three predicted patterns.

Alternatively, a theory might assume that latent preference states randomly fluctuate across measurement occasions (e.g., across time, persons, or situations), whereas the response process is error-free (Regenwetter & Robinson, 2017). This means that at each measurement occasion, one of the predicted patterns describes the “true” data-generating mechanism perfectly. However, since we do not know which latent states generated the responses in which trials, this error specification leads to a finite mixture model over the predicted patterns (Regenwetter et al., 2014). Fig. 1C shows the parameter space of this mixture model for our example. Essentially, the model permits only those probability vectors $θ$ that are inside the triangle obtained by connecting the three predicted preference patterns by straight lines (i.e., $θ_{11} \geq 1 - θ_{21}$ ). Geometrically, this area is the convex hull of the finite number of predicted patterns $v^{(s)}$ and defines a convex polygon in two dimensions (cf. Eq. (8)). More generally, for $D = 3$ choice probabilities, the convex hull results in a convex polyhedron, and for arbitrary number of probabilities $D$ , this geometric object is known as a convex polytope (Koppen, 1995, Suck, 1992).

The present paper is concerned with mixture models as that illustrated in Fig. 1C. Theoretically, these models assume random variation in the latent, data-generating process, which can be represented statistically as a mixture distribution over the finite set of predicted patterns $v^{(s)}$ (Regenwetter & Robinson, 2017). The parameter space of these models can equivalently be described by specifying explicit linear inequality constraints on choice probabilities (e.g., $θ_{i} \leq θ_{j}$ ), or by the convex hull of all response patterns $v^{(s)}$ that are predicted by a theory. These mixture models are quite general and, depending upon the experimental design, can provide a strong test of the theory/axiom of interest. For example, applied to a single individual with choice responses aggregated over multiple time points, a violation of a mixture model over a set of predictions provides evidence that this individual must have violated the theory of interest; as the model allowed for an arbitrary distribution over all possible theory-consistent preferences.

Section snippets

Multinomial models with linear inequality constraints

In this section, we outline the notation, likelihood function, and prior distribution of multinomial models and introduce the two equivalent formal representations of linear inequality constraints.

Bayesian inference using the inequality representation

In this section, we summarize and improve computational methods for the Bayesian analysis of multinomial models given a set of linear inequality constraints.

Bayesian inference using the vertex representation

In the following, we develop computational tools for obtaining posterior samples and computing the Bayes factor for inequality-constrained multinomial models that are defined by the $V$ -representation. Instead of providing a set of inequalities as in the $A b$ -representation, the $V$ -representation uses an $S \times D$ matrix that contains one vertex $v^{(s)}$ (e.g., a predicted pattern) per row as illustrated in Eq. (8). For many psychological theories, it is indeed easier to obtain a list of all admissible

The R package multinomineq

We implemented the above computational methods for multinomial models with convex, inequality constraints in C++ using the linear-algebra library Armadillo (Sanderson, 2010). This has the advantage that many of the sequential computations can efficiently be performed using precompiled code. To also make the methods available to a broad audience, the functions are embedded in the R package multinomineq, which is freely available on GitHub (www.github.com/danheck/multinomineq/; Heck &

Discussion

In mathematical psychology in general and judgment and decision making in particular, many theories can be formulated by a set of linear inequality constraints on multinomial models (Iverson, 2006). This includes representational measurement theory (Karabatsos, 2001, Krantz et al., 1971), state-trace analysis (Prince et al., 2012), decision axioms such as transitivity (Myung et al., 2005, Regenwetter et al., 2011), random utility models (for a review, see Marley & Regenwetter, 2017), and

References (87)

AvisD. et al.
How good are convex hull algorithms?
11th ACM symposium on computational geometry
Computational Geometry
(1997)
BamberD. et al.
How to assess a model’s testability and identifiability
Journal of Mathematical Psychology
(2000)
CyrusM. et al.
Generalized two- and three-dimensional clipping
Computers & Graphics
(1978)
Davis-StoberC.P.
Analysis of multinomial models under inequality constraints: applications to measurement theory
Journal of Mathematical Psychology
(2009)
Davis-StoberC.P.
A lexicographic semiorder polytope and probabilistic representations of choice
Journal of Mathematical Psychology
(2012)
Davis-StoberC.P. et al.
Individual differences in the algebraic structure of preferences
Journal of Mathematical Psychology
(2015)
Davis-StoberC.P. et al.
Extended formulations for order polytopes through network flows
Journal of Mathematical Psychology
(2018)
Davis-StoberC.P. et al.
Bayes factors for state-trace analysis
Journal of Mathematical Psychology
(2016)
DoignonJ.-P. et al.
Primary facets of order polytopes
Journal of Mathematical Psychology
(2016)
FishburnP.C.
Induced binary probabilities and the linear ordering polytope: a status report
Mathematical Social Sciences
(1992)

HeckD.W. et al.

From information processing to decisions: formalizing and comparing probabilistic choice models

Cognitive Psychology

(2017)

HeckD.W. et al.

Adjusted priors for Bayes factors involving reparameterized order constraints

Journal of Mathematical Psychology

(2016)

HeckD.W. et al.

Testing order constraints: qualitative differences between Bayes factors and normalized maximum likelihood

Statistics & Probability Letters

(2015)

IversonG.J.

An essay on inequalities and order-restricted inference

Journal of Mathematical Psychology

(2006)

IversonG. et al.

Statistical issues in measurement

Mathematical Social Sciences

(1985)

KarabatsosG.

The exchangeable multinomial model as an approach to testing deterministic axioms of choice and measurement

Journal of Mathematical Psychology

(2005)

KlauerK.C. et al.

The flexibility of models of recognition memory: the case of confidence ratings

Journal of Mathematical Psychology

(2015)

KlauerK. et al.

Parametric order constraints in multinomial processing tree models: an extension of knapp and batchelder (2004)

Journal of Mathematical Psychology

(2015)

KlugkistI. et al.

The Bayes factor for inequality and about equality constrained models

Computational Statistics & Data Analysis

(2007)

KoppenM.

Random utility representation of binary choice probabilities: critical graphs yielding critical necessary conditions

Journal of Mathematical Psychology

(1995)

McCauslandW.J. et al.

Prior distributions for random choice structures

Journal of Mathematical Psychology

(2013)

MyungJ.I. et al.

A Bayesian approach to testing decision making axioms

Journal of Mathematical Psychology

(2005)

RissanenJ.

Modeling by shortest data description

Automatica

(1978)

SmeuldersB. et al.

Testing probabilistic models of choice using column generation

Computers & Operations Research

(2018)

StephanK.E. et al.

Bayesian model selection for group studies

NeuroImage

(2009)

SuckR.

Geometric and combinatorial properties of the polytope of binary choice probabilities

Mathematical Social Sciences

(1992)

WetzelsR. et al.

An encompassing prior generalization of the Savage–Dickey density ratio

Computational Statistics & Data Analysis

(2010)

AgrestiA. et al.

Bayesian inference for categorical data analysis

Statistical Methods & Applications

(2005)

AssarfB. et al.

Computing convex hulls and counting integer points with polymake

Mathematical Programming Computation

(2017)

BarlowR.E. et al.

Statistical inference under order restrictions: Theory and application of isotonic regression

(1972)

BröderA. et al.

Bayesian strategy assessment in multi-attribute decision making

Journal of Behavioral Decision Making

(2003)

BrøndstedA.

An introduction to convex polytopes

(2012)

CarboneE. et al.

Which error story is best?

Journal of Risk and Uncertainty

(2000)

CavagnaroD.R. et al.

A model-based test for treatment effects with probabilistic classifications

Psychological Methods

(2018)

ChristofT. et al.

Porta - polyhedron representation transformation algorithm

(1997)

Davis-Stober, C. P., Brown, N., & Cavagnaro, D. R. (2018). Erratum to Davis-Stober et al. (2015): individual...

DevroyeL.

Non-uniform random variate generation

(1986)

EfronB. et al.

Data analysis using Stein’s estimator and its generalizations

Journal of the American Statistical Association

(1975)

FukudaK.

Frequently asked questions in polyhedral computation

(2004)

GelfandA.E. et al.

Bayesian analysis of constrained parameter and truncated data problems using Gibbs sampling

Journal of the American Statistical Association

(1992)

GhoshM.

Objective priors: an introduction for frequentists

Statistical Science

(2011)

HaafJ.M. et al.

Some do and some don’t? Accounting for variability of individual difference structures

Psychonomic Bulletin & Review

(2019)

HeckD.W.

A caveat on the Savage-Dickey density ratio: the case of computing Bayes factors for regression parameters

British Journal of Mathematical and Statistical Psychology

(2019)

Cited by (26)

Order-constrained inference to supplement experimental data analytics in behavioral economics: A motivational case study
2023, Journal of Behavioral and Experimental Economics
A common approach to theory testing in behavioral and experimental economics relies on null hypothesis significance testing via (generalized) linear regression models. Here, we showcase order-constrained inference as an alternative route to theory testing. Order-constrained inference can improve the precision and nuance of behavioral decision analytics. For example, the method can be leveraged to quantify the evidence in support of, or against, a given hypothesis. It also offers advanced model selection tools for quantitative competition among multiple theories. To illustrate our case for order-constrained methods, we re-analyze data from a pre-registered experiment on incentives, cognitive reflection, and dishonest behavior. Building on this publicly available dataset, we further highlight the advantages of Bayesian order-constrained inference. We discuss how the method can deliver more convincing and more nuanced evidence than frequentist null hypothesis significance testing, pointing to new research avenues for supplementing and expanding on experimental designs in behavioral economics.
An illustrated guide to context effects
2023, Journal of Mathematical Psychology
Three context effects pertaining to stochastic discrete choice have attracted a lot of attention in Psychology, Economics and Marketing: the similarity effect, the compromise effect and the asymmetric dominance effect. Despite this attention, the existing literature is rife with conflicting definitions and misconceptions. We provide theorems relating different variants of each of the three context effects, and theorems relating the context effects to conditions on discrete choice probabilities, such as random utility, regularity, the constant ratio rule, and simple scalability, that may or may not hold for any given discrete choice model. We show how context effects at the individual level may or may not aggregate to context effects at the population level. Importantly, we offer this work as a guide for researchers to sharpen empirical tests and aid future development of choice models.
Cultural consensus theory for two-dimensional location judgments
2023, Journal of Mathematical Psychology
Cultural consensus theory is a model-based approach for analyzing responses of informants when correct answers are unknown. The model provides aggregate estimates of the latent consensus knowledge at the group level while accounting for heterogeneity in informant competence and item difficulty. We develop a new version of cultural consensus theory for two-dimensional continuous judgments which are obtained when asking informants to locate a set of unknown sites on a geographic map. The new model is fitted using hierarchical Bayesian modeling. A simulation study shows satisfactory parameter recovery for realistic numbers of informants and items. We also assess the accuracy of the aggregate location estimates by comparing the new model against simply computing the unweighted average of the informants’ judgments. A simulation study shows that, due to weighing judgments by the inferred competence of the informants, cultural consensus theory provides more accurate location estimates than unweighted averaging. The new model also showed a higher accuracy in an empirical study in which individuals judged the location of 57 European cities on maps.
Bayesian inference for generalized linear model with linear inequality constraints
2022, Computational Statistics and Data Analysis
Bayesian statistical inference for Generalized Linear Models (GLMs) with parameters lying on a constrained space is of general interest (e.g., in monotonic or convex regression), but often constructing valid prior distributions supported on a subspace spanned by a set of linear inequality constraints can be challenging, especially when some of the constraints might be binding leading to a lower dimensional subspace. For the general case with canonical link, it is shown that a generalized truncated multivariate normal supported on a desired subspace can be used. Moreover, it is shown that such prior distribution facilitates the construction of a general purpose product slice sampling method to obtain (approximate) samples from corresponding posterior distribution, making the inferential method computationally efficient for a wide class of GLMs with an arbitrary set of linear inequality constraints. The proposed product slice sampler is shown to be uniformly ergodic, having a geometric convergence rate under a set of mild regularity conditions satisfied by many popular GLMs (e.g., logistic and Poisson regressions with constrained coefficients). One of the primary advantages of the proposed Bayesian estimation method over classical methods is that uncertainty of parameter estimates is easily quantified by using the samples simulated from the path of the Markov Chain of the slice sampler. Numerical illustrations using simulated data sets are presented to illustrate the superiority of the proposed methods compared to some existing methods in terms of sampling bias and variances. In addition, real case studies are presented using data sets for fertilizer-crop production and estimating the SCRAM rate in nuclear power plants.
TUTORIAL: “With sufficient increases in X, more people will engage in the target behavior”
2020, Journal of Mathematical Psychology
Citation Excerpt :
This uncertainty about convergence, together with the potentially slow convergence overall, is the main price to pay for having an algorithm that applies to a very broad collection of models, including models that combine inequality with equality constraints. For these hypotheses, none of which involve equality constraints, the user who requires very high accuracy and confidence in the Bayes factors can either supplement or replace this Bayes factor calculation with a draw-and-test algorithm that is currently available only in the Matlab code version of QTest (as well as the R package of Heck & Davis-Stober, 2019).15 The most parsimonious hypotheses, such as Hypotheses 1 & 2 generate very small Bayes factors that unambiguously rate the evidence against these hypotheses as decisive in Targets 1 & 3, and strong in Target 2.
Psychological theory should guide the method. A method should not dictate theory. Extraneous assumptions entering psychological theories through the backdoor of a method may differentially affect the analysis of different data sets. This introduces noise and jeopardizes successful replication of valid theoretical claims. Auxiliary theoretical assumptions can also bias substantive conclusions (including across replications). It is therefore becoming ever more crucial that theoretical claims genuinely represent the given theory, no more, no less. Recent work has highlighted a disconnect between some theories and their ‘predictions,’ questioned the scope of theories in the presence of heterogeneity in hypothetical constructs, and developed methods to avoid extraneous assumptions. This tutorial merges these strands of research using a simple, illustrated case study on formulating and testing order-constrained theories. The tutorial applies to empirical paradigms in which scholars can state ordinal constraints on the outcome probabilities for several binary variables such as binary responses or the presence/absence of symptoms, and where the collection of binary variables is associated with a finite set of distinct conditions, such as group membership, treatment condition, or discrete levels of an independent variable. The goal is to let scholars spell out very precise hypotheses that (1) areunadulterated reflections of their theory, (2) provide exceptional theoretical nuance, (3) formally accommodate substantive heterogeneity and (4) offer rigorous and strong quantitative diagnosticity.
Bayesian hypothesis testing for Gaussian graphical models: Conditional independence and order constraints
2020, Journal of Mathematical Psychology
Gaussian graphical models (GGM; partial correlation networks) have become increasingly popular in the social and behavioral sciences for studying conditional (in)dependencies between variables. In this work, we introduce exploratory and confirmatory Bayesian tests for partial correlations. For the former, we first extend the customary GGM formulation that focuses on conditional dependence to also consider the null hypothesis of conditional independence for each partial correlation. Here a novel testing strategy is introduced that can provide evidence for a null, negative, or positive effect. We then introduce a test for hypotheses with order constraints on partial correlations. This allows for testing theoretical and clinical expectations in GGMs. The novel matrix- $F$ prior distribution is described that provides increased flexibility in specification compared to the Wishart prior. The methods are applied to PTSD symptoms. In several applications, we demonstrate how the exploratory and confirmatory approaches can work in tandem: hypotheses are formulated from an initial analysis and then tested in an independent dataset. The methodology is implemented in the R package BGGM.

View all citing articles on Scopus

^☆: The R package multinomineq can be installed from https://github.com/danheck/multinomineq/. Data and R code for the analyses are available at the Open Science Framework at https://osf.io/xv9u3/.

¹: The first author was supported by the research training group Statistical Modeling in Psychology (GRK 2277), funded by the German Research Foundation (DFG).

²: The second author was supported by the National Science Foundation, United States (grant SES 14-59866) and the National Institute of Health, United States (grant K25AA024182).

View full text

Multinomial models with linear inequality constraints: Overview and improvements of computational methods for Bayesian inference☆

Highlights

Abstract

Introduction

Section snippets

Multinomial models with linear inequality constraints

Bayesian inference using the inequality representation

Bayesian inference using the vertex representation

The R package multinomineq

Discussion

Computational Geometry

Journal of Mathematical Psychology

Computers & Graphics

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Mathematical Social Sciences

Cognitive Psychology

Journal of Mathematical Psychology

Statistics & Probability Letters

Journal of Mathematical Psychology

Mathematical Social Sciences

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Computational Statistics & Data Analysis

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Journal of Mathematical Psychology

Automatica

Computers & Operations Research

NeuroImage

Mathematical Social Sciences

Computational Statistics & Data Analysis

Bayesian inference for categorical data analysis

Statistical Methods & Applications

Computing convex hulls and counting integer points with polymake

Mathematical Programming Computation

Statistical inference under order restrictions: Theory and application of isotonic regression

Bayesian strategy assessment in multi-attribute decision making

Journal of Behavioral Decision Making

An introduction to convex polytopes

Which error story is best?

Journal of Risk and Uncertainty

A model-based test for treatment effects with probabilistic classifications

Psychological Methods

Porta - polyhedron representation transformation algorithm

Non-uniform random variate generation

Data analysis using Stein’s estimator and its generalizations

Journal of the American Statistical Association

Frequently asked questions in polyhedral computation

Bayesian analysis of constrained parameter and truncated data problems using Gibbs sampling

Journal of the American Statistical Association

Objective priors: an introduction for frequentists

Statistical Science

Some do and some don’t? Accounting for variability of individual difference structures

Psychonomic Bulletin & Review

A caveat on the Savage-Dickey density ratio: the case of computing Bayes factors for regression parameters

British Journal of Mathematical and Statistical Psychology