Abstract
The voter model is a simple agent-based model to mimic opinion dynamics in social networks: a randomly chosen agent adopts the opinion of a randomly chosen neighbour. This process is repeated until a consensus emerges. Although the basic voter model is theoretically intriguing, it misses an important feature of real opinion dynamics: it does not distinguish between an agent's publicly expressed opinion and her inner conviction. A person may not feel comfortable declaring her conviction if her social circle appears to hold an opposing view. Here we introduce the Concealed Voter Model where we add a second, concealed layer of opinions to the public layer. If an agent's public and concealed opinions disagree, she can reconcile them by either publicly disclosing her previously secret point of view or by accepting her public opinion as inner conviction. We study a complete graph of agents who can choose from two opinions. We define a martingale M that determines the probability of all agents eventually agreeing on a particular opinion. By analyzing the evolution of M in the limit of a large number of agents, we derive the leading-order terms for the mean and standard deviation of the consensus time (i.e. the time needed until all opinions are identical). We thereby give a precise prediction by how much concealed opinions slow down a consensus.
Export citation and abstract BibTeX RIS
Original content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
1. Introduction
The voter model, introduced in the 1970s [1, 2], has became a paradigmatic model for the theoretical study of opinion dynamics [3, 4]. It describes the emergence of an ordered state (e.g. a consensus) through the local interactions of individual agents (e.g. voters). This phenomenon occurs in various contexts. Therefore, the original voter model and its extended versions have been broadly applied not only in social and political sciences [4, 5], but also in chemistry (e.g. in the study of catalytic reactions [6]) and biology (e.g. for modelling ecological competition [7–9] and prey-predator interaction [10]).
Most voter models share the features that
- (i)each agent is in one state (e.g. has one particular opinion) out of two alternatives and
- (ii)the state can only be changed through pairwise interactions between agents.
(For exceptions from rules (i) and (ii), see [11] and [12], respectively.) In each interaction, a focal agent is selected at random together with a randomly chosen adjacent agent. The graph that describes the adjacency can be complete, regular (e.g. a square lattice) or more complex (see examples in [13] and [14]). The focal agent can adopt or reject the opinion of the adjacent agent. In the original version of the voter model, which we call the basic voter model (BVM), the rule of interaction is simple: the focal agent must adopt the neighbour's opinion.
The BVM on a finite, strongly connected graph (i.e. a graph in which there is a directed path between every pair of agents) inevitably reaches a global consensus [15] (i.e. an absorbing state in which every agent holds the same opinion). Various versions of the model have applied more complex rules [4], some of which may not guarantee that all agents ultimately share the same opinion. However, in this paper we only consider models that must end in a consensus. In this context, an important question is how long it takes until all agents agree. The consensus time (also called exit time, hitting time or first-passage time) depends on the rules of interaction and graph structure [16].
In this article, we introduce a new kind of model, the concealed voter model (CVM), which differs from the BVM by distinguishing between an agent's publicly expressed opinion and her inner conviction about the particular subject. For this reason, we add a second, concealed layer of opinions to the public layer (figure 1). The duality between inner conviction and publicly expressed opinion is an important phenomenon in every society (see examples in [17]). To our knowledge, the first theoretical model that explicitly considered this duality was the 'partisan' voter model [18, 19]. Every agent in that model is a 'partisan' with a fixed and innate preference for one of the two opinions, which influences the agent's publicly expressed opinion. Thus, the partisan model distinguishes between two layers, but the interaction between the layers is only unidirectional: the internal opinion influences the external one, but the partisan voter model does not contain any feedback mechanism from the external to the internal opinion. The main novelty in the CVM is that it permits a bidirectional interaction between both layers.
We compare the mean and the standard deviation of the consensus time in the BVM and CVM. Thus, we investigate whether the existence of concealed opinions increases the coexistence time of alternative opinions in a group of people and, if yes, to what extent. In real societies, concealed opinions are ubiquitous on many kinds of issues [17], for example on political votes, debated social norms or consumption habits. Understanding the composition of opinions in the hidden layer, and how it influences the public, is important for making reliable predictions about collective opinion formation.
2. The basic and the concealed voter model
Let N denote the number of agents in the system. In the BVM, the state of each agent α at time t is defined as her publicly expressed opinion . The subscript 'ext' emphasizes that this is an external opinion in the sense that any adjacent agent can learn that α's opinion is . In our model, t is continuous and opinions are updated asynchronously (see [20] for a discussion of differences between voter models with synchronous and asynchronous updates). The opinion is binary. We denote the options as red (R) or blue (B). The agent updates her opinion by selecting one of the adjacent agents, say β, randomly and copying β's opinion . The time intervals between two consecutive copying events by agent α are exponentially distributed with a rate c. All time intervals are independent of each other so that the times between two successive copying events in a system composed of N agents is exponentially distributed with a rate .
In the CVM, the state of each agent is given by a pair of states , where is the external and the internal opinion (figure 1). We denote the two possible external opinions with a capital letter R or B, whereas the lower-case letter r or b stands for her internal opinion. The agents' four possible states are denoted as Rr, Rb, Br and Bb. The rules of updating in the external layer are the same as in the BVM. An additional process, that is specific to the CVM, is introspection. When (i.e. there is a discordance between the external and the internal opinion), the agent can relax it by either of two processes: externalization, whereby the formerly concealed internal state becomes public; or internalization, when the publicly voiced opinion becomes an internal conviction. Altogether, there are three competing processes in the system: copying, externalization, and internalization. The time intervals between two consecutive events of each kind are independent and exponentially distributed with rates , and , respectively, in the group of N agents. Copying other agents' opinions promotes consensus within the external layer whereas externalization and internalization advance consensus between the two layers.
In this article, we study the global outcome of these processes for large N in complete graphs. Because complete graphs are strongly connected, the agents must reach a consensus if N is finite. We include the reflexive relation (i.e. there is a loop from an agent to herself) for mathematical convenience. For large N, the difference between the dynamics with and without the self-loop is negligible. In the BVM, we describe the state of the system at any given time t by the fraction of agents whose opinion is red in the only (i.e. external) layer. In the CVM, we describe the state by (i) in the external layer, (ii) the fraction of red opinions in the internal layer, and (iii) the fraction of agents whose opinion is red in both layers. We study the time evolution of these observables and examine the consensus time (i.e. the first time when only a single opinion is present in both layers).
In section 3, we review some published results about the BVM concerning the main statistical properties of the consensus time. In order to investigate the same characteristics in the CVM, we first introduce an intermediate model in section 4 (the two-layered voter model), in which we also assume the existence of two connected layers. However, unlike in the CVM, the event is independent of so that the equations are easier to solve. Finally, we show in section 5 that the CVM equations can be solved similarly.
3. The basic voter model (BVM) on the complete graph
The mean consensus time of the BVM on the complete graph has been the subject of earlier publications (e.g. [13, 21]). Here we briefly review these results and also state the equation for higher moments of the consensus time distribution. We include this review because the BVM acts as a base case for comparison with the CVM and motivates the analytic techniques we apply below.
We describe the state S of the system by the fraction of agents whose opinion is red. The probability that the next copy increases by equals the product of the probability that the copying agent's opinion is blue and the probability that the copied agent holds the red opinion. Because copying events happen with rate cN, the transition rate from to is
Similarly, the rate with which decreases to is the product of the copy rate cN, the probability that the copying agent is red and the probability that the copied opinion is blue,
A comparison between (1) and (2) reveals that both transition rates are in fact equal.
The probability that in one copy we increase or decrease by more than is zero. The diagonal element of a transition rate matrix is conventionally defined as . Therefore, the only nonzero matrix elements in the row corresponding to state are and
The relatively simple structure of allows us to verify that the state at time t is a martingale,
where denotes the conditional expectation value and u is an arbitrary nonnegative time. Equation (4) follows directly from the fact that for all . If we start a series of Monte Carlo simulations always from a fixed initial fraction S(0) = s0 of red agents, the martingale property (4) manifests itself as follows. As we increase the number of simulations, the sample mean of —averaged over different simulations, but at a fixed t—converges to s0 although each individual run is likely to differ from s0 [22].
We denote the consensus time by . For every realization, we can tell whether is true without having to know any of the states at times >t. Therefore, in the parlance of stochastic processes, is a 'stopping time'. Because we also know that S is a martingale with time-independent lower and upper bounds 0 and 1, we must have (see [23]). Moreover, the only two possible states at are either (i.e. a red consensus) or (i.e. a blue consensus) so that the probability of a red consensus is given by the expected initial fraction of red agents. In the special case where all simulations start with S(0) = s0, we reach a red consensus with probability s0.
How long does it take on average to reach a consensus from s0? If we denote the expected consensus time by
where the superscript '(1)' indicates that the left-hand side is the first moment of the conditional consensus time distribution, then must satisfy [23]
Inserting (1)–(3) into (5), we obtain
We approximate by a smooth function and substitute its Taylor expansion into (6). After dropping terms O(N−3), which are negligible if , we obtain the differential equation
We can generalize this approach to higher moments of the consensus time. Let denote the nth moment of the consensus time conditioned on the initial state s0,
With the method described in section 1.6.2.2 of [21], one can obtain the following generalization of (7),
For the second moment, in particular, we find the following result that we have not seen explicitly stated in the previous literature,
where is the dilogarithm.
Our objective is to compare the mean and the standard deviation
of the BVM consensus time with those of the CVM. We will derive that, for , the CVM's mean and standard deviation differ by a factor that depends on the rates of copying, externalization and internalization. We will give the explicit equation (31) for this factor below.
4. An intermediate, two-layered voter model (TLVM)
4.1. Motivating and defining the TLVM
The CVM is more complicated than the BVM because we need more than one variable to describe its state. The agents can have four possible combinations of external and internal opinions (Rr, Rb, Br and Bb). Because the sum of agents in these four states is constrained by the total number N of all agents, the state space in the CVM is three-dimensional. We can, for example, choose the following set of variables to uniquely characterize the current state,
- : fraction of agents whose external opinion is red (i.e. Rr and Rb agents),
- : fraction of agents whose internal opinion is red (i.e. Rr and Br agents),
- : fraction of Rr agents.
In the CVM, the events (i.e. agent α's external opinion at time t is red) and (i.e. her internal opinion is red) generally depend on each other. That is, . As we will show below, we can calculate the leading-order term of the CVM consensus time for , but the inequality complicates the solution. For this reason, we will first solve a simplified model where we ignore the dependence between the external and internal layer, i.e. we set . We include the simpler model in this article because it demonstrates, with fewer intermediate steps, the analytic techniques that we will apply later to the CVM. Although the model does not yet fully capture the CVM dynamics, we will still be able to transfer some of the results directly to the CVM.
As in the CVM, we still distinguish between an external and an internal layer of opinions and, therefore, call this model the Two-Layered Voter Model (TLVM). We keep two of the CVM's features.
- Each agent copies the external opinion of a random agent with rate c.
- There is no direct opinion exchange in the internal layer.
We can impose the condition by introducing links between all N2 pairs formed by one external and one internal opinion.
- With rate , every pair externalizes (i.e. the internal opinion becomes the external opinion).
- With rate , each pair internalizes.
We divide e and i by N so that the mean time between externalization and internalization events is equal in the TLVM and CVM. We list all transitions in the TLVM together with their rates in table 1.
Table 1. Transitions from the state in the TLVM and their rates.
New state | How is the new state reached? | Transition rate matrix element |
---|---|---|
A B agent copies a neighbour with external opinion R or a Br pair externalizes. | ||
An R agent copies a neighbour with external opinion B or an Rb pair externalizes. | ||
An Rb pair internalizes. | ||
A Br pair internalizes. | ||
Negative sum of all rates above. |
Similar to the BVM, the TLVM possesses a martingale, albeit a slightly more complex one. We define the function
which maps the two-dimensional input onto a real number in . We can view m as the relative proportion of the red opinion that is present in the combination of the external and internal layer, prorated by the weights and , respectively. Let SR(t) be the random variable that equals the fraction of R agents4. Similarly, Sr(t) equals the fraction of r agents. We show in lemma A.1 in the appendix that is a martingale. In the appendix, we also prove a corollary of this lemma (corollary A.2): if the initial state is and , then the probability of reaching a red consensus is . As we will see shortly, is a convenient summary statistic to derive the time needed to reach a consensus.
4.2. The consensus time in the TLVM
Suppose we are in the state , at time t. What can we infer about the state at a later time t + u? In appendix B, we prove the following property (theorem B.1): after a transient of duration O[(e + i)−1], SR(t + u) approximately equals Sr(t + u). If e and i are independent of N, the relative error we commit by the approximation is negligible if because the consensus time turns out to be (see equation (13)). In other words, we can separate two time scales. On a fast time scale, the expected value of quickly decays to zero. On a slow time scale, the system diffuses along the line towards one of the absorbing states. In figure 2, we show a typical run of the TLVM that confirms this conjecture. We calculated the data shown in figure 2 and all other numerical results in this article with the Gillespie algorithm, an exact implementation of the stochastic dynamics [24].
Download figure:
Standard image High-resolution imageBecause the TLVM spends most of the time in states with , we can approximately describe the dynamics in terms of the one-dimensional random variable , where m is the function defined in (12). In this simplified picture, we use the symbol for the transition rate matrix. The nonzero elements of are
which we can derive by substituting (12) into the transition rates in table 1.
In analogy to the BVM, we define the mean consensus time in the TLVM as
It must satisfy the equivalent of (5),
for all with and . As in the BVM, we Taylor expand to second order and obtain the differential equation
The solution, subject to the boundary conditions , is
The calculation of the higher moments of the consensus time
is similar to (10) in the BVM. Induction on n shows that
Because , the dynamics of the external and internal opinions both behave like BVMs, but with a time that is rescaled by a factor
In particular, the standard deviations of TLVM and BVM are related by
In figure 3, we show, as a numerical confirmation of (13) and (15), the mean and standard deviation of the TLVM consensus time obtained from Monte Carlo simulations. The numerical results are in excellent agreement with the theoretical predictions (i.e. the finite size effect at N = 1000 does not modify the results substantially).
Download figure:
Standard image High-resolution imageThere are four main conclusions from the TLVM that we will be able to transfer to the CVM. (1) Although the state space is two-dimensional, the system spends most of the time near the one-dimensional manifold . (2) We can express the position along this manifold in terms of a function m that has the property that is a martingale. (3) After expressing the transition rate matrix in terms of m, we can derive the moments of the TLVM consensus time distribution. (4) The TLVM moments are related to those of the BVM by for a scale factor that is independent of n. We will encounter similar rules in the CVM.
5. Solving the concealed voter model (CVM)
5.1. Short-term evolution of the states in the CVM
In the CVM, we denote the random variables that map a stochastic configuration at time t to the fractions , and (defined in section 4.1) by SR(t), Sr(t) and SRr(t) respectively. Compared to the TLVM, the CVM not only has a three- instead of a two-dimensional state space, but the transition rate matrix also has more nonzero elements (listed in table 2). Despite the added complexity, some of the TLVM results remain unchanged for the CVM. In particular, the TLVM martingale , where m is defined in equation (12), is also a martingale of the CVM (see lemma A.1 in the appendix). Moreover, as in the TLVM, equals the probability of reaching a red consensus if the initial condition satisfies and (see corollary A.2). Curiously, m does not depend on . If we arrange the four possible combinations of external and internal opinions Rr, Rb, Br and Bb in a contingency table, the marginal frequencies and fully determine the probability that all agents ultimately agree with the red opinion. The joint distribution of internal and external opinions, which can be derived with the help of , does not add more information about the probable consensus opinion.
Table 2. Transitions from the state in the CVM and their rates.
New state | How is the new state reached? | Transition rate matrix element |
---|---|---|
A Br agent externalizes or copies a neighbour with external opinion R. | ||
An Rb agent internalizes. | ||
An Rr agent copies a neighbour with external opinion B. | ||
A Bb agent copies a neighbour with external opinion R. | ||
An Rb agent externalizes or copies a neighbour with external opinion B. | ||
A Br agent internalizes. | ||
Negative sum of all rates above. |
Besides identical martingales, the TLVM and the CVM also have equations (B.1)–(B.4) in common (see theorem B.1 in the appendix). We can deduce from (B.1)–(B.4) that, after a transient of duration O[(e + i)−1], the state of the CVM satisfies . If e and i are independent of N and , the transient is negligible compared to the consensus time so that we can approximate the dynamics of the CVM with a simplified two-dimensional state space. Upon setting , the matrix elements in table 2 become
With this two-dimensional approximation of , we can deduce the short-term evolution of SRr (see theorem B.2 in the appendix): after a transient that lasts no longer than O[(c + e + i)−1], the states in the CVM satisfy and . We show a typical trajectory of a CVM simulation in figure 4 that confirms this approximation.
Download figure:
Standard image High-resolution image5.2. The consensus time in the CVM
Figure 4 demonstrates that we can neglect the transient (i.e. the initial approach towards the black curve) compared to the consensus time. We can thus approximate the CVM dynamics by a one-dimensional model whose transition rate matrix follows from setting in (16)–(22),
We use a double tilde in the symbol for the transition rate matrix to express that (23)–(25) are approximations of , which in turn is an approximation of the exact CVM transition rate matrix given by table 2. The relative error that we introduce with these approximations is negligible if .
With the one-dimensional approximation , the derivation of the mean consensus time is now analogous to that of the TLVM in section 4.2. We call the mean CVM consensus time
so that
if and
Upon inserting (23)–(25) into (26) and taking the continuum limit, we obtain the differential equation
The solution, subject to the boundary condition (27), is
Comparing (28) with the BVM mean consensus time in (8), we find that
which generalizes to the higher moments as
It follows that the standard deviation of the consensus time obeys
where can be calculated from (10) and (11). Monte Carlo simulations of the CVM are in excellent agreement with equations (28) and (30), see figure 5.
Download figure:
Standard image High-resolution imageEquation (29) has the remarkable consequence that the CVM consensus time distribution differs from that of the BVM only by the prefactor
We plot the function in figure 6. Interestingly, for all c, e and i for the following reason. Because c, e and i are rates and hence positive numbers, we must have
Download figure:
Standard image High-resolution imageNext we add to the last two terms in the inequality,
Hence, the denominator in (31) is positive and smaller than the numerator, proving . By contrast, the corresponding TLVM scale factor , given by equation (14), can be larger or smaller than 1.
6. Discussion
Having concealed opinions is ubiquitous on various subjects from politics to personal habits. An agent may choose to misrepresent her privately held opinion when it appears to be socially unacceptable. For example, the agent may disapprove of the latest fashion trend, but still adopt it just to blend in with her acquaintances. Kuran [17] argues that such preference falsification can slow down changes in social norms. The CVM is a simple model to investigate whether the existence of concealed opinions indeed prolongs the consensus time and, if yes, to what extent.
Our study is related to earlier models on 'partisan' voters [18, 19]. Partisans have an innate and fixed preference for one of the opinions. In the limit , the CVM resembles the partisan model: although agents externally accept other opinions, the internal layer never changes. Models with 'zealot' agents (also called stubborn or inflexible agents [25–27]) are even more restrictive. Zealots are agents that never change their opinions. In these models, typically only a small number of agents are zealots, but even as few as two zealots with opposite opinions are enough to prevent a consensus. In the limiting case and , the CVM corresponds to a zealot model so that the consensus time goes to infinity. This explains the high values of in the front left corner of figure 6.
The partisan model permits the flow of opinions from the internal to the external layer, but only in this direction. In the CVM, the flow is bidirectional (provided that i > 0 and e > 0) so that agents can hold contrarian opinions for long but not infinite times. The longer an agent has been holding an external opinion, the more likely it is that she has adopted it as an internal one. Thereby, the CVM takes into account Kuran's remark that social norms can become genuinely accepted simply because they have persisted for a long time.
A characteristic feature of the CVM is that, for all externalization and internalization rates, the consensus is on average slower than in the BVM. Masuda et al [18] have shown that one way to slow down the convergence to consensus in the BVM is the introduction of heterogeneous copying rates. In their model, each agent k has a different copying rate ck chosen from a heavy-tailed probability distribution. The CVM gives an alternative mechanism for the prolongation of the consensus time. It keeps the assumption of homogeneity as in the original BVM (i.e. neither c nor e nor i depend on the agent). The deceleration in the CVM is instead caused by the requirement to reconcile the two layers with each other. The process that leads to reconciliation is fairly complex. Even when the external layer has reached an apparent consensus, the alternative opinion may lurk in the internal layer. This model feature in itself prolongs the consensus time. However, the main reason for the slower consensus is that the lurking opinion can come to the fore by externalization and then spread by copying, returning the system to a state of discord in the external layer.
In spite of this complexity, we were able to find some simple relations for a complete graph if . We defined a martingale M that determines the probability that the agents eventually agree on one opinion ('red') rather than the alternative ('blue'). By analyzing the stochastic dynamics of M, we obtained the leading-order terms for the mean and standard deviation of the consensus time. Notably, the consensus times in the CVM only differ from the BVM by a factor that is independent of N. It would be an interesting task for future research to study how different network topologies (e.g. Erdős–Rényi graphs or spatial lattices) or heterogeneity in the rates c, e, and i change the consensus time in the CVM.
Acknowledgments
We are grateful to Károly Takács and Zsuzsanna Szvetelszky for exciting discussions about the topic and helpful comments on this manuscript. The work was supported by the European Commission (project number FP7-PEOPLE-2012-IEF 6-4564/2013, M T Gastner), the European Research Council under the European Unions Horizon 2020 research and innovation programme (grant agreement No 648693, M Gulyás) and NKFIH-OTKA (grant agreements K109215 and K124438, B Oborny, and K112929, M Gulyás).
Appendix A. Derivation of the probability of a red consensus
We want to calculate the probability that all agents finally agree on the red opinion. The crucial step is to find a suitable martingale.
Lemma A.1. The random variable is a martingale of both the TLVM and CVM. That is, M satisfies
for any , where is the state at time t,
Proof. Let be the transition rate matrix or , respectively. Because S is a Markov chain with transition rate matrix , we must have
where the right-hand side is the th element in the matrix . Hence,
where is the identity matrix and m is defined by (12). Next we take the time derivative of (A.4),
If we can show
for all states in the TLVM and in the CVM, then is independent of u so that (A.1) must be true. We can verify (A.5) by substituting the elements of or from tables 1 or 2, respectively, into (A.5). □
The probability of a red consensus now follows from the following corollary.
Corollary A.2. If the initial state is
then the probability of reaching a red consensus is , where m is defined by (12).
Proof. The time until the consensus is reached is a stopping time and M is a martingale with time-independent upper and lower bounds, namely 0 and 1. Thus, . Because can only be 0 (blue consensus) or 1 (red consensus), we must have . □
Appendix B. Derivation of the short-term evolution of the TLVM and CVM
In this appendix, we derive the conditional mean and variance of a future state shortly after the system was in state S defined by (A.2).
Theorem B.1. We define in the TLVM and in the CVM. In both the TLVM and the CVM, the conditional expectation of a future state after a time is given by
independent of N, whereas the conditional variances satisfy
Therefore, if , the exponential decay in (B.1) and (B.2) is much faster than the increase in the variances.
Proof. We outline the proof of (B.1). Equations (B.2)–(B.4) can be derived similarly.
From (A.3), it follows that the conditional expectation is
where is the transition rate matrix or from table 1 or 2, respectively. The summation in (B.5) is over all states in the TLVM or in the CVM. Induction on k proves that
for Substituting (B.6) into (B.5), we obtain
□
Our next result characterizes the short-term evolution of SRr in the two-dimensional approximation of the CVM defined by the transition rate matrix given by equations (16)–(22). Because in this approximation , it is sufficient to characterize the state by two values: the value of m defined in equation (12) and the value of assumed by the random variable SRr.
Theorem B.2. The conditional expectation of SRr satisfies
Thus, in the limit of large N, the conditional expectation approaches after a transient of duration O[(c + e + i)−1]. The conditional variance satisfies
so that, during the transient, the increase in the conditional variance is negligible if .
Proof. One can show by induction on k that
for all nonnegative integers a and For the special case a = 0, it follows that
Rearranging the terms in (B.8) proves (B.7). A similar argument proves that the conditional variance is . □
Footnotes
- 4
SR should not be confused with . SR is the function that maps a stochastic configuration of opinions to the fraction of red external opinions. Thus, SR is a random variable whereas is a number between 0 and 1.