1 Introduction

Interest in understanding and explaining the nature and causes of comparative health outcomes across and within countries and by socioeconomic and demographic sub-group has expanded dramatically during the last three decades. The rapid increase in the use of individual and household population surveys for collecting socioeconomic, demographic and health data has given rise to self-rated health becoming one of the most frequently utilised health measurements in social science research. Its rise is not only due to the financial and logistical practicalities associated with the collection of biomarkers or other clinically oriented health measures but also reflects its predictive association with mortality (Idler and Benyamini 1997; De Salvo et al. 2005; Doiron et al. 2015) and its potential comprehensiveness as a measure of health (Jylha 2009). In 1996, the World Health Organization (WHO) recommended the employment of self-rated health as a principal tool for monitoring the health and quality of life of the population (Health interview 1996) and self-rated measures are now widely used in Health Technology Assessments to inform the allocation of health resources. In this context it is therefore important that the properties of self-rated health data—both in general and for particular populations—are well understood.

The typical population survey question on self-rated health (hereafter SRH) asks respondents: “In general, how would you rate your health?—very good, good, average, bad or very bad”. Results from these surveys have been used to examine population health patterns in relation to income (Frijters et al. 2005), socioeconomic status (Contoyannis et al. 2004), social capital (O’Doherty et al. 2017), retirement (Eibach 2015) and the life-course (Bauldry et al. 2012). It is assumed that the underlying components that inform SRH somehow proxy ‘true’ health by reflecting the presence (or absence) of, physical and mental illnesses, disabilities or constraints in physical functioning, pain and discomfort, and other medical conditions. Yet, in reality, responses to SRH questions are a product of subjective, cognitive-emotional processes that are specific to prevailing social, economic and cultural settings. That is, in self-rating health in this ordinal manner, an individual’s response is informed by myriad factors, other than their true health, including education, environment, gender relations, social networks, labour market status, the presence or absence of long-term chronic conditions and culture. If this is so, then SRH is as much a health-related social construct (Kaplan et al. 1976) as it is a measure of ‘true’ underlying health. This is not to deny that SRH is linked in important ways to underlying physiological and psychological states but simply to contend that an individual’s comprehension of these states is itself a function of socioeconomic and demographic characteristics, health behaviours and expectations, and national, neighbourhood and peer comparisons.

It is because of these internal evaluative processes and the context specific frames of reference within which they occur that SRH necessarily incorporates an unknown element of subjectivity which, if non-random and variable across distinct sub-groups, could give rise to systematic biases when comparing SRH across time, place and population sub-group. This phenomenon, of systematic differences in SRH, according to population markers (e.g. gender, religion, age, region, education level) is known as “reporting heterogeneity” (Shmueli 2003; Lindeboom and van Doorslaer 2004). It captures the idea that, for a given true underlying but unobserved health state, individuals self-rate their health in ways that are systematically conditioned by observed or unobserved characteristics. If this is the case, then two individuals—with the same underlying true health states—may self-rate their health differently.

While this form of reporting heterogeneity may have limited consequences for tracking an individual’s health over time, the consequences are much more serious for aggregated comparisons. For example, if poorer respondents, with otherwise identical medical conditions, report higher levels of SRH than richer respondents, then the true level of health inequality will be underestimated (Bago D’Uva et al. 2008; Murray et al. 2003). Equally, cross group comparisons of SRH, that fail to account for context, may be misleading as well as overlooking aspects of health and functioning that respondents themselves would prioritise (Hahn 1999). Therefore, if subjective health measures are used to inform the allocation of health resources, it is essential that reporting heterogeneity and its relationship with the measurement and determinants of SRH are well understood (Cubí-Mollá, et al. 2017).

In testament to its importance, a burgeoning literature providing empirical evidence of reporting heterogeneity has now emerged. In an American study, Idler (1993) found that elderly respondents persistently overestimate their health compared to other age groups. Lindeboom and van Doorslaer (2004) argued that, in undertaking self-assessment, respondents situate themselves alongside a particular reference group defined by age, gender and education. Similarly, Dowd and Zajacova (2010) found evidence of systematic income-based differences in SRH which were inconsistent with respect to objective biomarkers. Van Doorslaer and Gerdtham (2003), Lindeboom and van Doorslaer (2004), Etilé and Milcent (2006) and Bago D’Uva et al. (2008) present evidence that cultural and semantic understandings, as well as socioeconomic and demographic characteristics (e.g. age, education, gender, income), influence reporting heterogeneity.

As this literature was emerging, it began to draw influence from another strand of health research that related to the broader social and environmental determinants of health (WHO 2008). In this spirit, Etilé and Milcent (2006) and Bago D’Uva et al. (2008) argue that, independently of health, reporting behaviour is related more broadly to the environment in which individuals live. The former use information on regional mortality rates to capture local conditions, while the latter argue that the self-rating of health may be lower for individuals “living within a community in which a large proportion of the population suffers poor health” and, moreover, that an individual’s willingness to tolerate ill-health is itself related to variations in access to effective health care. This captures the spirit of the work of Hildebrand and Van Kerm (2009) who examine the relationship between regional income inequality and individual self-rated health across a number of European Union countries.

The existing research has explored reporting heterogeneity for a range of developed and less-developed countries. Among the former are Germany (Schneider et al. 2012), France (Etilé and Milcent 2006), the UK (Hernández-Quevedo et al. 2004), Canada (Linderboom and van Doorslaer 2004) and, through a cross-country analysis, 10 European countries (Jürges 2007). Among the latter, Bago D’Uva et al. (2008) examine data for Indonesia, India and China. Russia, with its vast territory, geographical location between Europe and Asia, large multi-ethnic population, high level of socioeconomic inequality and variation in regional healthcare provision, represents an interesting case for studying reporting heterogeneity. To date, we have found only two Russia-based studies in this field, both published in Russian. The first (Kuchakov and Raskina 2017) uses anchoring vignettes, while a more recent paper (Kaneva and Baidin 2018) adopts an approach more similar to ours but does not attempt to capture the impact that regional environmental factors (e.g. regional socioeconomic characteristics) have on individual reporting heterogeneity. Yet, there is also evidence emerging of the relationship between regional level characteristics and individually reported outcomes for Russia. Carlson (2005) found that “regional health differences among men are in part explained by regional income differences” (p. 389). More recently, Lyytikainen and Kemppainen (2016) concurred, finding that persistent regional differences in self-rated health across Russia could not be explained by individual level differences. Watson et al. (2013), analyzing socio-economic determinants of obesity, noted that there are significant correlations between individual subjective health status and physical environments, including morbidity.

In this paper we therefore seek to extend the existing literature in two main ways. First, following Lindeboom and van Doorslaer (2004), we provide an evaluation of the nature of reporting heterogeneity in Russia. Second, following Etilé and Milcent (2006) and Bago D’Uva et al. (2008), we incorporate proxies for the individual’s local social and economic environment on the basis that the latter may influence self-rating behaviour in systematic ways. In doing so, we draw out the vital connections between the voluminous literature that examines the link between social and environmental determinants of individual health (e.g. WHO 2008; Hilderbrand and van Kerm 2009; Marmot et al 2012) and the literature, referred to above, that seeks to distinguish true health effects from reporting effects. Our empirical results provide evidence that self-rating behaviour is systematically influenced by individual characteristics but also by regional socioeconomic characteristics. In contrast to other literature, we find that all of these effects are broadly similar across gender. While identifying the precise mechanisms linking aggregate level variables with either individual health outcomes or health reporting behaviour is beyond the scope of this paper, we suggest that both the relative income mechanism and a mechanism linking the cognitive process of choice with the sense of optimism that individuals’ have for the local institutional environment are both plausible points of departure for future research.

The paper proceeds as follows. Drawing on Lindeboom and van Doorslaer (2004), in Sect. 2, we briefly summarise the formal approach for estimating and testing the presence of reporting heterogeneity in SRH and outline our identifying assumptions. Section 3 describes the Russian data that we use and presents the objective disease index employed in our identification strategy. Section 4 describes the results, which are then discussed in greater depth in Sect. 5. Section 6 concludes by identifying important policy implications and areas in which this research can be developed.

2 Methods: Estimating Reporting Heterogeneity

The essential logic underpinning this research is that, observed changes or differences in SRH between individuals could be due to changes in underlying ‘true’ health, or (and) could result from a measure-specific variation, holding true health constant. It is the latter variation which is referred to as reporting heterogeneity (Shmueli 2003; Gerry and Baidin 2017). The challenge of distinguishing these changes from fluctuations in true underlying health is well-established and there are various methods available. The WHO has advocated the use of anchoring vignettes (Tandon et al. 2003; Bago D’Uva et al. 2008; Kuchakov and Raskina 2017) which involve respondents in subjectively rating the health of the same set of hypothetical individuals. The responses to these vignettes are then used to anchor the respondents’ self-rating of the components captured by the vignettes. In effect, by enabling inter-respondent comparability, this approach seeks to disconnect reporting behaviour from the underlying health concepts under investigation. As an alternative Lindeboom and van Doorslaer (2004) proposed a statistical model that facilitates the disaggregation of observed differences in SRH into (i) variation in underlying health and (ii) variation in reporting behaviour. In this approach, an objective health measure (rather than vignettes) anchors the responses and allows for cross group comparison. The major complication revolves around the identification problem. Lindeboom and Van Doorslaer (2004) and Etilé and Milcent (2006) provide excellent and detailed accounts of the issues around identification and so, below, we provide a summary outline of the model drawing attention to our key identification assumptions.

Taking a typical ordinal SRH question as our point of departure, define HS and H* as individual SRH and true underlying (but unobserved) individual health. For simplicity, we treat H* as a single (unobserved) health index and argue below that there are certain observable health measures which can be used to construct a direct proxy for this unobserved index. First, in estimating SRH, we define an ordered probit model for the categorical variable HS and its correspondence with the latent variable H* as:

$${H^S} = j \Leftrightarrow {c_{j - 1}} < H* \leqslant {c_j},\;j = 1, \ldots ,n,$$
(1)
$$H^* = f\left( {{H^0};\alpha } \right) + X_2^{\prime}{\beta_2} + {\varepsilon_2}.$$
(2)

where H* is a function of X2, a set of factors affecting true unobserved health (e.g. environmental, medical and genetic factors) and H0 is the unobserved initial health level. It follows from (2) that, if X2 is empty, H* will correspond to the initial true health H0.

The ordered probit model rests on the proportional odds (PO) assumption which means that, for each variable estimated in the model, the 'slope' estimates between each pair of outcomes across two response levels (e.g. SRH categories) are assumed to be the same regardless of which partition we consider (see Kim (2003) for a detailed analysis of the PO assumption). That is, the cut points (cj in (1)), which are determined during the estimation of the model, are constant across the n categories and are independent of the factors included in X2. The set of variables in X1 do not affect true health, H*, but instead influence individual level health perceptions. As such, these become our key variables of interest in exploring reporting heterogeneity. Crucially, by relaxing the PO assumption the cut points are able to vary conditional on this set of perception influencing variables, X1, which may also overlap with X2. Thus, the formula for cj is:

$${c_j} = {g_j}\left( {{X_1};{\beta_{1j}}} \right) = X_1^{\prime}{\beta_{1j}} + {\varepsilon_{1j}},\;j = 1, \ldots ,n - 1,\;{c_0} = - \infty ,\;{c_n} = \infty .$$
(3)

Substituting (2) and (3) into (1) and then combining (1) with (3) yields the generalized ordered probit model of (4)–(6) below. This model extends the previous one by incorporating an additional set of variables, associated with cut-point shifts. That is, the β1j parameters are allowed to vary across the partitions. This allows us to distinguish between f1, in (5), which captures the reporting heterogeneity determinants of SRH and f2, in (6), which describes the determinants of true unobserved health. In combination, (5) and (6) therefore describe the relationship between true health (H*), self-rated health (HS), the factors influencing health reporting behaviour (X1, e.g. low socio-economic status) and the factors influencing true underlying health (X2, e.g. individual morbidities), with \({\varepsilon_{1j}}\) and \({\varepsilon_2}\) representing random errors.

$${H^S} = j \Leftrightarrow {g_{j - 1}}\left( {{X_1};{\beta_{1,j - 1}}} \right) - X_2^{\prime}{\beta_2} < f\left( {{H^0};\alpha } \right) + {\varepsilon_2} \leqslant {g_j}\left( {{X_1};{\beta_{1j}}} \right) - X_2^{\prime}{\beta_2}.$$
(4)
$${H^S} = {f_1}\left( {H^*,{X_1},{\varepsilon_{1j}};{\beta_{1j}}} \right),$$
(5)
$$H^* = {f_2}\left( {{H^0},{X_2},{\varepsilon_2};{\beta_2}} \right).$$
(6)

Following Lindeboom and van Doorslaer, we focus on the existence of a cut-point shift and test the null hypothesis (7) that each k factor influences each cut point cj in a similar way (so called ‘index shift’). If the hypothesis is not rejected, we are unable to identify whether observed changes in SRH are due to an index shift (i.e. the k factor belongs to X2 and reflects changes in true underlying health) or if they reflect changes arising from reporting heterogeneity (i.e. the k factor belongs to X1). In contrast, rejection of (7) indicates that the k factor effects the reporting thresholds in heterogenous ways and is therefore evidence of reporting heterogeneity (i.e. the k factor belongs to at least X1), since true underlying health is a linear function of its determinants (6) and so is only consistent with a parallel index shift.

$${H_0}:\;{k_j} = const,\;\;\;\forall j.$$
(7)

However, estimation of (4)–(6) and the testing of (7), does not entirely resolve the identification problem since, though it provides evidence of non-parallel cut-point shifts, it doesn’t allow us to disentangle these from concurrent parallel cut-point shifts. For example, a localised environmental deterioration can affect true underlying health differentially (i.e. X2, in (6)) while also provoking a generalised index shift (i.e. X1, in (5)) as all residents become less optimistic about the future. In other words, the identification problem persists if we are unable to distinguish between variables in X1, which affect reporting behaviour, and those in X2, which affect only true underlying health.

In addressing this identification problem, Lindeboom and van Doorslaer note that “In practical situations one does not observe H* but a range of more objective measures. For example, whether an individual suffers from various health problems, diseases and self-related impediments in performing a range of daily activities” (Lindeboom and Doorslaer 2004, p. 1086). They argue that these variables are relevant only to underlying health outcomes and that, conditional on these effects, all other variables capture reporting heterogeneity. In this spirit, Etilé and Milcent (2006) construct a synthetic measure of clinical health, that is assumed to have no effect on the cut points, to “decompose the effect of income on SAH into an effect on clinical health (which is called a health production effect) and a reporting heterogeneity effect”. In other words, the identification problem is resolved by including objective observable health variables in X2 which, by assumption, are not present in X1.

We mirror this approach by constructing an objective health index which we use as a proxy variable for unobservable true health H*, conditional on which, we are able to interpret the impact of other variables on HS as reporting heterogeneity. More formally, we implement (4)–(6) by including an objective disease index (\({\tilde H}^*\)~H*) as a proxy for H* and thus estimate the following generalized ordered random effects probit model (Pfarr et al. 2011) for our three category dependent variable:

$$\begin{array}{*{20}{l}} {\Pr \left[ {{H^S} = 1|{X_1},{H^*} = {{\tilde H}^*}} \right] = 1 - F\left( {{X_1}{\beta_{11}} + {{\tilde H}^*}{\beta_2}} \right)} \\ {\Pr \left[ {{H^S} = 2|{X_1},{H^*} = {{\tilde H}^*}} \right] = F\left( {{X_1}{\beta_{11}} + {{\tilde H}^*}{\beta_2}} \right) - F\left( {{X_1}{\beta_{12}} + {{\tilde H}^*}{\beta_2}} \right)} \\ {\Pr \left[ {{H^S} = 3|{X_1},{H^*} = {{\tilde H}^*}} \right] = F\left( {{X_1}{\beta_{12}} + {{\tilde H}^*}{\beta_2}} \right)} \end{array}.$$
(8)

where F is the cumulative distribution function of the normal distribution. In estimating SRH (8) conditional on a proxy for true underlying health, it follows that the remaining explanatory power of the model is that associating each k factor with a measure-specific variation in SRH (given true underlying health). That is, the coefficient for each k factor represents the SRH reporting heterogeneity attributable to this factor, while the difference in the beta coefficients, across the partitions, represents the cut-point reporting heterogeneity discussed above (Schneider et al. 2012). In addition, in the panel data framework of the Russian data, we are also able to attenuate the effects of unobserved individual heterogeneity as conventionally captured by the random effects specification of the generalized ordered probit model.

3 Data

This research draws principally on data from Phase II of the Russia Longitudinal Monitoring Survey—Higher School of Economics population survey (hereafter RLMS-HSE), yielding a nationally representative household sample from a repeated cross-section sample undertaken each autumn since 1994 (other than in 1997 and 1999).Footnote 1 The survey collects extensive information on the socioeconomic, demographic, behavioural, attitudinal and health characteristics of the Russian population.Footnote 2 We further augment these data with regional macroeconomic data, from the Russian State Statistical Agency, and public health expenditure, from The Federal Treasury, for the period 2006–2014. Our overall analysis is therefore based on a longitudinal dataset covering the period 2006–2014, corresponding to a total sample size of 113,676 (47,144 males and 66,532 females). Descriptive statistics of all the variables used in the analysis are presented in Appendix 1 and are further elaborated below.

As with all empirical work, our results are reliant on the correct specification of our estimates. The dependent variable, self-rated health—known to be a good predictor of future mortality (Bago D’Uva et al. 2008)—is based on a standard question, from the RLMS-HSE, allowing respondents to choose between five possible answers: “very bad”, “bad”, “average”, “good” and “very good”. Interestingly, the answers “very bad” and “very good” attract very few responses, with less than 2% of respondents rating their health in these categories. We therefore merge the “bad” and “very bad” categories and the “good” and “very good” categories to produce a 3-category dependent variable capturing good health (= 1), average health (= 2) and bad health (= 3).

The first subset of independent variables represents a standard set of survey-based variables in line with those used in previous theoretical and empirical studies of health, including age (Grossman 1972; Cairney 2000; Kim 2015), marital status (Bos and Bos 2007; Monden and Uunk 2013), education (Grossman 1972; Kaneva et al. 2018), (log) income (Grossman 1972; Lokshin and Ravallion 2008), settlement type (Gerry and Papadopoulos 2015) and regional location (Lyytikäinen and Kemppainen 2016). The second subset of independent variables are the regional macroeconomic indicators and are made up of (log) gross regional product, share of the labour force with tertiary education, regional morbidity (measured as the number of patients with newly diagnosed diseases per 100,000 population) and public expenditure on healthcare. While it is the case that the range of regional level indicators available is rather narrow for Russia, we focused our selection of variables on those areas that provided symmetry with the factors we know are important at the individual level (e.g. income, education, health). We draw these aggregate indicators, covering 80 Russian regions, from the Federal State Statistic Service of the Russian Federation (Rosstat) yearbooks (Regions of Russia 2017; Health in Russia 2017).Footnote 3 The inclusion of these variables marks one of our core contributions and recognises the seminal literature on the social determinants of individual health (WHO 2008; Marmot et al. 2012). This literature identifies—among other things—the crucial role played by the welfare system, the tax framework, gross domestic product and other socio-economic, cultural, political and environmental conditions in shaping individual health experiences and outcomes.

Finally, we turn to the objective disease index, for which there are several plausible approaches identified in the literature (Schneider et al. 2012; Pfarr et al. 2012). The two fundamental requirements for this index are firstly, availability of objective health variables which are relevant only to the determination of true health and; secondly, that the index is “correctly specified as a homogenous function of covariates” (Bago D’Uva et al. 2008, p.357). If these conditions are satisfied then, by including the index in the estimate of (8), we can net out the true health impact on SRH and observe reporting heterogeneity through the remaining covariates. The RLMS-HSE data contains information on six diseases diagnosed by a medical specialist: diabetes, myocardial infarction, stroke, anaemia, hepatitis and tuberculosis. The index, scaled to take values between 0 and 100, is constructed from the predicted values of a bivariate random effects panel probit regression which includes the six diagnosed diseases as independent variables regressed on a binary variable indicating whether the respondent has experienced health problems in the last 30 days.

In implementing Eq. (8) we estimate the models separately for males and females. First, prior research shows that there are substantial differences in health reporting between genders in Russia (Kaneva et al. 2018) that can bias the construction of the objective disease index. Second, the existing literature also points to systematic gender differences in reporting behaviour: Murray et al (2003) suggest that “men may be more likely to deny declines in health so that their cut-points may be systematically shifted as compared to women” (p. 710) while Schneider et al. (2012) provide quantitative estimates of the gender differences in the generalized ordered probit model for SRH.

4 Results

Before turning to the results of estimating Eq. (8), we briefly present the outcome of the bivariate random effects panel probit regressions used to generate the objective health index (Appendix 2). In line with the discussion above, we estimate these models separately for males (ind6m) and females (ind6f) over the period 2006–2014. Appendix 2 provides very strong evidence of a positive relationship between the diagnosed diseases and the recent reporting of health problems, with a higher index score for females compared with males (Appendix 1). The latter is consistent with the means for the two dependent variables, which also show females reporting less good health than males. The respective indices are also strongly significant and with the expected sign in the SRH estimates.

Turning to the results of the SRH estimates, Table 1 presents the regression results separately for males and females, with two sets of estimates in each case. In the first, category 1 (good health) is contrasted with categories 2 and 3 (average health and bad health) to produce a set of βk1 estimates which represent the impact that a unit change in the independent variable k has on the probability of being in bad or average health rather than good health. In the second, category 3 (bad health) is contrasted with categories 1 and 2 (good health and average health) to produce a set of βk2 estimates which capture the impact of a unit change in the independent variable k on the probability of being in bad health rather than average and good health. Where the coefficients are identical, we only report them in the first column of each case. Since, through the objective disease index, the estimates condition for true health, we are able to identify the way in which the explanatory variables influence the self-rating process at different points of the SRH distribution. Tables 2 and 3 present the average marginal effects for each level of SRH, depicting how the probability of choosing a particular outcome responds to a marginal change in the explanatory variable. Table 1 therefore presents the full results for both constrained and unconstrained variables while Tables 2 and 3 present the marginal effects for the variables where the coefficients are not identical.

Table 1 Random effects generalized ordered probit regression, SRH, 2006–2014
Table 2 Average marginal effects for females, 2006–2014
Table 3 Average marginal effects for males, 2006–2014

We start by presenting the results of the first group of variables (i.e. the socioeconomic and demographic variables used in related literature), before moving on to present our main variables of interest—the aggregate level variables. For both men and women, each year of additional age is associated with an approximate 1 percentage point reduction in the likelihood of reporting good health and a consequent smaller but significant likelihood of reporting bad or average health. Being unmarried decreases the likelihood of reporting good health by 2–4 percentage points, depending on whether single, divorced or widowed. Both age and marital status are associated with cut-point reporting heterogeneity, since the parallel-lines assumption is rejected and while, for age, the lower probability of reporting good health is balanced by the higher probability of reporting both average and bad health, not being married impacts particularly on the reporting of bad (rather than average or good) health.

Secondary and vocational education, compared to higher education, are strongly significant and associated, as expected, with less good SRH. The size of the effect is similar for males and females and, representing an index shift, is identical across the SRH distribution. However, we find strong evidence that having the lowest level of education is linked with cut-point reporting heterogeneity: females and males are 5–6 (4–7) percentage points less (more) likely to report good (bad) health in this category. Interestingly, females are also less likely to report average health and therefore more inclined to report bad health (7 percentage points), whereas males are more likely to report average health and therefore only 3.8 percentage points more likely to report bad health. In a similar vein to education, income drives reporting behaviour and is positively associated with good health and negatively associated with reporting bad health although in both cases the effects are roughly twice as large for males.

The regional and settlement type variables are also associated with cut-point reporting heterogeneity and in differing ways for males and females. For males, being outside of urban areas is associated with a significant increase in the propensity to report good health and a corresponding decrease in the likelihood of reporting bad health. Being outside of Moscow and St Petersburg increases the likelihood of reporting average health and substantially lowers the probability of reporting either good or bad health. For women, the likelihood of reporting average health is also significantly higher outside of Moscow and St Petersburg, while being outside of urban areas per se strongly reduces the likelihood of reporting average health; although, for good and bad health reporting, there is no significant difference between urban and rural areas. In contrast to males, females in small urban areas are more likely to report bad health.

Turning now to the aggregate level variables, we find that regional government health spending, for which the parallel lines assumption is not rejected, is associated with better SRH for males but with no significant impact on female SRH. For the remaining three aggregate variables there is evidence of cut-point reporting heterogeneity. First, increases in the level of regional morbidity are strongly linked with a lower (higher) likelihood of reporting good (bad and average) health. Second, for both males and females, higher gross regional product reduces the probability of reporting good health substantially—by 9 percentage points and 6 percentage points respectively. For females there is a corresponding 4 percentage points (2 percentage points) increase in the likelihood of reporting average (bad) health, while for males there is an 8 percentage points increase in the probability of reporting average health, but no significant impact on the probability of reporting bad health. The larger the share of the labour force with higher education, the more likely are both females and males to evaluate their health positively and, correspondingly, the lower the likelihood of reporting bad SRH. Finally, the probability of reporting good health each year, increased by between 4 and 10 percentage points per year for females and between 2 and 6 percentage points for males.

As explained in the previous section, in arguing that it is possible to identify reporting heterogeneity, we lean heavily on the reliability of the disease index. As a robustness check therefore, and within the constraints of the RLMS-HSE survey data, we repeat the above exercise with an expanded disease index incorporating three additional self-diagnosed conditions, which we consider are the least likely to be falsely self-diagnosed: oncological disease, allergies and joint disease. Information on these health conditions is only available from 2010 and therefore we have a substantially reduced sample, representing about 40% of the full sample. The results of this exercise are presented in Appendix 3 and show relatively little qualitative difference to our main Table 1 results. In particular, the sign and scale of the coefficients is reassuringly similar, and the evidence of reporting heterogeneity is substantively overlapping. We are therefore as confident as we can be that our objective disease index is a reasonable proxy for true underlying health.

5 Discussion

The empirical approach adopted in this paper seeks to identify reporting heterogeneity by controlling for true underlying health via the inclusion of an objective health index in the estimation of SRH. In an echo of the ground-breaking literature on the social determinants of health, our main empirical contribution is to further augment the estimated model of SRH by incorporating regional socioeconomic characteristics. Specifically, we argue that, it is not only individual characteristics that can impact both real underlying health and self-reported health, but that aggregate, environmental characteristics also have this attribute. To our knowledge, this fact has been overlooked to date in the reporting heterogeneity literature. However, the logic is powerful; for example, a large increase in public health spending may directly affect the health of those benefitting from the expenditure but it is likely also to have an indirect reporting effect through its impact on the confidence that the population have in the public health environment.

We argue that there are at least three broad channels through which an ‘aggregate’ reporting effect might operate. First, is the ‘immediate effects’ channel, which mirrors the manner in which individual characteristics, such as income, may impact on reporting. For example, reductions in pollution, or improvements in weather (like increases in personal income) can immediately lift an individual’s mood in ways that impact upon their SRH. Second, the ‘communication channel’ captures the impact that interaction with third parties can have on self-perceived health. For example, working in an environment in which co-workers are known to be unwell, impacts—through the cognitive reorientation of risk—on the individual’s self-assessment. Finally, the ‘relative position’ channel captures the impact that changes in a comparative position can have on an individual. For example, relocating a hypothetical individual from a high-income environment into a lower income environment, in which their relative income is now much higher, may impact upon SRH and well-being, independently of any objective changes in underlying health.

Turning to our findings, consistent with the ‘relative position’ channel, an increase in regional income results in a deterioration in SRH beyond its effect on underlying health. In other words, as the relative income of the individual falls, some cognitive adjustment takes place which links the perception of being poorer with a less optimistic assessment of health. This macro level relationship therefore operates in precisely the opposite direction to the individual micro relationship. That is, while increases in individual income provide enhanced opportunities for accessing health care, holding individual income constant, a deterioration in relative income is associated with a significantly reduced likelihood of reporting good health, a greater probability of reporting average health and, for females only, an increased likelihood of reporting bad health. This is potentially a very important finding as it suggests that, beyond individual socioeconomic and demographic characteristics, an individual’s relative standing in their local economy conditions their SRH status. While consistent with a rich and well-established literature examining the link between income inequality and health outcomes (e.g. Gravelle and Sutton 2009; Hilderbrand and van Kerm 2009; Wilkinson and Pickett 2010), we contend that, in the first instance, the important mechanism linking aggregate social effects with SRH is a reporting effect. Understanding the relationship between this reporting effect and subsequent changes in underlying health conditions (e.g. via status anxiety) is a matter for further research.

The second important aggregate finding relates to the positive impact that the local education level has on SRH. This is consistent with the communication channel outlined above. By operating in a higher human capital environment, either at work or in local neighbourhoods, the individual internalises some of the external higher education effects. This education effect then augments the, separate, positive relationship between individual education and SRH, which is consistent with the so-called education gradient, implying an association between health and education, with the latter endowing people with the knowledge and individual agency to improve their health.

Turning to the aggregate health indicators, while we don’t find evidence of cut-point reporting heterogeneity in the case of regional government healthcare spending, our results suggest that it does impact on SRH—a finding that merits further investigation. We find strong evidence that higher regional morbidity levels are associated with lower levels of SRH, even after controlling for individual health through the objective health index. In other words, the higher the levels of illness prevailing in the surrounding area are, the less positive individuals feel about their own health, independently of their objective underlying health. Once again, we argue that this is more likely to be a consequence of cognitive recalibration (e.g. regarding the perception of risks) rather than reflecting a necessary link with objective underlying health conditions. That is, the effect operates via the ‘communication channel’.

Briefly, we also confirm that, even with the inclusion of the aggregate factors, the results relating to the individual level characteristics are consistent with both the related literature and with previous evidence for Russia (Kaneva and Baidin 2018). There is strong evidence that age contributes to reporting heterogeneity and that, independent of objective health, each year that passes increases the likelihood of reporting bad health and decreases the likelihood of reporting good health. If this is the case then recalibrating age-specific health expectations could be an important policy instrument (see, for example, Kaneva et al. 2019). In concert with earlier research for Russia (Sinelnikov 2011) our results also show that being in a relationship is likely to decrease the probability of reporting bad health (and average health for men). This could stem from a positive expectation that being in a partnership will offer both financial and emotional support and therefore reduce feelings of vulnerability. Of course, it is plausible that healthier individuals self-select into successful relationships and if this is the case then the additional reporting effect is particularly interesting. Our results also show that individuals outside of Moscow and St Petersburg are less likely to evaluate their health as either good or bad, suggesting that there is less inequality in expectation, understanding and reality than in the metropolis areas of Russia’s two big cities.

Finally, one interesting divergence relates to gender. Broadly speaking, the variables determining reporting heterogeneity in these Russian data are the same for males and females. While this is consistent with Lindeboom and van Doorslaer (2004), it contrasts with other research (e.g. Schneider et al. 2012) suggestive of more substantive gender differences. In Russia, as in most countries, we know that there are large reporting differences between males and females and so, because we can’t explain this through observed characteristics, this research suggests that gender may capture some other, perhaps culturally shaped, behavioural traits and attitudes which also merit further research.

This brings us to consider the limitations of the study. Foremost among these is that we are not able, with these data, to do much beyond speculate as to the underlying mechanisms that drive the data generating process. Secondly, we are also constrained by the data with regards to the objective health indices. Ideally, we would like to have included still more ‘objective’ health data including blood pressure, grip strength, biomarkers and the like. Given the importance of the objective index for resolving the identification problem in the estimated model, this is an obvious caveat. Thirdly, the aggregated data are not at the optimum level of aggregation, but they are the only such variables available for Russia at this time. Finally, the results we present are associations and the claims we can make to establishing causality are inevitably limited. All this having been said, we are confident that our research provides an important contribution to the literature on reporting heterogeneity in SRH and motivates a number of further interesting lines of enquiry.

6 Conclusion

Using an objective health index as a proxy for true health enables us to distinguish between explanatory variables which give rise to reporting heterogeneity in SRH. We find that the causes of observed reporting heterogeneity are broadly similar for males and females and that therefore, since females report substantially higher SRH than males, gender itself is an important dimension underscoring reporting heterogeneity in Russia. For both males and females, socio-economic, demographic and settlement/region characteristics all give rise to important forms of reporting heterogeneity. Notwithstanding these important additions to the literature, our single most important contribution is to provide the first assessment of the relationship between the broader social and economic environment and SRH. We show that aggregate level environmental characteristics can not only impact directly on health, but can also separately and systematically influence reporting behaviour, conditional on underlying true health. We provide some tentative suggestions as to the possible mechanisms at play, which augment those identified in the literature linking income inequality per se with social and health outcomes (e.g. relative income hypothesis, status anxiety etc.). Further work is necessary to disentangle what appear to be reporting effects from what might be more ‘real’ effects.