1 Introduction

Subjective wellbeingFootnote 1 (SWB) has attracted growing interest in empirical economics’ research in recent years. Regressing life satisfaction (LS) scores against a set of individual predictors including economic and non-economic (socio-demographic) factors is a means to investigating SWB. Several studies have also included contextual factors like socio-economic disadvantage or income inequality (Alesina et al., 2004; Kubiszewski et al., 2019b; Oishi et al., 2011), environmental factors (Bertram & Rehdanz, 2015; Kubiszewski et al., 2019a) or climate variables (Brereton et al., 2008; Lignier et al., 2023). However, much of this empirical research ignored the different levels of interaction between contributors and wellbeing and thus risk potential endogeneity, i.e., some factors influence LS at an individual (age, education) or household level (income, house ownership), while other are macro-level factors impacting at a neighbourhood or regional level. Research that used a multilevel analysis approach to examine those different levels of interaction has been undertaken in various countries including Europe (Aslam & Corrado, 2012; Neira et al., 2018; Pittau et al., 2010), the United Kingdom (UK) (Ballas & Tranmer, 2012), the United States (US) (Fernandez & Kulik, 1981) and New Zealand (Aminzadeh et al., 2013); however to our knowledge, this approach has never been applied to urban neighbourhoods in Australia.

The term social capitalFootnote 2 is used by social scientists to refer to the social networks and associated effects such as trust and norms of reciprocity that influence human wellbeing (Coleman, 1988; Helliwell & Putnam, 2004). At an empirical level, it has been described as “the shared knowledge, norms, rules and networks that facilitate collective experience within a neighbourhood” (Vemuri et al., 2011, p. 6). Social capital (alongside natural capital, human capital and built capital) is also one of the domains contributing to human wellbeing identified by Costanza et al. (2013) and its importance has been documented in LS studies at various scales: country (Helliwell & Putnam, 2004; Lawless & Lucas, 2010; Layard, 2005); regional (Ballas & Thanis, 2022; Rentfrow et al., 2009) and local (Aminzadeh et al., 2013; Vemuri et al., 2011). While human and social capital variables have often been considered in Australian LS research (Kubiszewski et al., 2019a; Shields et al., 2009), we are missing a systematic analysis of the impact of different dimensions of social capital (Helliwell & Putnam, 2004) on LS. This investigation seeks to address this gap, placing a particular focus on social capital influence at individual and neighbourhood levels.

Geographic clustering of LS scores has been reported in Europe (Jokela et al., 2015; Okulicz-Kozaryn, 2011; Rentfrow et al., 2015), the US (Rentfrow & Jokela, 2016) and Canada (Helliwell et al., 2019; Ziogas et al., 2023). In this research, we focus on geographic variations between neighbourhoods within an urban/ suburban context. For this purpose, we selected two metropolitan areas of Australia. One is on the East coast, in southeast Queensland centred around Brisbane, the other on the West coast around Perth (Fig. 3, Appendix C). Brisbane and Perth are both classified as “Beta” cities according to the Globalisation and World Cities Research Network (GaWC, 2020). These two regions are comparable in population and geographical spread and have growing demographic and economic significance. Larger metropolitan areas could have been chosen for this study, however, results from previous SWB research in Australia point to a different pattern of relationship between LS and LS predictors in Greater Perth compared to elsewhere across the country (Lignier et al., 2023) and we sought to put this particular assumption to the test.

Our large sample (nearly 4,000 respondents) mean that our findings have relevance beyond the two regions investigated here. They may be of interest when developing public policies that aim to enhance social cohesion and trust within local communities, specifically, public policies on urban social infrastructure.

The rest of this article is structured as follows: Section 2 briefly reviews the relevant literature; the methodology used for our project is described in Section 3, while results are presented in Section 4. We discussed our findings in Section 5, while in Section 6, we summarise our findings and discuss their practical implications.

2 Literature Review

2.1 Geography of Wellbeing

Geography of wellbeing refers to the study of spatial variations in the level of wellbeing and the factors that impact wellbeing (Weckroth et al., 2022). Findings from this body of research show that even after adjusting for differences in individual backgrounds and characteristics, significant differences exist between countries (Layard, 2005; Veenhoven, 2008), regions (Aslam & Corrado, 2012; Helliwell et al., 2019; Okulicz-Kozaryn, 2011; Pittau et al., 2010) or neighbourhoods (Ballas & Tranmer, 2012). There is some indication that geographical contexts affect LS more than momentary happiness (Schwanen & Wang, 2014).

Studies in the US and Canada report that people living in rural areas are happier than urban residents (Fernandez & Kulik, 1981; Helliwell et al., 2019), however evidence from Europe shows little difference in the level of quality of life between rural and urban areas in the richest countries of the European Union, while in the poorer countries of Eastern and Southern Europe, people residing in rural areas appear to have a lower perceived quality of life (Shucksmith et al., 2009). Others have emphasised that the degree of remoteness may be a better predictor of wellbeing than rurality: Gilbert et al. (2016) found that while residents living in remote rural areas in Scotland had a higher level of LS, there was no significant difference between accessible rural areas and urban areas.

Studies that investigated the variations in LS within urban areas found that variables such as accessibility, commuting time, safety, level of pollution and climate variables were contributing factors (Brereton et al., 2008; Ferrer-i-Carbonell & Gowdy, 2007; Stutzer & Frey, 2008; Weckroth et al., 2022). The influence of population density on LS is contested: some argue that overall district density has no significant impact (Ala-Mantila et al., 2018; Ferreira et al., 2013), others report that high density at census block level has a negative effect (Cramer et al., 2004; Helliwell et al., 2019; Li & Kanazawa, 2016). According to Ettema and Schekkerman (2016) and Ala-Mantila et al. (2018), subjective variables such as perceived quality of neighbourhood, perceived safety and subjective distance to amenities appear to be more significant predictors of wellbeing than objective characteristics.

The spatial attributes described above are often intertwined with socio-economic factors such as income inequality and unemployment level in the neighbourhood (Ala-Mantila et al., 2018). The relationship between income inequality and wellbeing is not clear. Higher inequality has been associated with lower reported levels of wellbeing in Europe (Alesina et al., 2004) and in the US (Alesina et al., 2004; Oishi et al., 2011). However when using state level data, Glaeser et al. (2016) found that inequality had a small positive influence on happiness while Florida et al. (2013) noticed that it had no influence in metropolitan areas. It seems however that areas where there is a high inequality in LS have lower levels of average LS (Kubiszewski et al., 2019a; Ziogas et al., 2023). The influence of area level unemployment on individual wellbeing has also been examined: the consensus is that contextual unemployment has a positive influence on individual wellbeing as it acts as a social norm and softens the negative impact of individual unemployment (Clark, 2003; Clark & Oswald, 1994; Powdthavee, 2007).

2.2 The Importance of Social Capital

The significance of local or regional social capital as a predictor of individual wellbeing has been confirmed by studies in Europe (Aslam & Corrado, 2012; Mouratidis, 2019; Neira et al., 2018; Weckroth et al., 2022), the US (Vemuri & Costanza, 2006; Vemuri et al., 2011) and New Zealand (Aminzadeh et al., 2013). Social capital is sometimes included in the analysis as a single proxy factor (Florida et al., 2013; Subramanian et al., 2000), however others integrate different dimensions of social capital as distinct variables: frequency of social contacts (Yuan, 2016), social trust (Subramanian et al., 2000) and community involvement (Gilbert et al., 2016). Trust in institutions is sometimes distinguished from trust in people and social networks from formal networks.

Social trust is the belief that those around you (neighbours, family etc.,) can be trusted and is considered a strong indicator of social capital at aggregate level (Helliwell & Putnam, 2004; Yuan, 2016). The positive relationship between institutional trust, freedom and happiness has been documented in many studies (Bruni, 2006; Frey & Stutzer, 2000; Veenhoven, 2000), while the national average level of interpersonal trust has been shown to have a positive effect on SWB (Helliwell, 2003). When aggregated at the regional level, social trust and institutional trust were positively correlated with LS, but the influence of the aggregate regional mean variables was found to be stronger than the individual effect (Aslam & Corrado, 2012). Social networking is strongly associated with both higher levels of LS and higher levels of happiness (Aslam & Corrado, 2012; Neira et al., 2018), but not with eudaimonic wellbeing (Gilbert et al., 2016).

Individual perceptions about the neighbourhood social and physical attributes have also been used as measures of social capital (Aminzadeh et al., 2013). A psychological sense of community, that is the perception that neighbours are helpful and could be relied upon if necessary, the feeling that one belongs to the community has been found to be associated with higher levels of LS (Ma et al., 2018; Shields et al., 2009; Vemuri et al., 2011). Conversely, negative perceptions about the level of crime (Ambrey et al., 2014) and physical deterioration such as derelict buildings, litter and noise (Ettema & Schekkerman, 2016; Mouratidis, 2019) or pollution (MacKerron & Mourato, 2009) were reported as negative drivers of LS.

2.3 Multi-Level Analysis and Heterogeneity

Multi-level analysis is a way to control for endogeneity caused by grouping (nested) behaviour of individuals. Multi-level analysis investigating geographic heterogeneity has been applied to a variety of topics including: self-rated health in US states (Subramanian et al., 2000); SWB across census districts in the UK (Ballas & Tranmer, 2012); adolescent wellbeing in New Zealand neighbourhoods (Aminzadeh et al., 2013); SWB across regions of Europe (Aslam & Corrado, 2012; Neira et al., 2018; Pittau et al., 2010) and in neighbourhoods of Finland (Weckroth et al., 2022).

Multi-level Modelling (MLM) is the technique of statistical analysis that examines the influence of contextual factors on LS using the hierarchical structure of the data. MLM is particularly suitable for studying the role of contextual factors such as geography based cultural and socio-economic differences, in shaping up lifestyle behaviour (Ballas & Tranmer, 2012; de Leeuw & Meijer, 2008). MLM allows the group coefficients (intercepts and slopes) to be modelled and provides separate estimates of individual and contextual effects at different levels. (Pittau et al., 2010). MLM estimates variations both within and between the groups by allowing intercepts and slopes to vary simultaneously (Gelman & Hill, 2007).

The effects of contextual factors on the dependent variable have been described in various literatures as heterogeneity, however it is important to distinguish between two types of heterogeneity: between-context from between-individual heterogeneity (Duncan et al., 1998). Between-context heterogeneity is a higher-level form of heterogeneity that reflects differences between groupings or regions. On the other hand, between-individual refers to differences at a micro-level and reflects the different characteristics of the people within the grouping/region (Duncan et al., 1998; Neira et al., 2018).

3 Methodology

3.1 Data

This study uses data from Wave 18 (2018) of the Household Income and Labour Dynamics in Australia (HILDA) survey. HILDA is a nationally representative study designed to examine various aspects of Australian households that periodically collects information relating to social capital. Special permission is required to access the restricted version of the HILDA survey.

The descriptive summary of the database used for this study is as follows: 3,869 individual respondents nestedFootnote 3 into 2,176 households nested into 390 level 2 statistical areas. A level 2 statistical area (SA2) is determined by population size (average 10,000). This means that the geographic size of a district may vary from a small neighbourhood in densely populated inner suburbs, to a relatively large area (several hundred km2) in outer metropolitan districts (Australian Bureau of Statistics, 2016).

The comparative statistics between the reference populations and our sample (Table 1) indicate that both metropolitan regions share similar key demographic and socio-economic characteristics except for the proportion of people born overseas and people speaking a Language Other than English (LOTE) that is higher in Greater Perth than in SE Queensland. Respondents in our sample are older on average and households have a much higher median income compared to the reference populations. People born overseas in particular people who speak a LOTE, are significantly under-represented. This selection bias could have implications for the generalisation of our findings, especially the under-representation of immigrants, as empirical evidence suggests that ethnicity has a significant influence on SWB (Bruna, 2021; Helliwell et al., 2019).

Table 1 Statistical profile of the study sample compared to reference populations

3.2 Statistical Analysis

We adopt a three-level MLM for our analysis, with Level 1 representing the individual component, Level 2 the household component and Level 3 the SA2 component. Apart from modelling group coefficients, the use of MLM decomposes the total random variation into individual and group components. While group level predictors are themselves of interest, their inclusion may also reduce the unexplained group level variation which can be interpreted as a measure of the importance of the predictor (Pittau et al., 2010). The number of clusters should be large enough (> 30) and the groups heterogeneous (Hox, 1998). MLM can be applied even where some groups have a size of 1 as long as there is a sufficient number of larger groups (Snijders & Berkhof, 2008). In this research, households with one observation were all retained, however all SA2 districts with less than 2 observations were discarded from the sample as groups with a single observation were problematic when estimating a model with random slope coefficients. The different metropolitan regions could arguably have been treated as an additional level of analysis. However, considering that we only have two clusters, we opted to treat regions as a fixed effect in our estimation.

3.3 Variables

The selection of variables (Table 2, Appendix A) reflects the three-level nested design: individual level variables, household level variables, statistical area level variables. Social capital variables are discussed separately.

3.3.1 Individual and Household Level Variables

The selection of relevant individual socio-demographic variables was guided by the literature: age, gender, health, employment, marital status, and education level. We also include variables specific to the Australian context such as indigenous status, and ‘speak a LOTE at home’ as a proxy for non-English speaking background (Ambrey & Fleming, 2014; Shields et al., 2009).

Household type was included as a potential predictor of LS (Ballas &Tranmer, 2012), with ‘lone person household’ being the baseline and three identified types: ‘couple without children’ ‘couple with children’ and ‘single parent’. Household income is represented by ‘relative income’ based on the evidence that relative income matters more than absolute income as predictor of LS (Clark et al., 2008). House ownership is also retained as a potential predictor (Ballas & Tranmer, 2012; Kubiszewski et al., 2018).

3.3.2 Statistical Area (SA2) Level Variable

Contextual variables representing SA2 socio-economic conditions were selected based on their potential relevance according to the literature. Income inequality is best represented by the Gini coefficient. In the absence of Gini coefficient data at SA2 level, two statistics were retained as a proxy for the measure of inequality: percentage of low-income households (with a weekly income < A$650) and percentage of high-income households (with a weekly income ≥ $A3,000). These variables can be assumed to represent respectively the bottom and the top of the Lorenz curve that determine the Gini coefficient (Florida & Mellander, 2016). We also include a contextual variable representing the area average unemployment level for 2018Footnote 4. Rather than incorporating raw neighbourhood population density as an independent variable, we distinguish between urban and suburban neighbourhoods by using a dummy identifying neighbourhoods that are more sparsely populated (less than 100 people per km2) (Neira et al., 2018). Some previous studies have identified a small influence of neighbourhood ethnicity on individual LS (Fernandez & Kulik, 1981). We identify percentage of people speaking a LOTE at home (Kubiszewski et al., 2019b) as a proxy for ethnic diversity in the neighbourhood.

The contextual influence of the environment and the climate is also considered. To this purpose, we include the following variables: natural vegetation index (NDVI) representing “greenness” (Kubiszewski et al., 2019b), average rainfall, and maximum temperature in summer (Florida et al., 2013).

3.3.3 Social Capital Variables

Following the methodology adopted in Aslam and Corrado (2012), Aminzadeh et al. (2013) and Neira et al. (2018), social capital was captured by the survey instrument at respondent level. District averages for each variable were then calculated using sample design weights.

Informed by the relevant literature discussed in Section 2, six dimensions of social capital are initially identified for this project: social trust, social connection, social engagement, psychological sense of community, neighbourhood perceived crime and safety, neighbourhood perceived shabbiness. Psychological sense of community reflects the individual’s perception about the neighbourhood social cohesion and social harmony. The construct was first introduced by Vemuri et al. (2011) who labelled it social capital index. A similar variable was used by Aminzadeh et al. (2013) who combined it with neighbourhood perceived safety into an overall ‘social cohesion’ construct. The details for each social capital construct are shown in Table 2 and underlying survey questions can be found in Table 7, Appendix A. Cronbach Alpha calculations reveal a high level of internal consistency for each construct. Institutional trust was not believed to be relevant here as the neighbourhoods investigated are subject to very similar political and governance structures.

Table 2 Initial social capital constructs

Pairwise correlations between the different constructs were estimated. A high level of correlation was found between social connection and social engagement at both individual (0.51) and district level (0.75). To avoid multicollinearity issues, the two constructs were merged into an ‘engagement and connection’ composite variable. Similarly, neighbourhood perceived safety and neighbourhood perceived shabbiness with correlations at 0.60 and 0.70, were merged into ‘neighbourhood safety issues & shabbiness’.Footnote 5

3.4 Unexplained Group Level Heterogeneity

Some of the group level random effects identified by the analysis could be correlated with the regressors, for instance heterogeneity in social capital variables between SA2 could be correlated with the corresponding individual regressors. While using a MLM model, this heterogeneity can be resolved by including the group means for these variables in the regression (Mundlak, 1978). To avoid problematic multicollinearity between the individual level variables and the group mean variables, we use a mean centred level 1 covariate as an instrumental variable (Snijders & Berkhof, 2008), this makes it possible to assess the relative position of the individual as well as the effect of the absolute group factors.

When calculating the district mean for social capital variables, design weights reflecting the difference in probability of selection of different individuals are used to correct for sampling bias (Aslam & Corrado, 2012). As the HILDA weight represents the probability of an individual being selected by reference to the entire population (Watson, 2012), a corrected weight is calculated by reference to the statistical district to reflect the probability of being selected within the district. The aggregate mean social capital variable \(\stackrel{-}{{(C}_{k}})\) is determined as follows:

$$\stackrel{-}{{C}_{k}}=\frac{1}{{k}_{D}}\times \sum _{i=1}^{{k}_{D}}{w}_{di}\times {C}_{ijk}$$

where \({w}_{di}\) is respondent i weight in the district, \({k}_{D}\) is the number of respondents in the district and \({C}_{ijk}\) is the social capital variable measure of respondent i.

3.5 Estimated Regression Models

Six successive models are estimated. Model 0 is an empty model of individuals nested within households nested within SA2 areas with no independent variables:

$${y}_{ijk}={\beta }_{000}+{v}_{00k}+{u}_{0jk}+{e}_{ijk}$$
(0)

where \({\beta }_{000}\) is the overall LS mean, \({v}_{00k}\) is the intercept adjustment for each SA 2 area, \({u}_{ojk}\) is the intercept adjustment for each household and \({e}_{ijk}\) is an individual error term. \(i=1\dots I\) represents individual respondents, \(j=1\dots .J\) represents households, and \(k=1\dots .K\) represents SA2 areas.

Model 1 extends Eq. (0) by including individual level socio-demographic variables:

$${y}_{ijk}={\beta }_{000}+{\gamma }_{100}{X}_{ijk}+{v}_{0k}+{u}_{0jk}+{e}_{ijk}$$
(1)

where \({X}_{ijk}\) is a vector of individual socio-demographic variables.

Model 2 extends Eq. (1) by including household level variables:

$${y}_{ijk}={\beta }_{000}+{\gamma }_{100}{X}_{ijk}+{\gamma }_{010 }{Z}_{jk}+{v}_{0k}+{u}_{0jk}+{e}_{ijk}$$
(2)

where \({Z}_{kj}\) is a vector of household level variables.

Model 3 extends Eq. (2) by including mean-centred individual social capital variables as well as the aggregate value of the social capital variables at SA2 levels, and contextual socio-economic and environmental variables.

$${y}_{ijk}={\beta }_{000}+{\gamma }_{100}{X}_{ijk}+{\gamma }_{010 }{Z}_{jk}+{\delta }_{100}({C}_{ijk}-\stackrel{-}{{C}_{k})}{+{\delta }_{001}\stackrel{-}{{C}_{k}}+{\theta }_{001}{D}_{k}+v}_{0k}+{u}_{0jk}+{e}_{ijk}$$
(3)

where \({C}_{ijk}\) is a vector of social capital variables at individual level, \(\stackrel{-}{{C}_{k}}\) is the aggregate mean of a social variable for area k and \({D}_{k}\) is a vector of contextual variables for area k.

Model 4 extends Eq. (3) and accounts for between-group heterogeneity by making the coefficients of the individual social variables dependent on the SA2 area. Snijders and Bosker (2011, p. Chapter 5) recommend using the primary variable rather than the centred variable as the control individual level variable.

$${y}_{ijk}={\beta }_{000}+{\gamma }_{100}{X}_{ijk}+{\gamma }_{010 }{Z}_{jk}+{\delta }_{100}{C}_{ijk}{+{\delta }_{001}\stackrel{-}{{C}_{k}}+{\theta }_{001}{D}_{k}+v}_{0k}+{v}_{1k}{C}_{ijk}+{u}_{0jk}+{e}_{ijk}$$
(4)

where \({v}_{1k}\) is the slope adjustment term for the social capital variable 1 in area k.

Model 5 modifies Eq. (4) and instead of using random slope coefficients for \({C}_{ijk}\), introduces cross-interaction terms between the individual social capital variables and their mean at SA2 level.Footnote 6 This interaction term for social capital variables is \({\delta }_{101}{C}_{ijk}\stackrel{-}{{C}_{k}}\). This means that the slope of the individual social capital variables effectively vary depending on the SA2 as it does for Model 4; however instead of producing one coefficient estimate per area as in Model 4, it produces two unique estimates: \({\delta }_{100}\) and \({\delta }_{001}\) (Neira et al., 2018).

$${y}_{ijk}={\beta }_{000}+{\gamma }_{100}{X}_{ijk}+{\gamma }_{010 }{Z}_{jk}+{\delta }_{100}{C}_{ijk}{+{\delta }_{001}\stackrel{-}{{C}_{k}}+{\delta }_{101}{C}_{ijk}\stackrel{-}{{C}_{k}}{+\theta }_{001}{D}_{k}+{\theta }_{101}{X}_{ijk}{D}_{k}+v}_{0k}+{u}_{0jk}+{e}_{ijk}$$
(5)

To minimise the possibility of heteroskedasticity, models were estimated using robustness checks such as robust variances and independent residuals (Pek et al., 2018). We checked for normality of residuals using visual tests including kernel density and standardised P-P plot (Figs. 1 and 2, Appendix B); given the large sample limitation of the Shapiro-Wilk test (Royston, 1982).

4 Results

4.1 Descriptive Statistics

Descriptive statistics are summarised in Table 3. The average LS in our sample is 7.94 on a 0–10 scale which is very close to the 7.91 Australian within-person average for the period 2000-17 reported by Kubiszewski et al. (2020). Figure 4 (Appendix C) provides some visual evidence of the geographical clustering in both metropolitan regions. Comparison of social capital average scores with results from prior research is often meaningless because of differences in scales and methodologies. Variations of aggregate scores between neighbourhoods remain high with a standard deviation well over 50% of the variation at individual level. Figures 567 and 8 (Appendix C) visually illustrate the diversity in social capital variables across neighbourhoods in both metropolitan areas.

Table 3 Summary descriptive statistics for selected variables

4.2 Model 0 (Empty Model)

Results for Model 0 (null hypothesis) are shown in Table 4. Random intercept effects are significant at all levels. The intraclass correlation coefficient (ICC) measures the correlation between observations within a particular class. It is calculated as the ratio of the between-cluster variance and the sum of between and within cluster variances (Raudenbush & Sampson, 1999). The ICC at SA2 level is small compared to the ICC at household level revealing an important variance between household clusters.

Table 4 Intercept and random effect parameters for the null hypothesis model

4.3 Model 1 and Model 2

Model 1 incorporates level 1 socio-demographic variables while household variables are added in Model 2. Results for both models are shown in Table 5. The significant drop in the SA2 random intercept variance (from 0.109 for Model 0 to 0.051 for Model 2) shows that a large portion of the initial between-group heterogeneity in LS between SA2 is explained by socio-demographic variables.

Table 5 Summary statistics for Model 1 and Model 2

Being healthy is by far the most important positive contributor of LS. Age (in quadratic form) is another significant contributor; being female is a positive factor; being unemployed is associated with lower LS, as is having a higher education. Among household type variables, living as a couple either with or without children has a positive effect on LS compared to living in a single person household. Relative income measured as the ratio between household income and median income in the statistical area is a positive contributor as is house ownership. Finally, the fixed effect dummy variable for the Greater Perth region is highly significant indicating a different pattern of relationship for that region.

4.4 Models 3, 4 and 5

These models introduce social capital variables at individual and SA2 levels as well as contextual variables at SA2 level. Summary statistics for each model are shown in Table 6. Model 3 allows for random intercepts, i.e., it captures between-context heterogeneity. Model 4 allows for random slope coefficients for the social variables. This captures between-individual heterogeneity through group dependence (Snijders & Bosker, 2011). Model 5 includes interactions terms for the social capital variables without random slopes. This may provide explanations for the geographical variability of the individual social capital coefficients (Schynz, 2002). We also include interactions terms for ethnicity and unemployment.

The introduction of social capital variables causes a further drop in the random intercept variance at SA2 level from 0.051 (Model 2) to 0.036 (Model 3). Random effects of intercept at SA2 level remain significant for Models 3 and 5. Coefficients for most individual and household level factors remain stable across Models 3, 4 and 5 with little variation from Model 2. However, the coefficients for ‘speaking LOTE’ and ‘being unemployed’ lose their significance in Model 5 when an interaction term with the area level variable is introduced.

Results for Model 3 show that individual social trust, ‘engagement and connection’ and ‘psychological sense of community’ are significant positive contributors of LS. Conversely, ‘perception about safety issues and shabbiness’ has a significantly negative relationship with LS. Area level aggregate for social trust and ‘social engagement and connection’ are also positively associated with individual LS, but aggregate ‘psychological sense of community’ has no significant influence. The aggregate mean ‘neighbourhood safety issues and shabbiness’ has a significant negative coefficient. Coefficients for aggregate social capital means tend to be somewhat larger that coefficients for corresponding variables at individual level.

In Model 4 slope coefficients for individual social capital variables vary across SA2. The only coefficient for which the variation is significant is ‘neighbourhood safety issues and shabbiness’. The average coefficients for individual social capital variables are remarkably similar to those obtained for Model 3. Coefficients for SA2 aggregate are much smaller than in Model 3 and become non-significant. This outcome, similar to the results in Neira et al. (2018) suggests that the heterogeneity in the effect of social capital variables between SA2 is likely to be attributable to between-individual heterogeneity rather than between-group heterogeneity (Duncan et al., 1998).

Model 5 introduces cross-level social capital interaction terms and explores the possible causes for between-individual heterogeneity (Duncan et al., 1998). The results suggest that ‘engagement and connection’ is the only social capital variable, for which the contextual term and the individual term have a combined effect on LS. In other words, the impact of individual social engagement and connection on individual LS is influenced by the overall level of social engagement and connection in the area. Both the individual and mean variables have higher coefficients than in the previous models, however the interaction term coefficient is significantly negative.

None of the other contextual socio-economic variables appear to have a significant influence on LS. The introduction of an interaction term between the individual variable and the contextual variable for ‘speak LOTE’ and unemployment in Model 5 does not fundamentally alter this pattern. The interaction term for ‘speak LOTE’ has a weakly significant negative effect. Likewise, the environmental contextual variables representing green vegetation and climate are not found to significantly impact LS. The fixed effect identifier for Greater Perth is weakly significant in Model 3 and 5 and below significance level in Model 4.

Table 6 Summary statistics for Models 3, 4 and 5

5 Discussion

The small unexplained random effect at SA2 level in Model 3 suggests that socio-demographic variables and social capital variables explain much of the variations in LS between SA2. It is much larger than the random effect at group level in the Aslam and Corrado (2012) and Neira et al. (2018) studies where the clusters were large regions in the EU, but it is comparable to the variance at neighbourhood level in Aminzadeh et al. (2013) regression models.Footnote 7 The unexplained random effect at household level remains relatively large as in Ballas and Tranmer (2012) indicating the presence of idiosyncratic differences between households.

The importance of factors representing social capital at individual level is confirmed by this research. Specifically, social trust (Helliwell & Putnam, 2004; Neira et al., 2018; Subramanian et al., 2000) and ‘engagement and connection’ (Aminzadeh et al., 2013; Gilbert et al., 2016; Yuan, 2016) are significantly positive contributors of LS. The variable ‘psychological sense of community’ that reflects individual perception about the level of social connection and harmony within the neighbourhood also has a positive impact on LS. The same outcome was reached by Aminzadeh et al. (2013) (social cohesion) in their study of adolescent wellbeing in New Zealand. Conversely, Vemuri et al. (2011)’ social capital index based on the same dimensions was not found to be a significant driver of LS in metropolitan Baltimore.

Individual perception of safety issues and shabbiness in the neighbourhood has a significant negative impact on LS reflecting existing evidence (Ettema & Schekkerman, 2016; Mouratidis, 2019). This outcome might be surprising as the actual rate of violent crimeFootnote 8 in both metropolitan areas is around 1,200 per 100,000 which is close to the Australian average (Australian Bureau of Statistics, 2023), but much lower than for Philadelphia, a comparable Beta ranked city in the US.Footnote 9 This result is consistent with previous evidence that incidence of crime has little relationship with people’s perception of crime (Veenhoven, 2002) and that perceived crime levels are more significant negative contributors to LS than actual crime levels (Ambrey et al., 2014; Larson, 2010).

Our study finds little evidence of contextual influence for social capital factors on LS: the results from the random slope coefficients model suggest that differences between SA2 are generally attributable to compositional effects rather than contextual effects. The exception is ‘social engagement and connection” where the aggregate variable has a strong positive coefficient when an interaction term with the individual level variable is included. This suggests that the neighbourhood level of social connection and engagement interacts with the corresponding variable at individual level. The negative interaction factor can be interpreted as meaning that where aggregate social engagement and connection is high, the influence of individual level engagement and connection will be reduced. Only very few studies have considered the influence of area level social connection and engagement on individual LS. Aminzadeh et al. (2013) found a positive influence for membership in community organisations, Neira et al. (2018) reported a positive association for informal and formal networks, but no significant influence for civic engagement, a result also reached by Aslam and Corrado (2012). When interpreting these results, it is important to bear in mind that the two latter studies examined large regions (NUTS1-3) in different countries in Europe rather than urban neighbourhoods.

The influence of urbanity/ rurality was measured through the ‘sparsely populated’ dummy. It appears that this variable was not a significant contributor confirming earlier outcomes that a ‘rural’ residence has no significant influence on wellbeing when the area is accessible (Gilbert et al., 2016; Weckroth et al., 2022). Neither of the two proxy variables for income equality are significant. This outcome aligns with some earlier results (Florida et al., 2013; Weckroth et al., 2022), but challenges others, for instance Ala-Mantila et al. (2018) found that people had a higher Quality of life where the proportion of high income earners was higher.

Individual ethnicity has often been identified as a possible determinant of LS, but very few studies have considered the influence of neighbourhood ethnicity. The consensus is that being of a non-English background (Kubiszewski et al., 2018) or belonging to an ethnic minority (Aminzadeh et al., 2013; Oswald & Wu, 2011) is associated with lower LS levels. A recent Canadian study also found that neighbourhoods with a high proportion of foreign born had lower average LS (Ziogas et al., 2023). The analysis of our data shows no significant influence for ethnicity on LS at either individual or neighbourhood level. However, the negative coefficient for the interaction term suggests that being a person of non-English speaking background living in a neighbourhood with a high level of non-English speakers may have a negative impact on LS. We need to interpret this result with caution considering that non-English speakers were under-represented in our sample.

Our study confirms earlier findings that being unemployed tends to be associated with lower levels of LS. There is also strong empirical support in the literature for the argument that contextual unemployment might mitigate the effect of individual unemployment by acting as a social norm (Ballas & Tranmer, 2012; Clark, 2003; Powdthavee, 2007). This finding for contextual unemployment is not replicated in our study even when we include an interaction term. This might be explained by the relatively low level of contextual un-employment in our area of study (5.9%) at the time of survey. By comparison the unemployment rate in South Africa for the Powdthavee study was around 13% (Powdthavee, 2007) and it was 8.6% in the UK in 1991 (World Bank, 2023) when the data used by Ballas & Tranmer was collected.

Vegetation index was found to be a statistically significant LS factor in previous research (Kubiszewski et al., 2019b); similarly, temperature and to a lesser extent rainfall, have been found to influence average LS (Brereton et al., 2008; Florida et al., 2013; Lignier et al., 2023). Neither of these environmental variables significantly impact LS in our sample. While previous research that examined the influence of climate and the environment on LS covered regions with different climates, our study covered areas where climate variations are small. A similar conclusion to ours was reached from data collected in the urban area of Baltimore (US) (Vemuri et al., 2011).

Individual socio-economic variables mostly behave as expected from similar research in Australia. Age is a consistent predictor with a U shape non-linear relationship (Frijters & Beatton, 2012); being female is associated with higher levels of LS. Evidence about the influence of sex on LS is mixed, with some studies predicting a positive association for female (Kubiszewski et al. 2019; Neira et al., 2018) others a negative association (Aminzadeh et al., 2013; Ballas & Tranmer, 2012). The negative association between having a university degree and LS that was noted in some studies is confirmed here (Ambrey & Fleming, 2014).

According to our results, household structure matters: couples with or without children seem to have higher LS levels compared to single person households. This differs somewhat from earlier MLM studies in the UK where couples without children had higher levels of LS but couples with children had lower levels (Ballas & Tranmer, 2012).Footnote 10 House ownership is consistently a significant positive predictor of LS. This aligns with earlier results from the UK (Ballas & Tranmer, 2012) and Australia (Kubiszewski et al., 2018), but a US study found that home ownership was a negative predictor of metropolitan wellbeing (Florida et al., 2013). Relative household income is a significant positive contributor of LS in all models reflecting predictive models by Clark et al. (2008). Household income was also found to be a positive contributor in many LS/ social capital studies (Aslam & Corrado, 2012; Neira et al., 2018; Rentfrow et al., 2009).

A secondary objective of this study was to test whether the seemingly different pattern of relationship between LS and LS determinants in the Greater Perth region was confirmed. The two regions were identified through the inclusion of a fixed effect dummy variable. The significantly negative coefficient for the Greater Perth dummy in Models 1 and 2 seem to support previous findings that ceteris paribus a certain configuration of LS determinants would result in lower predicted LS in that region (Lignier et al., 2023). The coefficient remains negative when social capital and contextual variables are introduced but with a much weaker level of significance. This might suggest that the positive effect of social capital variables on LS in Greater Perth somewhat mitigates the differences with other regions.

6 Conclusion

The primary objective of this study was to analyse the influence of individual and contextual social capital variables on LS within a metropolitan context in Australia using MLM. This study is the first one, to the best of our knowledge, that uses MLM with three levels of aggregation to investigate LS and social capital in metropolitan regions within a single country.

We find that only a moderate proportion of the unexplained variation in LS prediction was attributable to differences between spatial clusters, while the difference between households is much more significant. These results reflect similar findings from MLM studies at neighbourhood scale conducted in the UK and in New Zealand. We also find that most of the spatial heterogeneity is probably attributable to compositional effects (i.e., different characteristics of respondents living in different area) rather than contextual effects (variations linked to the specific social capital characteristic of the area).

Our results corroborate previous findings that factors such as social trust, social engagement connection, and psychological sense of community representing individual contribution to social capital are strong positive contributors of individual LS. Negative individual perceptions about the neighbourhood such as safety issues and physical deterioration seem to have a deeper (negative) impact on LS, than positive perceptions about the neighbourhood social attributes. The influence of contextual social capital on individual LS appears to be limited to the interaction between individual and aggregate levels of social connection and engagement. Overall, our findings are consistent with the hypothesis that social capital will shape individual wellbeing (Coleman, 1988; OECD, 2001); however they do not support previous findings of a significant influence for contextual socio-economic variables such as income inequality, contextual un-employment and neighbourhood ethnicity.

Our research has several limitations. Firstly, as noted in our methodology section, our sample is somewhat biased towards older people with higher income, and households from non-English speaking background are under-represented. This may explain the non-significance of some contextual variables such as income inequality, unemployment and ethnicity. Secondly, the number of respondents in some of the SA2 is very small, consequently idiosyncratic individual data may have a disproportionate effect on the area means (Helliwell et al., 2019). Thirdly, we do not account for possible spillover effects where average level of LS and social capital for an area could be influenced by neighbouring areas (Ziogas et al., 2023). Finally, as noted by Neira et al. (2018), the lack of consistency in the definition of social capital dimensions and the absence of independently measured aggregate social capital indicators make inter-research comparisons hazardous.

Notwithstanding the above caveats, we believe our research contributes to knowledge about the impact of social capital on individual wellbeing. Our methodology can be replicated to other metropolitan areas anywhere in the world and applied to other wellbeing indicators such as happiness or mental health. Our findings will inform government social policy in urban areas, for instance the building of urban infrastructure that promote social access and encourage social activities such walking paths, community playgrounds, and the remedying of physical urban deterioration and crime to reinforce the perception of personal safety. As argued by Wilkinson and Pickett (2009), the positive impact of community engagement and connection and individual psychological sense of community on SWB suggest that access to urban and social infrastructure needs to be accompanied with better inclusion of all groups within the community to achieve better social outcomes.