Retirement Consumption Puzzle in Malaysia : Evidence from Bayesian Quantile Regression Model

The objective of this study is to use the Bayesian quantile regression for studying the retirement consumption puzzle, which is defined as the drop in consumption upon retirement, using the cross-sectional data of theMalaysianHousehold Expenditure Survey (HES) 2009/2010. Three different measures of consumption, namely, total expenditure, work-related expenditure, and nonworkrelated expenditure, are suggested for studying the retirement consumption puzzle. The results show that the drop in consumption upon retirement is significant and has a regressive distributional effect as indicated by larger drops at lower percentiles and smaller drops at higher percentiles. The smaller drops among higher consumption retirees (or higher income retirees) may imply that they have more savings and/or retirement benefits than the smaller consumption retirees (or lower income retirees). Comparison between the three types of consumption shows that the work-related expenditure has a uniform drop across the distribution. The drop under the nonwork-related expenditure varies across the distribution, implying that it is the source behind the variation of the consumption drop.


Introduction
A life cycle theory is a major economic theory that relates consumption and saving behavior, which states that individuals desire to maintain their level of consumption in their entire lifetime [1].The marginal utility consumption should remain smooth throughout retirement transition because the change in income during retirement should be predictable [2].On the contrary, a number of previous studies found a one-time significant drop in consumption in the early years of retirement, a situation which is known as retirement consumption puzzle.
Over the past three decades, several studies focusing on the smoothing or stable path of consumption have been carried out.Hamermesh [3], who was among the first to study retirement consumption puzzle in the United States, showed that an individual was unable to sustain the level of real consumption prior to retirement due to inadequate retirement savings.Later, several more studies have been conducted in other countries such as Banks et al. [4], Hurd and Rohwedder [5], Schwerdt [6], Wakabayashi [7], and Battistin et al. [8].A review on several studies related to the retirement consumption puzzle can be found in Attanasio and Weber [9].There are a number of studies which found that the decline in consumption upon retirement is due to several reasons.As examples, Haider and Stephens [10], Smith [11], and Blau [12] found that the consumption drop is due to the unexpected retirement which resulted from illnesses, disabilities, or involuntarily unemployment.Blau [12] also developed a modified life cycle model which incorporated the uncertainty in the timing of retirement.
In terms of consumption measures, several studies utilized food expenditure as a prominent substitution for the actual consumption during retirement.However, Aguiar and Hurst [13] found that food expenditure is a poor proxy for the actual consumption because retirees consumed the same quantity of food, as well as its quality, even when food expenditure has declined.The main reason is that the retirees consumed home production food in their retirement since they have more time to prepare their meal and survey for cheaper food.Later, Hurst et al. [14] and Fisher et al. [15] proposed a broader measure of consumption which can be categorized according to several different types of expenditure and showed that a broader consumption measure can be used to eliminate retirement consumption puzzle.
Previous studies on the drop in retirement consumption were mostly concentrated at the mean distribution, which covered only certain parts of distribution, and may lead to poor estimation of parameters especially in long-tailed distributions.The traditional mean model estimates only the average effects of the whole data and do not allow for an understanding of any potential distributional impacts (or heterogeneity potential).Furthermore, the mean regression model is based on least squares estimation and thus has a significant sensitivity (or is not robust) to outliers.Several studies have been carried out to study the distributional aspects of retirement consumption puzzle, including Bernheim et al. [16] who estimated the retirement consumption drop using subgroups of wealth and income replacement rate and Aguila et al. [17] who used low and high consumption households to examine the drop in retirement expenditures.Recently, Fisher and Marchand [18] used the quantile regression model to investigate the drop in consumption upon retirement.
Quantile regression model is different from the traditional mean regression model as it uses the Least Absolute Deviation (LAD), instead of the least square error, which is able to rectify the weaknesses prevalent in the usual regression framework.Quantile regression model also allows the impact of each regression parameter to be analysed based on different selected quantiles.In short, quantile regression model has several advantages; it is a distribution-free model which does not adhere to any distributional assumptions, it is robust to outliers, it does not require independence assumption, and it allows the analysis of regression parameters to be extended beyond central locations.
Quantile regression model is based on the works of Koenker and Bassett [19] and Koenker and Hallock [20] and is gaining rapid interests by other researchers.The model has been developed for linear models with continuous responses and has been applied in various fields such as finance and economics [21], ecology [22], environmental epidemiology [23], criminology [24], and climate change [25].Several extensions to the quantile regression model have been suggested and applied in other areas, such as Yu and Moyeed [26] who proposed Bayesian quantile regression model, Machado and Silva [27] who proposed quantile regression model for discrete data, Hewson and Yu [28] who suggested quantile regression model for binary data within the Bayesian framework, Reich et al. [25] who introduced Bayesian spatial quantile regression model, and Fuzi et al. [29] who applied Bayesian quantile regression model for claim count data in insurance area.
In this paper, we use the Bayesian quantile regression model to examine the drop in consumption upon retirement.The Bayesian quantile regression has the combined advantages of both the quantile regression and the Bayesian approach.The quantile regression is a distribution-free model and robust to data, while the Bayesian approach allows the complete univariate and joint posterior distribution of each parameter to be generated by the MCMC simulations.
Several motivational examples are worth mentioning here.For instance, in the field of environmental study, the quantile regression model allows the investigation of whether the effects of environmental exposure change depending on the level of respiratory health of the population [23], while in the area of insurance pricing and ratemaking, the Bayesian quantile regression models handle the parameters of covariates (or the risk factors) as random variables [29].We use a cross-sectional data of Household Expenditure Survey (HES) 2009/2010 in Malaysia to investigate the changes in consumption drop across the population.We also expand the consumption measure into three categories, namely, total expenditure (TE), work-related expenditure (WRE), and nonwork-related expenditure (NWRE), to identify the source behind the variation of consumption drop.

Mean Regression.
In our study, the drop in mean consumption upon retirement is estimated using the ordinary least square (OLS): where ln(  ) is the log of response variable (consumption measures),   is the binary variable which equals one for retirees and zero for working households, and x  is the vector of control variables consisting of demographic and socioeconomic variables (gender, marital status, ethnic, etc.).The regression parameter, , represents the mean consumption difference between working and retired households.

Frequentist Quantile Regression.
The frequentist quantile regression model from Koenker and Bassett [19] is used in our study for comparison purposes.The same equation shown in ( 1) is used, but the quantile regression is now fitted to the conditional differences in the log of consumption between working households (preretirement) and retirees (postretirement) at the th quantile.Let Y be the vector of continuous response variables, which in our case can be represented by the three different consumption measures, and x  be the associated row-vector of covariates consisting of demographic and socioeconomic characteristics of households.The classical regression model focuses on the expectation of variable , conditional on the values of variables X, which can be summarized as (  | x  ) = x   .On the other hand, the quantile regression model extends this approach to    ( | x  ) = x     , where the quantile  are fixed values between 0 and 1, allowing us to study the conditional distribution of  on X at different locations.
The regression parameter,   , can be obtained using min where the loss function   () = ( − [ < 0]) is a piecewise linear function and [⋅] is the indicator function.
Equivalently, the loss function can be written as Equation ( 3) can be minimized using linear programming, while the confidence interval can be obtained using bootstrap method.R statistical program with quantreg package [30] is used in this study to fit the frequentist quantile regression model.

Bayesian Quantile Regression.
The Bayesian approach for quantile regression model was introduced by Yu and Moyeed [26] who formed the likelihood function using the asymmetric Laplace distribution (ALD).A random variable  follows the ALD when the density function is given by where 0 <  < 1 and   (⋅) is the loss function in equation ( 3).Under the Bayesian approach, the MCMC can be used to obtain the posterior distributions of the unknown parameters.The posterior distribution of parameter  is given by where () is the prior distribution of  and (y | ) is the likelihood function which is formed by joining the independently distributed ALD.The joint likelihood function can be written as Since there is no specific conjugate prior distribution for generating the posterior distribution, we use the uniform prior distribution for all () in our study.The prior distribution for the scale parameter is the inverse-gamma distribution, or () ∼inverse-gamma(, ), which allows the Gibbs sampling algorithm in the MCMC to update and tune () for obtaining good acceptance rates.

Results and Discussion
3.1.Sample Data.The sample data from Household Expenditure Survey (HES) 2009/2010 is used in our study.The data contains information on monthly expenditure, together with demographic and socioeconomic characteristics of each household.The selected sample is a cross-sectional data consisting of 6480 household heads.Three different measures of consumption are used for the response variable, namely, total expenditure (TE), workrelated expenditure (WRE), and nonwork-related expenditure (NWRE).TE consists of food at home, alcohol and cigarettes, home appliances and furniture, clothing, education, entertainment and recreation, health, insurance, outside food (restaurant and café), transportation (own vehicle and public transport), personal care, rental, utility, and other services (such as legal services, tax services, and government agency).WRE and NWRE are constructed using the definitions in Aguiar and Hurst [31] and Fisher and Marchand [18].WRE consists of outside food, personal care, public transport, and clothing, whereas NWRE contains food at home, alcohol, utilities, and entertainment.

Model Development.
The MCMC simulations via Gibbs sampling algorithm are used to generate 5,000 posterior samples of each regression parameter.The first 1,000 runs of posterior samples are discarded as burn-ins to lessen the effect of initial simulations, and the process resulted in 4,000 final posterior samples for each regression parameter.The values of  are initialized at zero, while the inverse-gamma parameters are set at a = 0.01 and b = 0.01.

Summary Statistics.
Table 1 provides the summary statistics for the sample.More than half of household heads (69%) are located in urban areas, a large majority (78%) are married, and more than half (64%) belongs to Bumiputera ethnic.In terms of educational attainment, 14% of household heads are university or college graduates, 35% are high school graduates, and the balances are below high school education (16%), and others (35%) (attend informal or religious education).For occupational groups, almost 13% of household heads are professionals, while 19% and 13% are administrative supports and technicians, respectively.In terms of employment type, almost half of household heads are private sector employees, followed by self-employed (19%) and government employees (13%).The proportions of age group are quite equally divided, with the exception of younger household heads (age less than 25).With regard to the status of living quarters, more than half (67%) are homeowners, followed by renters (24%).

Mean Regression.
Table 2 provides the estimate, , for the mean regression model, which can be used to show the difference (in mean consumption) between working and retired households.Three response variables are considered, namely, total expenditure (TE), work-related expenditure (WRE), and nonwork-related expenditure (NWRE).
The mean consumption among retirees is lower compared to the working households for all expenditure measures as shown by the negative coefficient.The regression estimate for TE is significant and indicates that the consumption for retirees is 14% lower than the working households.The estimates for WRE and NWRE are also significant, showing that the retirees' consumptions are 39% and 5% lower than the working households, respectively.As expected, the WRE has a relatively larger drop than the NWRE, indicating that retirees are no longer involved in employment.The results also agree with studies from Fisher and Marchand [18] and Fisher et al. [15] who found that the WRE has the highest consumption drop.The results indicate that a broader measure of consumption may diminish the retirement consumption puzzle, as indicated by the smaller drop in the NWRE.

Bayesian Quantile Regression.
The trace plots for the regression estimate under the Bayesian median regression ( = 0.5) are provided in Figure 1.The trace plots for other quantiles are also obtained but are not shown here.The trace plots show that the MCMC simulations mix well, the convergence of the posterior distributions took place, and there were no significant problems in the chain simulations.Table 3 shows the regression estimates (and standard deviations) under the Bayesian quantile regression.It can be seen that the estimates for the three consumption measures are significant at all quantiles.The drops in consumption also differ across the distribution.All three consumption measures show larger drops at lower percentiles and smaller drops at higher percentiles (regressive trend), where the smallest and highest drops are at the highest and lowest percentiles respectively.The regressive trend disagrees with the results of Fisher and Marchand [18] who found that the estimates are more negative (progressive trend) when moving towards the upper distribution.However, the regressive trend is consistent with the results of Aguila et al. [17] who found that the retirement consumption has larger drops at lower percentiles.
The TE displays the largest drop (22%) at the 10th percentile, a drop of 15% at the median, and the smallest drop (3%) at the 90th percentile.Similar patterns are also seen in the WRE and NWRE: 60%, 36%, and 17% drops, respectively, at the 10th, median, and 90th percentiles for the WRE and 11%, 5%, and 2% drops, respectively, at the 10th, median, and 90th percentiles for the NWRE.In terms of magnitude, the WRE shows the largest drop at all quantiles, and the result agrees with the mean model which shows that the largest drop is from the WRE.As expected, the consumption drops under the Bayesian median model (=0.50)that is quite comparable to the OLS model.The drops for TE, WRE, and NWRE are 14%, 39%, and 5%, respectively, under the OLS and 15%, 36%, and 5%, respectively, under the Bayesian median model, indicating that the median model can be used as an alternative to the mean model (OLS).
It can be observed that the drop in WRE follows a uniform trend across the distribution, while the NWRE has more variations at lower percentiles (from 0.10 to 0.50).The variations show that the NWRE is the source behind the variation in the consumption drop.Our result agrees with Fisher and Marchand [18] who found that the WRE displays a uniform drop across the distribution, and the NWRE is the source behind the variation of the consumption drop.
Figure 2 exhibits the plots of regression estimates with their respective 95% credible intervals under the Bayesian quantile regression.For comparison purpose, the estimates from the OLS are also included, represented by the dashed horizontal line.It can be seen that the estimates under the NWRE have more variations in the lower percentiles (from 0.10 to 0.50), while the estimates under the WRE are uniformly increasing.The consistently small widths of the credible intervals throughout the quantiles indicate that the estimates for the three consumption measures are significant throughout the distribution.
3.6.Frequentist Quantile Regression.For comparison purpose, Table 4 shows the estimates (and standard errors) under the frequentist quantile regression model for the three different measures of consumption.Comparison between Tables 3 and 4 shows that the estimates under the Bayesian and frequentist quantile regression models are similar for all quantiles.The main difference between both models is shown by the standard deviations and standard errors; the standard deviations under the Bayesian model are different from the standard errors under the frequentist model.The differences are expected since the estimates under both approaches are obtained under different estimation methods; the frequentist intervals are estimated via bootstrap method, whereas the Bayesian intervals are obtained from the MCMC simulation.
The smaller standard deviations under the Bayesian regression suggest that the model has more significant estimates.The estimates are statistically significant at all quantiles for all three consumption measures under the Bayesian model, while the frequentist model has several insignificant estimates at several quantiles.
Similar to the Bayesian quantile regression, all three consumption measures under the frequentist quantile regression show larger drops at lower percentiles and smaller drops at higher percentiles (regressive trend).The TE displays the largest drop (22%) at the 10th percentile and the smallest drop (12%) at the 75th percentile.The drop at the 90th percentile is insignificant.The WRE displays the largest drop (60%) at the 10th percentile and the smallest drop (16%) at the 90th percentile, while the NWRE displays the largest drop (11%) at the 10th percentile and the smallest drop (5%) at the median.The drops after the median are insignificant.The drop in WRE also follows a uniform trend, while the drop in NWRE has more variations.

Conclusions
In this study, we applied the Bayesian quantile regression to investigate the consumption drop upon retirement which is an area where, currently, the quantile regression model is of limited utilization.The Bayesian quantile regression has the combined advantages of both quantile regression and Bayesian approach.In particular, the quantile regression is a distribution-free model and robust to data, while the Bayesian approach allows the complete univariate and joint posterior distribution of each parameter to be generated by the MCMC simulations.Our study also compared the estimates from the Bayesian quantile regression with the OLS (mean) and the frequentist quantile regression.We also considered three different consumption measures, namely, total expenditure (TE), work-related expenditure (WRE), and nonwork-related expenditure (NWRE).
The consumption drops in TE, WRE, and NWRE are 14%, 39%, and 5%, respectively, under the OLS (mean model), which agree with studies from Fisher and Marchand [18] and Fisher et al. [15] who found that the WRE has the highest drop, and the NWRE has the lowest drop.The results also prove that a broader measure of consumption may diminish the retirement consumption puzzle, as indicated by the smaller drop in the NWRE.
As expected, the drops in TE, WRE, and NWRE under the Bayesian median regression model ( = 0.50) are quite comparable to the OLS, indicating that the median model may be used as a substitute for the mean model.
The consumption drops upon retirement are statistically significant at all quantiles under the Bayesian quantile regression, where larger drops at lower percentiles and smaller drops at higher percentiles indicate a regressive distributional effect (regressive trend).The WRE shows a relatively uniform drop, while the drops in NWRE have more variations at lower percentiles (from 0.10 to 0.50), indicating that the NWRE drops are the source behind the variations of drops.Our study agrees with Aguila et al. [17] who found larger consumption drops at lower percentiles (regressive trend) but disagrees with Fisher and Marchand [18] who found larger consumption drops at higher percentiles (progressive trend).It should be noted that different data may provide different results.
The smaller drop among higher consumption retirees (or retirees with higher income) under the Bayesian quantile regression imply that the retirees with higher consumption have more savings and/or retirement benefits.The results are consistent with the expectations of life cycle theory which states that higher income households save more than lower income households.The larger drop at lower percentiles under the Bayesian quantile regression implies that the smaller consumption responses (or retirees with lower income) are exposed to larger consumption shocks.
Comparison between the Bayesian and the frequentist quantile regressions shows that the estimates are similar at all quantiles.The main difference between both models is that the standard deviations under the Bayesian model are different than the standard errors under the frequentist model.The differences are expected since the estimates under both approaches are obtained under different estimation methods.

Table 1 :
Summary statistics for HES Sample 2009.

Table 2 :
Regression estimate for mean regression model.

Table 3 :
Regression estimate for Bayesian quantile regression model.

Table 4 :
Regression estimate for frequentist quantile regression model.