Introduction

The second half of the twentieth century is an era of higher education expansion that access to college has been greatly increased in nearly every country in the world (Trow, 1972; Schofer & Meyer, 2005; Shavit et al., 2007; Carnoy et al., 2013). The worldwide average gross enrollment ratio in higher education increased dramatically from 3% in 1950 to 10% in 1970 and to 38% in 2018. The number of college students increased from 29 million to over 141 million in the world between 1970 and 2006 (Freeman, 2010). While most developed countries entered the mass higher education stage before the new century, many middle- and low-income countries have been largely expanding college enrollment in the past decades (Marginson, 2016a; Özoǧlu et al., 2016).

However, how higher education expansion affects educational inequality in access and production remains both theoretically and empirically unclear. On college access, existing literature presents two competing theories. Some researchers argue that expansion is a diversion process of social stratification that working class is diverted to the second-tier universities, while elite institutions remain the bastion of the privileged (Raftery & Hout, 1993; Lucas, 2001). Despite dramatic higher education expansions as well as various federal and state college-enrollment policies, large racial and socioeconomic gaps persist over decades (Perna et al., 2008). Some others insist that expansion brings the inclusion of higher education that the lower class students obtain more college opportunities for social mobility (Arum et al., 2007). On production (i.e., students’ post-college outcomes), as one of the best known evidence in higher education since the seminal work by Weisbrod and Karpoff (1968), we know that college quality substantially improves labor market outcomes and intergenerational social mobility (e.g., Brewer et al., 1999; Black & Smith, 2004; Zhang, 2005b; Dale & Krueger, 2014; Chetty et al., 2017), but we know very little about how policy controlled dramatic increases in enrollment would change college production as measured by a college’s value-added on students’ postgraduation employment and earnings.

This paper presents new evidence on the impacts of enrollment expansion on students’ college access and labor market outcomes by exploiting the world’s largest higher education expansion policy in China from 1999 to 2012. Similar to the global trend, China increased its annual undergraduate enrollment from 1 million to about 7 million in only thirteen years between 1999 and 2012. The organization of higher education also changed dramatically. The expansion was accompanied by differentiation and stratification that a stable higher education hierarchy comes into being (Li, 2004; Yang & Wang, 2020), which resulted in five selectivity tiers of higher education institutions, ranging from the top 39 universities to more than 1,000 three-year vocational colleges in the lowest selectivity tier.

This exogenous, unprecedented national expansion policy enables us to credibly answer three research questions: (1) How does enrollment expansion affect access to (elite) college differently between students from high and low SES families? (2) How does enrollment expansion change college production as measured by a college’s value-added on students’ labor market outcomes? And (3) how do the expansion effects vary between selective and non-selective colleges in a highly institutionally stratified higher education system?

To answer these questions, we combine nationally representative student-level survey data with newly available confidential institution-level data. The National Survey on College Graduates’ Employment, conducted by the Institute of Economics of Education at Peking University, surveyed a nationally representative sample of college graduates bi-annually from 2003 to 2013 (sample size = 113,435). This is the only available multi-period college graduates’ survey data that spanned a long time period during the expansion in China, starting from the first cohort that experienced enrollment expansion. Additionally, we use enrollment and input information on every college from the newly available China Educational Finance Statistics - Postsecondary data from 1998 to 2013, which were provided by the Ministry of Education of China. Using these unique data, this paper presents the first evidence on the differential changes in higher education access, inputs, and production by institutional stratification before and after the expansion policy.

Consistent with theories and the policy practice in other countries (e.g., the recent U.S. enrollment expansion during the Great Recession, Barr & Turner, 2013), our empirical analyses provide compelling evidence and explanations on the increased inequality and stratification in higher education access, resources, and outcomes during enrollment expansion. This paper makes four contributions to higher education policy research. First, we find evidence of the enlarging unequal access to elite higher education for students from different family backgrounds during the enrollment expansion, which supports the effectively or expanding maintained inequality theory (Lucas, 2001; Alon, 2009). These results speak to a large body of literature on higher education enrollment expansion and access in many countries, for instance, the United States (Taubman et al., 1972; Walters, 1984; Juhn et al., 2005; Barr & Turner, 2013; Soliz, 2018), France (Deer, 2005), the United Kingdom (Walker & Zhu, 2008; Boliver, 2011; Devereux & Fan, 2011), Italy (Bratti et al., 2008; Oppedisano, 2011)), Germany (Reimer & Pollak, 2010), Turkey (Özoǧlu et al., 2016), Brazil (McCowan, 2007; Boliver, 2011; Dias et al., 2011), Ireland (McCoy & Smyth, 2011), and China (Luo et al., 2018; Ou & Hou, 2019). In particular, this paper shows new evidence of the expanding socioeconomic gaps in college access in the presence of higher education selectivity stratification. Students from high SES families have substantial advantages in the access to elite higher education. These students are more likely to gain additional access to selective colleges by responding to the expansion policy in academic preparation and college choice behaviors.

Second, we present the first direct evidence of the differential changes in enrollment, inputs, and student outcomes by institutional stratification. Nearly all the previous research on higher education expansion uses national aggregate data or focuses exclusively on the quantity of higher education opportunities. We show how the highly selectivity-based stratified higher education system in China changed over time during the expansion policy with a focus on the differences between selectivity types. Carnoy et al. (2013) note that all the four BRICs countries become increasingly differentiated with an emphasis on strengthening a limited number of elite institutions. We show that the rapid, controlled expansion even differentiated less-selective institutions. Though all types of colleges have experienced similar trends in the reduced per-student inputs (such as student-faculty ratio and expenditures), there is large, persistent differentiation and stratification between college selectivity levels (Kyvik, 2008).

Third, we examine how returns to different selectivity levels of colleges (or college value-added) changed during the expansion. Previous studies present mixed findings. Using the 1940-1990 Census data, Juhn et al. (2005) find that the college wage premium reduced during the U.S. higher education expansion. In the United Kingdom, Walker and Zhu (2008) find no significant fall for men and even a large, but insignificant, rise for women in college premium during higher education expansion. Li et al. (2014) document that college expansion in China sharply increased the unemployment rate among college graduates. Our results indicate persistent or even expanding college selectivity premiums in earnings but declining college selectivity premiums in employment when colleges dramatically expanded their enrollments.

Lastly and most importantly, given that evidence on the impacts of increased enrollment or cohort size on college and labor market outcomes is limited (Stapleton & Young, 1988; Barr & Turner, 2013; Kyui, 2016; Bianchi, 2019), we provide new evidence on the causal effect of enrollment expansion on college quality. The expansion policy in China was unexpectedly initiated by the Central Government. Undergraduate enrollment was suddenly expanded by more than 47% in the first year. Using a difference-in-differences framework based on this natural policy experiment, we compare the variations in college quality by college-cohort that faced different degrees of policy-induced enrollment expansion. On average, enrollment expansion negatively impacted college quality as measured by value-added on graduates’ employment and earnings. Each 100% increase in undergraduate enrollment from 1999 decreases college value-added on wages by about 0.15 standard deviations. These findings are consistent with the evidence from the USA (Stapleton & Young, 1988), Russia (Kyui, 2016), and Italy (Bianchi, 2019). Moreover, we show that this negative shock would be fully compensated by a comparable increase in investments (especially in faculty resources), which is consistent with the resource overcrowding theory (Bound & Turner, 2007; Bianchi, 2019).

Background: Chinese higher education expansion in 1999–2012

There have been three large-scale expansions of college enrollment in the history of Chinese higher education, but the 1958 and 1978 expansions were trivial compared with the 1999-2012 expansion, which is rare in both scale and speed throughout the world. In only thirteen years, undergraduate freshman enrollment increased from 1,083,600 in 1998 to 6,888,300 in 2012. The national gross enrollment ratio more than tripled from 9.76% to 30%. The number of higher education institutions increased from 1,002 to 2,442.

The 1999 higher education expansion started as an exogenous, unexpected national policy. The worldwide higher education has been often driven by factors such as state policies, economic development, and global competition (Marginson , 2016a). It has been well documented that the China Government initiated this radical expansion policy to stimulate domestic consumption and decrease the labor supply of high school graduates as a response to the Asian Financial Crisis in 1997 (Wan, 2006; Wang, 2014). On January 13, 1999, the State Council ratified The Plan to Revitalize Education in the 21st Century and set a target of 15% higher education enrollment ratio by 2010. This was the earliest policy issued by the Central Government to set an explicit target index for the mass stage of higher education. On June 16, 1999, three weeks before the National College Entrance Exam, the State Development Planning Commission and the Ministry of Education jointly announced the decision to enroll 337,000 more freshmen, in addition to the previously announced 230,000 increased admissions quotas in January, into higher education institutions. Since the College Entrance Exam of 1999 was held during July 7–8, 1999, this unexpected expansion of college enrollment was decided and announced only three weeks before the exam and students already submitted their college applications.

Eventually, college enrollment increased by 513,200 in 1999, a 47.4% increase from 1998. Ever since that, undergraduate enrollment increased by more than 400,000 each year, reaching 6.85 million freshmen in 2012 from 1.08 million in 1998. Slots for new entrants had risen by six times over a decade. After thirteen year’s massive increase in college enrollment, the expansion policy ended unexpectedly. On March 16, 2012, the China Ministry of Education announced that “the scale of higher education enrollment needs to be held constant and the enrollment of public four-year institutions will remain unchanged.”

Similar to the higher education expansion experiences in many other countries, the enrollment expansion during only thirteen years has largely reshaped the Chinese higher education system in three primary ways. First, access to higher education has been mechanically expanded at a massive scale. In a few years before 1999, about 35% of high school graduates could enter college. The proportion of college enrollment among high school graduates dramatically reached 56% in 1999. The admission rate continued climbing as the number of high school graduates remained stable in the first few years after 1999. The admission rate reached 75% in 2012 even though families started to respond to the college expansion that more students applied to college. According to Trow (1972) and Trow (1976), a gross enrollment rate between 15%-50% indicates a mass stage of higher education. Chinese government announced in 2010 that the Chinese higher education entered the mass stage with the gross enrollment rate of higher education rising from less than 5% in 1990 to 25% in 2010. The Central Government also targeted that the rate would reach 40% by 2020.

Second, the organizational structures of the higher education system also transformed. The enrollment expansion was through two channels. On the intensive margin, each existing college largely increased its admissions quotas from about 4,000 per college in 1998 to over 10,000 per college in 2012. On the extensive margin, while there were 1,022 colleges in 1998, more than 1,500 new colleges were founded in the next decade. The higher education system in China has stratified into five selectivity tiers of institutions. Among the 2,688 higher education institutions in 2019, at the top are 112 elite public universities of the 211 Program (including the most selective 39 universities of the 985 Program) that receive intensive public funding and policy supports. Next to the elite colleges are non-elite four-year colleges founded before 1999. The fourth tier colleges are newly established public or private four-year colleges in or after 1999. At the bottom of the pyramid are three-year vocational colleges (similar to the U.S. community colleges). The centralized College Entrance Exam system assigns admissions based on students’ one-time test scores and applications. The tier-specific admissions cutoffs further differentiate students into different tiers of colleges that students could only apply to colleges within a tier if they have test scores higher than that tier’s cutoff.

Third, the dramatic expansion in enrollment without sufficient complementary inputs in teaching resources and finance has negatively affected college quality. Faculty-student ratio, a key indicator of higher education quality indicating class size and faculty availability (Black & Smith, 2006; Long, 2010), immediately decreased due to the shortage of the supply of high-quality professors in the short and medium terms. In 2007, the Ministry of Education openly admitted that the decision to expand college enrollment in 1999 was too hasty as investments in finance, faculty, and facility did not keep in pace. Furthermore, college tuition largely increased in response to the exceeding needs of financial revenues; however, financial aid was only accessible to a small number of students and at a low level throughout the decade.Footnote 1

Theoretical framework

The expanded enrollment increased access to higher education. However, not all students would benefit from this policy reform. The maximally maintained inequality (MMI) theory and the effectively maintained inequality (EMI) theory have been widely applied to examine the impacts of higher education expansion on the inequality in college access (Raftery & Hout, 1993; Lucas, 2001). The MMI theory suggests that class barriers will persist until privileged families have attained the desired level of education (a saturation point). Moreover, the EMI theory argues that, even a saturation point is reached, the social background-related inequality will persist as privileged groups use their status advantages to secure quantitatively similar but qualitatively superior educations. Alon (2009) extends the two theories into three prototypes of class inequality - effectively maintained, declining, and expanding - according to the degrees in admissions competition and the adaptation of the privileged to exclusion barriers. In a highly stratified higher education system where competition for access to college (particularly elite college) is high, and high-SES families heavily invest in preparing their children to meet the exclusion criteria, the class divide in higher education is then effectively expanding inequality.

The China case fits the effectively maintained/expanding inequality theory. First, higher education was not universal before and even after the expansion. The college attendance rate was low prior to the expansion. For example, the gross enrollment rate was 9.79% in 1998. Given the highly stratified higher education system, the level of competition in admissions remained high after the expansion. Second, the centralized college admissions system in China relies solely on the College Entrance Exam, which “restrict(s) access to selective institutions and magnify qualitative differences” (Alon, 2009, p.735). As students from low-SES backgrounds on average have lower scores, they are at a disadvantage even when selective colleges increased slots. Third, the increase in college tuition and fees during expansion enlarged the class divide due to the gaps in the ability to pay for college. In addition, the cumulative effect of many inequalities before college (Wu, 2019) would also add to the socioeconomic gaps, such as informational and behavioral barriers (see summaries in Page & Scott-Clayton, 2016), proximity to college opportunities (Hillman, 2016), and heterogeneous college-major choice preferences due to limited information or socioeconomic background (Teranishi et al., 2004; Wells & Lynch, 2012; Hoxby et al., 2013).

Incorporating these macro-level factors into the college choice model that students make choices to maximize their expected returns to college (Manski et al., 1983; Perna, 2006; DesJardins and Toutkoushian, 2005), we hypothesize [1-3] the following: (1) students from high-SES families are at an advantage in access to higher education in all years during the expansion; (2) the advantage increases with college quality tiers; and (3) the advantage increases over time as high-SES families respond to the policy change by increasing test-preparation investment in K-12 education.

The next research question of interest is to examine how enrollment expansion affects higher education production. An extensive literature has examined college quality and its impacts on student outcomes (e.g., Brewer et al., 1999; Dale & Krueger, 2002, Zhang, 2005b; Black & Smith, 2006; Dale & Krueger, 2014; Chetty et al., 2017). All these studies build on the standard education production function: An individual’s outcome (e.g., earnings) is a function of the college quality and her own pre-college characteristics. College quality measures the differences in college-level value-added or production to students’ outcomes, which is attributable to various college inputs, including measurable resources, faculty, peers, and other unmeasured inputs. We hypothesize [4] that the enrollment expansion decreased college production by changing the quality of inputs. Because the expansion was completed in a very short period and without massive complementary supply, the dramatic increase in enrollment largely reduced per-student resources (as “cohort crowding” in Bound & Turner, 2007). In particular, since the supply of high-quality faculty members was fixed in the short term, faculty-student ratio - a key college input - declined during the expansion that might have resulted in the popularly criticized “college credential inflation” in recent years (Levin & Xu, 2005). The other important change in inputs was peer quality. Given the fixed supply of college applicants in the short term, increasing enrollment would mechanically decrease the average peer quality (similar to the US experience, Taubman et al., 1972), which might also negatively impact college quality.

However, it is an empirically open question how college quality changed during the expansion as China started to invest heavily in higher education at the same time, particularly to elite universities (Carnoy et al., 2013; Wang, 2014; Yang & Wang, 2020). Given the inequality in investment and the high selection in the College Entrance Exam, we hypothesize [5] that the stratification in higher education institutions would persist and even enlarge; but the negative shock in college quality by increased enrollment in those non-selective colleges might not be sufficiently compensated by investment.

Regarding the policy impact heterogeneity, the rising selectivity in the elite colleges would serve as “the great equalizer” (Hout, 2012; Zhou, 2019). We hypothesize [6] that students benefit equally from attending a small number of elite colleges (Hoxby, 2009). Moreover, “hard science majors” with higher average costs were more difficult to expand (Hemelt et al., 2020) and in practice less likely to expand enrollment, we hypothesize [7] that engineering-focused colleges had higher college quality premiums during the expansion.

Data and descriptive evidence

College-level data

We use college-level data from the newly available China Educational Finance Statistics-Postsecondary data (CEFS-Postsecondary) from the Ministry of Education and the Ministry of Finance in China. The CEFS-Postsecondary data, similar to the U.S. Integrated Postsecondary Education Data System (IPEDS), contain yearly information of every higher education institution in China except for military and Sino-foreign joint institutions. We primarily use the following three types of data. Enrollment data include end-of-year and beginning-of-year unduplicated headcount of students, separately for undergraduate and graduate students. Human resources data measure the number of employees by faculty status and retirement status. Finance data include revenues by source and expenses by function. As one of the few researchers with access to the confidential NEFS data, by data agreement, we only have access to data in 1998 and the years that match the enrollment years in the student survey.

The focus of this paper is the stratification of higher education institutions: top, elite, four-year (existing prior to 1999), four-year (new), and vocational colleges. We use additional information from the Ministry of Education to identify the types of colleges. First, we obtain the identifiers of the top (39 universities in the 985 Program) and elite (73 universities in the 211 Program but not in the 985 Program) colleges, as well as four-year colleges and three-year vocational colleges from the institutional characteristics catalog. Next, we append the yearly report list of newly built colleges or renamed colleges from 1998 to 2012 to identify four-year colleges that were built before or after the expansion.

We use three primary college inputs: total expenditure, operating expenditure, and personnel expenditure (i.e., faculty salary). All of these variables are measured as per student each year. We use per-student faculty salary information to control for both the quantity and quality of faculty inputs. Results are similar but nosier when using faculty-student ratio as the input measure. As will be discussed below, the small number of college-year observations in the survey data limit the analysis on a wide array of college-level characteristics. However, existing literature shows that these three input measures are among the key college quality proxies (Dale & Krueger, 2002; Black & Smith, 2004).

Student-level data

Student-level data are from the National Survey on College Graduates’ Employment (NSCGE) by the Institute of Economics of Education at Peking University. The NSCGE surveyed a nationally representative sample of college graduates in June of 2003, 2005, 2007, 2009, 2011, and 2013, which covered all tiers of higher education institutions in China. See details about the stratified probability sampling at the college-major level in 2003-2009 in Yue (2015). The later two rounds followed the same survey designs. Similar to other college student surveys (e.g., the China College Student Survey in Luo et al. (2018), the Beijing College Students Panel Survey in Wu (2019)), the NSCGE used a two-stage stratified sampling process. First, colleges were randomly selected within strata by geographic area, type, and institutional tiers. There were 45 colleges in the 2003 survey and about 30 colleges in the subsequent surveys. The college characteristics within strata are comparable across years. Second, 500-1000 senior students were randomly selected from each sampling college. Table 1 reports the number of respondents by survey year, which varies due to different colleges sampled in different years. The 18,467 respondents in 2003 represent about 1% of the two million graduates. Since only about 4% of students attended elite institutions, these institutions were purposely over-sampled (lower-ranked colleges were under-sampled): 15% top college students, 10% elite, 30% four-year (existing), 20% four-year (new), and 15% three-year.

The NSCGE data include detailed student-level information on demographics, family background (parental education and occupations), pre-college and college experience, and early labor market outcomes upon graduation. Importantly, the survey questionnaire design and data structure of the six surveys are comparable. The code system of various categorical measures remained consistent in the 14 years, such as the definition of employment, parental occupations, and family residence levels (metropolis, medium city, rural county, rural). We match the survey sample to the institutional data by college and year. The final cross-sectional sample of six cohorts includes 113,435 student-level observations (393 observations from the original sample were dropped due to non-match) in 259 college-year cells (176 unique college observations).

The NSCGE data are the only available multi-period, nationally representative random sample survey data. The survey spanned a long time period during the higher education expansion in China, starting from the first cohort that experienced enrollment expansion. With detailed information on students’ college attendance and postgraduation outcomes, we are able to estimate the policy impacts on both access and production. However, we cannot identify the sharp changes before and after the expansion policy started in 1999 as we do not have data on students who enrolled in college before 1999; instead, we examine the changes in access and production during the expansion.

We measure the quality-differential access to higher education by examining the socioeconomic gap in the probability of attending colleges at the five stratified selectivity tiers. Using information from detailed parental education (7 categories) and occupations (11 categories), we define a student is from an “upper SES” household if she is (1) not a first-generation college student (i.e., parents do not have a college degree) and (2) with parents having upper-level jobs including managers and government officials.

The production outcome of interest is initial wage, measured as the monthly self-reported wage in a student’s first job upon graduation. Wage is the most widely used measure of college student labor market outcomes (e.g., Zhang, 2005b; Gaertner et al., 2014; Chetty et al., 2017). There are two concerns regarding the wage data. First, the wage information might be censored as students with extremely high or low wages were usually less likely to report their true wages. We censored the data in the range of 500-20,000. Results are robust when we vary the top and bottom limits. We also perform robustness checks using the Tobit model. Results are qualitatively unchanged compared with the OLS estimates.

Second, though most of the students who had received job offers (99.5%) reported their initial wages, those who had pending decisions or in self-employment at the time of survey did not report this information. Moreover, many students choose to continue graduate studies and college quality matters in their decisions (Zhang, 2005a; Posselt & Grodsky, 2017). To supplement the wage analysis, we construct several “employment” indicators that include or exclude those students and found similar results: (1) a dichotomous variable indicating nonzero wage, (2) a dichotomous variable indicating having received a job offer or graduate school admissions, and (3) a dichotomous variable indicating non-unemployment that includes all types of employment and graduate studies. We do not separately analyze graduate studies. This outcome measure is imprecise as we do not have information of which graduate school a student was going to attend. Graduate schools are also highly stratified in China as in many other countries (Posselt & Grodsky, 2017). Simply looking at whether a student attended graduate school will mask the quality differentiation.Footnote 2

Descriptions of the higher education expansion

Previous research has already shown the sharp increase in the total college enrollment from 1999 using aggregate data from the China National Bureau of Statistics (Yeung, 2013; Li et al., 2014; Ou & Hou, 2019). We present the first evidence on the differential changes in enrollment, inputs, and student access outcomes by institutional stratification.

Figure 1 shows the enrollment expansion from 1997 to 2013 using the CEFS-Postsecondary data. Overall, there was little change prior to the expansion policy in 1999. Panel (a) shows the extensive margin of expansion that the numbers of both four-year and three-year vocational colleges increased dramatically after 2001, which took about two years to build new colleges since the expansion policy started. The number of three-year vocational colleges experienced a much larger increase than that of four-year colleges. Panel (b) shows the difference in the intensive margin between four-year and three-year colleges. The average college size as measured by the number of undergraduate students increased immediately after 1999. While four-year colleges expanded enrollment more than three-vocational colleges at the absolute level, both types of colleges nearly tripled their numbers of undergraduate students.

Fig. 1
figure 1

College enrollment expansion at the national scale. Notes: This figure describes the college enrollment expansion at the national scale, and by institutional selectivity in China using the China Educational Finance Statistics - Postsecondary data (CEFS-Postsecondary) compiled by the China Ministry of Education and the China Ministry of Finance. Panel (a) plots the number of four-year universities and Three-year Vocational colleges from 1997 to 2013, the extensive margin of expansion. Panel (b) plots the number of undergraduate students per college, the intensive margin of expansion. Panel (c) plots the number of undergraduate students in total. Panel (d) plots the number of undergraduate students per college. Top includes the 39 universities in the “985 Program.” Elite includes the 73 universities in the “211 Program” but not in the “985 Program.” Four-year (existing) includes four-year universities that were founded before 1999. Four-year (new) includes four-year universities that were founded in or after 1999, some of which were three-year vocational colleges before 1999. Three-year vocational includes all the three-year vocational colleges. The vertical gray line indicates the year 1999 when the expansion started

Combining the increases at the extensive and intensive margins, Panel (c) plots the total number of undergraduate students by the stratified institutional selectivity. The small number of top and elite colleges were relatively stable in their shares of undergraduate students, while the other three lower-tier colleges greatly increased their total numbers of undergraduate students. On average, as Panel (d) suggests, there was clearly stratification in college size that higher-tier colleges had more undergraduate students and all types of colleges increased their enrollment sizes during the enrollment expansion policy.

Meanwhile, there were also large changes in higher education inputs. As discussed earlier, student-faculty ratio, a proxy of college instructional quality, immediately increased due to the fixed faculty supply in the short term. Panel (a) of Fig. 2 shows the changes in student-faculty ratio by institutional stratification. All colleges experienced immediate, large increases in the number of students per faculty after 1999. Lower-tier colleges had a much higher student-faculty ratio that also increased at a higher rate, partly due to their limited financial resources as suggested by the differences in per-student expenditures by institutional tiers in Panel (b). The top and elite colleges had largely increased their spending levels, which may have compensated the negative shocks of enrollment expansion on college quality. Given the limited resources, lower-tier colleges that experience rapid increases in enrollment, were more likely to experience a large decline in their college quality.

Table 1 presents the summary statistics of student demographics and labor market outcomes by enrollment cohorts in the student survey data. Though sampling weight is not available for analysis, the unweighted statistics show important information of changes in Chinese higher education given the representativeness of the sample. The first panel presents the mean proportion of students from high-SES families in all types of colleges, which were quite stable during the expansion. Panels (a) and (b) of Fig. 3 show the highly stratified access to different institution selectivity tiers and the average proportion of high-SES students in selective institutions increased in the early years after the enrollment expansion started, which supports the effectively expanding inequality theory. The second panel of Table 1 suggests that there is no gender difference in the access to college, but fewer ethnic minority students gained access to college in later years. The decline of students with low-SES or ethnic minority backgrounds motivated China to implement a set of affirmative action admissions policies (Zhou & Hill, 2009).

The last panel of Table 1 reports the mean labor market outcomes by cohorts. The average starting monthly wage slightly increased from about $300 in 2003 to $350 in 2011, and a large jump to $450 in 2013. This rise of wage should not be interpreted as causal results of the enrollment expansion but is mainly driven by the economic growth changes in China (Li et al., 2017), which will be controlled for by year fixed effects in the estimation models. About half of the college graduates reported to have nonzero wages. Exceptions are in year 2003 (for the 1999 cohort) when the SARS delayed students job search and in year 2009 (for the 2005 cohort) when the Financial Crisis of 2008 negatively shocked the labor market. About another 20% of students continued graduate studies and 10% of students had pending job offers by the time of graduation or chose to be self-employed. These unweighted average employment outcomes are consistent with national statistics provided by the Chinese Ministry of Education.

Unequal access to (elite) higher education

We use a multinomial logistic regression to model the types of colleges that a student enrolled at, including top, elite, four-year (existing), four-year (new), and three-year vocational colleges. The probability of a student enrolling at college type i (vs. three-year vocational colleges as the reference group, \(Y=1\)) \(\mathbf {P}(Y=i)\) is:

$$\begin{aligned} \mathbf {P}(Y=i) = \frac{exp(\alpha _i Upper \ SES + X \beta _i)}{\sum _{k=1}^{J} exp(\alpha _k Upper \ SES + X \beta _k)}, \end{aligned}$$
(1)

where X is a vector of covariates, including gender, race, age, family residence, and enrollment year fixed effects. The parameter of interest \(\alpha _i\) estimates the difference between students in upper SES families and those in lower SES families in the log of relative risk ratios of enrolling at college type i compared with enrolling at the reference group - three-year vocational colleges.

In addition to estimating the overall SES gap during the entire college expansion period, we also estimate the heterogeneous trends overtime between upper SES students and their counterparts. Theoretically, the highly hierarchical categories of institutional selectivity from least to most selective can be modeled using an ordered logit model that will be more efficient than a multinomial logit model. However, similar to many US studies that use multinomial instead of ordered logit regression, violating the parallel slopes assumption makes the ordinal model is not appropriate (Perna & Titus, 2004; Rodriguez et al., 2018).

Table 2 presents the multinomial logit model results. The results provide clear evidence of the unequal access to elite higher education during the enrollment expansion. Each panel reports estimates from a separate model, and each column reports estimates for one college category i within a regression. Panel A estimates the basic model in Eq. 1. Relative to students from lower SES backgrounds, upper SES students had a statistically significantly higher probability of enrolling at a more-selective college compared with enrolling at a three-year vocational college. This SES gap is much larger in selective college opportunities. For instance, compared with lower SES students, upper SES students had a 217% (=\(exp(1.154)-1\)) higher probability of enrolling at the top colleges compared with enrolling at a three-year vocational college. The difference between three-year vocational colleges and four-year colleges newly built after 1999 was smaller, but still economically and statistically significant.

In Panel B, we add the interaction terms between the upper SES indicator and enrollment cohort indicators to examine the heterogeneous time trends. Coefficients on “upper SES” show that the SES gap existed in the first year of expansion. Nearly all of the coefficients on the interaction terms are positive, which suggests that the SES gap was likely to increase over time during the entire expansion period. The gap largely widened for the 2009 freshman cohort. One potential reason is that most of Chinese provinces changed their college admissions mechanisms that students from upper SES families may have better information and strategy to gain the entry into more selective colleges, even holding College Entrance Exam scores equal (Ye, 2018; Ha et al., 2020).

To increase power by reducing the number of parameters, Panel C fits a linear year trend model. Panels A and B of Fig. 3 show that a linear year trend model is a good approximation of the changes in access to different types of colleges. Results show a similar story that the SES gap expands over years. Finally, we decompose the SES gap into the two dimensions: (1) parents had manager-/government-level jobs, and (2) parents received college degrees. Consistently, advantaged students had a much higher chance of entering elite higher education, which increased with the expansion of the slots for new college entrants.

We should note that we do not have data on students’ College Entrance Exam scores. Therefore, as discussed in the theoretical framework, the estimated SES gap in the access to elite higher education consists of differences in both academic achievements and other factors between high- and low-SES students. However, the estimated SES gap, which meaningfully captured the inequality in college opportunities, would not be entirely attributed to the difference in academic achievements. First, due to the sudden, exogenous policy change, students’ academic achievements were not able to respond to the enrollment expansion sharply at least in the short term. Second, even having the same test scores, disadvantaged students were still less likely to enter colleges (especially elite colleges) due to college choice preferences and informational barriers (Li et al., 2015; Ye, 2018).

Stratified production during higher education expansion

Measuring the impacts of college quality on labor market outcomes

Following the long literature that measures the impacts of college quality on labor market outcomes (Brewer et al., 1999; Black & Smith, 2004; Zhang, 2005b; Long, 2010; Borgen, 2014; Dale & Krueger, 2014), we use the following OLS model:

$$\begin{aligned} Y_{ijt}=\beta * X_{ijt} + \gamma _{jt}+ \delta _t+\varepsilon _{ijt} \end{aligned}$$
(2)

where \(Y_{ijt}\) is the labor market outcome for student i who graduated from college j in year t. Note that the data are cross-sectional that we can only observe each student once. The key parameter of interest \(\gamma _{jt}\) measures college j’s impact on the labor market outcomes of students in cohort t, holding a set of pre-college covariates \(X_{ijt}\) (gender, race, age, family residence) and common time shocks \(\delta _t\) constant. \(\varepsilon _{ijt}\) is a stochastic term representing unmeasured impact factors of labor market outcomes.

Unlike the studies that identify the impacts of institutional quality characteristics, we focus on identifying \(\gamma _{jt}\), as similar to a value-added model (Hanushek & Rivkin, 2010; Cunha & Miller, 2014; Chetty et al., 2014a). We then study how college quality \(\gamma _{jt}\) changed over time during the enrollment expansion. To address the potential omitted variable bias, we include flexible controls of family backgrounds by adding three-way interactions of detailed parental education levels (8 levels), parental job categories (12 levels), and family residence urbanicity levels (4 levels). These flexible controls are sufficient statistics for the inequality in family resources and individual earnings potential (Wu, 2019). Furthermore, since the Chinese centralized college admissions system requires students to submit college-and-major admissions, we also include major categories that students enrolled at to control for preference heterogeneity.

To present the evidence of stratified higher education production, we estimate Eq. 2 by replacing the college-year fixed effects with college group-year fixed effects in \(\gamma _{jt}\). In Table 3, we start with a baseline model that does not allow the college group fixed effects to vary across years in column (1). Results show that, on average, higher education institutional selectivity is statistically significantly and positively correlated with an increase in postgraduation earnings. For example, students graduating from the top colleges earned 92% (\(=exp(0.654)-1\)) more than students graduating from three-year vocational colleges. Similar to the differential access to college, the differences between less selective four-year colleges and three-year colleges are smaller, but still statistically significant. In column (2), we add the linear time trends interactions to examine how the college selectivity/quality premium changed overtime during the expansion policy. We find that the stratified production existed from the first year of higher education expansion, particularly between selective and non-selective colleges. The coefficients of the linear time trends interactions are positive, which suggests that the stratification in production might have expanded, although imprecisely estimated. For instance, the gap in log of monthly wage between elite and three-year vocational colleges was 0.213 for the 1999 freshman cohort, which increased by 0.043 (\(p<0.05\)) each year and nearly doubled for the 2009 cohort. We further control for detailed family background information and major groups in columns (3) and (4) to minimize potential omitted variable bias. Estimates remain unchanged.

Panel (c) of Fig. 3 shows the results using the full model of Eq. 2 with 30 parameters of \(\gamma _{jt}\) (6 cohorts and 5 college selectivity categories; three-year vocational in 1999 is normalized as the reference point). Consistent with the estimates as discussed above in Table 3, the more selective colleges had higher value-added in boosting students’ initial wages upon graduation. Notably, three-year vocational colleges experienced a large decrease in their value-added quality in the first few years of the expansion. These vocational colleges increased students’ wages more than most of the four-year colleges at the beginning of the expansion. This might be because that vocational colleges were better at preparing students for the labor market employment. But this advantage quickly disappeared as all colleges expanded their enrollments.

Columns (5)-(7) in Table 3 report the college value-added estimates on postgraduation employment, using different measures of employment from including nonzero wages only to including employment, graduate studies, and self-employment. The institutional stratification in employment persisted during the expansion. The stratified gaps reduced between students in selective institutions and those in less selective institutions, particularly in initial labor market employment (in column 5). Panel (d) of Fig. 3 shows the time-series changes. Consistently with the theoretical framework and numerous anecdotal evidence, students from all types of colleges, except those in four-year newly built colleges, experienced drops in employment in the short term after the expansion policy as it was too short for colleges to invest and adjust for the dramatic enrollment increase (Li et al., 2014).

We also explore the heterogeneity in the estimated college value-added (results are available in an appendix table). Overall, there were no large differences between high SES and low SES students. Nevertheless, if students from low SES background could make their way into the top colleges, they might benefit more than higher SES peers. In contrast, high SES students benefited largely from four-year newly built colleges compared with enrolling at three-year colleges. Female students were more likely to benefit from attending high-quality colleges, but the gender gap reduced during the expansion period. Finally, engineering-focused colleges across all the institutional selectivity levels had a higher wage premium than non-engineering-focused colleges. This is because engineering-focused colleges have higher average costs and are less likely to expand their enrollment sizes.

We acknowledge several estimation limitations. Potential omitted variable bias might rise as we are not able to control for the College Entrance Exam scores. Using a rich set of pre-college covariates, the identification assumption is to compare students with exactly the same demographics and family background, and studying the same major, but enrolling at different colleges, which might be close-to-random due to the exam score-based admissions and dramatic changes in admission quotas each year. The remaining potential upward bias should be minimal. Furthermore, the key research question is how different types of colleges had stratified production trends. The potential bias needs to be sufficiently large to reverse the conclusions, which is very unlikely. To formally test this argument, we conduct the bounding analysis proposed by Oster (2019) and the results are very robust. Furthermore, results are qualitatively unchanged when we control for college entrance exam scores for the 2003 and 2005 cohorts that we have the relevant information.

Estimating the impacts of expansion on college quality: a natural experiment

The final analysis is to examine how much the changes in college quality \(\gamma _{jt}\) could be causally attributed to the rapid enrollment expansion. Using the exogenous expansion policy as a natural experiment, our research design compares the variations in college quality by college-cohort that faced different degrees of policy-induced enrollment expansion. We estimate the following OLS regression:

$$\begin{aligned} \hat{\gamma _{jt}} = \pi * \Delta Enrollment_{jt}+\lambda *Z_{jt}+\nu _{jt} \end{aligned}$$
(3)

where \(\hat{\gamma _{jt}}\) are college-cohort value-added estimates from Eq. 2, standardized within the whole college-cohort sample. \(\Delta Enrollment_{jt}\) is the relative changes in undergraduate enrollment in college j of cohort t compared with the enrollment of the 1998 cohort. We also add additional controls \(Z_{jt}\), including enrollment in 1998, and year and college group fixed effects. We also control for expenditures to test the underlying mechanisms. The identification assumption of \(\pi\) is that, conditional on observables, the enrollment change was not associated with other college unmeasured variables that affected college quality. One possible example is that colleges might endogenously set their annual enrollment quotas. However, colleges in China have very little autonomy to change their enrollment plans, which are determined by the Ministry of Education and state-level Department of Education. Furthermore, as a national policy, the expansion policy was implemented so dramatically and unpredictably that colleges could not endogenously respond to the policy changes. We have matched 154 college-year observations in the estimated college quality \(\gamma _{jt}\) and spending data. The unmatched cases are mostly due to non-reporting in the spending data, particularly in the early years. However, we don’t find systematically non-random unmatch problems.

Column (1) of Panel A in Table 4 examines the raw correlation between enrollment expansion and the estimated college value-added on the outcome of initial wages, which suggests that college quality was negatively correlated with the increase in undergraduate enrollment. Results on other value-added outcomes and using different measures of the enrollment expansion are quite consistent that we do not report all of those estimates due to the page limit. Without controlling for any other covariates, each 100% increase in undergraduate enrollment from 1998 is associated with a 0.069 standard deviations decrease in college quality. The estimated negative impact of enrollment expansion enlarges and becomes statistically significant when we exclude differences between years in Column (2).

Column (3) uses our preferred specification that controls for year fixed effects and the original college size in 1998. Results show that each 100% increase in undergraduate enrollment from 1999 decreases college value-added on wages by about 0.143 standard deviations. Since the average college size nearly tripled as shown in Table 1, the results imply that the average college quality would drop by about 0.43 standard deviations if there were no complementary investments in higher education. The estimates are qualitatively unchanged when we compare colleges within institutional stratification in Column (4).

We further examine how inputs in higher education mitigate the negative shocks of expansion on college quality in Panel B. All the specifications control for enrollment in 1998 and year fixed effects as in Column (3). Results in Columns (5)-(8) consistently show that the impact of enrollment expansion could be fully compensated by the increased investment, particularly the investment in faculty. For example, Column (6) suggests that each 100% increase in personnel expenditure per student - capturing both quantity and quality of faculty resources - is statistically significantly associated with a 0.755 standard deviations increase in college quality. Note that this is not a causal estimate since the spending might be correlated with unobservables. Holding per-student personnel spending equal, enrollment expansion did not affect college quality at all. Column (8) shows that investment in faculty is far more important than investment in operation to have impacted college quality, although the correlation coefficient of the two variables is 91%.

We have presented compelling evidence that the enrollment expansion might not negatively affect college quality if the college investments (especially in faculty resources) kept the same pace. However, as we show earlier that many colleges were limited to increase such investments, and the public funding has been concentrated on the selective institutions in China (Wang, 2014; Yang & Wang, 2020), the increase in enrollment and the differential college investments might have contributed to the expanding stratification in higher education. In Panel C, we explore the differential changes across college selectivity levels. To ease comparison, we report the main effects for each college group rather than interaction terms. Column (9) shows that the enrollment expansion had much larger negative impacts on less selective colleges. This college stratification gap could be nearly explained by the resource gap, as suggested by the results in Column (10). If less selective colleges had the same increases in inputs, the enrollment expansion would not have largely reduced the college quality. Column (11) shows that the marginal effect was about 40% larger in engineering-focused colleges than that in non-engineering-focused colleges. Similarly, holding college investments equal, the negative shocks by enrollment expansion were fully compensated.

Our analyses have limitations. First, as the National Survey on College Graduates’ Employment changed the survey institutions each year, we cannot use a college-cohort panel data that further eliminate time-invariant factors. An alternative approach is to collapse data to college type-cohort and estimate a panel data fixed effects model. Estimates are similar but under-powered due to the small number of observations. Second, we can only observe a student’s initial wage. Different colleges may have different value-added on wage growth (Thomas & Zhang, 2005). However, entry conditions (e.g., initial earnings) have lasting impacts on long-term career success (Oyer, 2006; Altonji et al., 2016). Future research can look at these long-term impacts.

Discussion and conclusion

Over the past decades, enrollment expansion has been an important, common national policy to increase access to higher education opportunities all over the world. A key question among researchers, policymakers, and practitioners is whether and how expansion affects inequality in the access to higher education as well as the production of higher education (Arum et al., 2007). Using unique institution-level and student-level data, this paper presents new evidence on how expansion affects access, resources, and production of higher education by studying the dramatic higher education expansion in China from 1999 to 2012.

Similar to the higher education expansion experiences in many other countries, higher education expansion in China creates new college opportunities but does not reduce the inequality in the access to (elite) college. Students from high SES families are persistently at an advantage in college opportunities. Moreover, expansion is often accompanied by differentiation and stratification of higher education institutions. Without sufficient per-student investments and resources, particularly in less selective colleges, expanding enrollment may have diminished a college’s value-added quality. The policy practice in China shows that the inequality in access between high- and low-SES students and the stratified production between colleges persisted or even enlarged during the dramatic expansions.

These findings have important policy implications. Colleges have long been seen as “the great equalizer,” which motivates the policy option that an expansion in higher education would promote social mobility. However, simply expanding the pool of college students is unlikely to boost intergenerational income mobility (Zhou, 2019). Our findings of the unequal access to elite colleges during the expansion policy are consistent with studies conducted by other Chinese scholars using various data sources and methods, including Liu (2006) using the Chinese General Social Survey (CGSS) data of 2003, Ding (2007) using data from urban household surveys conducted by the National Statistics Bureau of China in 1991 and 2000, Li (2010) using 1% census data in 2005, Ye and Ding (2015) using data from college student survey in Beijing in 2011, and Luo et al. (2018) using the 2012 China College Student Survey. The persisting and expanding gap in the access to elite college opportunities suggests that students from low SES families may not have fully benefited from the expansion policy. Other college-enrollment policies should be considered in order to expand college opportunities, including promoting academic preparation in K-12 school period, expanding financial aid, and providing information and guidance in college applications (Perna et al., 2008; Page & Scott-Clayton, 2016). In recent years, China has been implementing special affirmative action programs for rural students in low-income regions that have the potential to expand elite higher education opportunities for those disadvantaged students.

Beyond access, for those students who have already made their way into selective institutions, the possible diminished value of college induced by enrollment expansion may reduce college premiums in labor market outcomes. In this paper, we have shown that expanding enrollment without sufficient per-student resources would have harmed college quality and therefore student outcomes. In response to the possible negative impacts of college production, China ended the expansion policy in 2012 in order to “improve higher education quality.” Two years later, the Chinese Ministry of Education decided to re-structure the higher education with a focus on shifting more than 600 local four-year colleges to “modern vocational education,” most of which are newly built colleges during the expansion policy. These policy changes are consistent with the implications in this paper: Policymakers and college administrators have to make sure that institutional resources per student (e.g., faculty and facility) will not decline during enrollment expansion (Bound & Turner, 2007; Bound et al., 2010).

In the most recent years, China has been using the expansion policy again to mitigate the supply pressures in the labor market. The three-year vocational colleges expanded enrollment by 1.16 million in 2019 and will increase enrollment by another 2 million in the next two years. Graduate programs also expanded enrollment by 189,000 in 2020 in response to the COVID-19 pandemic. While both expansion policies mentioned “qualitative expansion” that implies a series of policies would be used to ensure that the higher education quality will not decline, future research is needed to evaluate how these new expansion policies will affect Chinese higher education.

Finally, from the international comparative perspective, lessons from the Chinese higher education expansion show that simply expanding higher education enrollment is not an optimal development strategy and will not solve the unequal college access problem. Rather, without complementary policies and inputs, there are likely widened SES gaps in access and decreased college production. Moreover, the expansion of higher education in both developed and developing countries has been changing the landscape of global higher education market. In particular, colleges in developed countries are competing heavily for international students from developing countries (Chellaraj et al., 2005; Freeman, 2010; Bound et al., 2016). Future work is needed to examine how the worldwide enrollment expansion affects the supply and demand in the global higher education market.

Fig. 2
figure 2

Changes in higher education inputs during the expansion. Notes: This figure describes the changes in higher education inputs (student-faculty ratio in Panel a); total expenditure per student in Panel b) during the expansion by institutional selectivity using the national data

Fig. 3
figure 3

Stratified access and production in higher education by institutional selectivity. Notes: The top two panels show the share of college enrollment from upper SES family background by institutional selectivity using data from the National College Student Graduation Survey, 2003–2013. Panel (a) presents the three-group comparisons and Panel (b) presents the five-group comparisons. The bottom two panels plot the estimated college value-added across years by institutional selectivity, separately for outcomes of initial wages in Panel (c) and initial employment in Panel (d). To foresee the yearly trends, each line is plotted using the locally weighted smoothing approach

Table 1 Summary statistics
Table 2 Stratified access to higher education during expansion
Table 3 Stratified production of higher education during expansion
Table 4 Impacts of enrollment expansion on higher education production (value-added on wage)