Evolution of the Exoplanet Size Distribution: Forming Large Super-Earths Over Billions of Years

The radius valley, a bifurcation in the size distribution of small, close-in exoplanets, is hypothesized to be a signature of planetary atmospheric loss. Such an evolutionary phenomenon should depend on the age of the star–planet system. In this work, we study the temporal evolution of the radius valley using two independent determinations of host star ages among the California–Kepler Survey (CKS) sample. We find evidence for a wide and nearly empty void of planets in the period–radius diagram at the youngest system ages (≲2–3 Gyr) represented in the CKS sample. We show that the orbital period dependence of the radius valley among the younger CKS planets is consistent with that found among those planets with asteroseismically determined host star radii. Relative to previous studies of preferentially older planets, the radius valley determined among the younger planetary sample is shifted to smaller radii. This result is compatible with an atmospheric loss timescale on the order of gigayears for progenitors of the largest observed super-Earths. In support of this interpretation, we show that the planet sizes that appear to be unrepresented at ages ≲2–3 Gyr are likely to correspond to planets with rocky compositions. Our results suggest that the size distribution of close-in exoplanets and the precise location of the radius valley evolve over gigayears.


Introduction
By far the most intrinsically common planets known are small (<4 R ⊕ ), close-in (<1 au) planets. NASA's Kepler mission (Borucki et al. 2010) revealed the surprising abundance of these planets; some 30%-60% of Sun-like stars host a small, close-in planet, depending on assumptions about the intrinsic multiplicity and inclination dispersion within planetary systems (Fressin et al. 2013;Petigura et al. 2013;Zhu et al. 2018;He et al. 2019). An enduring mystery posed by small planets is how some accreted sizable atmospheres while others appear to have avoided runaway accretion altogether (e.g., Ikoma & Hori 2012;Lee et al. 2014;Lee & Chiang 2016). Oftentimes Kepler multiplanet systems host planets both with and without atmospheres, in some cases separated from one another by only a hundredth of an astronomical unit (Carter et al. 2012).
Recent progress in understanding small planets has been fueled by improved precision in stellar and planetary parameters. Through homogeneous spectroscopic characterization of >1300 Kepler planet hosts, the California-Kepler Survey (CKS; Johnson et al. 2017;Petigura et al. 2017) revealed that the size distribution of close-in (P < 100 days) small planets is bimodal, with a valley in the completeness-corrected radius distribution between 1.5 and 2 R ⊕ . The radius valley is widely believed to be a signature of atmospheric loss. This belief is bolstered by determinations of planet densities on either side of the valley; planets below the valley, dubbed super-Earths, have densities consistent with a rocky composition, while planets above the valley, known as sub-Neptunes, require atmospheres of a few percent by mass to explain the low measured densities (e.g., Weiss & Marcy 2014;Rogers 2015). In the atmospheric loss model, some fraction of super-Earths are the remnant cores of planets that shed their primordial envelopes, which potentially alleviates the issue of neighboring planets with dissimilar densities (e.g., Lopez & Fortney 2013;Owen & Morton 2016). While exploration of the radius valley among planets orbiting low-mass stars has provided support for an alternative hypothesis (formation in a gas-poor disk without the need for atmospheric loss; Cloutier & Menou 2020), atmospheric erosion remains the leading theory for planets around Sun-like stars.
Atmospheric loss requires energy. Energy deposited into a planet's atmosphere from an internal or external source can heat gas to velocities exceeding the planet's escape velocity. External mechanisms of energy deposition include photoevaporation (heating of the planet's thermosphere by X-ray and extreme ultraviolet radiation; e.g., Owen & Jackson 2012) and impacts by planetesimals or planetary embryos (Liu et al. 2015;Inamdar & Schlichting 2016;Chatterjee & Chen 2018;Wyatt et al. 2020). Internal energy deposition can be provided by the luminosity of a planet's cooling core (e.g., Ginzburg et al. 2016). Notably, planetary evolution models studying the effect of photoevaporation predicted the existence of a radius valley before it was observed (Lopez & Fortney 2013;Owen & Wu 2013;Jin et al. 2014;Chen & Rogers 2016). However, subsequent studies considering the effects of core-powered mass loss were also able to reproduce the bimodal radius distribution of small planets (Ginzburg et al. 2018;Gupta & Schlichting 2019. Photoevaporation and core cooling remain the two leading explanations for the radius valley, and both processes may well be important, but to determine the relative importance of the two effects will require a better understanding of the dependence of the valley on other key parameters. Determining how empty the radius gap is represents an important step toward understanding its origins. In the initial CKS sample, typical planet radius uncertainties were comparable to the width of the gap so that an intrinsically empty gap would not have been resolved . Van Eylen et al. (2018) studied planets orbiting a subset of Kepler host stars with precise asteroseismic parameters (including ages ranging from ∼2 to 10 Gyr) and found a gap considerably wider and emptier than that found in the initial CKS sample. Including trigonometric parallaxes from Gaia DR2, Fulton & Petigura (2018) were able to improve the median R * errors by a factor of 5 in the CKS sample, but the gap remained populated. Those authors presented simulations that suggest that the gap is not empty (i.e., solely filled in by noisy data), and that there are real planets in the gap. More recently, however, Petigura (2020) showed that a sizable number of planets in and around the gap have poorly determined radii due to high impact parameters, indicating that the gap may be emptier than previously appreciated.
It also appears that the gap, which is a 1D projection of a higher-dimensional manifold, is partially filled in due to a dependence of the gap center on orbital period (or stellar light intensity) and host star mass. The gap center is anticorrelated with orbital period Van Eylen et al. 2018;MacDonald 2019;Martinez et al. 2019;Loyd et al. 2020), which is considered compatible with both the photoevaporation (e.g., Jin & Mordasini 2018;Lopez & Rice 2018;Owen & Wu 2013 and core-powered mass-loss models (e.g., Gupta & Schlichting 2019 but incompatible with formation in a gas-poor disk (Lopez & Rice 2018); at larger orbital periods, only the smallest and least massive cores are susceptible to total atmospheric loss, driving the gap to smaller radii. The length of the radius valley, i.e., its outer boundary in either period or insolation, may also provide clues to its origin, though this parameter remains poorly studied. In the photoevaporation model, the radius valley should not extend beyond orbital periods of 30-60 days, as the incident X-ray and ultraviolet (XUV) flux is believed to be too low to drive substantial mass loss (Owen & Wu 2017). However, the low completeness of the Kepler data set for small planets at these orbital periods presents a challenge for detecting such a transition point.
The gap center is positively correlated with stellar mass (Fulton & Petigura 2018;Wu 2019;Berger et al. 2020a;Cloutier & Menou 2020;Hansen et al. 2021;Van Eylen et al. 2021), although it has been suggested that this trend is due to the relationship between stellar mass and planetary insolation (Loyd et al. 2020). The measured mass dependence of the gap has been used to argue support for photoevaporation (e.g., Wu 2019) but requires that the average planet mass scale approximately linearly with host star mass, an assertion that has not been verified for small planets. By comparison, in the corepowered mass-loss model, the dependence of the radius gap location on stellar mass is a natural consequence of the dependence of planet equilibrium temperature (which partially determines the mass-loss rate in the Bondi-limited regime) on the stellar mass-luminosity relation (e.g., Gupta & Schlichting 2020).
As for metallicity, there is tentative evidence for a wider radius valley for metal-rich stars (Owen & Murray-Clay 2018). Such a dependence could result if the core mass distributions, core bulk densities, or initial atmospheric mass fractions of small planets depend sensitively on the metallicity of the host star, and hence the protoplanetary disk. There is evidence that large Kepler planets (2-8 R ⊕ ) are more common around highermetallicity stars Petigura et al. 2018) and that planets at short orbital periods are preferentially larger around higher-metallicity stars (Owen & Murray-Clay 2018). Both findings are compatible with a scenario in which metalrich stars form more massive cores, on average. It has also been suggested that metal-rich stars host planets with higher atmospheric metallicities, which increases the efficiency of atomic line cooling in photoevaporative flows and decreases mass-loss rates (Owen & Murray-Clay 2018). In the corepowered mass-loss model, the rate at which sub-Neptunes cool and contract is anticorrelated with the opacity of the envelope, which is assumed to be proportional to the stellar metallicity (Gupta & Schlichting 2020). Thus, in both the photoevaporation and core-cooling models, larger sub-Neptunes and a consequently wider radius valley are expected around more metal-rich stars (for fixed mass and age and neglecting any potential scaling between metallicity and core mass distributions).
The characteristic timescale for atmospheric loss among close-in exoplanets has been proposed as a key parameter for assessing the relative importance of photoevaporation and corepowered mass loss. Firm observational constraints on that timescale, however, are lacking. Constraining this timescale through exoplanet population studies may provide a means for discerning the relative importance of proposed mass-loss mechanisms. Core-powered mass loss is believed to operate over gigayear timescales (Ginzburg et al. 2016(Ginzburg et al. , 2018Gupta & Schlichting 2019. By comparison, photoevaporation models predict that the majority of mass loss occurs during the first 0.1 Gyr (e.g., Lopez et al. 2012;Owen & Jackson 2012;Lopez & Fortney 2013;Owen & Wu 2013, corresponding roughly to the length of time a Sun-like star spends as a saturated X-ray emitter (e.g., Jackson et al. 2012;Tu et al. 2015). However, a more recent study found that the majority of the combined X-ray and extreme UV emission of stars occurs after the saturated phase of high-energy emission, implying that XUV irradiation of exoplanet atmospheres continues to be important over gigayear timescales (King & Wheatley 2021). If valid, then observational constraints on exoplanet evolution timescales may not provide a conclusive means for discerning the relative importance of photoevaporation and core-powered mass loss. Nevertheless, there is evidence that the detected fraction of super-Earths to sub-Neptunes increases over gigayears, suggesting that the sizes of at least some planets evolve on these long timescales (Berger et al. 2020a;Sandoval et al. 2021).
A basic prediction of atmospheric loss models is that the radius gap is wider at younger ages and fills in over time; at a fixed value of high-energy incident flux and initial atmospheric mass fraction, photoevaporation models predict that sub-Neptunes with the least massive cores (and smallest core sizes) will cross the gap first, with more massive cores crossing the gap at later times, if at all. As a result, the radius valley is expected to be wider and emptier at early times, progressively filling in with stripped cores of ever larger masses and sizes (e.g., Rogers & Owen 2021). In the core-powered mass-loss model, Gupta & Schlichting (2020) suggested that the average size of sub-Neptunes is expected to decline with age while the average size of super-Earths remains relatively constant, again leading to a wider and emptier radius valley at earlier times.
While the specific theoretical predictions for the age dependence of the radius valley morphology are uncertain, the fundamental prediction from atmospheric loss models that this feature should weaken with increasing age is a firm conclusion. We aim to investigate this hypothesis using the CKS sample. Here we investigate the time evolution of the exoplanet radius gap. In Section 2, we describe our sample selection process, including several filters intended to rid our sample of stars or planets with unreliable parameters. Our analysis procedures are discussed in Section 3, and finally, we interpret our results and summarize our primary findings in Section 4.

Sample Selection
We began with the CKS VII sample published in Fulton & Petigura (2018, hereafter F18). The CKS VII sample is a wellcharacterized subset of all Kepler planet candidates. Stellar characterization for these stars was performed in a homogeneous manner, with spectroscopic T eff , g log , and [Fe/ H] derived from high signal-to-noise ratio (S/N), high-dispersion Keck/HIRES spectra (Johnson et al. 2017;Petigura et al. 2017). Using their spectroscopic T eff and bolometric luminosities computed from Gaia DR2 parallaxes (Gaia Collaboration et al. 2018), extinction-corrected Two Micron All Sky Survey (2MASS) K s magnitudes (Cutri et al. 2003), and theoretical bolometric corrections from the MESA Isochrones and Stellar Tracks (MIST; Choi et al. 2016;Dotter 2016), F18 derived stellar radii from the Stefan-Boltzmann law. They additionally computed ages for the CKS sample using the isoclassify package (Huber et al. 2017), which also depends on the MIST models. We use the F18 median posterior isochrone ages as one source of age in the analysis that follows.
We constructed several filters, many motivated by the cuts outlined in F18, to refine the sample and select those planets and stars with the most reliable parameters. The filters are enumerated as follows.
1. Planet orbital period. We restricted our analysis to planets with orbital periods <100 days. At larger periods, Kepler suffers from low completeness, particularly for small planets. 2. Planet size. We restricted our analysis to planets with sizes <10 R ⊕ . 3. Planet radius precision. We restricted our analysis to planets with fractional radius uncertainties s < R 20% R P P . 4. Planet false-positive designation. We excluded planets identified as false positives in Table 4 of the CKS I paper , which synthesized dispositions from Mullally et al. (2015), Morton et al. (2016), and the NASA Exoplanet Archive (as accessed on 2017 February 1; Akeson et al. 2013). 5. Stellar radius (dwarf stars). We restricted our analysis to dwarf stars with the following condition: The left-hand side of this condition excludes a small number of stars far below the main sequence that may have erroneous parameters. The right-hand side excludes stars that have evolved considerably away from the main sequence. This cut is depicted in Figure 1. We additionally excluded cool stars elevated from the main sequence that would result in unrealistically old ages. This cut was performed by requiring T eff > T isoc , where T isoc is the temperature of a log(age) = 10.25, [Fe/H] = +0.25 MIST v1.1 nonrotating isochrone with an equivalent evolutionary point (EEP) <500. 6. Stellar mass. We wish to isolate the effect of stellar age on the exoplanet radius gap while minimizing the effects of stellar mass as much as possible. We restricted our sample to stars with masses 0.75 < M å /M e < 1.25, where the masses were derived from stellar evolution models in F18. 7. Stellar metallicity. For the same reason we confined our sample in stellar mass, we restricted our analysis to stars with spectroscopically determined metallicities in the range −0.3 < [Fe/H] < +0.3. 8. Isochrone parallax. We removed stars where the Gaia and F18 "spectroscopic" or "isochrone" parallaxes differed by more than 4σ, where the latter quantities were computed in Table 2 of CKS VII. It was speculated by F18 that such discrepancies may be due to flux contamination from unresolved binaries. This cut also removes all stars where the CKS VII isochrone-derived radius, R iso , differed by more than 10% from the radius derived from the Stefan-Boltzmann law. 9. Stellar dilution (Gaia). We used the r 8 column in Table 2 of CKS VII to exclude stars with closely projected sources detected by Gaia that contribute a nonnegligible fraction of the optical flux in the Kepler aperture. We excluded stars where additional sources in an 8″ radius (two Kepler pixels) contribute more than 10% of the cumulative G-band flux (including the target). 10. Stellar dilution (imaging). As in F18, we excluded Kepler Objects of Interest (KOIs) with closely projected stellar companions bright enough to require corrections to the planetary radii of 5% or more. Like F18, we use the radius correction factor (RCF) computed by Furlan et al. (2017) based on high-resolution imaging from several authors, accepting planets for which RCF < 1.05. 11. Unresolved binaries (Gaia). We excluded stars with Gaia renormalized unit weight error (RUWE) values >1.4, where these values were queried from the Gaia archive. 8 The RUWE is a goodness-of-fit metric for a single-star astrometric model (Lindegren et al. 2018). It is a sensitive indicator of unresolved binaries (Belokurov et al. 2020), which are a concern for planet radius studies due to potential flux dilution or misidentification of the planet host. 12. Discrepant photometry. We removed stars with discrepant optical brightnesses, |G − K P | > 1 mag, indicating a potentially erroneous cross-match between the Kepler and Gaia sources. 13. Reddening. We removed stars with reddening estimates of A V > 0.5 mag, where these estimates were sourced from Lu et al. (2021). Stars with high reddening are more susceptible to erroneously determined stellar parameters. 14. Planets with grazing transits. Due to degeneracies inherent to light-curve modeling, planets with grazing transits can have poorly constrained radii. Petigura (2020) showed that there is some level of contamination of the radius valley from planets with grazing transits. Since impact parameters measured from long-cadence photometry are unreliable, we follow Petigura (2020) and exclude planets with R τ < 0.6, where R τ is the ratio of the measured transit duration to the duration of a b = 0, e = 0 transit with the same period around the same star.
The overall CKS VII sample contains 1913 planets orbiting 1189 unique stellar hosts. In the analysis that follows, we will refer to the base sample (constructed from the first five cuts enumerated) and the filtered sample (constructed from all of the filters). After applying the filters, the base sample consists of 1443 planets orbiting 871 unique stellar hosts. The filtered sample consists of 732 planets orbiting 466 unique hosts. Later, we find that our analysis is insensitive to many of these restrictions and relax most of them.

Rotation Period Vetting
We supplemented the CKS sample with stellar rotation periods, which we use in Section 3.2 to empirically age rank planet hosts, compiled from the literature. In order to perform an accurate rotation-based selection, it is imperative to have reliable rotation periods. To this end, we performed visual vetting of the full Kepler light curves for each star in the CKS sample. For each star, our period vetting procedure consisted of the following steps. . 3. Perform a Lomb-Scargle (L-S) periodogram analysis of the Kepler PDCSAP light curve using the LombScargle class in the astropy.timeseries package. 4. Phase-fold the PDCSAP light curve on the L-S peak power period and any published period, as well as on the first harmonic and subharmonic of each of the previously mentioned periods. 5. Generate a vetting sheet including all phase-folded light curves, the L-S periodogram, a 120 day segment of the light curve, and the full light curve. 6. Visually examine each vetting sheet, recording the preferred period source and assigning a reliability flag to each period determination (3: highly reliable; 2: reliable; 1: period could not be unambiguously determined; and 0: no periodicity evident). 11 The results of our rotation period vetting are summarized in Table 1. Of the 1189 unique planet hosts in the CKS VII sample, which are predominantly FGK main-sequence stars, we found that approximately 22% have highly reliable rotation periods, 23% have reliable periods, 34% could not have periods determined unambiguously from the light curve, and 21% had no clear periodicity evident in the light curve.

Evolution of the P-R Diagram: Isochrone Ages
We first plotted the period-radius (P-R) diagram for CKS planets in four bins of log(age): <9. 25, 9.25-9.5, 9.5-9.75, and 9.75 for both the filtered and base samples (Figures 2 and 3, respectively). 12 This binning scheme was in part chosen because there are few planets with log(age) < 9 or >10. From these figures, we observed a conspicuous void of planets around the radius valley in the youngest age bin (1.8 Gyr). Moreover, the slope of the void appears to be very close to the slope of the radius valley determined in Van Eylen et al. (2018, hereafter V18) from a subset of planets orbiting stars with precise asteroseismic parameters. However, while the slope of the young planet void appears to be consistent with that of the radius valley, the intercept appears to be different. This is evident from the fact that the lower boundary of young sub-Neptunes straddles the V18 line, while the super-Earths are well separated from the V18 valley. In other words, there appears to be a dearth of large super-Earths at younger ages, resulting in an apparent shift in the peak of the super-Earth radius distribution to larger radii at older ages. Our interpretation of this shift is discussed in Section 4.
While age, mass, and metallicity are correlated in the CKS sample, we show that the distribution of masses in each age bin is not changing drastically. The age-metallicity gradient is stronger, and essentially all stars in the youngest age bin are metal-rich. However, it is also clear that stars hosting planets in the radius valley are not exclusively metal-poor but rather have a wide range of metallicities. Additionally, while the [Fe/H] distributions in the two youngest age bins are broadly similar, the distributions of planets in the P-R diagram are markedly different. While these observations offer some degree of assurance that observed features in the P-R diagram are due to age rather than mass or metallicity, we explore the effects of stellar mass and metallicity further in Sections 3.5 and 3.7.

Evolution of the P-R Diagram: Gyrochronology
To this point, we have only considered isochrone ages from F18 in our analysis. While we present a qualitative validation of the F18 ages in Appendix B, there are substantial uncertainties associated with isochrone ages. It is also possible to empirically age rank the CKS sample with a gyrochronology analysis. Recently, Curtis et al. (2020) presented empirical gyrochrones for several open clusters that enable high-fidelity, model-independent age ranking of solar-type stars with welldetermined T eff and rotation periods.
We separated the CKS sample into "fast" and "slow" rotators using the hybrid NGC 6819 + Ruprecht 147 gyrochrone of Curtis et al. (2020), corresponding to an age of ∼2.7 Gyr. To perform this cut, we first converted the gyrochrone from a (B P -R P )-P rot relation into a T eff -P rot relation using the colortemperature polynomial relation presented by those authors (valid for the temperatures considered here). 13 We constructed the young planet sample from the CKS base sample (Section 2) by choosing host stars placed between the 0.12 (Pleiades) and 2.7 Gyr gyrochrones in the T eff -P rot diagram, high-reliability P rot flags, RUWE < 1.4 (to remove unresolved binaries with unreliable rotation periods), and T eff < 6000 K. The T eff < 6000 K cut is motivated by the fact that gyrochrones cluster closely for hotter stars and small (This table is available in its entirety in machine-readable form.) 11 An example vetting sheet is presented in Appendix A, and all sheets are available through the journal. 12 The completeness curve in Figures 2 and 3 was computed for CKS stars using CKS stellar parameters and the methodology of Burke et al. (2015) as implemented in Python at https://dfm.io/posts/exopop/. 13 We opted not to perform the gyrochronology classification in color space because we noted increased scatter in the (B P -R P )-P rot diagram for CKS stars, possibly a result of reddening, metallicity effects, or both.
temperature uncertainties can translate to large uncertainties in age from a gyrochronology analysis. The old rotation-selected planet sample was selected in the same fashion from stars lying above the 2.7 Gyr gyrochrone, except for the high-reliability P rot flag requirement. We found that we assigned the high-reliability flag more often to faster rotators that tend to exhibit higher amplitude and more stable brightness modulations (presumably due to larger, longer-lived spots), while the older, more slowly rotating stars exhibit smaller-amplitude, more sporadic Sun-like variations potentially due to smaller, short-lived spots. Thus, in the slow rotator sample, we accepted stars with either reliable or highly reliable P rot flags.
The distributions of the young and old rotation-selected planet hosts in the T eff -P rot and Hertzsprung-Russell (H-R) diagrams are shown in Figure 4, along with the 1D distributions of their median isochrone ages. We observe that the fast-rotating sample does indeed correspond to stars that lie closer to the zero-age main sequence with median isochrone ages strongly skewed toward younger ages (mostly below 3 Gyr) relative to the CKS sample. The more slowly rotating stars show a distribution of median isochrone ages that is practically indistinguishable from the bulk of the CKS sample but with a significant number of stars in the ∼1-3 Gyr range. However, this is likely to be the result of isochrone clustering on the main sequence and not because those stars are actually young. We also note that only 46% (546/1189) Figure 2. First row: evolution of the Kepler planet population in the P-R diagram for the filtered CKS sample (age bins indicated above each panel). Contours show Gaussian kernel density estimates of planets in the overall CKS sample. The black dashed line indicates the radius valley derived by V18. The gray shaded region indicates the 25% pipeline completeness contour calculated from the CKS sample. Second row: 1D distributions of planet radii for the samples plotted above in each case. The nominal location of the radius gap from Fulton et al. (2017) is indicated by the vertical gray stripe. Third row: our base CKS planet host sample in the T eff -R * plane (gray) with the host stars in the age bins indicated at the top (pink). Stars hosting planets in the radius range 1.6-1.9 R ⊕ are outlined in black. Fourth row: as in the third row, the distribution of planet hosts in the mass-metallicity plane. of the stars in our sample have assigned rotation periods from our period vetting procedure. For the remaining stars, it was not possible to unambiguously assign a period. Those stars are preferentially more evolved relative to the periodic sample, though they are observed across the H-R diagram. Thus, the modest decrement at old ages in the age distribution of the slow rotators relative to the overall CKS sample may be the result of a finite active lifetime for solar-type stars.
After separating the planet hosts into the fast and slow rotator samples, we examined the distributions of the corresponding planet populations in the P-R diagram ( Figure 5). We observe qualitatively similar behavior as to what was found when using isochrone ages to perform age cuts. That is, there is a dearth of exoplanets in the radius valley among planets empirically determined to be younger than ∼2.7 Gyr from a gyrochronology analysis. Among the planets older than ∼2.7 Gyr, the radius valley appears more filled in. Moreover, the slope and boundaries of the radius valley in the left panel of Figure 5 appear to be very close to those derived from the isochrone age-selected sample (as described in Section 3.3).

Measuring the Slope of the Void
As the slope of the radius valley contains information about the mechanism(s) responsible for producing it, we proceeded to characterize the void for four planetary samples described as follows. Each sample is a subset of the base sample, sharing the following cuts: planets orbiting dwarf stars (described in Section 2) with P < 100 days, R P < 10 R ⊕ , and s < R 20 R P P %. The isoc_fgk_1to2 sample also employs the age restriction ( ) <  9 log age 9.25, while the isoc_fgk_lt2 sample is produced from the more inclusive criterion ( )  log age 9.25. As isochrone ages for hotter stars are more reliable than those of cooler stars, the isoc_fg_lt2 sample combines the criteria ( )  log age 9.25 and T eff > 5500 K. Finally, the gyro_ gk_lt3 sample combines the common cuts with the following criteria: T eff < 6000 K, RUWE < 1.4, high-reliability rotation periods (reliability flag of 3), and positions in the T eff -P rot plane between the empirical Pleiades and NGC 6819 + Ruprecht 147 gyrochrones of Curtis et al. (2020). The distributions of these planetary samples in the P-R and insolation-radius planes are depicted in Figure 6.
Following the approach of V18, we used support vector machines (SVMs) to find the decision boundary that maximizes the margins between two distinct classes of planets in the P-R and insolation-radius planes. To label the planets, we found that shifting the V18 radius valley equation downward by 0.07 dex in ( ) provided an unambiguous separation of planets into two classes for the isoc_fgk_1to2 sample. Thus, we used the equation to label planets as sub-Neptunes or super-Earths. To implement the  (Curtis et al. 2020). In the middle panels, the gray curves show solar-metallicity, nonrotating MIST v1.1 isochrones from log(age) = 9-10 in steps of 0.25 dex. Figure 5. The P-R diagram for exoplanets orbiting stars rotating more rapidly (left) or more slowly (right) than an empirical 2.7 Gyr gyrochrone. The shaded contours represent a 2D Gaussian kernel density estimation for the overall CKS sample. The black dashed lines indicate the margins of the young planet void derived from the isochrone-selected sample in Section 3.3.
SVM classification, we used the sklearn.svm.SVC module in Python with a linear kernel (Pedregosa et al. 2011).
We explored the sensitivity of our results to the regularization parameter, , finding that for <  5, the SVM misclassifies a large fraction of planets and fails to trace the center of the void that is so readily visible by eye (Figure 7). In determining the equation of the void, we ultimately adopt the slope and intercept derived from the =  10 case but recommend =  1000 for determining the upper and lower boundaries of the void. To calculate the uncertainties on the slope and intercept of the radius valley, we performed 10 3 bootstrapping simulations, selecting 50 planets (with replacement) randomly from the young planet samples and recording the slope and intercept resulting from the SVM classification for each bootstrapped sample. Table 2 lists the slopes and intercepts for the young planet void inferred from the SVM bootstrapping simulations. Figures 8 and 9 show the derived radius valley from the bootstrapping simulations, and Figure 10 shows the distributions of slopes and intercepts from this analysis. For the  values explored here, the inferred slopes and intercepts of the radius valley are relatively constant. We find in almost all cases that the slope of the valley is consistent with the slope found in V18 at the 1σ level. However, we find an intercept that is systematically smaller than that found by V18 and Martinez et al. (2019) by at least 2σ and, in some cases, as much as 10σ using the quoted uncertainties from those works. While the statistical significance of this difference is highly dependent on the adopted uncertainty (where ours appears to be generally larger), it is clear from Figure 6 that the void we observe is offset from previous determinations of the radius valley. We note that previous works characterized the radius valley among samples with a broader range of ages, while the focus of this analysis is on the younger planets in the CKS sample. In Section 4, we discuss our interpretation of this difference. The level of agreement between the radius valley slopes derived here and in V18 is noteworthy, given that the samples we characterize are ≈30%-100% larger and selected on the basis of age rather than radius precision, which was the impetus for the V18 sample.
The upper and lower boundaries of the radius valley are given by the equation   . In the insolation-radius diagram, it is ( 10 inc . Adopted values are in bold. in the insolation-radius plane. We use the highest  parameter explored here for determination of the radius valley boundaries, as it provides the closest match to the data (i.e., the fewest planets inside those boundaries).

Calculation of False-alarm Probability
To determine whether the void we observe in the P-R diagram could be due to chance, we performed simulations to determine the probability of finding n itv or fewer planets in the void from N planets selected at random (without replacement) from the CKS sample. Here N is the total number of planets in each of the samples described in Section 2, i.e., 156, 238, 124, and 190 for the isoc_fgk_1to2, isoc_fgk_lt2, isoc_fg_lt2, and gyro_gk_lt3 samples, respectively. The definition of the radius valley boundaries and hence the true number of planets in the valley, n itv,true , are sensitive to the regularization parameter, , and the specific sample used in the SVM bootstrapping analysis (see Figure 11). For each sample and  value, we performed 10 4 simulations, selecting N planets randomly (without replacement) from the overall CKS sample, modeling the planet period and radius uncertainties with normal distributions, and recording the number of planets in the valley, n itv,sim . The false-alarm probability was then computed as the fraction of total trials that satisfied the condition  n n itv,sim itv,true . The results of these simulations are tabulated in Table 3. For =  5, we find false-alarm probabilities in the range of 18%-30%, but from Figure 11, it is clear that the SVM margins in this case are so wide as to not provide an accurate description of the void boundaries. The same may be argued for the =  10 case, but even then we find false-alarm probabilities of <10%. Finally, for the =  100, 1000 cases, for which the SVM margins precisely trace the boundaries of the void, we found false-alarm probabilities of =1%.
We also performed Monte Carlo simulations to estimate the probability that the void could be produced from random selection among those planets orbiting hosts rotating more slowly than the empirical 2.7 Gyr gyrochrone (see Figure 4).  We started by performing the same generic cuts on the overall CKS sample as were performed on the rotation-selected sample (main-sequence stars, no false positives, RUWE < 1.4, T eff < 6000 K, and planets with P < 100 days, R P < 10 R ⊕ , and radius precision <20%). Then, in 10 4 simulations, we modeled the uncertainties in the planet periods and radii using normal distributions, selected 190 planets at random (without replacement) from the slow rotator sample, and computed the number of planets in the void. Here 190 is the total number of planets in the fast rotator sample. We found that the probability of finding a comparably empty void from the slow rotator sample is <2% when using the gyro_gk_lt3 margins and =  100, 1000. Using the same  values but margins derived from the isochrone samples raises the false-alarm probability, but only to ∼2%-6% at most. Taken together, we conclude that the emptiness of the observed void is not due to chance.

Effects of Stellar Mass and Metallicity
Our analysis separates the data set into age bins in order to understand the evolution of planets on a population level. Since age, mass, and metallicity are correlated in the CKS sample, it is difficult to entirely disentangle the effects of each parameter on the distribution of planets in the P-R diagram. We explored how sensitive our analysis is to specific binning schemes by recording the fractional number of planets in the valley over a 2D grid of bin centers and widths in age, mass, and metallicity. We used the definition of the radius valley boundaries expressed in Section 3.3 for this purpose (specifically, the margins given by the =  1000, isoc_fgk_1to2 sample case). Figure 12 shows the results of this exercise; the young planet void is apparent as the light, broad diagonal stripe in the left panel. For small bin widths, the minimum density of the radius valley is achieved for a bin center in the range 9 < log (age) < 9.25. This is unsurprising, as the boundaries of the radius valley were identified in and derived for just such a binning strategy. However, as the bin width in log (age) increases, the minimum density of the valley shifts systematically toward younger bin centers. This suggests that the filling of the radius valley is due to preferentially older planets. Figure 12 also reveals that there is no binning strategy in mass or metallicity that can produce a comparably empty void (in a fractional sense) except in finely tuned regions of parameter space where sample sizes are small. However, while not as pronounced, we do note that the radius valley (as defined in this exercise) appears emptier for lower-mass and metal-rich stars. The latter observation is consistent with the finding of an apparently wider radius valley for metal-rich hosts within the CKS sample (Owen & Murray-Clay 2018), though we note that age and [Fe/H] are anticorrelated.
We also examined the 1D radius and period distributions for CKS planets in the extremes of the stellar age, mass, and metallicity axes ( Figure 13). The purpose of this exercise was to highlight any exoplanet demographic trends as a function of these key stellar parameters. Interestingly, even without completeness corrections or occurrence rate calculations, several of the now well-established trends in the Kepler planet population are evident from Figure  , and the rising occurrence of short-period planets (1 day < P < 10 days) of all sizes with increasing metallicity . The other trend that is apparent is the dearth of planets in the radius valley for young stars. The trend of a wider radius valley around more metal-rich stars found by Owen & Murray-Clay (2018) is not immediately obvious, but relative to that study, we use updated planetary radii from F18, do not include completeness corrections, and perform slightly different cuts.

Accounting for Age Uncertainties
In Section 3.4, we assessed the probability that the observed void was due to a chance selection of planets from the overall CKS data set, and in Section 3.5, we explored the sensitivity of the void occupancy to binning schemes in mass, age, and metallicity. Here we attempt to account for stellar age uncertainties in examining the void occupancy as a function of age. As discussed in Appendix B, there may be substantial uncertainties in stellar ages, particularly if those ages originate from isochrones. As a result, when binning in stellar age, there is considerable uncertainty in the degree of contamination by stars with inaccurate ages.
To mitigate the effect of stars with spurious ages, we performed Monte Carlo simulations in which the ages were modeled as normal distributions in log(age) centered on the median values published in F18 with widths taken as the maximum of the lower and upper age uncertainties for each Table 3 False-alarm Probabilities 22% 4.5% 0.03% 0.08% Figure 12. Effects of binning schemes (in age, mass, and metallicity, from left to right) on the fractional occupancy of the radius valley. Note that the color scaling is the same for each panel.
star. While this is not the same as drawing from the empirical posterior probability density functions in age (which are not available), it is a crude proxy. For 50 bin centers in log (age) from 8.25 to 10.25, we then performed 10 3 Monte Carlo simulations in each bin, randomly generating ages as described above, to measure the fraction of planets in the valley as a function of age. We measured the fraction of planets in the valley relative to the total number of planets in each age bin for bin widths of 0.125, 0.25, 0.5, and 1.0 dex. The results of this analysis are shown in Figure 14. The scarcity of planets with log(age) < 9 and >10 leads to large uncertainties in the trend at both extremes, in addition to the larger age uncertainties at younger ages. However, we observe a marginally significant increase in the fraction of planets located in the radius valley in the range 8.75 < log(age) < 9.75. This is in agreement with Figures 2 and 3, which show that the radius valley appears weaker among the oldest planets in the CKS sample. Computing planet occurrence rates in the valley as a function of age might lead to a more robust conclusion on the trend noted here but is outside the scope of the present work.

In What Ways Are Planets in the Valley Different?
In an effort to quantify the parameters that are most important in contributing to the filling of the radius valley, we performed a k-sample Anderson-Darling (A-D) test (Scholz & Stephens 1987) with scipy.stats.anderson_ksamp to test the null hypothesis that the distribution of a given variable for stars hosting planets in the valley was drawn from the same distribution of that parameter among our base CKS sample.
The results of this exercise are summarized in Table 4, and select parameter distributions are shown in Figure 15. In this exercise, we have assumed that the equation for the equations for the radius valley and its boundaries are given by the fourth row of Table 2. This choice is motivated by the fact that higher regularization parameters correspond to tighter boundaries of the valley, offering a cleaner separation between planets in, above, or below the valley. Additionally, for each parameter, we restrict our analysis to those CKS stars/planets for which that parameter is defined (i.e., we exclude targets missing data for a given variable). We also apply the common cuts described  in Section 3.3 before performing the A-D tests. In the case of P rot , we also restricted the sample to T eff < 6000 K, where rotation periods are more reliable indicators of age.
Of the nine parameters with the highest normalized k-sample A-D test statistics (and p-values <0.05), all but two pertain directly or indirectly to the star's evolutionary state: P rot from various sources, P rot flag, R var (a measure of the photometric variability amplitude), median posterior age from isochrones, and R * . The other two parameters are fractional R P precision and r 8 (a measure of flux dilution). Thus, of the parameters investigated, those that contribute most to the filling of the radius valley either relate to stellar age or may be associated with erroneous measurements of the planetary radii. Inspection of the parameter distributions (like those shown in Figure 15) reveals that planets in the valley tend to orbit stars that are older, larger, less likely to have a securely detected rotation period, rotating more slowly, and photometrically quieter. Planets in the valley also have lower r 8 values relative to the CKS base sample. Naively, one might expect higher r 8 values among planets in the radius valley, as flux dilution can lead to erroneous planet radius measurements. However, planets in the radius valley are by definition small, so it is perhaps not surprising that there is a preference for stars not affected by crowding.
We also found that the stellar mass and metallicity distributions for stars hosting planets in the radius valley are statistically indistinguishable from those of the CKS base sample. This lends further support to the notion that the feature identified in the CKS data set from age selections is not due to correlations between stellar age, mass, and metallicity.
The parameter that appears to be most important in contributing to the filling of the radius valley is planet radius precision. This suggests that the radius valley may be emptier than is suggested by current data. The fractional stellar radius precision does not, however, contribute to the filling of the valley. This is not surprising, as the typical error budget for a planet's radius in the CKS sample is dominated by the R P /R å uncertainty from light-curve fitting rather than the stellar radius uncertainty (Petigura 2020). In Section 3.8, we examine the possibility that a correlation between planet radius precision and age could conspire to produce the observed void.

Confounding Scenarios
A possibility not yet explored is that the radius valley is inherently empty but, for some reason, planets orbiting stars with younger assigned ages in the CKS sample have more precise radii. In Section 3.7, we established that the planet radius precision is the most important parameter contributing to the filling of the radius valley. We quantified the correlation between fractional planet radius precision and log(age) by computing the Spearman rank correlation coefficient for planets with P < 100 days, R P < 10 R ⊕ , non-false-positive dispositions, and main-sequence host stars (filters 1, 2, 4, and 5 from Section 2). We used the scipy.stats.spearmanr function for this purpose and found a small p-value (2 × 10 −4 ) but a very weak correlation coefficient (ρ < 0.1).
To further investigate the impact of radius precision, we computed the fraction of planets in the valley for young and old samples as a function of fractional radius precision allowed. For a given sample of planets and over a grid of radius precision thresholds, we selected the planets with fractional radius uncertainties smaller the threshold value and computed the ratio of planets in the valley to the total number of planets meeting the radius precision requirement. We performed 10 3 bootstrapping simulations (including modeling of the planet radii as normal distributions) to determine the uncertainties on these trends, which are shown in Figure 16. We found that the radius valley is comparably empty for young and old planets if the fractional radius precision is required to be better than ∼5%. However, this is not unexpected, as our CKS base sample size diminishes steeply below fractional radius uncertainties of 7%. For reference, the median fractional radius uncertainties for the CKS base sample, young isochrone ageselected sample (isoc_fgk_lt2 ), and old isochrone ageselected sample are 5.0%, 4.5%, and 5.2%, respectively.
Finally, we examined the 1D radius distributions for the young and old isochrone age-selected samples with planet radii known to better than 5%. We performed 10 3 bootstrapping simulations (again modeling the planet radii as normal distributions) to determine the uncertainties on these distributions. The results are shown in Figure 17. While radius precision is clearly an important parameter in determining the occupancy of the radius valley, we conclude that it is unlikely to explain the entire deficit observed for the young planet sample. ?   Figures 2, 3, and 5 give the impression that the radius valley progressively fills in over time, becoming weaker or disappearing   entirely among older planet populations. This interpretation appears to be at odds with the results of V18, who observed a clean gap in the P-R distribution of planets orbiting asteroseismic host stars, which are preferentially older than the stars in our young sample. 14 Notably, planets with ages 3 Gyr in the CKS base sample have a median radius precision of 5.3%, while planets in the V18 sample have a median radius precision of 3.3%. Similarly, in Section 3.8, we found that age-dependent differences in the radius valley filling factor can be resolved at least partially by restricting the analysis to planets with the most precise radii.

Is the Radius Gap Empty
To further investigate this issue, we constructed a new sample, the gold sample, which implements several reliability cuts. In addition to the cuts of the base sample, the gold sample is restricted to planets with a fractional radius precision of <6%, a fractional R P /R * precision of <6%, nongrazing transits (R τ > 0.6), RCF < 1.05, A V < 0.5 mag, RUWE < 1.1, agreement between the F18 isochrone-derived and trigonometric parallaxes, and a KOI reliability score >0.99 from the Q1-Q17 DR25 catalog. We then split this sample into young, gold_lt3, and old, gold_gt3, samples. The young sample includes the restriction that the stellar age inferred from both isochrones and gyrochronology is <3 Gyr, while the old sample requires a planet host to have a median isochrone age >3 Gyr and does not have a rotation period consistent with an age <3 Gyr. For both the gold_lt3 and gold_gt3 samples, we confirmed that the corresponding distributions in R P precision, R P /R * precision, and singletransit S/N (defined as ( ) R R CDPP3 P 2 * ) were not statistically different either from each other or from the overall distributions in the gold sample, yielding p-values > 0.25 in each case from k-sample A-D tests. The gold_lt3 sample in comparison to the V18 and gold samples in the P-R diagram are shown in Figure 18. From that figure, it appears that the reliability cuts have a significant impact on how well defined the super-Earth and sub-Neptune distributions are, as well as how empty the gap appears at all ages, though it is not entirely devoid of planets. Furthermore, it is clear from Figure 18 that the gap in the gold_lt3 sample is offset from the gap in both the V18 and gold samples, indicating that the difference is unlikely to be due to systematic differences in planet radii between the two studies. For reference, we also show the P-R distribution of V18 planets using the CKS radii, which highlights the importance of the precise (R P /R å ) values used by V18 in resolving the gap (for a detailed discussion, see Petigura 2020).
We proceeded to perform the same SVM analysis as was presented in Section 3.3 with one difference: classification of the samples into super-Earths and sub-Neptunes was performed using the threshold R P = 1.8 R ⊕ rather than a period-dependent classification scheme. The reason for this choice is because this scheme clearly works well for the gold_lt3 sample and allows us to test the sensitivity of the analysis to the classification step. The results of our analysis are presented in Figure 19 and Table 5. We find that despite the simplified classification scheme, a negative slope in the P-R diagram is still preferred (though the data are also consistent with no orbital period dependence). Furthermore, we observe that, independent of regularization parameter, there is a persistent offset in the center of the gap for the gold_lt3 sample, compared to both the gold and gold_gt3 samples.
Related to this last point, we emphasize that the gap identified in this work is primarily due to a lack of large super-Earths at young ages, as opposed to a difference in the sub-Neptune size distribution or some combination of the two. This is most apparent in Figures 17 and 18. We note that the young planet samples are always smaller than the control samples, and the dearth of large super-Earths at young ages could be due in part to small number statistics. To assess the probability of this scenario, we performed 10 4 simulations and measured the fraction of outcomes in which the number of >1.5 R ⊕ planets in a control sample was equal to or fewer than the number of >1.5 R ⊕ planets in the young sample. In each simulation, we selected 40 planets at random from the control sample, corresponding to the size of our young super-Earth sample. We accounted for planet radius uncertainties in both the young and control samples by modeling the radii as normal distributions given their published uncertainties. For the control samples, we used the CKS gold and V18 samples, where we used the V18 radius valley equation to select only the super-Earths in each. In both cases, we found that 2% of the simulations resulted in outcomes where the number of >1.5 R ⊕ super-Earths was greater in the young sample than in the control sample. We also compared the young and control super-Earth size distributions with a k-sample A-D test, finding that the null hypothesis can be rejected at the 1% level.
In conclusion, we propose a solution to resolve the apparent tension described in the beginning of this section and to explain all of the observations to date: the radius gap is intrinsically empty, or at least emptier than previously appreciated, but its precise location shifts with the age of the planetary population. Since the radius gap appears to have an orbital period dependence, a gap that is intrinsically empty in the P-R plane will always appear filled in when projected along the radius axis, even if the radii are perfectly known. Similarly, if the location of the gap also depends on host star mass, age, or metallicity, as has been suggested, then the gap will only appear empty in sufficiently narrow projections of parameter space. While this proposed solution would help explain some of our observations, we emphasize that we have not conclusively shown it to be the case. Confirming or rejecting this hypothesis may be possible with (1) a larger sample providing sufficient coverage of the P-R plane across the Figure 17. Small planet size distributions among CKS planets with fractional radius uncertainties better than 5%. Uncertainties (the 16th and 84th percentiles) are determined from bootstrapping simulations with planet radii modeled as normal distributions.
variables of interest and/or (2) a thorough, multivariate investigation of the radius gap in order to find the projection of the data resulting in the emptiest gap. We leave such an investigation to future works and emphasize that planet radius uncertainties (resulting from inaccurate light-curve fits, stellar radius uncertainties, or more pernicious sources, such as flux dilution from unresolved binaries) remain an obstacle to our understanding of the radius gap.

Discussion and Conclusions
We observe a nearly empty void in the P-R plane for close-in (P < 100 days) exoplanets orbiting stars younger than ∼2-3 Gyr. The void was first identified among a sample of planets with median posterior isochrone ages <1.8 Gyr but is also present among planets with stars rotating faster than an empirical 2.7 Gyr gyrochrone. The difference between these two timescales could conceivably be due to systematic offsets Figure 18. Planet distributions in the P-R diagram for the samples described in Section 3.9. In the middle panel, planets in the V18 asteroseismic sample are plotted using the F18 radii. Lines connect each planet in that sample to its radius as determined by V18. between the CKS isochrone ages and ages implied from a gyrochronology analysis. Because the gyrochrone used to perform our sample selection is calibrated to open clusters with main-sequence turnoff ages, the longer timescale may more accurately reflect the lifetime of this feature in the P-R diagram.
We derived equations for the center of this void, which we refer to as the young planet gap, as a function of orbital period, P, and insolation, S inc : For periods in the range of 3-30 days, describing the bulk of our sample, this places the center of the radius valley at 1.87-1.56 R ⊕ . Over this same period range, the lower boundary of the void is in the range of ∼1.6-1.4 R ⊕ , while the upper boundary is at ∼2.1-1.8 R ⊕ . From a subset of the CKS sample created using reliability and precision cuts, we similarly derived equations for the radius valley valid for all ages: The slope of the young planet gap in the P-R diagram is consistent at the 1σ level with the slope of the radius valley measured from the asteroseismic sample in V18 but with an intercept that is smaller by ∼3σ using the uncertainty reported by those authors. The smaller intercept among the "young" planet sample corresponds to a shift in the radius valley toward smaller radii and would be compatible with a prolonged massloss timescale for the sub-Neptune progenitors of the largest observed super-Earths. An alternative explanation could be the late-time formation of secondary or "revived" atmospheres through endogenous or exogenous processes (e.g., Kite & Barnett 2020;Kite & Schaefer 2021). Differentiating between these two hypotheses might be achieved with detailed composition modeling or atmospheric studies of the largest super-Earths.
The shallow, negative slope of the void is compatible with models of atmospheric loss through photoevaporation (e.g., Owen & Wu 2013Jin & Mordasini 2018;Lopez & Rice 2018) or core cooling (Gupta & Schlichting 2019 but incompatible with the steeper negative slope implied for one model of impact-driven atmospheric erosion (Wyatt et al. 2020). The negative slope we find is also incompatible with the positive slope predicted by models of late-stage formation in a gas-poor disk (Lopez & Rice 2018). However, we note that the void is only marginally inconsistent with being flat when adopting a more conservative regularization parameter in the SVM analysis. The slope of this void in the insolation-radius plane is shallower (by 2σ-3σ) than the slope found by Martinez et al. (2019).
Both rotation-selected and isochrone-selected planet samples show the same qualitative trend: an absence of large super-Earths at young ages. We estimate that the probability of this feature being due to chance is <1% for both the isochrone-selected and rotation-selected samples. We also showed that this feature is relatively insensitive to various data reliability filters and unlikely to be the result of correlations between stellar age, mass, and metallicity. Simulations accounting for age and planet radius uncertainties show an increasing fraction of planets residing in this gap as a function of age (see Figure 14). The occupancy of the radius valley is also clearly dependent on the precision of planetary radii, and we note that the differences between the young and old planetary samples diminish with more stringent precision requirements (Figure 16). However, resolving the discrepancies entirely requires discarding more than half of the CKS sample. A larger sample size and higher-precision planetary radii for the entire CKS sample would help to more securely determine how much of the discrepancy between young and old planet populations is astrophysical and how much is due to noise.
While a more detailed study of planet radius demographics as a function of age, mass, and metallicity is left for future works, we note that our findings are broadly consistent with expectations from both the core-powered mass loss and photoevaporation theories. This is most evident from Figure 20, where the sub-Neptune size trends with age, mass, and metallicity among those CKS planets with the most precise radii are shown in relation to scalings that approximately, though not exactly, mimic those presented in Gupta & Schlichting (2020). The slope in the stellar mass versus planet radius plane is shallower than predicted in core-cooling models but more similar to that predicted by photoevaporation models, provided that planet mass scales approximately linearly with stellar mass (Wu 2019). The gigayear timescale we find for the evolution of the radius valley is more compatible with core-powered mass-loss models than the canonical timescale of 0.1 Gyr from photoevaporation. However, although photoevaporation models predict that the radius gap will emerge on a timescale of 0.1 Gyr, some small fraction of planets are expected to cross the gap on timescales of ∼1 Gyr or more (Rogers & Owen 2021), which is compatible with our observations. The difference in the radius distributions between young and old planets is primarily driven by an absence of large super-Earths (1.5-1.8 R ⊕ ) at young ages, rather than an absence of small sub-Neptunes (see Figure 17). As a result, the precise location of the radius valley is shifted to larger planet sizes at older ages. To better understand the compositions of planets missing from the young planet radius distribution, we compiled data for well-characterized, confirmed exoplanets from the NASA Exoplanet Archive (Akeson et al. 2013). We selected planets with masses known to 25% precision or better, radii with 10% precision or better, orbital periods <100 days, and host stars with 4500 K < T eff < 6500 K to match the CKS sample. We computed the bulk densities of these planets and compared the distribution of planets in the radius-density plane to composition curves from Zeng et al. (2019). Among these well-characterized planets, we observe a clean separation in the radius-density plane (also observed by Sinukoff 2018) between planets that are consistent with rocky compositions and those that require a significant volatile component (such as an H 2 -He atmosphere, H 2 O-dominated ices/fluids, or some combination of the two) to explain their bulk compositions (Figure 21). We also observed that the radius valley identified in Van Eylen et al. (2018) bridges the gap between planets in these two  exist for a far larger portion of the Kepler sample, we restrict our analysis here to only those stars that overlap with CKS VII. Figure 23 shows the comparison of ages and other parameters from F18 and B20. For 80% of the stars with age estimates in both catalogs, the age estimates agree to within ∼0.4 dex. The median offset in ages is 0.075 dex, with the F18 ages being systematically older, but this shift is smaller than the typical age uncertainties from either catalog. The age discrepancies should not be due to differences in the adopted stellar models; both F18 and B20 use the isoclassify package (Huber et al. 2017) to compute ages from MIST v1.1 models (Choi et al. 2016;Dotter 2016).
To better understand the origins of the age discrepancies between the two studies, we searched for correlations between ( ) D log age yr 10 1 and other parameters in the data set. We found that ( ) D log age yr 10 1 is most strongly correlated with ΔM * and ΔT eff (see Figure 24). As both F18 and B20 determine mass and age simultaneously from stellar models, and these two parameters are intrinsically related, the ΔM * -( ) D log age yr 10 1 correlation is unsurprising. The correlation  with ΔT eff , however, is more informative. Here F18 derived T eff from high-resolution spectroscopy, while B20 derived T eff from isochrones using photometry (specifically Sloan g and 2MASS K s ) and parallaxes as input.
We examined the dependence of T eff -color relations on [Fe/H] and A V and found, when using the CKS spectroscopic parameters, that [Fe/H] can explain most of the dispersion in the T eff -color relations. By contrast, when using the photometric T eff and [Fe/H] from B20, there is no clear metallicity gradient in the T eff -color relations.
We find that ΔT eff is more strongly correlated with the reddening values (sourced from either B20, L21, or Gaia) than it is with any of the metallicity parameters. While reddening might help to explain temperature and age differences for some sources, we note that differences in photometric and spectroscopic temperature scales persist independent of reddening corrections (Pinsonneault et al. 2012).
We next compared the F18 and B20 ages with those determined from precise asteroseismic parameters. Silva Aguirre et al. (2015) determined ages for a sample of 33 Kepler planet candidate host stars with high-S/N asteroseismic observations, achieving a median statistical uncertainty of 14% on age. We compare the ages from F18 and B20 with the asteroseismic ages in Figure 25. We find reasonably good agreement with the asteroseismic ages for both F18 and B20. The residual scatter between the isochrone and asteroseismic ages is 0.11 dex for F18 and 0.22 dex for B20. In both cases, the residuals are comparable to the median age uncertainties from those catalogs.
Gyrochronology ages were computed using the stardate software package (Angus et al. 2019a(Angus et al. , 2019b. 16 The stardate ages were computed in the gyrochronology mode alone rather than in the combined isochrone-fitting and gyrochronology mode. The stardate gyrochronology relations are calibrated in Gaia color space. Using the rotation periods we vetted in Section 2, we noticed increased scatter in the (B P − R P )-P rot plane compared to the T eff -P rot plane, where T eff is the CKS spectroscopic temperature from F18. As such, rather than using the star's actual Gaia colors, which are susceptible to reddening, we converted the F18 spectroscopic T eff and B20 photometric T eff to the predicted Gaia colors using the relation in Curtis et al. (2020), which was calibrated for stars with negligible reddening. Using the vetted P rot and predicted (B P − R P ) colors, we then computed the gyrochronology ages (without uncertainties). Our comparison of the isochrone and gyrochronology ages is shown in Figure 25. We note that there is better agreement between F18 and the gyrochronology ages at young ages (<1 Gyr).
We note that the stardate model has not been updated to include recently determined open cluster rotation period sequences in its calibration. As such, we can compare the CKS sample to empirical gyrochrones from Curtis et al. (2020). This comparison is shown in Figure 26, which shows that the F18 isochrone ages do not always map predictably onto the T eff -P rot plane. For example, in the F18 log(age) bin of 8.75-9 dex (≈0.6-1 Gyr), approximately half of the stars fall below the 1 Gyr gyrochrone, and half lie above it. Similarly, in the F18 log(age) bin of 9.5-9.75 dex (≈3.2-5.6 Gyr), a nonnegligible number of stars fall below the log(age) ≈ 9.4 (≈2.5 Gyr) gyrochrone. However, we note that the vast majority of stars with F18 isochrone ages of log(age) < 9.25 fall below the log(age) ≈ 9.4 gyrochrone. This is in agreement with the comparison made to the stardate gyrochronology ages, in the sense that the majority of stars with F18 isochrone ages 1.8 Gyr appear to be younger than 2.7 Gyr from a gyrochronology analysis. Figure 26. In each panel, contours show the Gaussian kernel density estimate of CKS planet hosts in the T eff -P rot plane. Points depict stars with F18 isochrone ages indicated in the titles of each panel. The solid, dashed, dashed-dotted, and dotted lines indicate polynomial fits to the empirical gyrochrones of the Pleiades (log (age) ≈ 8.1), Praesepe (log(age) ≈ 8.8), NGC 6811 (log(age) ≈ 9), and NGC 6819 + Ruprecht 147 (log(age) ≈ 9.4) clusters, respectively (Curtis et al. 2020). Finally, we also examined the evolution of other physical parameters known to correlate with age, such as the variability amplitude R var , near-UV (NUV) excess, and velocity dispersion. We tracked velocity dispersion using v tan , the velocity tangential to the celestial sphere, and v b , the velocity in the direction of the galactic latitude, sourced from Lu et al. (2021). GALEX NUV magnitudes were obtained from Olmedo et al. (2015). For a crude approximation of the NUV excess, we performed a quadratic fit to the full Kepler Q1-Q17 DR25 sample in the (G − G RP ) versus (NUV − K s ) color-color diagram. The NUV excess was then defined as a star's (NUV − K s ) color minus the quadratic color-color trend. No dereddening was performed. Figure 27 shows the evolution of these parameters as a function of age. For both the F18 and B20 isochrone ages, we observe increasing dispersion in v tan and v b with age, as expected. The strongest expected correlation is observed for R var (sourced from Lu et al. 2021) and F18 age, with R var declining for the first ∼3 Gyr before plateauing. The average NUV excess appears to decline over a similar timescale when using the F18 ages, though that trend is less significant, and there may be residual systematics from the manner in which we computed the excess. Both the R var and NUV excess trends are expected, as starspot coverage, variability amplitudes, and chromospheric activity are known to decline with age. By comparison, when using the B20 ages, the behavior of R var and NUV excess with age is not as expected. We conclude by noting that, while substantial uncertainties remain for isochronal ages, there is qualitative agreement between the CKS ages and the ages (or age indicators) derived from independent methods. In some of the comparisons above, the CKS and GKSPC ages perform comparably well, though it is at the youngest ages (3 Gyr) where the GKSPC ages do not reproduce some expected trends. As the evolution of small planets at early times is a primary focus of this work, we adopt the CKS ages.