THE MINIMUM OF SOLAR CYCLE 23: AS DEEP AS IT COULD BE?

, , , , , , , and

Published 2015 May 4 © 2015. The American Astronomical Society. All rights reserved.
, , Citation Andrés Muñoz-Jaramillo et al 2015 ApJ 804 68 DOI 10.1088/0004-637X/804/1/68

0004-637X/804/1/68

ABSTRACT

In this work we introduce a new way of binning sunspot group data with the purpose of better understanding the impact of the solar cycle on sunspot properties and how this defined the characteristics of the extended minimum of cycle 23. Our approach assumes that the statistical properties of sunspots are completely determined by the strength of the underlying large-scale field and have no additional time dependencies. We use the amplitude of the cycle at any given moment (something we refer to as activity level) as a proxy for the strength of this deep-seated magnetic field. We find that the sunspot size distribution is composed of two populations: one population of groups and active regions and a second population of pores and ephemeral regions. When fits are performed at periods of different activity level, only the statistical properties of the former population, the active regions, are found to vary. Finally, we study the relative contribution of each component (small-scale versus large-scale) to solar magnetism. We find that when hemispheres are treated separately, almost every one of the past 12 solar minima reaches a point where the main contribution to magnetism comes from the small-scale component. However, due to asymmetries in cycle phase, this state is very rarely reached by both hemispheres at the same time. From this we infer that even though each hemisphere did reach the magnetic baseline, from a heliospheric point of view the minimum of cycle 23 was not as deep as it could have been.

Export citation and abstract BibTeX RIS

1. INTRODUCTION

The solar magnetic cycle is a process that takes the Sun through subsequent periods of high (maximum) and low (minimum) activity. Since the pioneering work of Parker (1955), there has been a continuous effort to understand the mechanisms that keep it going and define its properties (see a review by Charbonneau 2010). Most of this effort has focused on understanding the periods of highest activity (solar maximum). However, this focus shifted after the arrival of the unexpectedly deep minimum of solar cycle 23, in which record lows were measured across the board in solar activity indices and solar wind properties.

One of the most direct proxies of solar activity, and the one most commonly used to understand cycle variability, is the presence of sunspots on the photosphere. One of its main advantages is that reliable sunspot records exist that span more than a century. Sunspot groups are associated with bipolar magnetic regions (BMRs), that are believed to originate from an underlying large-scale toroidal field that is critical for the evolution of the solar cycle (see a review by Fan 2009). Furthermore, thanks to the systematic orientation and tilt of BMRs, their emergence and decay seem to be the primary mechanism that regenerates the poloidal field from which the solar cycle can start again (Cameron et al. 2010; Dasi-Espuig et al. 2010; Muñoz-Jaramillo et al. 2013), a process known as the Babcock–Leighton mechanism (BL; Babcock 1961; Leighton 1964).

One of the main questions that has been in the forefront of our minds since the arrival of the deep minimum of cycle 23, is: Is there a baseline for solar magnetism beyond which activity indices and solar wind properties cannot be any weaker? (Svalgaard & Cliver 2007; Schrijver et al. 2011; Cliver et al. 2012; Wang & Sheeley 2013).

Both the length of the sunspot record and the direct connection between sunspots and the solar cycle make it an ideal proxy with which to probe this question. However, distilling information out of sunspot (and BMR) properties is non-trivial due to the large variability observed in their properties (area, flux, and tilt and time, latitude, and longitude of emergence). The main objective of this paper is to introduce a new technique for studying the cycle dependence of sunspot properties (specifically sunspot group area) and demonstrate how powerful it is for contextualizing the extended minimum of cycle 23.

In order to better explain this new approach, we begin with a brief overview of the traditional approaches for binning sunspot and BMR properties to study the solar cycle (by date, by cycle, and by cycle phase; see Section 2). Then we discuss why these approaches are sub-optimal when the object of interest is the sunspots themselves, and we define activity level and how to use it to bin sunspot data (see Section 3). In Section 4, we introduce our two databases and how to combine them by taking advantage of their statistical properties followed by our statistical model and our methods for fitting the data and assessing their relative performance (see Section 5). In Section 6 we separate our data according to activity level and fit each activity level bin separately. We also show how the area distribution related to small structures is independent of activity level. In Section 7 we show how the area distribution related to BMRs is strongly dependent on activity level and re-define our statistical model to take advantage of this relationship. In Section 8 we demonstrate quantitatively, that an activity-level-dependent statistical model is the best of all the models proposed in this paper for characterizing our data. In Section 9, we take advantage of this model to better understand the extended minimum of cycle 23 and put it in the context of the last 12 cycles. In Section 10 we discuss whether the existence of a composite distribution arises from different components of the dynamo (small-scale versus global), or whether an alternative interpretation of the results is necessary. We finish with a summary and conclusions in Section 11.

2. TIME: THE PREDOMINANT WAY OF BINNING DATA

Given that the solar cycle is a continuously evolving transient process, time has been the traditional primary focus of statistical analyses of sunspot and BMR properties. This means that comparative studies break down and compartmentalize data into chunks defined by when they occur. This kind of binning is illustrated in Figures 1(a) and (b), showing vertical markers that break down our data into separate cycles. There are essentially three ways of binning data in time; to illustrate them, in this section we make a very limited review of studies using each type (which should not be considered comprehensive). Example studies are chosen specifically to highlight the advantages of each approach.

Figure 1.

Figure 1. (a)–(b) Temporal binning of sunspot data vs. (c)–(d) data binning by activity level. In panels (a)–(c), northern (southern) hemispheric data are marked using a light blue (dark red) color. In panels (c) and (d), activity level is indicated using a color scale changing from light yellow to dark blue. Note that the vertical and color axes are the same in panel (c). Activity level, displayed in panels (b) and (c), is calculated by convolving the daily sunspot group area with a six-month Gaussian filter.

Standard image High-resolution image

2.1. Binning Data by Arbitrary Time Interval

The most straightforward way of time binning is to separate data into specific time intervals (by day, month, or year or by Carrington rotation). This kind of binning is the most natural form of data exploration (see, for example, the pioneering studies by Tang et al. 1984; Wang & Sheeley 1989), and it is still widely used today. It is particularly powerful for searching for evidence of long-term trends (spanning timescales longer than the solar cycle itself). More recent examples of this kind of binning are the thought-provoking papers by Penn & Livingston (2006, 2011) who reported a decrease in the average magnetic field of sunspots since 1998; they speculated that, if this trend continued, sunspots would become very rare by 2025. Very good examples of how best to use this kind of binning are the responses to Penn & Livingston by Nagovitsyn et al. (2012) and Pevtsov et al. (2014) where, using data going back to 1920, they demonstrate that there is no apparent long-term trend in the evolution of the average properties of sunspots; there is simply a cyclic modulation.

2.2. Binning Data by Cycle

The second way of binning data involves grouping data according to the cycle to which they belong. This kind of binning is very good for identifying significant changes between different cycles. Some examples of this kind of binning are the papers by Hathaway & Choudhary (2008) who looked at the decay rate of sunspots, McClintock & Norton (2013) who studied variations in sunspot group tilt angles and Joy's Law, and de Toma et al. (2013) who examined changes in sunspot area between cycles 22 and 23.

This type of time binning truly excels when used for studying the physical mechanisms responsible for sustaining and propagating the solar cycle. An excellent example of this kind of work was performed by Dasi-Espuig et al. (2010) who by looking at cyclic averages of tilt angles, found a correlation between weighted averages of sunspot group properties during a cycle and the amplitude of the following cycle (not to be confused with the reported same-cycle negative correlation between the amplitude of a cycle and tilt averages; see Ivanov 2012; Dasi-Espuig et al. 2013; McClintock & Norton 2013). This result has been further expanded by Muñoz-Jaramillo et al. (2013) who demonstrated that this connection exists because the average properties of BMRs determine the strength of the poloidal field at the end of the solar cycle. Together they represent observational evidence in favor of the BL mechanism and our current understanding of the solar cycle.

2.3. Binning Data by Cycle Phase

The last type of time binning we will review here is binning according to cycle phase. This involves either choosing specific phases of the solar cycle (rising, maximum, declining, and minimum phases) or binning the data relative to its position within a particular solar cycle. Some examples of this kind of binning are the papers by Mathew et al. (2007) who studied the dependence of umbral and penumbral brightness on the solar cycle, Zharkova & Zharkov (2008) who examined daily variations of tilt angles during rising and declining phases of cycle 23, and Watson et al. (2011) who looked at maximum magnetic field and umbral/penumbral areas during the different phases of cycle 23.

This type of time binning is very powerful for characterizing the general properties of the solar cycle. An excellent example of this kind of work was performed by Jiang et al. (2011), who performed a very detailed quantitative characterization of the relationship between cycle amplitude, cycle phase, and the properties of active latitudes (i.e., the shape, location, and width of the wings in the butterfly diagram). Taking advantage of this characterization, they laid a solid foundation for the construction of synthetic data sets based solely on sunspot number, which can be used to drive surface flux transport simulations.

3. ACTIVITY LEVEL: A NEW WAY OF BINNING DATA

Although there is undeniable value in using time to bin sunspot and BMR data (when the objective is to study and characterize the solar cycle), this kind of binning is sub-optimal for studying how the evolution of the solar cycle changes the properties of BMRs and their associated sunspots. The problem is conceptual: a single BMR and its associated sunspot group is believed to be the photospheric manifestation of a buoyant flux tube. These tubes take roughly a month to travel through the convection zone (Fan 2009; Weber et al. 2011) and have a typical lifetime (after eruption) of about a month. Considering that the solar cycle involves decadal timescales, this means that from the perspective of each BMR, the magnetic cycle is a quasi-static process (i.e., time-independent during a BMR's life cycle). It is commonly believed that the total number of BMRs and their combined flux is a direct indication of the strength of the underlying toroidal field. Furthermore, this magnetic field strongly determines the resulting properties of the emerged BMRs (see a review by Fan 2009). We propose that the amplitude of the solar cycle at any specific moment in time (which we assume to be directly related to the characteristics of the underlying toroidal field), is the true quantity determining the statistical properties of sunspots and BMRs, and that the time dependence of these properties is simply a direct consequence of the evolution of this toroidal magnetic field.

To avoid possible misunderstandings, from now on we will use the term activity level to refer to the mean level of hemispheric solar activity at any specific moment in time. We do this in order to avoid confusion with a cycle's peak amplitude. In order to bin our data according to activity level, we first calculate the total hemispheric daily sunspot area, remove high-frequency components by convolving it with a six-month Gaussian filter (Hathaway 2010; Muñoz-Jaramillo et al. 2013), and use the result to assign an activity level to each data point. This kind of binning is illustrated in Figure 1(c), showing horizontal lines demarcating different activity levels. Note that activity level is calculated separately for each hemisphere. It is also important to mention that we are assuming no intrinsic differences in sunspot and BMR properties across different hemispheres or across different cycles. Instead, we assume that what makes each cycle unique is the actual evolution of activity level in each hemisphere (i.e., its ups and downs).

Figures 1(c) and (d), showing a butterfly diagram in which each point is colored according to activity level, highlight some of the subtle but important aspects of binning by activity level. First, we are assuming that activity levels and their associated statistical properties are not unique to any given cycle. This means that sunspots and BMRs that appear during the absolute maximum of a weak hemispheric cycle have the same statistical properties as those that appear at the same activity level in stronger hemispheric cycles. Cycle 19, the strongest cycle ever observed, is an extreme example of this (as it samples the entire range of activity levels we have observed so far). Second, under this assumption, sunspots that appear simultaneously in the northern and southern hemispheres will have different statistical properties if there is a hemispheric asymmetry in activity level.

4. DATA SELECTION AND TRUNCATION

As the backbone of our analysis we use the sunspot group database compiled and published by the Royal Greenwich Observatory (RGO). This set includes heliographic positions and areas of sunspot groups observed from 1874 to 1976 by a small network of observatories: the Cape of Good Hope, Kodaikanal, and Mauritius. The RGO data, covering nine solar cycles (from cycle 12 to cycle 20), provide the longest and most complete record of sunspot group areas. We extract from this database a single area and position for each sunspot group. We assign to the group the single largest reported area in all its days of observation. The result is a set of 30,026 groups. Data are shown in Figure 2(a).

Figure 2.

Figure 2. Logarithmic plot of sunspot group size of the (a) RGO and (b) KMAS data sets. Dashed black horizontal lines indicate the threshold above which data is used to calculate activity level and also fitted to the composite Weibull plus log-normal distribution. This threshold is set an order of magnitude above the smallest structure of each set.

Standard image High-resolution image
Figure 3.

Figure 3. (a) KMAS and (b) RGO empirical distribution functions for the period between 1954 and 1976. (c) The multiplication of RGO sunspot group area by a factor of 1.06 maximizes the agreement between both empirical distribution functions. Empirical distribution functions show all data in each set, but only data in the dark shaded region are used in the cross-calibration.

Standard image High-resolution image

Since part of our objective is to study the transition between sunspot cycles 23 and 24, we supplement the RGO data using observations taken by the Kislovodsk Mountain Astronomical Station (KMAS) of the Central Astronomical Observatory at Pulkovo. The KMAS has been in continuous operation since 1948, making it one of the very few institutions performing a wide array of solar surveys through the entirety of the space age. This makes it quite valuable as a connecting set between modern missions and previous surveys. This database contains 108,364 sunspot group observations taken from 1954 February 9 to the present (covering 6.5 solar cycles, from cycle 18 to cycle 24), giving us a nice overlap with the RGO set that we can use to cross-calibrate them. As with the RGO set, we extract a single area and position for each sunspot group. We assign to the group the single largest reported area in all its days of observation. The result is a set of 19,221 groups. KMAS data are available at http://158.250.29.123:8000/web/Soln_Dann. Data are shown in Figure 2(b).

As recounted in detail by Muñoz-Jaramillo et al. (2015), there is a host of issues that can potentially distort the statistical properties of structures near the lower detection threshold. They include but are not limited to observational bias, artificial binning caused by resolution, convolution of instrumental cadence and feature lifetime, underestimation due to the quality of the observing conditions, and underestimation due to excessive complexity in the observed phenomenon. Considering that small structures are also the most numerous, here we follow the suggestion by C. DeForest (2014, private communication) and implemented by Muñoz-Jaramillo et al. (2015) of imposing a truncation limit one order of magnitude above the minimum size of detection in our databases. We only use data above this limit in our distribution fits and analysis. The location of these thresholds, shown in Figure 2 as dark horizontal lines, successfully isolates problematic data from the rest of each set.

4.1. Cross-calibration

Our first task is to cross-calibrate the RGO and KMAS data sets to form a composite spanning from 1874 to the present. Given that the KMAS survey is still active, we use it as our reference set. This way we will be able to extend our composite database into the future as KMAS continues to perform observations. Since for this study it is important for the sets to be statistically compatible, we find the proportionality calibration constant by matching the KMAS (Figure 4(a)) and RGO (Figure 4(b)) empirical size distribution functions. For this purpose, we use only data belonging to the overlapping interval between the RGO and KMAS data sets (i.e., between 1954 and 1976). This technique was used by Muñoz-Jaramillo et al. (2015) to reconcile and cross-calibrate 11 different sunspot group, sunspot, and bipolar magnetic flux data sets. It involves the following steps.

  • 1.  
    Choose a proportionality constant out of a range of possible values.
  • 2.  
    Multiply all sunspot group areas in the RGO database by this proportionality constant (effectively shifting the empirical distribution left or right in logarithmic scale).
  • 3.  
    Evaluate if the resulting empirical distribution overlaps with the reference KMAS empirical distribution.
  • 4.  
    Find the root mean square error (RMSE) between the overlaps.
  • 5.  
    After trying all possible proportionality values in a set, identify which one minimizes the RMSE.

Figure 4.

Figure 4. (a) Histogram using logarithmic binning of RGO/KMAS data. (b) Empirical PDF of RGO/KMAS data. Both represent different ways of looking at the same data, and both panels are overplotted with the same fit using a linear combination of Weibull (dashed blue line) and log-normal distributions (dotted yellow line). The composite fit is shown as a solid dark red line. Both panels include all data in the set, but only data shown in a dark shade are included in the fit.

Standard image High-resolution image

We find that multiplying RGO data by a factor of 1.06 maximizes the agreement between the KMAS and RGO empirical distribution functions (shown in Figure 4(c)). We construct our composite by using all RGO data (with areas multiplied by the 1.06 factor), and KMAS data from 1977 onward.

5. SIZE DISTRIBUTION, DISTRIBUTION FITTING, AND MODEL SELECTION

In the past, different characterizations of the size-flux distribution of magnetic structures have used different statistical distributions to fit the data: the exponential distribution (Tang et al. 1984; Schrijver et al. 1997), the log-normal distribution (Bogdan et al. 1988; Baumann & Solanki 2005; Zhang et al. 2010; Schad & Penn 2010), exponential polynomials (Harvey & Zwaan 1993), the Weibull distribution (Parnell 2002), the power-law distribution (Meunier 2003; Zharkov et al. 2005; Parnell et al. 2009), and linear combinations of these distributions (Kuklin 1980; Jiang et al. 2011; Nagovitsyn et al. 2012; Muñoz-Jaramillo et al. 2015).

In this work we build upon the results of Muñoz-Jaramillo et al. (2015) who found after an in-depth quantitative comparison between the different proposed distributions that a linear combination of Weibull and log-normal distributions fits observations best. This linear combination is used to define the probability-density function (PDF) of sunspot group sizes:

Equation (1)

where x is the area of a given magnetic structure, $k\gt 0$ and $\lambda \gt 0$ are the shape and scale parameters of the Weibull distribution, μ and σ are the logarithmic mean and standard deviation characterizing the log-normal, and $0\leqslant c\leqslant 1$ is the proportionality constant that blends these distributions together. Note that we introduced a small change with respect to the composite distribution defined by Muñoz-Jaramillo et al. (2015): here, the Weibull (log-normal) term is multiplied by $1-c$ (c). It is also important to highlight that Equation (1) is normalized so that its integral over the entire space is equal to one. This is necessary so that we can later compare the empirical and analytical PDFs associated with activity level bins that contain different amounts of data points.

Due to the fact that we are working with truncated sets, we use the following truncated form of our PDF on our fits:

Equation (2)

where CDF denotes the cumulative distribution function associated with Equation (1), and ${{x}_{{\rm trunc}}}$ denotes the limit value below which data is not used in the fit (see Section 4).

In order to fit this PDF to the data we use maximum likelihood estimation (MLE). Its basic idea is to find the set of parameters that maximizes the likelihood of a statistical model given the observed data. For this purpose, the user defines and maximizes a likelihood function constructed using the probability of observing all data in the set. This method is far superior to fitting functional forms to histograms because it is not sensitive to the details of data binning. A more detailed description of MLE can be found in Appendix A and in most modern statistics books (for example in Hoel 1984).

To quantify the relative improvement of our statistical model by separating our data according to activity level, we use Akaike's information criterion (AIC; Akaike 1983). The AIC is a powerful tool for discriminating between different models by making an estimate of the expected, relative distance between the fitted model and the unknown true mechanism that generated the observed data. It uses a combination of the likelihood of the data and the model's degrees of freedom (dof) to strike a balance between bias and variance (i.e., between underfitting and overfitting). A more detailed description of AIC can be found in Appendix B and in an excellent book by Burnham & Anderson (2002).

5.1. Fitting the Composite Distribution to the Entire RGO/KMAS Set

Our first task is to fit the composite PDF to our RGO/KMAS database. This helps us place this work in light of the results of Muñoz-Jaramillo et al. (2015) and gives us a reference against which we can evaluate the performance of PDF fitting of data binned by activity level. The results of the fit are shown in Figure 4 and tabulated in Table 1. They are in agreement with those found by Muñoz-Jaramillo et al. (2015).

Table 1.  Fitting Parameters of the Composite Distribution to RGO/KMAS Sunspot Group Data

Composite Fit to RGO/KMAS data
Weibull Log-Normal  
k ${{\lambda }^{a}}$ μ σ c
0.48 11.14 5.63 0.88 0.55
±0.15 ±3.98 ±0.13 ±0.04 ±0.10

Note.

aUnits of μHem, and other quantities are dimensionless. The first row contains the fitted parameters; the second row contains the values of their 95% confidence intervals.

Download table as:  ASCIITypeset image

6. FITTING THE COMPOSITE DISTRIBUTION TO THE BINNED RGO/KMAS SET

After fitting our PDF to the entire data set, we now separate data according to activity level. An inspection of the empirical size PDF functions associated with different activity levels (see Figures 5(a) and (b)) shows a striking relationship between the abundance of large sunspot groups and activity level—with groups bigger than 1000 μHem being 30 times more likely to occur during high activity levels (e.g., the peak of cycle 19) than during a typical solar minimum. Additionally, as can be observed in Figures 5(d) and (e), the Weibull–log-normal composite is able to capture the overall shape of the empirical PDF in every case.

Figure 5.

Figure 5. (a), (b) Empirical size distribution associated with different activity levels. (d), (e) Composite PDFs fitted to data binned according to activity level. In order to enhance perception, we use two ways of displaying the PDFs. In (a) and (d), a solid color fills the area below each PDF and those associated with lower activity levels are plotted closer to the foreground. In (b) and (e), each PDF is plotted using a thin line colored according to activity level. Vertical dashed lines mark the limit below which data are not included in the fits. Panels (c), (f)–(i) show the relationship between the different fitting parameters in the composite PDF and activity level (see Equation (1)). Error bars indicate the 95% confidence intervals of each value. The Spearman's rank correlation coefficient (ρ) and its confidence level (P) are included as the title of each of these panels.

Standard image High-resolution image

In order to evaluate the relationship between the different fitting parameters of the composite PDF, we use Spearman's rank correlation coefficient ($\rho ;$ Spearman 1904), which assesses how well the relationship between two variables can be described using a monotonic function. The results are displayed in Figures 5(c) and (f)–(i). We find no correlation between activity level and the parameters characterizing the Weibull component of the composite PDF—with $\rho =0.0$ and $\rho =-0.19$ for the factor (λ) and shape parameter (k), respectively. On the other hand, the parameters that characterize the log-normal component are found to be correlated with activity level with a high degree of statistical significance (above 98%)–with $\rho =0.66$ and $\rho =0.57$ for the logarithmic mean (μ) and the logarithmic variance (σ) respectively. We find a moderate correlation between the proportionality constant that blends these distributions together (c) with $\rho =0.37$.

7. USING COMMON WEIBULL PARAMETERS FOR ALL ACTIVITY LEVELS

The apparent independence between activity level and the parameters characterizing the Weibull component of the composite PDF is in agreement with the results of Hagenaar et al. (2003, 2008) who found essentially no dependence between the distribution of ephemeral regions and the solar cycle. Furthermore, considering the values that these parameters assume, and the large width of their 95% confidence intervals (see Figures 5(c) and (f)), it is clear that leaving them unconstrained is being used by our fitting algorithm to over-fit the data.

To address this, we re-fit our data with the additional constraint that the parameters characterizing the Weibull component must be the same for all activity levels. We do this by maximizing total average log-likelihood (Tlk) for all activity levels:

Equation (3)

where $f(x;k,\lambda ,\mu ,\sigma ,c)$ is our composite PDF function (see Equation (1)); the index j denotes each activity level bin; the index i denotes each data  point in a bin; k and λ (the Weibull parameters) are free to vary but must be the same for all activity level bins; and ${{\mu }_{j}}$, ${{\sigma }_{j}}$, and cj (the log-normal parameters and the constant of proportionality) are allowed be different for different bins.

As can be seen in Figure 6(c), Tlk has a single global maximum located at ${{k}_{{\rm best}}}=0.46$ and ${{\lambda }_{{\rm best}}}=11.49\;\mu {\rm Hem}.$ These values are well within the 95% confidence intervals previously found for k and λ in both the unconstrained fit (see Figures 5(c) and (f)), and the fit to the unbinned RGO/KMAS Set (see Table 1).

Figure 6.

Figure 6. (a), (b) Empirical size distribution associated with different activity levels. (d), (e) Composite PDFs fitted to data binned according to activity level using the same Weibull parameters for all activity levels. Vertical dashed lines mark the limit below which data are not included in the fits. (c) Total average log-likelihood (Tlk) for all activity levels as a function of the Weibull parameters k and λ (see Equation (3)). The optimum values that maximize Tlk simultaneously for all activity level bins are ${{k}_{{\rm best}}}=0.46$ and ${{\lambda }_{{\rm best}}}=11.49\;\mu {\rm Hem}$. Panels (g)–(i) show the relationship between the remaining parameters in the composite PDF and activity level. Error bars indicate the 95% confidence intervals of each value. The Spearman's rank correlation coefficient (ρ) and its confidence level (P) are included as the title of each of these panels. Fits to the relationships between these parameters and activity level are shown as red dashed lines. The analytical expression for each fit is included in the legend of each panel.

Standard image High-resolution image

After forcing k and λ to have the same value for all activity level bins, there is a remarkable tightening of the relationship between activity level and the remaining PDF parameters (μ, σ, and c, which can be seen both qualitatively and as a significant improvement in the Spearman's rank correlation coefficients.

We perform a ${{\chi }^{2}}$ fit to this dependence using power functions (see Figures 6(g)–(i) for fitting values), finding a reduced ${{\chi }^{2}}$ lower than unity in all cases. Although in this work we fit these dependencies using power functions, due to their simplicity there are several functional forms that would fit the scatter plots equally well within the 95% confidence intervals (for example logarithmic and exponential forms). The true characterization of these dependencies would involve a large amount of tests that is beyond the scope of this paper and will be performed in a later work.

Nevertheless, using these results, one can define a PDF with constant k and λ, whose properties depend on activity level through the relationships shown in Figures 6(g)–(i), and in which binning by activity level is no longer necessary. This PDF is defined as:

where AL is the activity level in mHem on the day each sunspot group was observed.

8. QUANTITATIVE MODEL SELECTION

Now that we have characterized the dependence of the size-flux PDF on activity level, our task is to quantitatively identify the best of all the models that have been used so far. These are as follows.

  • 1.  
    The same PDF irrespective of activity level (see Muñoz-Jaramillo et al. 2015, and Section 5.1).
  • 2.  
    A different PDF for each activity level bin (see Section 6).
  • 3.  
    A different PDF for each activity level bin, but in which the values of k and λ are forced to be equal for every bin (see Section 7).
  • 4.  
    A PDF with constant k and λ, but in which μ, σ, and c depend on activity level through power functions (see Equation (4)).

For this purpose, we use the AIC (described in detail in Appendix B), and the results are shown in Table 2. A comparison between log-likelihood and dof (columns 3 and 4, respectively), shows that log-likelihood is the main factor determining AIC (not the dof; see Equation (8)). The reason is that our data set has significantly more data points than the dof in each of our models.

Table 2.  Comparative Performance of the Different Ways of Fitting the Data Presented in this Paper

Quantification of Distribution Performance Using AIC
Fit Characteristics Description Log-Likelihood Degrees of Freedom ${\Delta }_{j}^{{\rm AIC}}$ Aw
No dependence on activity level Section 5.1 −2.097 × 105 5 3,779 <0.001
Unconstrained, binned by activity level Section 6 −2.093 × 105 85 3,063 <0.001
Constrained, binned by activity level Section 7 −2.093 × 105 53 3,033 <0.001
No binning, analytical dependence on activity level Equation (4) 2.078 × 105 9 0 >0.999

Note. ${\Delta }_{j}^{{\rm AIC}}$ is the relative AIC difference described by Equation (B2). Aw is the Akaike weight described by Equation (B4). The lower ${\Delta }_{j}^{{\rm AIC}}$ is, the More a model is likely to be correct (Quantified Using Aw). Bold text indicates the best model according to AIC.

Download table as:  ASCIITypeset image

As expected, the worst fit corresponds to the PDF that does not depend on activity level (defined in Section 5.1). This is followed by both our constrained and unconstrained fits binned by activity level (defined in Sections 6 and 7, respectively). We find, with a very high degree of statistical significance, that the best model to fit our data is the unbinned PDF whose parameters depend analytically on activity level (defined by Equation (4)). It is important to highlight that AIC works only as a relative estimate. This means that it cannot tell us whether Equation (4) characterizes the true mechanism that generated the observed data. Instead, it allows us to rule out all the models proposed in this work (with near certainty) in favor of the model described by Equation (4). Taken together with the results of Muñoz-Jaramillo et al. (2015) who fitted five more models to similar data (including the model described in Section 5.1), we can take advantage of the relative nature of AIC to rule those models out as well.

9. THE SOLAR MAGNETIC FLOOR AND THE TIME DEPENDENCE OF SUNSPOT PROPERTIES

As was shown in Section 5.1, we find the Weibull component of the composite PDF to be independent of activity level. This result is in agreement with other studies arguing in favor of a magnetic baseline (Svalgaard & Cliver 2007; Schrijver et al. 2011; Cliver et al. 2012), since it suggests that small magnetic structures in the photosphere indeed arise from a cycle-independent process. However, the striking connection between the size-flux PDF and activity level (demonstrated in Section 7) allows us to do more than that; it allows us make a quantitative comparison of the depth of each minimum during the last 130 yr (12 solar cycles). For this purpose, we calculate the expected value of sunspot group area (i.e., the average size of magnetic structures) as a function of activity level:

Equation (4)

where Γ is the gamma function. This quantity can be used as a thermometer for solar magnetism, as it tells us the typical magnetic structure size that we can expect to see as a function of time.

As happens with the composite PDF, the expected value of the sunspot group area is also a linear combination of Weibull (second line, Equation (4)) and log-normal (third line, Equation (4)) components. Furthermore, almost all of the time dependence of the expected value can be attributed to the log-normal component. The consequence is that the magnetic baseline is defined by the Weibull component and is only truly visible in those times in which the emergence of cycle-related BMRs shuts down.

Figure 7(b) shows the time evolution of the log-normal component of the expected sunspot area and how it contrasts with its Weibull counterpart. Note that there is a weak modulation of the Weibull contribution to the expected value due to the fact that the proportionality constant (c) by definition depends on activity level. We find that the log-normal contribution drops below its Weibull counterpart in most hemispheric minima. Using a notation where $n.5$ corresponds to the minimum between cycle n and cycle n+1, the exceptions are 15.5N&S, 17N&S, 19.5 N, 20.5N&S, 21.5N&S, and 22.5N&S (note that most exceptions occur during the space age). This means that, from a hemispheric point of view, we have been able to observe that magnetic baseline. However, the results are different from a whole Sun point of view. We calculate the whole Sun expected value as a weighted average between the expected value of the northern and southern hemispheres. Due to its strong correlation with sunspot numbers, for simplicity we use activity level as our weighting coefficient:

Equation (5)

where ${{\mathbb{E}}_{{\rm WS}}}$, ${{\mathbb{E}}_{{\rm N}}}$, and ${{\mathbb{E}}_{{\rm S}}}$ are the whole Sun, northern, and southern expected values, respectively, and ALN (ALS) is the activity level in the northern (southern) hemisphere.

Figure 7.

Figure 7. (a) Total daily sunspot area smoothed using a six-month Gaussian filter; this is the quantity we use to define activity level. (b) Log-normal and Weibull contributions to the expected sunspot group area. Log-normal components are calculated separately for each hemisphere; the Weibull component is shown using a black dotted line. For both panels, the northern (southern) hemisphere is denoted by a solid light blue (dashed dark red) line. (c) Expected value of sunspot group area for the whole Sun (solid green line). For reference, the Weibull component is shown as well.

Standard image High-resolution image

Figure 7(c) shows the time evolution of the expected value for the whole Sun (including both log-normal and Weibull components) and the Weibull baseline for comparison. The story is quite different; there have been only two solar minima during the last 12 cycles in which observation of the baseline magnetism has been possible in both hemispheres simultaneously (i.e., for the whole Sun; 13.5 and 14.5). Apart from those, hemispheric asymmetries have conspired to raise the whole Sun level above the baseline magnetism.

One of the striking features evidenced in Figure 7(c) is how different the minimum of cycle 23 is when compared to other minima during the space age (19.5 through 23.5). It is no wonder why it seemed so unusual to us. This has important implications because the properties of the heliosphere, its current sheet, and the background solar wind are strongly determined at a global level. The fact that the minimum of cycle 23 had a very asymmetric current sheet is evidence that solar magnetism was not in its baseline state. Based on this result, we infer that even though each hemisphere did reach the magnetic baseline, the minimum of cycle 23 was not as deep as it could have been.

10. SMALL-SCALE VERSUS GLOBAL DYNAMO?

In our previous paper (Muñoz-Jaramillo et al. 2015), we proposed that the existence of two populations of sunspot groups originated from separate contributions by the small-scale and global components of the dynamo. However, with the evolution of our understanding we feel that further clarification is necessary. We find strong evidence that the size-flux distribution has two components that are discriminated by size, and only the properties of the larger structures are modulated by the solar cycle. However, in spite of the fact that the properties of small pores are independent of the solar cycle, their latitude of appearance is still modulated by active latitudes (as can be seen in Figure 1). This means that there must still be a connection between these structures and the cycle itself and that their formation cannot be attributed solely to the small-scale component of the dynamo (otherwise they would be observable throughout the photosphere).

A possible explanation is that these small structures arise from the re-processing of the decaying magnetic field of their large-scale counterparts and thus their relative numbers are governed by the properties of surface convection (and only loosely by the amount of available decaying field). In this case, the magnetic baseline found in the previous section is contingent on the emergence of a minimum amount of large-scale structures and cannot be taken as a hard lower limit for grand minima like the Maunder minimum. Unfortunately, it is difficult to make a further assessment of this connection without resorting to magnetic data. For this reason, we will look at this issue in more detail in future work involving magnetic structure catalogs compiled using SOHO/MDI and SDO/HMI data.

11. SUMMARY AND CONCLUDING REMARKS

In this work we have introduced a new way of binning sunspot group and BMR data with the purpose of better understanding the impact of the solar cycle on sunspot and BMR properties and how this defines the characteristics of the extended minimum of cycle 23. This approach hinges critically on our current understanding of BMRs as the photospheric manifestation of emergent buoyant flux tubes arising from a large-scale underlying toroidal field. In particular, we assume that from the point of view of each active region, the solar cycle can be approximated as a quasi-static process. This means that the properties of sunspots and BMRs are completely determined by the strength of the underlying toroidal field and have no additional long-term dependencies. In other words, we are assuming that the statistical properties of sunspots and BMRs do not depend on cycle phase (rising versus declining; maximum versus minimum), but on how strong the cycle is at each particular moment (something we refer to as activity level).

In this work we build upon the results of Muñoz-Jaramillo et al. (2015) who found after analyzing 11 different databases, that the solar size-flux distribution is better characterized by a linear combination of Weibull and log-normal distributions—where a pure Weibull (log-normal) characterizes the distribution of structures with fluxes below (above) 1021Mx (1022Mx). After binning our data according to activity level, we fit this composite distribution to each separate bin and look at the dependence of each parameter on activity level.

We find that the parameters that characterize the Weibull component have no dependence on activity level. This is in agreement with the results of Hagenaar et al. (2003, 2008) who found essentially no dependence between the distribution of ephemeral regions (below 1020Mx) and the solar cycle. We propose that the structures characterized by the Weibull component are what give the Sun a magnetic baseline.

In stark contrast to the Weibull component, we find a clear dependence between activity level and the parameters that characterize the log-normal component of the size-flux distribution (which we fit using power functions). This supports the interpretation of Muñoz-Jaramillo et al. (2015) who proposed that the log-normal component is directly connected to the global component of the dynamo (and the generation of bipolar active regions).

By taking advantage of our analytical characterization of the size-flux distribution and its dependence on activity level, we study the relative contribution of each component (small-scale versus large-scale) to solar magnetism. In order to do this, we calculate the expected value of sunspot group areas and study its evolution with time. We find that from a hemispheric point of view, almost every solar minimum (during the last 12 cycles) reaches a point where the only contribution to magnetism comes from the small-scale component. However, due to asymmetries in cycle phase, this state is very rarely reached by both hemispheres at the same time (according to our data, only during the minima of cycles 13 and 14). There is no question that the extended minimum of cycle 23 is deeper than any other minimum of the space age. However, based on our results, we infer that even though each hemisphere did reach the magnetic baseline, from a heliospheric point of view the minimum of cycle 23 was not as deep as it could possibly be.

We thank Karel Schrijver, María Navas Moreno, and an anonymous referee for useful discussions and suggestions. This research was supported by the NASA Living With a Star Jack Eddy Postdoctoral Fellowship Program, administered by the UCAR Visiting Scientist Programs, contract SP02H1701R from Lockheed-Martin to the Smithsonian Astrophysical Observatory, and the CfA Solar Physics REU program, NSF grant number AGS-1263241. Andrés Muñoz-Jaramillo is very grateful to George Fisher and Stuart Bale for their support at the University of California, Berkeley, and Phil Scherrer for his support at Stanford University. The National Solar Observatory (NSO) is operated by the Association of Universities for Research in Astronomy, AURA Inc. under cooperative agreement with the National Science Foundation (NSF).

APPENDIX A: MAXIMUM LIKELIHOOD ESTIMATION

The idea behind the MLE is to find the set of parameters that maximizes the likelihood of a statistical model M given the observed data $D=\{{{D}_{1}},{{D}_{2}},...,{{D}_{n}}\}$ by maximizing the likelihood (L) function:

Equation (6)

This process of maximization is typically performed by first taking the logarithm of both sides of Equation (A1), and maximizing the resulting log-likelihood (lk) function:

Equation (7)

APPENDIX B: AKAIKE'S INFORMATION CRITERION

The AIC for a model Mj is defined as:

Equation (8)

where ${\rm lk}({{M}_{j}})$ is the log-likelihood of model Mj (see Equation (A2)) and nj is the number of parameters of model j. The model with the minimum AIC is chosen as the best. In a sense, by minimizing the AIC one is looking for the model with the largest log-likelihood. However, log-likelihood alone is not sufficient to discriminate between models because it is biased as an estimation of the model selection target. This bias was found by Akaike (1983) to be approximately equal to each model's number of parameters (n) and thus the presence of the second term in Equation (B1). The significance of the AIC is strongly dependent on an appropriate choice of models. Applying the AIC to a set of very poor models will always select one estimated to be the best (even though that model may still be poor in an absolute sense).

The relative nature of the AIC is better represented by calculating the relative AIC differences:

Equation (9)

This in turn can be used to estimate the likelihood of a model given the data:

Equation (10)

and use it to calculate the Akaike weights:

Equation (11)

which are a measure of the probability that the model Mj is the best model given the data.

Please wait… references are loading.
10.1088/0004-637X/804/1/68