A publishing partnership

The following article is Open access

False Alarms Revealed in a Planet Search of TESS Light Curves

, , , , , , and

Published January 2023 © 2023. The Author(s). Published by the American Astronomical Society.
, , Citation Michelle Kunimoto et al 2023 Res. Notes AAS 7 7 DOI 10.3847/2515-5172/acb149

2515-5172/7/1/7

Abstract

In this paper, we investigate the impact of false alarms on planet searches of TESS data by performing a search of a large number of stars. We examine the period distribution of transit-like signatures uncovered in a Box-Least Squares transit search of TESS light curves, and show significant pileups at periods related to instrumental and astrophysical noise sources. Signatures uncovered in a search of inverted light curves feature similar structures in the period distribution. Automated vetting methods will need to remove these excess detections, and light curve inversion appears to be a suitable method for simulating false alarms and designing new vetting metrics. Automated vetting will be a significant step toward making TESS data useful for demographic studies.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

Exoplanet demographics provide significant insight into exoplanet formation and evolution theory. Exoplanet catalogs, however, do not represent the true exoplanet population. Catalogs are not complete because they miss exoplanets, and may not be reliable because of contamination from false positives (e.g., nearby or bound eclipsing binaries) or false alarms (e.g., noise, systematics, and stellar variability). The Kepler mission (Borucki et al. 2010) was designed to produce demographic studies, yielding a catalog with well-measured completeness and reliability (Thompson et al. 2018), making it possible to infer the underlying planet population. The TESS mission (Ricker et al. 2015) is finding thousands of exoplanets, but the TESS Object of Interest catalog is not useful for demographics because it has unknown completeness and reliability.

Exoplanet catalogs are typically produced by a software pipeline that detects transit events which are vetted to determine if the events are caused by planets. The most straightforward way to produce a catalog with well-measured completeness and reliability is to use a fully automated detection and vetting pipeline. Designing an automated vetting pipeline is a non-trivial task requiring an understanding of false alarms found in observed data. Automated vetting (Thompson et al. 2018) becomes particularly challenging in the low signal-to-noise ratio (S/N) regime of smaller and/or longer period planets because most false alarms due to noise and systematics have low S/N.

We use light curves obtained from multiple sources to investigate whether or not systematics depend on how the light curves were generated. We identify periods with excess detections, or pileups, and speculate about their causes in TESS systematics. We also perform a preliminary exploration using light curve inversion to simulate false alarms, needed for characterizing catalog reliability for future TESS demographics studies.

2. Planet Search

We perform our investigation on M dwarfs (R < 0.6 R, M < 0.6 M, Teff < 3900 K), which are of high interest to exoplanet searches, especially for small planets. We selected all 92,899 M dwarfs brighter than T = 13.5 mag from the TESS Input Catalog (TICv8.2; Paegert et al. 2021) that were observed by TESS across sectors 1 to 52. Multi-sector light curves from the Quick-Look Pipeline (QLP; Huang et al. 2020; Kunimoto et al. 2021, 2022) were obtained for each star. To compare to QLP, light curves were also extracted using the eleanor pipeline (Feinstein et al. 2019). Both eleanor CORR_FLUX and PCA_FLUX systematics-corrected versions of each light curve were saved.

Long-term trends from all three sets of light curves (QLP, CORR_FLUX, and PCA_FLUX) were removed using a biweight detrending algorithm with a 0.5 day window length as implemented in wotan (Hippke et al. 2019). We then performed a multi-planet search of the detrended light curves with the Box-Least Squares (BLS) algorithm from cuvarbase. 7 Each star was searched from 0.5 to 100 days or the length of the longest consecutive stretch of TESS sectors comprising the light curve, whichever was shorter. This strategy was chosen to minimize the challenges of searching data with long gaps between sectors. If a detection had S/N > 9, data within one transit duration of the measured center of the transit time is masked out and the residuals were searched, up to five times per light curve. We define Threshold Crossing Events (TCEs) as signals with S/N > 9 and at least two events.

The same BLS procedure was re-run on the same light curves after inversion to produce inverted TCEs (invTCEs). Inversion should eliminate valid transits but preserve the general noise and systematic properties seen in the original time series, and was a strategy used by Kepler to simulate false alarms (Thompson et al. 2018).

3. Results

Figure 1 shows histograms of TCE and invTCE periods from all three light curve sources. In all cases, we see the same general structure of pileups, which are similar to those found by the SPOC pipeline (Jenkins et al. 2016) using postage stamp targets. 8 While the inverted light curves include detections of astrophysical phenomena such as stellar flares and some eclipsing binaries, detection pileups occur at periods not correlated with flare statistics. We conclude that light curve inversion is a suitable method for simulating TESS false alarms in future demographic studies.

Figure 1.

Figure 1. Period histograms of TCEs and invTCEs identified in various input TESS light curves by our detection pipeline, with insets showing inverted eleanor CORR_FLUX light curves with planet-like false alarms. Transits in the insets are indicated by short blue vertical lines, and the red vertical dashed lines delineate sectors. The filled histograms are TCEs and the line histograms are invTCEs. The vertical gray lines are aliases of the 13.7 days TESS orbital period. Most signals are false alarms. We see that, in all cases, pileups in TCEs correspond to pileups in invTCEs, indicating that many detections in the observed data are false alarms. Top panel: CORR_FLUX light curves, comparing two- and three-transit thresholds for transit detection. Middle panel: Comparing three-transit detections from CORR_FLUX and PCA_FLUX light curves. Lower panel: Comparing three-transit detections from CORR_FLUX and QLP light curves. The lower panel also shows "no sines" TCEs and invTCEs after the removal of sinusoidal signals identified by the SWEET test (see text).

Standard image High-resolution image

We see two classes of pileups:

  • 1.  
    Sharp peaks with broad shoulders at aliases of the 13.7 days TESS orbital period. Inspection of observed and inverted light curves reveals that most of these detections are easily identified excursions at the edges of data gaps, due to scattered light, that were not removed by detrending. Not all residual edge effects are easily identified, however, as shown by the inset light curve examples in Figure 1.
  • 2.  
    A general excess with a period <1 day. Inspection of the light curves reveals that most of these are highly periodic, which we believe are due primarily to scattered light from the Earth's rotation, and also stellar variability (e.g., spots and pulsations) and contact eclipsing binaries. We find that the Kepler SWEET test (Thompson et al. 2018), which measures the correlation of light curves with sinusoids, enables the removal of the highly sinusoidal detections, shown by the "no sines" histograms in Figure 1.

Comparing the pileup structures for the various light curve sources, we see little difference between eleanor CORR_FLUX and PCA_FLUX light curves. For periods ≳3 days, we see little difference between eleanor and QLP, but QLP light curves have a larger excess with periods below one day. We find that requiring three versus two transits for a transit detection significantly reduces the number of detections with period ≳3 days, but does not remove the pileup structure. While most excess detections below one day are sinusoidal, some appear more transit like, as shown in the inset example in Figure 1.

4. Conclusions

Creating an exoplanet catalog with well-measured reliability requires the ability to detect false positives and false alarms. Kepler effectively used automated vetting based on metrics that were tuned to remove detections from inverted Kepler data, and inverted data was used to measure Kepler reliability. The observations in this paper lead us to expect that a similar strategy, appropriately tuned for TESS using inverted TESS data, will be successful.

Footnotes

Please wait… references are loading.
10.3847/2515-5172/acb149