Machine-learning based lightning nowcasting data archive
Description
This data archive contains the derived data supporting the findings of article "Lightning nowcasting with aerosol-informed machine learning and satellite-enriched dataset". The paper is currently in the preprint version: https://doi.org/10.21203/rs.3.rs-2616886/v1
The prediction results in this data archive are generated by various models:
1. Current model. The model involves data input of aerosol observations together with meteorological variables and auxiliary datasets, as well as data enrichment by Geostationary Lightning Mapper (GLM). In the demo of the dataset, the year of 2020 is trained and predicted on a cross-validation scheme.
2. LMA model. The model acts as the baseline model considering only data label obtained from the ground-based Lightning Mapping Array (LMA), which observes accurate lightning occurrence in limited spatial range.
3. No-AOD model. The model acts as the baseline model considering no aerosol observation is utilized during the machine learning process.
The model results are demonstrated in a continuous value in 0-1. Trade-offs between Probability of Detection (POD) and False Alarm Ratio (FAR) can be optimized by selection of different thresholds.
Other datasets:
1. Dataset for training. It is for the public use of machine learning training for the current model and no-AOD model (training input features vary).
2. PM2.5 dataset. The real-time spatially continuous and hourly-level PM2.5 dataset is obtained following a published method by Zeng et al.. In this method, the fundamental in-situ measurements are obtained from Air Quality System (AQS) monitoring network operated by United States Environmental Protection Agency.
Reference:
Siwei Li, Ge Song, Jia Xing et al. Lightning nowcasting with aerosol-informed machine learning and satellite-enriched dataset, 14 March 2023, PREPRINT (Version 1) available at Research Square [https://doi.org/10.21203/rs.3.rs-2616886/v1]
Zeng, Z. et al. Estimating hourly surface PM2. 5 concentrations across China from high-density meteorological observations by machine learning. Atmospheric Research 254, 105516 (2021).
Files
Files
(991.9 MB)
Additional details
References
- Siwei Li, Ge Song, Jia Xing et al. Lightning nowcasting with aerosol-informed machine learning and satellite-enriched dataset, 14 March 2023, PREPRINT (Version 1) available at Research Square [https://doi.org/10.21203/rs.3.rs-2616886/v1]
- Zeng, Z. et al. Estimating hourly surface PM2. 5 concentrations across China from high-density meteorological observations by machine learning. Atmospheric Research 254, 105516 (2021).