Abstract
The effective use of available information in extreme value analysis is critical because extreme values are scarce. Thus, using the r largest order statistics (rLOS) instead of the block maxima is encouraged. Based on the four-parameter kappa model for the rLOS (rK4D), we introduce a new distribution for the rLOS as a special case of the rK4D. That is the generalized logistic model for rLOS (rGLO). This distribution can be useful when the generalized extreme value model for rLOS is no longer efficient to capture the variability of extreme values. Moreover, the rGLO enriches a pool of candidate distributions to determine the best model to yield accurate and robust quantile estimates. We derive a joint probability density function, the marginal and conditional distribution functions of new model. The maximum likelihood estimation, delta method, profile likelihood, order selection by the entropy difference test, cross-validated likelihood criteria, and model averaging were considered for inferences. The usefulness and practical effectiveness of the rGLO are illustrated by the Monte Carlo simulation and an application to extreme streamflow data in Bevern Stream, UK.
Similar content being viewed by others
Code availability
https://github.com/yire-shin/rGLO.git
References
Aarnes OJ, Breivik O, Reistad M (2012) Wave extremes in the Northeast Atlantic. J Clim 25(5):1529–1543. https://doi.org/10.1175/JCLI-D-11-00132.1
Ahmad MI, Sinclair CD, Werritty A (1988) Log-logistic flood frequency analysis. J Hydrol 98(3):205–224. https://doi.org/10.1016/0022-1694(88)90015-7
An Y, Pandey MD (2007) The \(r\) largest order statistics model for extreme wind speed estimation. J Wind Eng Indust Aerodynam 95(6):467–467. https://doi.org/10.1016/j.jweia.2007.02.026
Bader B, Yan J, Zhang XB (2017) Automated selection of \(r\) for the r largest order statistics approach with adjustment for sequential testing. Stat Comput 27(6):1435–1451
Bader B, Yan J (2020) Eva : extreme value analysis with goodness-of-fit testing. - R package version 0.2.6
Banfi F, Cazzaniga G, De Michele C (2022) Nonparametric extrapolation of extreme quantiles: a comparison study. Stoch Environ Res Risk Assess 36:1579–1596
Blum AG, Archfield SA, Vogel RM (2017) On the probability distribution of daily streamflow in the United States. Hydrol Earth Syst Sci 21(6):3093–3103. https://doi.org/10.5194/hess-21-3093-2017
Coles S (2001) An introduction to statistical modeling of extreme values. Springer-Verlag, London
Dupuis DJ (1997) Extreme value theory based on the r largest annual events: a robust approach. J Hydrol 200(1–4):295–306
Fauer FS, Rust HW (2023) Non-stationary large-scale statistics of precipitation extremes in central Europe. Stoch Environ Res Risk Assess 37:4417–4429
Feng J, Jiang W (2015) Extreme water level analysis at three stations on the coast of the Northwestern Pacific Ocean. Ocean Dyn 65(11):1383–1397
Fitzgerald D (2005) Analysis of extreme rainfall using the log-logistic distribution. Stoch Env Res Risk Assess 19:249–257
Fletcher D (2018) Model Averaging. Springer
Galavi H, Mirzaei M, Yu B et al (2023) Bootstrapped ensemble and reliability ensemble averaging approaches for integrated uncertainty analysis of streamflow projections. Stoch Environ Res Risk Assess 37:1213–1227
Gilleland E, Katz RW (2016) extRemes 2.0: an extreme value analysis package in R. J Stat Software 72(8):1–39
Hosking JRM (1994) The four-parameter kappa distribution. IBM J Res Devel 38(3):251–258. https://doi.org/10.1147/rd.383.0251
Hosking JRM, Wallis JR (1997) Regional frequency analysis: an approach based on L-moments. Cambridge University Press, Cambridge
Hosking JRM, Wallis JR, Wood EF (1985) Estimation of the generalized extreme-value distribution by the method of probability-weighted moments. Technometrics 27:251–261
Hussain T, Bakouch HS, Iqbal Z (2018) A new probability model for hydrologic events: properties and applications. J Agric Bio Envir Stat 23:63–82. https://doi.org/10.1007/s13253-017-0313-6
Ibrahim MN (2022) Four-parameter kappa distribution for modeling precipitation extremes: a practical simplified method for parameter estimation in light of the L-moment. Theor Appl Climat 150:567–591
Jeong BY, Murshed MS, Seo YA, Park JS (2014) A three-parameter kappa distribution with hydrologic application: a generalized Gumbel distribution. Stoch Environ Res Risk Assess 28(8):2063–2074
Johnson NL, Kotz S, Balakrishnan N (1995) Continuous univariate distributions, vol 2. John Wiley & Sons
Kim S, Shin H, Ahn H (2015) Development of an unbiased plotting position formula considering the coefficient of skewness for the generalized logistic distribution. J Hydrol 527:471–481
Kjeldsen TR, Jones DA (2004) Sampling variance of flood quantiles from the generalised logistic distribution estimated using the method of L-moments. Hydrol Earth Syst Sci 8:183–190
Kjeldsen TR, Ahn H, Prosdocimi I (2017) On the use of a four-parameter kappa distribution in regional frequency analysis. Hydrol Sci J 62:1354–1363. https://doi.org/10.1080/02626667.2017.1335400
Martins ES, Stedinger JR (2000) Generalized maximum-likelihood generalized extreme value quantile estimators for hydrologic data. Water Resour Res 36(3):737–744
Matsuda Y, Yajima Y, Tong H (2006) Selecting models with different spectral density matrix structures by the cross-validated log likelihood criterion. Bernoulli 12(2):221–249
Meshgi A, Khalili D (2008) Comprehensive evaluation of regional flood frequency analysis by L- and LH-moments. II. Development of LH-moments parameters for the generalized Pareto and generalized logistic distributions. Stoch Environ Res Risk Assess 23:137–152
Murshed MS, Seo YA, Park J-S (2014) LH-moment estimation of a four parameter kappa distribution with hydrologic applications. Stoch Environ Res Risk Assess 28(2):253–262
Naseef MT, Kumar SV (2017) Variations in return value estimate of ocean surface waves-a study based on measured buoy data and ERA-Interim reanalysis data. Nat Hazards Earth Syst Sci 17(10):1763–1778
Okoli K, Breinl K, Brandimarte L, Botto L et al (2018) Model averaging versus model selection: estimating design floods with uncertain river flow data. Hydrol Sci J 63(13–14):1913–1926
Pan X, Rahman A, Haddad K et al (2022) Peaks-over-threshold model in flood frequency analysis: a scoping review. Stoch Environ Res Risk Assess 36:2419–2435
Papukdee N, Park J-S, Busababodhin P (2022) Penalized likelihood approach for the four-parameter kappa distribution. J Appl Stat 49(6):1559–1573
Rieder H (2014) Extreme value theory: a primer. https://www.ldeo.columbia.edu/~amfiore/eescG9910_f14_ppts/Rieder_EVTPrimer.pdf
Salaki DT, Kurnia A, Sartono B et al (2023) Model averaging in calibration of near-infrared instruments with correlated high-dimensional data. J Appl Stat. https://doi.org/10.1080/02664763.2022.2122947
Salinas JL, Castellarin A, Viglione A et al (2014) Regional parent flood frequency distributions in Europe-Part 1: Is the GEV model suitable as a pan-European parent? Hydrol Earth Syst Sci 18(11):4381–4389
Saulo H, Vila R, Bittencourt VL et al (2023) On a new extreme value distribution: characterization, parametric quantile regression, and application to extreme air pollution events. Stoch Environ Res Risk Assess 37:1119–1136
Serinaldi F, Lombardo F, Kilsby CG (2020) All in order: distribution of serially correlated order statistics with applications to hydrological extremes. Adv Water Resour 144:103686
Shin H, Kim T, Kim S et al (2010) Estimation of asymptotic variances of quantiles for the generalized logistic distribution. Stoch Environ Res Risk Assess 24:183–197
Shin Y, Park J-S (2023) Modeling climate extremes using the four-parameter kappa distribution for r-largest order statistics. Weath Clim Extrem 39:100533
Silva RSD, Nascimento FFD, Bourguignon M (2022) Dynamic linear seasonal models applied to extreme temperature data: a Bayesian approach using the r-largest order statistics distribution. J Stat Comput Simul 92(4):705–723
Smith RL (1986) Extreme value theory based on the r largest annual events. J Hydrol 86(1):27–43
Smyth P (2000) Model selection for probabilistic clustering using cross-validated likelihood. Stat Comput 10:63–72
Soares CG, Scotto MG (2004) Application of the r-largest order statistics for long-term predictions of significant wave height. Coastal Eng 51(5–6):387–394
Stein ML (2017) Should annual maximum temperatures follow a generalized extreme value distribution? Biometrika 104(1):1–16
Stein ML (2021) Parametric models for distributions when interest is in extremes with an application to daily temperature. Extremes 24:293–323
Stein ML (2021) A parametric model for distributions with flexible bahavior in both tails. Environmetrics 32(2):e2658
Tawn JA (1988) An extreme-value theory model for dependent observations. J Hydrol 101(1):227–250
van Wieringen WN, Chen Y (2021) Penalized estimation of the Gaussian graphical model from data with replicates. Stat Med 40(19):4279–4293
van Wieringen WN, Binder H (2022) Sequential learning of regression models by penalized estimation. J Comput Graph Stat 31(3):877–886
Vogel RM, Wilson I (1996) Probability distribution of annual maximum, mean, and minimum streamflows in the United States. J Hydrol Eng 1(2):69–76
Wang J, Lu F, Lin K et al (2017) Comparison and evaluation of uncertainties in extreme flood estimations of the upper Yangtze River by the Delta and profile likelihood function methods. Stoch Environ Res Risk Assess 31:2281–2296
Wang J, Zhang X (2008) Downscaling and projection of winter extreme daily precipitation over North America. J Clim 21(5):923–937
Weissman I (1978) Estimation of parameters and large quantiles based on the k largest observations. J Am Stat Assoc 73(364):812–815
Yadav R, Huser R, Opitz T (2021) Spatial hierarchical modeling of threshold exceedances using rate mixtures. Environmetrics 32:e2662
Yoon S, Shin Y, Park JS (2023) Model averaging with mixed criteria for estimating high quantiles in the generalized extreme value distribution. Submitted manuscript
Zhang XB, Zwiers FW, Li GL (2004) Monte Carlo experiments on the detection of trends in extreme values. J Clim 17(10):1945–1952
Zin WZW, Jemain AA, Ibrahim K (2009) The best fitting distribution of annual maximum rainfall in Peninsular Malaysia based on methods of L-moment and LQ-moment. Theor Appl Climat 96:337–344
Acknowledgements
The authors would like to thank the reviewers and the Associate Editor for their valuable comments and constructive suggestions.
Funding
This research was supported by Basic Science Research Program (No.RS-2023-00248434, 2020R1I1A3069260) and BK21 FOUR (No. 5120200913674) through the National Research Foundation of Korea funded by the Ministry of Education.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Shin, Y., Park, JS. Generalized logistic model for r largest order statistics, with hydrological application. Stoch Environ Res Risk Assess 38, 1567–1581 (2024). https://doi.org/10.1007/s00477-023-02642-7
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00477-023-02642-7