Lessons Learned in the Challenge: Making Predictions and Scoring Them

Kohonen, Jukka; Suomela, Jukka

doi:10.1007/11736790_7

Lessons Learned in the Challenge: Making Predictions and Scoring Them

Jukka Kohonen²² &
Jukka Suomela²²

Conference paper

2305 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3944))

Abstract

In this paper we present lessons learned in the Evaluating Predictive Uncertainty Challenge. We describe the methods we used in regression challenges, including our winning method for the Outaouais data set. We then turn our attention to the more general problem of scoring in probabilistic machine learning challenges. It is widely accepted that scoring rules should be proper in the sense that the true generative distribution has the best expected score; we note that while this is useful, it does not guarantee finding the best methods for practical machine learning tasks. We point out some problems in local scoring rules such as the negative logarithm of predictive density (NLPD), and illustrate with examples that many of these problems can be avoided by a distance-sensitive rule such as the continuous ranked probability score (CRPS).

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer, Heidelberg (2001)
Book MATH Google Scholar
Härdle, W.: Applied Nonparametric Regression. Cambridge University Press, Cambridge (1990)
Book MATH Google Scholar
Gneiting, T., Raftery, A.E.: Strictly proper scoring rules, prediction, and estimation. Technical Report 463, Department of Statistics, University of Washington (2004)
Google Scholar
Bernardo, J.M., Smith, A.F.M.: Bayesian Theory. John Wiley & Sons, Inc., Chichester (2000)
MATH Google Scholar
Sanders, F.: The verification of probability forecasts. Journal of Applied Meteorology 6, 756–761 (1967)
Article Google Scholar
Smith, C.A.B.: Consistency in statistical inference and decision. Journal of the Royal Statistical Society. Series B 23, 1–37 (1961)
MathSciNet MATH Google Scholar
Savage, L.J.: Elicitation of personal probabilities and expectations. Journal of the American Statistical Association 66, 783–801 (1971)
Article MathSciNet MATH Google Scholar
Winkler, R.L.: Probabilistic prediction: Some experimental results. Journal of the American Statistical Association 66, 678–685 (1971)
Article Google Scholar
Corradi, V., Swanson, N.R.: Predictive density evaluation. Technical Report 200419, Rutgers University, Department of Economics (2004)
Google Scholar
Bremnes, J.B.: Probabilistic forecasts of precipitation in terms of quantiles using NWP model output. Monthly Weather Review 132, 338–347 (2004)
Article Google Scholar
Epstein, E.S.: A scoring system for probability forecasts of ranked categories. Journal of Applied Meteorology 8, 985–987 (1969)
Article Google Scholar
Hamill, T.M., Wilks, D.S.: A probabilistic forecast contest and the difficulty in assessing short-range forecast uncertainty. Weather and Forecasting 10, 620–631 (1995)
Article Google Scholar
Matheson, J.E., Winkler, R.L.: Scoring rules for continuous probability distributions. Management Science 22, 1087–1096 (1976)
Article MATH Google Scholar
Bernardo, J.M.: Expected information as expected utility. The Annals of Statistics 7, 686–690 (1979)
Article MathSciNet MATH Google Scholar
Staël von Holstein, C.A.S.: A family of strictly proper scoring rules which are sensitive to distance. Journal of Applied Meteorology 9, 360–364 (1970)
Article Google Scholar
Murphy, A.H.: The ranked probability score and the probability score: A comparison. Monthly Weather Review 98, 917–924 (1970)
Article Google Scholar
Murphy, A.H.: On the “ranked probability score”. Journal of Applied Meteorology 8, 988–989 (1969)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Helsinki Institute for Information Technology, Basic Research Unit, Department of Computer Science, University of Helsinki, P.O. Box 68, FI-00014, Finland
Jukka Kohonen & Jukka Suomela

Authors

Jukka Kohonen
View author publications
You can also search for this author in PubMed Google Scholar
Jukka Suomela
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Max Planck Institute for Biological Cybernetics, Spemannstr. 38, Tübingen, Germany
Joaquin Quiñonero-Candela
Bar Ilan University, 52900, Ramat Gan, Israel
Ido Dagan
ITC-IRST, Trento, Italy
Bernardo Magnini
Université d’Evry-Val d’Essonne, IBISC CNRS FRE 2873 and GENPOLE, 523, Place des terrasses, 91000, Evry, France
Florence d’Alché-Buc

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kohonen, J., Suomela, J. (2006). Lessons Learned in the Challenge: Making Predictions and Scoring Them. In: Quiñonero-Candela, J., Dagan, I., Magnini, B., d’Alché-Buc, F. (eds) Machine Learning Challenges. Evaluating Predictive Uncertainty, Visual Object Classification, and Recognising Tectual Entailment. MLCW 2005. Lecture Notes in Computer Science(), vol 3944. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11736790_7

Download citation

DOI: https://doi.org/10.1007/11736790_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33427-9
Online ISBN: 978-3-540-33428-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics