Skip to main content

Lessons Learned in the Challenge: Making Predictions and Scoring Them

  • Conference paper
  • 2305 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3944))

Abstract

In this paper we present lessons learned in the Evaluating Predictive Uncertainty Challenge. We describe the methods we used in regression challenges, including our winning method for the Outaouais data set. We then turn our attention to the more general problem of scoring in probabilistic machine learning challenges. It is widely accepted that scoring rules should be proper in the sense that the true generative distribution has the best expected score; we note that while this is useful, it does not guarantee finding the best methods for practical machine learning tasks. We point out some problems in local scoring rules such as the negative logarithm of predictive density (NLPD), and illustrate with examples that many of these problems can be avoided by a distance-sensitive rule such as the continuous ranked probability score (CRPS).

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer, Heidelberg (2001)

    Book  MATH  Google Scholar 

  2. Härdle, W.: Applied Nonparametric Regression. Cambridge University Press, Cambridge (1990)

    Book  MATH  Google Scholar 

  3. Gneiting, T., Raftery, A.E.: Strictly proper scoring rules, prediction, and estimation. Technical Report 463, Department of Statistics, University of Washington (2004)

    Google Scholar 

  4. Bernardo, J.M., Smith, A.F.M.: Bayesian Theory. John Wiley & Sons, Inc., Chichester (2000)

    MATH  Google Scholar 

  5. Sanders, F.: The verification of probability forecasts. Journal of Applied Meteorology 6, 756–761 (1967)

    Article  Google Scholar 

  6. Smith, C.A.B.: Consistency in statistical inference and decision. Journal of the Royal Statistical Society. Series B 23, 1–37 (1961)

    MathSciNet  MATH  Google Scholar 

  7. Savage, L.J.: Elicitation of personal probabilities and expectations. Journal of the American Statistical Association 66, 783–801 (1971)

    Article  MathSciNet  MATH  Google Scholar 

  8. Winkler, R.L.: Probabilistic prediction: Some experimental results. Journal of the American Statistical Association 66, 678–685 (1971)

    Article  Google Scholar 

  9. Corradi, V., Swanson, N.R.: Predictive density evaluation. Technical Report 200419, Rutgers University, Department of Economics (2004)

    Google Scholar 

  10. Bremnes, J.B.: Probabilistic forecasts of precipitation in terms of quantiles using NWP model output. Monthly Weather Review 132, 338–347 (2004)

    Article  Google Scholar 

  11. Epstein, E.S.: A scoring system for probability forecasts of ranked categories. Journal of Applied Meteorology 8, 985–987 (1969)

    Article  Google Scholar 

  12. Hamill, T.M., Wilks, D.S.: A probabilistic forecast contest and the difficulty in assessing short-range forecast uncertainty. Weather and Forecasting 10, 620–631 (1995)

    Article  Google Scholar 

  13. Matheson, J.E., Winkler, R.L.: Scoring rules for continuous probability distributions. Management Science 22, 1087–1096 (1976)

    Article  MATH  Google Scholar 

  14. Bernardo, J.M.: Expected information as expected utility. The Annals of Statistics 7, 686–690 (1979)

    Article  MathSciNet  MATH  Google Scholar 

  15. Staël von Holstein, C.A.S.: A family of strictly proper scoring rules which are sensitive to distance. Journal of Applied Meteorology 9, 360–364 (1970)

    Article  Google Scholar 

  16. Murphy, A.H.: The ranked probability score and the probability score: A comparison. Monthly Weather Review 98, 917–924 (1970)

    Article  Google Scholar 

  17. Murphy, A.H.: On the “ranked probability score”. Journal of Applied Meteorology 8, 988–989 (1969)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kohonen, J., Suomela, J. (2006). Lessons Learned in the Challenge: Making Predictions and Scoring Them. In: Quiñonero-Candela, J., Dagan, I., Magnini, B., d’Alché-Buc, F. (eds) Machine Learning Challenges. Evaluating Predictive Uncertainty, Visual Object Classification, and Recognising Tectual Entailment. MLCW 2005. Lecture Notes in Computer Science(), vol 3944. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11736790_7

Download citation

  • DOI: https://doi.org/10.1007/11736790_7

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-33427-9

  • Online ISBN: 978-3-540-33428-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics