skip to main content
10.1145/1273496.1273573acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

Asymmetric boosting

Published:20 June 2007Publication History

ABSTRACT

A cost-sensitive extension of boosting, denoted as asymmetric boosting, is presented. Unlike previous proposals, the new algorithm is derived from sound decision-theoretic principles, which exploit the statistical interpretation of boosting to determine a principled extension of the boosting loss. Similarly to AdaBoost, the cost-sensitive extension minimizes this loss by gradient descent on the functional space of convex combinations of weak learners, and produces large margin detectors. It is shown that asymmetric boosting is fully compatible with AdaBoost, in the sense that it becomes the latter when errors are weighted equally. Experimental evidence is provided to demonstrate the claims of cost-sensitivity and large margin. The algorithm is also applied to the computer vision problem of face detection, where it is shown to outperform a number of previous heuristic proposals for cost-sensitive boosting (AdaCost, CSB0, CSB1, CSB2, asymmetric-AdaBoost, AdaC1, AdaC2 and AdaC3).

References

  1. Chawla, N. V., Lazarevie, A., Hall, L. O., & Bowyer, K. (2003). Smoteboost: Improving prediction of the minority class in boosting. In Proceedings of Principles of Knowledge Discovery in Databases.Google ScholarGoogle ScholarCross RefCross Ref
  2. Domingos, P. (1999). Metacost: a general method for making classifiers cost-sensitive. Proceedings of the fifth ACM SIGKDD. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Duda, R., Hart, P. E., & Stork, D. (2001). Pattern classification. New York: Wiley and Sons. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Fan, W., Stolfo, S., Zhang, J., & Chan, P. (1999). Adacost: Misclassification cost-sensitive boosting. ICML. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Freund, Y., & Schapire, R. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55, 119--139. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Freund, Y., & Schapire, R. (2004). A discussion of "Process consistency for AdaBoost" by Wenxin Jiang, "On the Bayes-risk consistency of regularized boosting methods" by Gabor Lugosi and Nicolas Vayatis, "Statistical behavior and consistency of classification methods based on convex risk minimization" by Tong Zhang. Annals of Statistics.Google ScholarGoogle Scholar
  7. Friedman, J., Hastie, T., & Tibshirani, R. (2000). Additive logistic regression: A statistical view of boosting. Journal of Annals of Statistics.Google ScholarGoogle ScholarCross RefCross Ref
  8. Guo, H., & Viktor, H. L. (2004). Learning from imbalanced data sets with boosting and data generation: the databoost-im approach. SIGKDD Explor. Newsl. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Hastie, Tibshirani, & Friedman (2001). The elements of statistical learning. New York: Springer-Verlag Inc.Google ScholarGoogle Scholar
  10. Mason, L., Baxter, J., Bartlett, P., & Frean, M. (2000). Boosting Algorithms as Gradient Descent. NIPS.Google ScholarGoogle Scholar
  11. Park, S.-B., Hwang, S., & Zhang, B.-T. (2003). Mining the risk types of human papillomavirus (hpv) by adacost. International Conference on Database and expert Systems Applications.Google ScholarGoogle ScholarCross RefCross Ref
  12. Schapire, R. E., Freund, Y., Bartlett, P., & Lee, W. S. (1998). Boosting the margin: A new explanation for the effectiveness of voting methods. The Annals of Statistics.Google ScholarGoogle Scholar
  13. Sun, Y., Wong, A. K. C., & Wang, Y. (2005). Parameter inference of cost-sensitive boosting algorithms. Machine Learning and Data Mining in Pattern Recognition, 4th International Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Ting, K. M. (2000). A comparative study of cost-sensitive boosting algorithms. ICML. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Viaene, S., Derrig, R. A., & Dedene, G. (2004). Costsensitive learning and decision making for massachusetts pip claim fraud data. International Journal of Intelligent Systems. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Viola, P., & Jones, M. (2001). Robust real-time object detection. Proc. 2nd Intl Workshop on Statistical and Computational Theories of Vision Modeling, Learning, Computing and Sampling. Vancouver, Canada.Google ScholarGoogle Scholar
  17. Viola, P., & Jones, M. (2002). Fast and robust classification using asymmetric adaboost and a detector cascade. NIPS.Google ScholarGoogle Scholar
  18. Zadrozny, B., Langford, J., & Abe, N. (2003). A simple method for cost-sensitive learning. Technical Report RC22666, IBM.Google ScholarGoogle Scholar
  1. Asymmetric boosting

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICML '07: Proceedings of the 24th international conference on Machine learning
        June 2007
        1233 pages
        ISBN:9781595937933
        DOI:10.1145/1273496

        Copyright © 2007 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 20 June 2007

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate140of548submissions,26%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader