Reference Hub

This research has been cited in:

Article
Artificial neural network in the discrimination of lung cancer based on infrared spectroscopyPLOS ONE10.1371/journal.pone.0268329
Chapter
Fast Human Activity Recognition Based on a Massively Parallel Implementation of Random ForestIntelligent Information and Database Systems10.1007/978-3-662-49390-8_16
Chapter
Skin Cancer Classification Using Deep LearningICDSMLA 202110.1007/978-981-19-5936-3_8
Conference
Semi-Supervised Learning with GANs for Melanoma Detection2022 6th International Conference on Intelligent Computing and Control Systems (ICICCS)10.1109/ICICCS53718.2022.9787990

Classification and Regression Trees

Johannes Gehrke

Source Title: Encyclopedia of Data Warehousing and Mining, Second Edition

ISBN13: 9781605660103|ISBN10: 1605660108|EISBN13: 9781605660110

DOI: 10.4018/978-1-60566-010-3.ch031

Cite Chapter Cite Chapter

MLA

Gehrke, Johannes. "Classification and Regression Trees." Encyclopedia of Data Warehousing and Mining, Second Edition, edited by John Wang, IGI Global, 2009, pp. 192-195. https://doi.org/10.4018/978-1-60566-010-3.ch031

APA

Gehrke, J. (2009). Classification and Regression Trees. In J. Wang (Ed.), Encyclopedia of Data Warehousing and Mining, Second Edition (pp. 192-195). IGI Global. https://doi.org/10.4018/978-1-60566-010-3.ch031

Chicago

Gehrke, Johannes. "Classification and Regression Trees." In Encyclopedia of Data Warehousing and Mining, Second Edition, edited by John Wang, 192-195. Hershey, PA: IGI Global, 2009. https://doi.org/10.4018/978-1-60566-010-3.ch031

Export Reference

Favorite

View Full Text HTML

View Full Text PDF

Abstract

It is the goal of classification and regression to build a data mining model that can be used for prediction. To construct such a model, we are given a set of training records, each having several attributes. These attributes can either be numerical (for example, age or salary) or categorical (for example, profession or gender). There is one distinguished attribute, the dependent attribute; the other attributes are called predictor attributes. If the dependent attribute is categorical, the problem is a classification problem. If the dependent attribute is numerical, the problem is a regression problem. It is the goal of classification and regression to construct a data mining model that predicts the (unknown) value for a record where the value of the dependent attribute is unknown. (We call such a record an unlabeled record.) Classification and regression have a wide range of applications, including scientific experiments, medical diagnosis, fraud detection, credit approval, and target marketing (Hand, 1997). Many classification and regression models have been proposed in the literature, among the more popular models are neural networks, genetic algorithms, Bayesian methods, linear and log-linear models and other statistical methods, decision tables, and tree-structured models, the focus of this chapter (Breiman, Friedman, Olshen, & Stone, 1984). Tree-structured models, socalled decision trees, are easy to understand, they are non-parametric and thus do not rely on assumptions about the data distribution, and they have fast construction methods even for large training datasets (Lim, Loh, & Shih, 2000). Most data mining suites include tools for classification and regression tree construction (Goebel & Gruenwald, 1999).

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.

Username or email: *

Password: *

Forgot individual login password?

Create individual account

Classification and Regression Trees

MLA

APA

Chicago

Export Reference

Abstract

Request Access