Skip to main content

An Introduction to Statistical Learning

with Applications in Python

  • Textbook
  • © 2023

Overview

  • Presents an essential statistical learning toolkit for practitioners in science, industry, and other fields
  • Demonstrates application of the statistical learning methods in Python
  • Covers regression, classification, tree methods, SVM, clustering, survival analysis, deep learning

Part of the book series: Springer Texts in Statistics (STS)

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 89.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book USD 119.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (13 chapters)

Keywords

About this book

An Introduction to Statistical Learning provides an accessible overview of the field of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fields ranging from biology to finance, marketing, and  astrophysics in the past twenty years. This book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classification, resampling methods, shrinkage approaches, tree-based methods, support vector machines, clustering, deep learning, survival analysis, multiple testing, and more. Color graphics and real-world examples are used to illustrate the methods presented. This book is targeted at statisticians and non-statisticians alike, who wish to use cutting-edge statistical learning techniques to analyze their data.

Four of the authors co-wrote An Introduction to Statistical Learning, With Applications in R(ISLR), which has become a mainstay of undergraduate and graduate classrooms worldwide, as well as an important reference book for data scientists. One of the keys to its success was that each chapter contains a tutorial on implementing the analyses and methods presented in the R scientific computing environment. However, in recent years Python has become a popular language for data science, and there has been increasing demand for a Python-based alternative to ISLR. Hence, this book (ISLP) covers the same materials as ISLR but with labs implemented in Python. These labs will be useful both for Python novices, as well as experienced users.

Authors and Affiliations

  • Goizueta Business School, Emory University, Atlanta, USA

    Gareth James

  • Departments of Statistics and Biostatistics, University of Washington, Seattle, USA

    Daniela Witten

  • Department of Statistics and Biomedical Data Science, Stanford University, Stanford, USA

    Trevor Hastie

  • Departments of Biomedical Data Science and Statistics, Stanford University, Stanford, USA

    Robert Tibshirani

  • Department of Statistics, Stanford University, Stanford, USA

    Jonathan Taylor

About the authors

Gareth James is the John H. Harland Dean of Goizueta Business School at Emory University. He has published an extensive body of methodological work in the domain of statistical learning with particular emphasis on high-dimensional and functional data. The conceptual framework for this book grew out of his MBA elective courses in this area.

Daniela Witten is a professor of statistics and biostatistics, and the Dorothy Gilford Endowed Chair, at University of Washington. Her research focuses largely on statistical machine learning techniques for the analysis of complex, messy, and large-scale data, with an emphasis on unsupervised learning.

Trevor Hastie and Robert Tibshirani are professors of statistics at Stanford University and are co-authors of the successful textbook Elements of Statistical Learning. Hastie and Tibshirani developed generalized additive models and wrote a popular book with that title. Hastie co-developed much of the statistical modeling software and environment in R, and invented principal curves and surfaces. Tibshirani invented the lasso and is co-author of the very successful book, An Introduction to the Bootstrap. They are both elected members of the US National Academy of Sciences. 

Jonathan Taylor is a professor of statistics at Stanford University. His research focuses on selective inference and signal detection in structured noise.


Bibliographic Information

Publish with us