research-article

Fast and scalable polynomial kernels via explicit feature maps

Authors:
Ninh Pham

IT University of Copenhagen, Copenhagen, Denmark

IT University of Copenhagen, Copenhagen, Denmark
View Profile

,
Rasmus Pagh

IT University of Copenhagen, Copenhagen, Denmark

IT University of Copenhagen, Copenhagen, Denmark
View Profile

KDD '13: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data miningAugust 2013Pages 239–247https://doi.org/10.1145/2487575.2487591

Published:11 August 2013Publication History

KDD '13: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining

Pages 239–247

ABSTRACT

Approximation of non-linear kernels using random feature mapping has been successfully employed in large-scale data analysis applications, accelerating the training of kernel machines. While previous random feature mappings run in O(ndD) time for $n$ training samples in d-dimensional space and D random feature maps, we propose a novel randomized tensor product technique, called Tensor Sketching, for approximating any polynomial kernel in O(n(d+D \log{D})) time. Also, we introduce both absolute and relative error bounds for our approximation to guarantee the reliability of our estimation algorithm. Empirically, Tensor Sketching achieves higher accuracy and often runs orders of magnitude faster than the state-of-the-art approach for large-scale real-world datasets.

References

C.-C. Chang and C.-J. Lin. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1--27:27, 2011. Google ScholarDigital Library
M. Charikar, K. Chen, and M. Farach-Colton. Finding frequent items in data streams. In Proceedings of ICALP'02, pages 693--703, 2002. Google ScholarDigital Library
R. Chitta, R. Jin, T. C. Havens, and A. K. Jain. Approximate kernel k-means: solution to large scale kernel clustering. In Proceedings of KDD'11, pages 895--903, 2011. Google ScholarDigital Library
R. Chitta, R. Jin, and A. K. Jain. Efficient kernel clustering using random fourier features. In Proceedings of ICDM'12, pages 161--170, 2012. Google ScholarDigital Library
P. Drineas and M. W. Mahoney. On the Nyström method for approximating a gram matrix for improved kernel-based learning. Journal of Machine Learning Research, 6:2153--2175, 2005. Google ScholarDigital Library
R.-E. Fan, K.-W. Chang, C.-J. Hsieh, X.-R. Wang, and C.-J. Lin. LIBLINEAR: A library for large linear classification. Journal of Machine Learning Research, 9:1871--1874, 2008. Google ScholarDigital Library
S. Fine and K. Scheinberg. Efficient SVM training using low-rank kernel representations. Journal of Machine Learning Research, 2:243--264, 2001. Google ScholarDigital Library
A. Frank and A. Asuncion. UCI machine learning repository, 2010.Google Scholar
T. Joachims. Training linear SVMs in linear time. In Proceedings of KDD'06, pages 217--226, 2006. Google ScholarDigital Library
P. Kar and H. Karnick. Random feature maps for dot product kernels. In Proceedings of AISTATS'12, pages 583--591, 2012.Google Scholar
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86:2278--2324, 1998.Google ScholarCross Ref
S. Maji and A. C. Berg. Max-margin additive classifiers for detection. In Proceedings of ICCV'09, pages 40--47, 2009.Google ScholarCross Ref
E. Osuna, R. Freund, and F. Girosi. An improved training algorithm for support vector machines. In Proceedings of NNSP'97, pages 276--285, 1997.Google ScholarCross Ref
R. Pagh. Compressed matrix multiplication. In Proceedings of ICTS'12, pages 442--451, 2012. Google ScholarDigital Library
M. Patraşcu and M. Thorup. The power of simple tabulation hashing. In Proceedings of STOC'11, pages 1--10, 2011. Google ScholarDigital Library
A. Rahimi and B. Recht. Random features for large-scale kernel machines. In Advances in NIPS'08, pages 1177--1184, 2007.Google Scholar
B. Schökopf and A. J. Smola. Learning with kernels: Support vector machines, regularization, Optimization, and Beyond. MIT Press, Cambridge, MA, USA, 2001. Google ScholarDigital Library
S. Shalev-Shwartz, Y. Singer, and N. Srebro. Pegasos: Primal estimated sub-gradient solver for SVM. In Proceedings of ICML'07, pages 807--814, 2007. Google ScholarDigital Library
A. J. Smola and B. Schökopf. Sparse greedy matrix approximation for machine learning. In Proceedings of ICML'00, pages 911--918, 2000. Google ScholarDigital Library
A. Vedaldi and A. Zisserman. Efficient additive kernels via explicit feature maps. In Proceedings of CVPR'10, pages 3539--3546, 2010.Google ScholarCross Ref
S. Vempati, A. Vedaldi, A. Zisserman, and C. V. Jawahar. Generalized RBF feature maps for efficient detection. In Proceedings of BMVC'10, pages 1--11, 2010.Google ScholarCross Ref
K. Q. Weinberger, A. Dasgupta, J. Langford, A. J. Smola, and J. Attenberg. Feature hashing for large scale multitask learning. In Proceedings of ICML'09, pages 1113--1120, 2009. Google ScholarDigital Library
C. K. I. Williams and M. Seeger. Using the Nyström method to speed up kernel machines. In Advances in NIPS'01, pages 682--688, 2001.Google Scholar
T. Yang, Y.-F. Li, M. Mahdavi, R. Jin, and Z.-H. Zhou. Nyström method vs random fourier features: A theoretical and empirical comparison". In Advances in NIPS'12, pages 485--493, 2012.Google Scholar

Index Terms

Fast and scalable polynomial kernels via explicit feature maps
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Supervised learning by classification
    2. Machine learning approaches
      1. Classification and regression trees

Recommendations

Efficient Additive Kernels via Explicit Feature Maps

Large scale nonlinear support vector machines (SVMs) can be approximated by linear ones using a suitable feature map. The linear SVMs are in general much faster to learn and evaluate (test) than the original nonlinear SVMs. This work introduces explicit ...
Read More
Polynomial summaries of positive semidefinite kernels

Polynomials have proven to be useful tools to tailor generic kernels to specific applications. Nevertheless, we had only restricted knowledge for selecting fertile polynomials which consistently produce positive semidefinite kernels. For example, the ...
Read More
Analysis of legendre polynomial kernel in support vector machines

For several types of machines learning problems, the support vector machine is a method of choice. The kernel functions are a basic ingredient in support vector machine theory. Kernels based on the concepts of orthogonal polynomials gave the great ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
KDD '13: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
August 2013
1534 pages
ISBN:9781450321747
DOI:10.1145/2487575
Editors:
Rayid Ghani
University of Chicago
,
Ted E. Senator
SAIC
,
Paul Bradley
MethodCare, Inc.
,
Rajesh Parekh
Groupon
,
Jingrui He
Stevens Institute of Technology
,
General Chairs:
Robert L. Grossman
University of Chicago and Open Data Group
,
Ramasamy Uthurusamy
General Motors Corporation (retired)
,
Program Chairs:
Inderjit S. Dhillon
University of Texas
,
Yehuda Koren
Google
Copyright © 2013 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 11 August 2013
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
count sketch
fft
polynomial kernel
svm
tensor product
Qualifiers
- research-article
Conference

Acceptance Rates
KDD '13 Paper Acceptance Rate125of726submissions,17%Overall Acceptance Rate1,133of8,635submissions,13%
More
Upcoming Conference
KDD '24

Sponsor:

sigkdd

sigkdd

The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona , Spain
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 165
  Total Citations
  View Citations
- 1,626
  Total Downloads
- Downloads (Last 12 months)133
- Downloads (Last 6 weeks)25
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Fast and scalable polynomial kernels via explicit feature maps

KDD '13: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Efficient Additive Kernels via Explicit Feature Maps

Polynomial summaries of positive semidefinite kernels

Analysis of legendre polynomial kernel in support vector machines

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Fast and scalable polynomial kernels via explicit feature maps

KDD '13: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Efficient Additive Kernels via Explicit Feature Maps

Polynomial summaries of positive semidefinite kernels

Analysis of legendre polynomial kernel in support vector machines

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media