Secure Semi-supervised Vector Quantization for Dissimilarity Data

Zhu, Xibin; Schleif, Frank-Michael; Hammer, Barbara

doi:10.1007/978-3-642-38679-4_34

Secure Semi-supervised Vector Quantization for Dissimilarity Data

Xibin Zhu¹⁹,
Frank-Michael Schleif¹⁹ &
Barbara Hammer¹⁹

Conference paper

2637 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7902))

Abstract

The amount and complexity of data increase rapidly, however, due to time and cost constrains, only few of them are fully labeled. In this context non-vectorial relational data given by pairwise (dis-)similarities without explicit vectorial representation, like score- values in sequences alignments, are particularly challenging. Existing semi-supervised learning (SSL) algorithms focus on vectorial data given in Euclidean space. In this paper we extend a prototype-based classifier for dissimilarity data to non i.i.d. semi-supervised tasks. Using conformal prediction the ‘secure region’ of unlabeled data can be used to improve the trained model based on labeled data while adapting the model complexity to cover the ‘insecure region’ of labeled data. The proposed method is evaluated on some benchmarks from the SSL domain.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Chapelle, O., Schölkopf, B., Zien, A. (eds.): Semi-Supervised Learning. MIT Press, Cambridge (2006)
Google Scholar
Boeckmann, B., et al.: The swiss-prot protein knowledgebase and its supplement trembl in 2003. Nucleic Acids Research 31, 365–370 (2003)
Article Google Scholar
Gisbrecht, A., Mokbel, B., Schleif, F.-M., Zhu, X., Hammer, B.: Linear time relational prototype based learning. J. of Neural Sys. 22(5), 72–84 (2012)
Google Scholar
Gusfield, D.: Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge University Press (1997)
Google Scholar
Haasdonk, B., Bahlmann, C.: Learning with distance substitution kernels. In: Rasmussen, C.E., Bülthoff, H.H., Schölkopf, B., Giese, M.A. (eds.) DAGM 2004. LNCS, vol. 3175, pp. 220–227. Springer, Heidelberg (2004)
Chapter Google Scholar
Hebiri, M.: Sparse conformal predictors. Statistics and Computing 20(2), 253–266 (2010)
Article MathSciNet Google Scholar
Li, Y.-F., Zhou, Z.-H.: Towards making unlabeled data never hurt. In: Getoor, L., Scheffer, T. (eds.) ICML, pp. 1081–1088. Omnipress (2011)
Google Scholar
Neuhaus, M., Bunke, H.: Edit distance based kernel functions for structural pattern classification. Pattern Recognition 39(10), 1852–1863 (2006)
Article MATH Google Scholar
Rajadell, O., Garcia-Sevilla, P., Dinh, V.C., Duin, R.P.W.: Semi-supervised hyperspectral pixel classification using interactive labeling. In: 2011 3rd Workshop on WHISPERS, pp. 1–4 (June 2011)
Google Scholar
Sato, A., Yamada, K.: Generalized learning vector quantization. In: Touretzky, D.S., Mozer, M., Hasselmo, M.E. (eds.) NIPS, pp. 423–429. MIT Press (1995)
Google Scholar
Schleif, F.-M., Zhu, X., Hammer, B.: A conformal classifier for dissimilarity data. In: Iliadis, L., Maglogiannis, I., Papadopoulos, H. (eds.) AIAI 2012. IFIP AICT, vol. 381, pp. 234–243. Springer, Heidelberg (2012)
Google Scholar
Singh, A., Nowak, R.D., Zhu, X.: Unlabeled data: Now it helps, now it doesn’t. In: Koller, D., Schuurmans, D., Bengio, Y., Bottou, L. (eds.) NIPS, pp. 1513–1520. Curran Associates, Inc. (2008)
Google Scholar
Trosset, M.W., Priebe, C.E., Park, Y., Miller, M.I.: Semisupervised learning from dissimilarity data. Computational Statistics and Data Analysis 52(10), 4643–4657 (2008)
Article MATH MathSciNet Google Scholar
Vovk, V., Gammerman, A., Shafer, G.: Algorithmic Learning in a Random World. Springer, New York (2005)
MATH Google Scholar
Zhu, X., Goldberg, A.B.: Introduction to semi-supervised learning. Synthesis Lectures on Artif. Intell. and Machine Learning 3(1), 1–130 (2009)
Article Google Scholar

Download references

Author information

Authors and Affiliations

CITEC - Centre of Excellence, Bielefeld University, 33615, Bielefeld, Germany
Xibin Zhu, Frank-Michael Schleif & Barbara Hammer

Authors

Xibin Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Frank-Michael Schleif
View author publications
You can also search for this author in PubMed Google Scholar
Barbara Hammer
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Architecture and Computer Technology, University of Granada,, Periodista Daniel Saucedo Aranda s/n,, 18071, Granada, Spain
Ignacio Rojas
Department of Electronics Technology, University of Malaga, 29071, Malaga, Spain
Gonzalo Joya
Department of Electronics Engineering, Universitat Politecnica de Catalunya, 08034, Barcelona, Spain
Joan Gabestany

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhu, X., Schleif, FM., Hammer, B. (2013). Secure Semi-supervised Vector Quantization for Dissimilarity Data. In: Rojas, I., Joya, G., Gabestany, J. (eds) Advances in Computational Intelligence. IWANN 2013. Lecture Notes in Computer Science, vol 7902. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38679-4_34

Download citation

DOI: https://doi.org/10.1007/978-3-642-38679-4_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38678-7
Online ISBN: 978-3-642-38679-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics