Feature Selectionfor Unlabeled Data

Chen, Chien-Hsing

doi:10.1007/978-3-642-21524-7_32

Chien-Hsing Chen²⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6729))

Included in the following conference series:

International Conference in Swarm Intelligence

1988 Accesses
4 Citations

Abstract

Feature selection has been explored extensively for several real-world applications. In this paper, we address a new solution of selecting a subset of original features for unlabeled data. The concept of our feature selection method is referred to a basic characteristic of clustering in thata data instance usually belongs in the same cluster with its geometrically nearest neighbors and belongs to different clusters with its geometrically farthest neighbors. In particular, our method uses instance-based learning for quantifying features in the context of the nearest and the farthest neighbors of every instance, such that using salient features can raise this characteristic. Experiments on several datasets demonstrated the effectiveness of our presented feature selection method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Lee, C., Lee, G.G.: Information gain and divergence-based feature selection for machine learning-based text categorization. Information and Process Management 42, 155–165 (2006)
Article Google Scholar
Wang, H., Li, P., Zhang, T.: Histogram features-based fisher lineardiscriminant for face detection. In: Asian Conference on Computer Vision, pp. 521–530 (2006)
Google Scholar
Crone, S.F., Kourentzes, N.: Feature selection for time series prediction - A combined filter and wrapper approach for neural networks. Neurocomputing 73, 1923–1936 (2010)
Article Google Scholar
Hathaway, R.J., Bezdek, J.C., Huband, J.M., Leckie, C., Kotagiri, R.: Approximate clustering in very large relational data, in review. Journal of Intelligent Systems (2005)
Google Scholar
Feder, T., Greene, D.: Optimal algorithms for approximate clustering. In: Proceedings of the 20th Annual ACM Symposium on the Theory of Computing, pp. 434–444 (1988)
Google Scholar
Kaufman, L., Rousseeuw, P.: Finding groups in data. Wiley, Chichester (1990)
Book MATH Google Scholar
Haykin, S.S., Widrow, B.: Least-mean-square adaptive filters. Wiley, Chichester (2003)
Book Google Scholar
Boutemedjet, S., Bouguila, N., Ziou, D.: A hybrid feature extraction selection approach for high-dimensional non-Gaussian data clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 3(8), 1429–1443 (2009)
Article Google Scholar
Oliva, A., Torralba, A.: Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision 42, 145–175 (2001)
Article MATH Google Scholar
Otsu, N.: A threshold selection method from gray-level histogram. IEEE Transactions on Systems, Man, and Cybernetics 9(1), 62–66 (1979)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Management, Hwa Hsia Institute of Technology, 111 Gong Jhuan Rd., Chung Ho, Taipei, Taiwan
Chien-Hsing Chen

Authors

Chien-Hsing Chen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Key Laboratory of Machine Perception, Department of Machine Intelligence, School of Electronics Engineering and Computer Science, Peking University, 100871, Beijing, China
Ying Tan
Department of Electrical and Electronic Engineering, Xi’an Jiaotong-Liverpool University, 215123, Suzhou, China
Yuhui Shi
Automation College, Chongqing University, 400030, Chongqing, China
Yi Chai
Institute of Computer Science and Technology, Chongqing University of Posts and Telecommunications, 400065, Chongqing, P.R. China
Guoyin Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, CH. (2011). Feature Selectionfor Unlabeled Data. In: Tan, Y., Shi, Y., Chai, Y., Wang, G. (eds) Advances in Swarm Intelligence. ICSI 2011. Lecture Notes in Computer Science, vol 6729. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21524-7_32

Download citation

DOI: https://doi.org/10.1007/978-3-642-21524-7_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21523-0
Online ISBN: 978-3-642-21524-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics