Abstract
Feature selection has been explored extensively for several real-world applications. In this paper, we address a new solution of selecting a subset of original features for unlabeled data. The concept of our feature selection method is referred to a basic characteristic of clustering in thata data instance usually belongs in the same cluster with its geometrically nearest neighbors and belongs to different clusters with its geometrically farthest neighbors. In particular, our method uses instance-based learning for quantifying features in the context of the nearest and the farthest neighbors of every instance, such that using salient features can raise this characteristic. Experiments on several datasets demonstrated the effectiveness of our presented feature selection method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Lee, C., Lee, G.G.: Information gain and divergence-based feature selection for machine learning-based text categorization. Information and Process Management 42, 155–165 (2006)
Wang, H., Li, P., Zhang, T.: Histogram features-based fisher lineardiscriminant for face detection. In: Asian Conference on Computer Vision, pp. 521–530 (2006)
Crone, S.F., Kourentzes, N.: Feature selection for time series prediction - A combined filter and wrapper approach for neural networks. Neurocomputing 73, 1923–1936 (2010)
Hathaway, R.J., Bezdek, J.C., Huband, J.M., Leckie, C., Kotagiri, R.: Approximate clustering in very large relational data, in review. Journal of Intelligent Systems (2005)
Feder, T., Greene, D.: Optimal algorithms for approximate clustering. In: Proceedings of the 20th Annual ACM Symposium on the Theory of Computing, pp. 434–444 (1988)
Kaufman, L., Rousseeuw, P.: Finding groups in data. Wiley, Chichester (1990)
Haykin, S.S., Widrow, B.: Least-mean-square adaptive filters. Wiley, Chichester (2003)
Boutemedjet, S., Bouguila, N., Ziou, D.: A hybrid feature extraction selection approach for high-dimensional non-Gaussian data clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 3(8), 1429–1443 (2009)
Oliva, A., Torralba, A.: Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision 42, 145–175 (2001)
Otsu, N.: A threshold selection method from gray-level histogram. IEEE Transactions on Systems, Man, and Cybernetics 9(1), 62–66 (1979)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, CH. (2011). Feature Selectionfor Unlabeled Data. In: Tan, Y., Shi, Y., Chai, Y., Wang, G. (eds) Advances in Swarm Intelligence. ICSI 2011. Lecture Notes in Computer Science, vol 6729. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21524-7_32
Download citation
DOI: https://doi.org/10.1007/978-3-642-21524-7_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21523-0
Online ISBN: 978-3-642-21524-7
eBook Packages: Computer ScienceComputer Science (R0)