Abstract
Multi-class classification algorithms are very widely used, but we argue that they are not always ideal from a theoretical perspective, because they assume all classes are characterized by the data, whereas in many applications, training data for some classes may be entirely absent, rare, or statistically unrepresentative. We evaluate one-sided classifiers as an alternative, since they assume that only one class (the target) is well characterized. We consider a task of identifying whether a substance contains a chlorinated solvent, based on its chemical spectrum. For this application, it is not really feasible to collect a statistically representative set of outliers, since that group may contain anything apart from the target chlorinated solvents. Using a new one-sided classification toolkit, we compare a One-Sided k-NN algorithm with two well-known binary classification algorithms, and conclude that the one-sided classifier is more robust to unexpected outliers.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Tax, D.M.J.: One-Class Classification, PhD Thesis. Delft University of Technology (2001)
Hollas, J.M.: Basic Atomic and Molecular Spectroscopy. Royal Society of Chemistry (2002)
Gardiner, D.J., Graves, P.R., Bowley, H.J.: Practical Raman Spectroscopy. Springer, New York (1989)
Bulkin, B.: The Raman Effect: An Introduction. John Wiley, New York (1991)
Harwood, L.M., Moody, C.J., Percy, J.M.: Experimental Organic Chemistry: Standard and Microscale. Blackwell Publishing, Malden (1999)
Conroy, J., Ryder, A.G., Leger, M.N., Hennessey, K., Madden, M.G.: Qualitative and quantitative analysis of chlorinated solvents using Raman spectroscopy and machine learning. In: Opto-Ireland 2005, vol. 5826, pp. 131–142 (2005)
Madden, M.G., Ryder, A.G.: Machine Learning Methods for Quantitative Analysis of Raman Spectroscopy Data. In: International Society for Optical Engineering (SPIE 2002), vol. 4876, pp. 1130–1139 (2002)
O’Connell, M.L., Howley, T., Ryder, A.G., Leger, M.N., Madden, M.G.: Classification of a target analyte in solid mixtures using principal component analysis, support vector machines and Raman spectroscopy. In: Opto-Ireland 2005, vol. 5826, pp. 340–350 (2005)
Howley, T.: Kernel Methods for Machine Learning with Application to the Analysis of Raman Spectra. PhD Thesis. National University of Ireland, Galway (2007)
Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
Munroe, D.T., Madden, M.G.: Multi-Class and Single-Class Classification Approaches to Vehicle Model Recognition from Images. In: AICS 2005, Portstewart (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Glavin, F.G., Madden, M.G. (2010). Analysis of the Effect of Unexpected Outliers in the Classification of Spectroscopy Data. In: Coyle, L., Freyne, J. (eds) Artificial Intelligence and Cognitive Science. AICS 2009. Lecture Notes in Computer Science(), vol 6206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17080-5_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-17080-5_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17079-9
Online ISBN: 978-3-642-17080-5
eBook Packages: Computer ScienceComputer Science (R0)