Abstract
The selection of features plays an important role in traffic detection and mitigation. Feature selection methods can significantly improve the computational performance of traffic classification. However, most of the selected features cannot be used online since these features can only be calculated after the completion of a flow. Another requirement is that an optimum number of features must be chosen so as to classify the P2P traffic within the minimumtime possible. Out of more than ten feature selection algorithms, it was discovered that Chi-squared, Fuzzy-rough and Consistency-based feature selection algorithms were the best for P2P feature selection. The proposed algorithm gives better feature subset for online Peer-to-Peer (P2P) detection using machine learning (ML) techniques. The process of validation and evaluation were done through experimentation on real network traces. The performance is measured in terms of its effectiveness and efficiency. The experimental results indicate that J48 classifier with online subset feature selection produces a higher accuracy (99.23
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Auld, T., Moore, A.W., Gull, S.F.: Bayesian neural networks for internet traffic classification. IEEE Transactions on Neural Networks 18(1), 223–239 (2007)
Bernaille, L., et al.: Traffic classification on the fly. ACM SIGCOMM Computer Communication Review 36(2), 23–26 (2006)
Dash, M., Liu, H.: Consistency-based search in feature selection. Artificial Intelligence 151(1), 155–176 (2003)
Erman, J., et al.: Semi-supervised network traffic classification. In: ACM SIGMETRICS Performance Evaluation Review. ACM (2007)
Gomes, J.V.P., et al.: The Nature of Peer-to-Peer Traffic. In: Handbook of Peer-to-Peer Networking 2010, pp. 1231–1252. Springer (2010)
Gomes, J.V.P., et al.: Detection and Classification of Peer-to-Peer Traffic: A Survey (accessed April 2011)
Jamil, H.A., Zarei, R., Fadlelssied, N.O., Aliyu, M., Nor, S.M., Marsono, M.N.: Analysis of Features Selection for P2P Traffic Detection Using Support Vector Machine. In: ICoICT, March 20-22, IEEE (2013)
Jensen, R., Shen, Q.: Fuzzyrough attribute reduction with application to web categorization. Fuzzy Sets and Systems 141(3), 469–485 (2004)
Jun, L., et al.: P2P traffic identification technique. In: 2007 International Conference on Computational Intelligence and Security. IEEE (2007)
Lei, D., Xiaochun, Y., Jun, X.: Optimizing traffic classification using hybrid feature selection. In: The Ninth International Conference on Web-Age Information Management Web-Age Information Management, WAIM 2008. IEEE (2008)
Liu, H., Setiono, R.: Chi2: Feature selection and discretization of numeric attributes. In: Proceedings of the Seventh International Conference on Tools with Artificial Intelligence. IEEE (1995)
Moore, A.W., Zuev, D.: Internet traffic classification using bayesian analysis techniques. ACM (2005)
Moore, A.W., Zuev, D., Crogan, M.: Discriminators for use in flow-based classification, Technical report, Intel Research, Cambridge (2005)
Rezaee, M.R., et al.: Fuzzy feature selection. Pattern Recognition 32(12), 2011–2019 (1999)
Szabó, G., Orincsay, D., Malomsoky, S., Szabó, I.: On the validation of traffic classification algorithms. In: Claypool, M., Uhlig, S. (eds.) PAM 2008. LNCS, vol. 4979, pp. 72–81. Springer, Heidelberg (2008)
Van Der Putten, P., Van Someren, M.: A bias-variance analysis of a real world learning problem: The CoIL challenge 2000. Machine Learning 57(1), 177–195 (2004)
Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann (2005)
Yang, Y., et al.: Solving P2P traffic identification problems Via optimized support vector machines. In: IEEE/ACS International Conference on Computer Systems and Applications, 2007. AICCSA 2007. IEEE (2007)
Zarei, R., Monemi, A., Marsono, M.N.: Retraining Mechanism for On-Line Peer-to-Peer Traffic Classification. In: Abraham, A., Thampi, S.M. (eds.) Intelligent Informatics. AISC, vol. 182, pp. 373–382. Springer, Heidelberg (2013)
Zhang, H.L., et al.: Feature selection for optimizing traffic classification. Computer Communications 35(12), 1457–1471 (2012)
Zhao, J.-J., et al.: Real-time feature selection in traffic classification. The Journal of China Universities of Posts and Telecommunications 15(suppl.), 68–72 (2008)
Zhen, L., Qiong, L.: A new feature selection method for internet traffic classification using ml. Physics Procedia 33, 1338–1345 (2012)
Cambridge data sets, http://www.cl.cam.ac.uk/research/srg/netos/nprobe/data/papers/sigmetrics/index.html (cited November 18, 2012)
Université Brescia data sets, http://www.ing.unibs.it/ntw/tools/traces/download/ (cited November 19, 2012)
WEKA. Data Mining Software in Java (2012), http://www.cs.waikato.ac.nz/ml/weka/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Jamil, H.A., Mohammed, A., Hamza, A., Nor, S.M., Marsono, M.N. (2014). Selection of On-line Features for Peer-to-Peer Network Traffic Classification. In: Thampi, S., Abraham, A., Pal, S., Rodriguez, J. (eds) Recent Advances in Intelligent Informatics. Advances in Intelligent Systems and Computing, vol 235. Springer, Cham. https://doi.org/10.1007/978-3-319-01778-5_39
Download citation
DOI: https://doi.org/10.1007/978-3-319-01778-5_39
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01777-8
Online ISBN: 978-3-319-01778-5
eBook Packages: EngineeringEngineering (R0)