Skip to main content

Selection of On-line Features for Peer-to-Peer Network Traffic Classification

  • Conference paper

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 235))

Abstract

The selection of features plays an important role in traffic detection and mitigation. Feature selection methods can significantly improve the computational performance of traffic classification. However, most of the selected features cannot be used online since these features can only be calculated after the completion of a flow. Another requirement is that an optimum number of features must be chosen so as to classify the P2P traffic within the minimumtime possible. Out of more than ten feature selection algorithms, it was discovered that Chi-squared, Fuzzy-rough and Consistency-based feature selection algorithms were the best for P2P feature selection. The proposed algorithm gives better feature subset for online Peer-to-Peer (P2P) detection using machine learning (ML) techniques. The process of validation and evaluation were done through experimentation on real network traces. The performance is measured in terms of its effectiveness and efficiency. The experimental results indicate that J48 classifier with online subset feature selection produces a higher accuracy (99.23

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Auld, T., Moore, A.W., Gull, S.F.: Bayesian neural networks for internet traffic classification. IEEE Transactions on Neural Networks 18(1), 223–239 (2007)

    Article  Google Scholar 

  2. Bernaille, L., et al.: Traffic classification on the fly. ACM SIGCOMM Computer Communication Review 36(2), 23–26 (2006)

    Article  Google Scholar 

  3. Dash, M., Liu, H.: Consistency-based search in feature selection. Artificial Intelligence 151(1), 155–176 (2003)

    Article  MathSciNet  MATH  Google Scholar 

  4. Erman, J., et al.: Semi-supervised network traffic classification. In: ACM SIGMETRICS Performance Evaluation Review. ACM (2007)

    Google Scholar 

  5. Gomes, J.V.P., et al.: The Nature of Peer-to-Peer Traffic. In: Handbook of Peer-to-Peer Networking 2010, pp. 1231–1252. Springer (2010)

    Google Scholar 

  6. Gomes, J.V.P., et al.: Detection and Classification of Peer-to-Peer Traffic: A Survey (accessed April 2011)

    Google Scholar 

  7. Jamil, H.A., Zarei, R., Fadlelssied, N.O., Aliyu, M., Nor, S.M., Marsono, M.N.: Analysis of Features Selection for P2P Traffic Detection Using Support Vector Machine. In: ICoICT, March 20-22, IEEE (2013)

    Google Scholar 

  8. Jensen, R., Shen, Q.: Fuzzyrough attribute reduction with application to web categorization. Fuzzy Sets and Systems 141(3), 469–485 (2004)

    Article  MathSciNet  MATH  Google Scholar 

  9. Jun, L., et al.: P2P traffic identification technique. In: 2007 International Conference on Computational Intelligence and Security. IEEE (2007)

    Google Scholar 

  10. Lei, D., Xiaochun, Y., Jun, X.: Optimizing traffic classification using hybrid feature selection. In: The Ninth International Conference on Web-Age Information Management Web-Age Information Management, WAIM 2008. IEEE (2008)

    Google Scholar 

  11. Liu, H., Setiono, R.: Chi2: Feature selection and discretization of numeric attributes. In: Proceedings of the Seventh International Conference on Tools with Artificial Intelligence. IEEE (1995)

    Google Scholar 

  12. Moore, A.W., Zuev, D.: Internet traffic classification using bayesian analysis techniques. ACM (2005)

    Google Scholar 

  13. Moore, A.W., Zuev, D., Crogan, M.: Discriminators for use in flow-based classification, Technical report, Intel Research, Cambridge (2005)

    Google Scholar 

  14. Rezaee, M.R., et al.: Fuzzy feature selection. Pattern Recognition 32(12), 2011–2019 (1999)

    Article  Google Scholar 

  15. Szabó, G., Orincsay, D., Malomsoky, S., Szabó, I.: On the validation of traffic classification algorithms. In: Claypool, M., Uhlig, S. (eds.) PAM 2008. LNCS, vol. 4979, pp. 72–81. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  16. Van Der Putten, P., Van Someren, M.: A bias-variance analysis of a real world learning problem: The CoIL challenge 2000. Machine Learning 57(1), 177–195 (2004)

    Article  MATH  Google Scholar 

  17. Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann (2005)

    Google Scholar 

  18. Yang, Y., et al.: Solving P2P traffic identification problems Via optimized support vector machines. In: IEEE/ACS International Conference on Computer Systems and Applications, 2007. AICCSA 2007. IEEE (2007)

    Google Scholar 

  19. Zarei, R., Monemi, A., Marsono, M.N.: Retraining Mechanism for On-Line Peer-to-Peer Traffic Classification. In: Abraham, A., Thampi, S.M. (eds.) Intelligent Informatics. AISC, vol. 182, pp. 373–382. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  20. Zhang, H.L., et al.: Feature selection for optimizing traffic classification. Computer Communications 35(12), 1457–1471 (2012)

    Article  Google Scholar 

  21. Zhao, J.-J., et al.: Real-time feature selection in traffic classification. The Journal of China Universities of Posts and Telecommunications 15(suppl.), 68–72 (2008)

    Article  Google Scholar 

  22. Zhen, L., Qiong, L.: A new feature selection method for internet traffic classification using ml. Physics Procedia 33, 1338–1345 (2012)

    Article  Google Scholar 

  23. Cambridge data sets, http://www.cl.cam.ac.uk/research/srg/netos/nprobe/data/papers/sigmetrics/index.html (cited November 18, 2012)

  24. Université Brescia data sets, http://www.ing.unibs.it/ntw/tools/traces/download/ (cited November 19, 2012)

  25. WEKA. Data Mining Software in Java (2012), http://www.cs.waikato.ac.nz/ml/weka/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Haitham A. Jamil .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Jamil, H.A., Mohammed, A., Hamza, A., Nor, S.M., Marsono, M.N. (2014). Selection of On-line Features for Peer-to-Peer Network Traffic Classification. In: Thampi, S., Abraham, A., Pal, S., Rodriguez, J. (eds) Recent Advances in Intelligent Informatics. Advances in Intelligent Systems and Computing, vol 235. Springer, Cham. https://doi.org/10.1007/978-3-319-01778-5_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-01778-5_39

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-01777-8

  • Online ISBN: 978-3-319-01778-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics