ABSTRACT
At the heart of many computer network planning, deployment, and operational tasks lie hard algorithmic problems. Accordingly, over the last decades, we have witnessed a continuous pursuit for ever more accurate and faster algorithms. We propose an approach to design network algorithms which is radically different from most existing algorithms. Our approach is motivated by the observation that most existing algorithms to solve a given hard computer networking problem overlook a simple yet very powerful optimization opportunity in practice: many network algorithms are executed repeatedly (e.g., for each virtual network request or in reaction to user mobility), and hence with each execution, generate interesting data: (problem,solution)-pairs. We make the case for leveraging the potentially big data of an algorithm's past executions to improve and speed up future, similar solutions, by reducing the algorithm's search space. We study the applicability of machine learning to network algorithm design, identify challenges and discuss limitations. We empirically demonstrate the potential of machine learning network algorithms in two case studies, namely the embedding of virtual networks (a packing optimization problem) and k-center facility location (a covering optimization problem), using a prototype implementation.
Supplemental Material
- Sanjeewa Athuraliya, Steven H Low, Victor H Li, and Qinghe Yin. 2001. REM: Active queue management. IEEE Network 15, 3 (2001), 48--53. Google ScholarDigital Library
- Irwan Bello, Hieu Pham, Quoc V. Le, Mohammad Norouzi, and Samy Bengio. 2017. Neural Combinatorial Optimization with Reinforcement Learning. In ICLR Workshop (2017). https://openreview.net/pdf?id=Bk9mxlSFxGoogle Scholar
- Michele Berlingerio, Danai Koutra, Tina Eliassi-Rad, and Christos Faloutsos. 2012. NetSimile: A Scalable Approach to Size-Independent Network Similarity. CoRR abs/1209.2684 (2012).Google Scholar
- Andreas Blenk, Patrick Kalmbach, Patrick Van Der Smagt, and Wolfgang Kellerer. 2016. Boost Online Virtual Network Embedding: Using Neural Networks for Admission Control. In Proc. IFIP/IEEE CNSM.Google ScholarCross Ref
- Andreas Fischer, Juan Felipe Botero, Michael Till Beck, Hermann de Meer, and Xavier Hesselbach. 2013. Virtual Network Embedding: A Survey. IEEE Communications Surveys & Tutorials 15, 4 (Jan. 2013), 1888--1906.Google ScholarCross Ref
- Jim Gao. 2014. Machine learning applications for data center optimization. Google White Paper (2014).Google Scholar
- Thomas Gärtner, Peter Flach, and Stefan Wrobel. 2003. On graph kernels: Hardness results and efficient alternatives. In Learning Theory and Kernel Machines. Springer, 129--143.Google Scholar
- Long Gong, Yonggang Wen, Zuqing Zhu, and Tony Lee. 2014. Toward profit-seeking virtual network embedding algorithm via global resource capacity. In Proc. IEEE INFOCOM. 1--9.Google ScholarCross Ref
- Frank Hutter. 2014. Machine Learning for Optimization: Automated Parameter Tuning and Beyond. (2014).Google Scholar
- Frank Hutter, Holger H Hoos, and Kevin Leyton-Brown. 2010. Automated configuration of mixed integer programming solvers. In Proc. CPAIOR. Springer, 186--202. Google ScholarDigital Library
- Patrick Kalmbach, Johannes Zerwas, Michael Manhart, Andreas Blenk, Stefan Schmid, and Wolfgang Kellerer. 2017. Data on "o'zapft is: Tap Your Network Algorithm's Big Data!". (2017).Google Scholar
- Elias Boutros Khalil, Pierre Le Bodic, Le Song, George L Nemhauser, and Bistra N Dilkina. 2016. Learning to Branch in Mixed Integer Programming.. In Proc. AAAI. 724--731. Google ScholarDigital Library
- Simon Knight, Hung X. Nguyen, Nickolas Falkner, Rhys Bowden, and Matthew Roughan. 2011. The Internet Topology Zoo. IEEE J. on Sel. Areas in Communications 29, 9 (2011).Google ScholarCross Ref
- Geng Li, Murat Semerci, Bülent Yener, and Mohammed J Zaki. 2012. Effective graph classification based on topological and label attributes. Statistical Analysis and Data Mining 5, 4 (Aug. 2012), 265--283. Google ScholarDigital Library
- Ruoqian Liu, Ankit Agrawal, Wei-keng Liao, and Alok Choudhary. 2014. Search space preprocessing in solving complex optimization problems. In Proc. IEEE Big Data. IEEE, 1--5.Google ScholarCross Ref
- Hongzi Mao, Mohammad Alizadeh, Ishai Menache, and Srikanth Kandula. 2016. Resource Management with Deep Reinforcement Learning. In Proc. 15th ACM Hotnets. ACM, New York, NY, USA, 50--56. Google ScholarDigital Library
- Marcio Melo, Susana Sargento, Ulrich Killat, Andreas Timm-Giel, and Jorge Carapinha. 2013. Optimal Virtual Network Embedding: Node-Link Formulation. IEEE Trans. Network and Service Management 10, 4 (Dec. 2013), 356--368.Google Scholar
- Maria Carolina Monard and Gustavo E AP A Batista. 2002. Learning with skewed class distributions. Advances in Logic, Artificial Intelligence and Robotics (2002), 173--180.Google Scholar
- Keith Winstein and Hari Balakrishnan. 2013. TCP Ex Machina: Computer-generated Congestion Control. In Proc. ACM SIGCOMM (SIGCOMM '13). ACM, New York, NY, USA, 123--134. Google ScholarDigital Library
- Minlan Yu, Yung Yi, Jennifer Rexford, and Mung Chiang. 2008. Rethinking Virtual Network Embedding: Substrate Support for Path Splitting and Migration. SIGCOMM Comput. Commun. Rev. 38, 2 (Mar. 2008), 17--29. Google ScholarDigital Library
Index Terms
- o'zapft is: Tap Your Network Algorithm's Big Data!
Recommendations
Big data classification: problems and challenges in network intrusion prediction with machine learning
This paper focuses on the specific problem of Big Data classification of network intrusion traffic. It discusses the system challenges presented by the Big Data problems associated with network intrusion prediction. The prediction of a possible ...
Survey: Big Data Application in Biomedical Research
ICCAE 2018: Proceedings of the 2018 10th International Conference on Computer and Automation EngineeringIn recent years, the emergence of diverse applications created a plethora of data and immense sources that can be applied in varying areas of the industry worldwide escalating its capabilities including biomedicine. Subsequently, analytical tools loomed ...
A comparative between hadoop mapreduce and apache Spark on HDFS
IML '17: Proceedings of the 1st International Conference on Internet of Things and Machine LearningData is growing now in a very high speed with a large volume, Spark and MapReduce1 both provide a processing model for analyzing and managing this large data -Big Data- stored on HDFS. In this paper, we discuss a comparative between Apache Spark and ...
Comments