A Novel Hybrid Network Traffic Prediction Approach Based on Support Vector Machines

Network traffic prediction performs a main function in characterizing network community performance. An approach which could appropriately seize the salient characteristics of the network visitors could be very useful for network analysis and simulation. Network traffic prediction methods could be divided into two classes: one is the single models and the opposite is the hybrid fashions.*e hybrid models integrate the merits of several single models and consequently can enhance the network traffic prediction accuracy. In this paper, a new hybrid network traffic prediction method (EPSVM) primarily based on Empirical Mode Decomposition (EMD), Particle Swarm Optimization (PSO), and Support Vector Machines (SVM) is presented.*e EPSVM first utilizes EMD to eliminate the impact of noise signals. *en, SVM is applied to model training and fitting, and the parameters of SVM are optimized by PSO. *e effectiveness of the presented method is examined by evaluating it with different methods, including basic SVM (BSVM), Empirical Mode Decomposition processed by SVM (ESVM), and SVM optimized by Particle Swarm Optimization (PSVM). Case studies have demonstrated that EPSVM performed better than the other three network traffic prediction models.


Introduction
It is generally known that network traffic prediction can provide a variety of practical information for Internet organizations, for example, about travelling, rental company, and smart search.Network traffic prediction is a procedure whereby a webmaster catches the network traffic and inspects it closely to discover what is going to happen in the follow-up and coming period on the network.It can assist each webmaster by establishing reasonable network planning and controlling the network traffic congestion effectively [1].Precise network traffic prediction can thoroughly catch the notable attributes of the traffic, and thus it plays a vital role in network traffic analysis and simulation and offers assistance to customers to understand the network dynamics.So, in recent years, to enhance the network traffic prediction precision, researchers in China and abroad have proposed numerous network traffic prediction methods.
In general, network traffic prediction methods can be divided into two categories: one is the single models and the other is the combination, i.e., hybrid model which integrates the merits of several single models [2].Dickinson [3] has demonstrated that the combined and hybrid models can get better forecasting result than that of individual models.Besides, the topology and geometry of network are always very complex, which often influence the network traffic prediction accuracy.Demonstration of network traffic complexity shows up in numerous circumstances, for instance, the long-area connections and self-resemblance were found in a statistical analysis of traffic estimations.e complexity indicated from the traffic estimations has prompted the development of network traffic prediction, which suggests that a single model cannot yield satisfactory prediction result [4][5][6].e main reason behind this is that network traffic displays numerous characteristics, such as trend, cycle time, self-resemblance, and long-area dependence.Network traffic prediction with a single model cannot capture all the characteristics mentioned above.But a combination model can not only capture the linear characteristics but also the nonlinear characteristics of the NTD (NTD).erefore, the combination model is applied in this paper.
Over the past few decades, scientists over China and abroad have presented a lot of strategies to predict network traffic in diverse areas [7,8].Among them, some were more inclined to improve the existing models.For example, the literature [9] prolonged the notion of the broadly stated and used the fractional Brownian traffic model.Qing-Fang et al. [10] used a BIC-based totally neighboring factor choice approach to select the quantity of the nearest neighboring factors for the nearby Support Vector Machines.And, with the intention to obtain quicker convergence in the training of BiLinear Recurrent Neural Network (BLRNN), the literature [11] applied two procedures to the network.Other experts preferred a combination of the existing models.For example, the literature [1] developed a novel combined model to predict the network traffic in the National Taitung University and Shu-Te University.Chen et al. [12] evolved a new bendy neural tree structure which used Gene Programming, and the parameters are optimized through the Particle Swarm Optimization algorithm.e literature [13] presented a novel approach, which integrated wavelet transform, the grey theory, and the chaos theory; the numerical experiment demonstrated that the proposed model can get better prediction results.Recent papers about network traffic prediction methods can be seen in literatures [14,15].
Despite the fact that the previous mentioned approaches can produce an adequately precise prediction result for various cases, they, in general, have focused on the precision evaluation of the approaches obtained without noting the internal characteristics of the network traffic data (NTD).In truth, NTD are normally influenced by means of risky factors, therefore inflicting noise indicators that can increase the difficulty of forecasting.So in this paper, the EMD is first applied to eliminate the noise signals before applying SVM to predict the network traffic.Besides, the parameters in SVM are optimized by PSO.
erefore, the presented method integrates the EMD, PSO, and SVM, hence its abbreviated name is EPSVM.In order to examine the effectiveness of EPSVM, we contrast it with three other approaches, namely, (1) the original NTD directly processed by SVM (the method is named as BSVM), (2) NTD processed by EMD and then using SVM to model the denoised data (the method is named as ESVM), and (3) the original NTD directly processed by the SVM, whose parameters are optimized by PSO (the method is named as PSVM).Besides, it is noteworthy that the NTD are gathered from the Network Center of Lanzhou University.e rest of this paper is presented as follows.e theoretical background of EMD, PSO, and SVM models is specified in Section 2. In Section 3, the presented approach is introduced.Section 4 illustrates the experimental results.At last, Section 5 concludes this paper.

Theoretical Background of EMD, PSO, and SVM Models
In this subsection, the theories related to the proposed method (EPSVM) are introduced, and they are EMD, PSO, and SVM.

Empirical Mode Decomposition.
Empirical Mode Decomposition (EMD) is a nonlinear signal processing method developed by Huang et al. [15].It can decompose a signal into a sum of functions and intrinsic mode functions (IMFs).ese IMFs must satisfy two conditions: (1) the number of extrema and the number of zero-crossings either are equal or differing at most by one; (2) the mean value of the envelope defined by the local maxima and the local minima is zero at all points.According to [16][17][18], any signal y(t) can then be disintegrated: (1) Identify all the local extrema, and then connect all the local maxima with a cubic spline line as the upper envelope.
(2) Repeat the procedure for the local minima to produce the lower envelope.e upper and lower envelopes should cover all the data between them.(3) e mean of the upper and lower envelopes is designated as n 1 (t), and the difference between the signal y(t) and n 1 (t) is the first component p 1 (t): Ideally, if p 1 (t) satisfies the definition of an IMF, then it is the first IMF.
(4) If p 1 (t) is not an IMF, p 1 (t) is treated as the original signal, and by repeating processes (1), (2), and (3), p 11 (t) � p 1 (t) − n 11 (t) is acquired.After repeating the sifting process up to k times, p 1k becomes an IMF, i.e., en, it is designated as e first IMF component from the original data c 1 should contain the finest scale or the shortest period component of the signal.
(5) Separating c 1 (t) from the original signal y(t), we could get r 1 (t) � y(t) − c 1 (t).By repeating the above process several times, the result was  [19] in view of the natural flocking and swarming behaviors of birds and insects.Consider an optimization problem of M variables.A swarm comprises of q particles flying in a M dimensional search space.Let x denote a particle's position and v denote the particle's flight velocity over a solution space.Each individual x in the swarm is scored utilizing a scoring function that obtains a fitness value representing how good it settles the issue.e best previous position of a particle is pbest.e index of the best particle among all particles in the swarm is gbest.Each particle records its own personal best position (pbest) and knows the best positions found by all particles in the swarm (gbest).en, the best position of particle i could be calculated [20]: where w is the inertia weight factor, r 1 and r 2 are two independent randomly distributed variables with the range of [0, 1], and c 1 and c 2 are two positive constants called acceleration coefficients.

Support Vector
Machines.Support Vector Machine (SVM) [21] is a set of classification and regression techniques, designed to systematically optimize its structure based on the input training data.More details about SVM can be seen in the literatures [22][23][24].
Given the training data (x 1 , y 1 ), ..., (x n , y n ) ∈ W × R, where W denotes the space of the input patterns x i (e.g.W � R n ) and y i is the associated output values of x i .In ε-SVR, our goal is to produce a function F(x) based on the training data set to approximate the unknown function F(x).By introducing different constraints for violating a "tube" constraint from above and from below, we arrive at the formulation stated in Vapnik's article [25] for ε-SVR: where n denotes the number of samples, whereas ξ i and ξ * i are the allowed error "above" and "below" the training error subject to ε-insensitive tube |y i − ( < w, x i > + b)| ≤ ε and w T w is the regularization term.e empirically selected constant C > 0 determines the tradeoff between these two terms.
To preserve the sparse property of the solution, Vapnik used the ε-insensitive loss function |ξ ε | described by Instead of minimizing the observed training error, ε-SVR attempts to minimize the generalization error bound so as to achieve generalized performance, and this makes ε-SVR extremely robust to outliers.Finally, we get the following explicit form by introducing Lagrange multipliers, the Kernel trick, and employing the optimality constraints:

The Proposed Method
e proposed method (EPSVM) first uses EMD to eliminate the noise signal, then the data after EMD procedure are put into the SVM, and the parameters of SVM are optimized by PSO.So, in this subsection, the theory of SVM optimized by PSO is introduced in Section 3.1.And then, the specific prediction procedure of EPSVM is presented in Section 3.2.

SVM Optimized by PSO.
e parameters of SVM have an extraordinary effect on the forecasting precision, and it is very important to optimize the two parameters in the forecasting procedure.So, PSO is utilized to optimize the parameters in SVM (which is named as PSVM).e detailed process of PSVM is depicted in Figure 1 which includes the following five steps: (1) Initialization: the quantity of the population q is initialized, and the preliminary position and velocity of each particle are randomly allocated.(2) Fitness assessment: for each particle, its fitness is assessed, and the fitness function is calculated as the subsequent: where x i and  x i stand for the actual and forecast values, respectively.(3) Update pbest i and gbest i according to the fitness function results.(4) Update the velocity of each particle v i according to Equation ( 4) and the position of each particle x i using Equation ( 5).(5) Termination: the velocity and position of the particle are updated until the stop conditions are met.

e Specific Process of the Proposed Method.
To address the issues of noise signals caused by many uncertainties, EMD is first applied to remove the noise section of the NTD, which has many merits as discussed in Section 2.1.After the processing of the EMD, ESPSVM applied the PSVM described aforementioned on the processed NTD series to get Journal of Computer Networks and Communications the final results.e detailed procedure of the EPSVM is depicted in Figure 2: (1) Noise reduction: utilize EMD to remove the noise section of the original data.
(2) Put the data processed by the first step to PSVM model, and the final forecasting result can be obtained.

Experimental Results and Discussion
4.1.Criteria for Measuring Accuracy.In time series prediction, we always enquire what criterion may be used to correctly measure the accuracy of the anticipated outcomes.Performance evaluation of time series prediction is in fact tremendously depending on what sorts of the standards are chosen to measure the accuracy of predicted outcomes.In an effort to justify the affordable accuracy for a time series forecast, three famous criteria [26] are selected for measuring the prediction accuracy.e selected standards are expressed as follows: where N is the number of periods in forecasting and x i and  x i are the actual value and forecasted value.

Network Traffic Data (NTD).
e presented EPSVM approach is evaluated by the real NTD in the Network Center of Lanzhou University (LZU).ese data are gathered every 5 minutes, so each hour has 12 NTD, and one day amounts to 288 (12 * 24) NTD.In addition, the NTD on workdays and nonworkdays (7 days) are all applied to examine the effectiveness and feasibility of the proposed EPSVM method.erefore, the total NTD used in this paper is 2016 (288 * 7).To guess the network traffic fluctuations instantly and at the same time expedite website tracking, two types NTD, namely, inflow and outflow NTD are applied.As a result, the prediction process contains two procedures: one being the inflow NTD prediction and the other the outflow NTD prediction.Figures 3 and 4 show the 2016 NTD which contains inflow NTD and outflow NTD, respectively.
From Figures 3 and 4, it can be concluded that the NTD used in this paper were divided into seven groups.Each group has 288 NTD, which means that each group represents one day.In the 288 data, the data from 12 p.m. to 7 a.m.(15 hours) are used for model training and fitting, and the trained as well as the fitted model is adopted to predict the NTD from 8 p.m. to 9 p.m.

Prediction Results.
As discussed above, the EPSVM approach initially implemented the Empirical Mode Decomposition to put-off the noise interference from the authentic data.After the noise removal process of the authentic NTD, the records are named as denoised NTD. Figure 5 shows the noise putting-off technique of the inflow data and outflow data.It is worth noting that all of the NTD used was processed by EMD so as to observe its effect.rough comparison of the authentic NTD with the denoised information from Figures 5 and 6, it could be seen that the denoised statistics becomes a little smoother.So, instead of the usage of the authentic series, the proposed approach EPSVM used the denoised information to model training and fitting.After acquiring the denoised data, EPSVM used the SVM version optimized with the PSO to predict further.Here, each institution of records became normalized, and every normalized record group is divided into training sets and testing units, where the training sets had been the NTD from 12 p.m. to 7 a.m.(15 hours) every day, and the testing units were the NTD from 8 p.m. to 9 p.m. e EPSVM obtained the final seven days prediction

Input signal y(t)
Empirical mode decomposition in Section 2.1    Journal of Computer Networks and Communications prediction results for each day by these four approaches, respectively.Figures 7 and 8 just roughly show a contrast between the predicted results of the four methods and the original NTD to confirm that the EPSVM can perform better than the other three approaches.On the basis of the three evaluation metrics (RMSE, MAE, and MAPE) calculated in Section 4.1, the three criteria for measuring the accuracy of the four methods were computed in this section and are recorded in Tables 1 and 2.
Tables 1 and 2 show the three criteria results of the four approaches (BSVM, PSVM, ESVM, and EPSVM) on each day and the average values of the three criteria.Because of that, the results of the three metrics of the four forecasting methods in Table 1 are not the same as that in Table 2; we discuss the two tables separately.From Table 1, the subsequent outcomes occurred: A comparison between BSVM and PSVM: Table 1 shows that if the three evaluation metrics of the seven days are all taken into consideration, PSVM had expected lower values than BSVM apart from Wednesday, Friday, and Saturday.
ere are only three days in the week; BSVM has lower values than PSVM; generally speaking, if we just compare the average values, it can be seen that PSVM performs better than BSVM.
A comparison between PSVM and ESVM: Table 1 shows that if the three metrics of each day are taken into consideration, the value of ESVM is lower values than PSVM for    1 shows that, if we compare the three metrics of each day, EPSVM had expected lower values than the alternative three methods for most days, apart from the fact that three evaluation metrics performances of EPSVM are higher than those of PSVM on Sunday and higher than those of EVM on Tuesday.In addition, if we compare the average values of EPSVM with other three alternative methods, EPSVM also has decrease values than the other three techniques.
From Table 2, the subsequent outcomes occurred: A comparison between BSVM and PSVM: by observing Table 2, we see that if the three metrics of each day are all taken into consideration, PSVM had expected lower values than BSVM apart from Monday and Sunday.ere are only two days in the week; BSVM has lower values than PSVM; generally speaking, if we just compare the average values, it can be seen that PSVM performs better than BSVM.
A comparison between PSVM and ESVM: by observing Table 2, we can see that if the three metrics of each day are considered, ESVM had expected lower values than PSVM apart from Saturday and Sunday.ere are only two days in the week; PSVM has lower values than ESVM; generally speaking, if we just compare the average values, it can be seen that ESVM performs better than PSVM.
All in all, from the above statements deduced from Tables 1 and 2, the following conclusion can be reached: the proposed method EPSVM obviously performs better than the other three methods, and these four methods all have an acceptable performance for each day of the week.

Conclusions
Network traffic prediction offers useful data for website administrators to customize the records that are hosted on Internet servers with a view to reach a bigger target market.In an effort to decorate the functionality of real-time network visitor's analysis, it is very vital to expand a rather correct network visitor's prediction technique to help the webmaster control the bandwidth allocation effectively.In view of this, an artificial intelligence-based hybrid method EPSVM is presented in this article.EPSVM first uses EMD to process the original NTD so as to remove the noise part of the NTD.
en, it employs SVM to model the denoised network traffic series.Here, the parameters of SVM are optimized by PSO.Experiments with the NTD from LZU network center obviously verify that EPSVM significantly can enhance network traffic prediction accuracy.As part of real-time and reliable analysis of smart grids, EPSVM will help the webmaster better monitor the websites or, in other words, help network engineers optimize their websites, maximize online marketing, track user behavior, and push ads to users.

Data Availability
e data used to support the findings of this study were supplied by Lanzhou University under license and so cannot be made freely available.Requests for access to these data should be made to the corresponding author.

Figure 5 :Figure 6 :
Figure 5: e noise eliminating process of inflow data.

Table 1 :
e result of evaluation metrics of the four forecasting methods (inflow NTD).Journal of Computer Networks and Communications almost every day of the week apart from Sunday.And, in comparison with the average value of one week, it can be found that ESVM also had lower values than PSVM.To sum up, ESVM performs better than PSVM, and PSVM was better than BSVM for most days of the week.If the advantages of PSVM and ESVM are assembled, the result should be superb.So, the proposed approach EPSVM which combined PSVM and ESVM could get better forecasting results.Table

Table 2 :
e result of the three evaluation metrics of the four forecasting methods (outflow NTD).