End-Point Static Control of Basic Oxygen Furnace (BOF) Steelmaking Based on Wavelet Transform Weighted Twin Support Vector Regression

. A static control model is proposed based on wavelet transform weighted twin support vector regression (WTWTSVR). Firstly, new weighted matrix and coefficient vector are added into the objective functions of twin support vector regression (TSVR) to improve the performance of the algorithm. The performance test confirms the effectiveness of WTWTSVR. Secondly, the static control model is established based on WTWTSVR and 220 samples in real plant, which consists of prediction models, control models, regulating units, controller, and BOF. Finally, the results of proposed prediction models show that the prediction error bound with 0.005% in carbon content and 10 ∘ C in temperature can achieve a hit rate of 92% and 96%, respectively. In addition, the double hit rateof 90% is the best result by comparing with four existing methods. The results of the proposed static control model indicate that the control error bound with 800 Nm 3 in the oxygen blowing volume and 5.5 tons in the weight of auxiliary materials can achieve a hit rate of 90% and 88%, respectively. Therefore, the proposed model can provide a significant reference for real BOF applications, and also it can be extended to the prediction and control of other industry applications.


Introduction
With the development of end-point control technology for basic oxygen furnace (BOF), the static control model can be established to overcome the randomness and inconsistency of the artificial experience control models.According to the initial conditions of hot metal, the relative calculations can be carried out to guide production.The end-point hit rate would be improved through this approach.Unfortunately, the control parameters would not be adjusted during the smelting process, which restricts the further improvement of the end-point hit rate.To solve this problem, the sublance based dynamic control model could be adopted by using the sublance technology.By combining the static model with the dynamic model, the end-point hit rate of BOF could be guaranteed.The establishment of the static control model is the foundation of the dynamic model.The accuracy of the static model will directly affect the hit rates of the dynamic control model, thus it plays an important role in the parameter optimization of BOF control process.Therefore, the static control model is still a practical and reliable technology to guide the production and improve the technology and management level of steelmaking plants.
In recent years, some significant developments of BOF prediction and control modelling have been achieved.Blanco et al. [1] designed a mixed controller for carbon and silicon in a steel converter in 1993.In 2002, three back propagation models are adopted to predict the end-blow oxygen volume and the weight of coolant additions [2].In 2006, a dynamic model is constructed to predict the carbon content and temperature for the end-blow stage of BOF [3].Based on multivariate data analysis, the slopping prediction was proposed by Brämming et al. [4].In 2014, the multi-level recursive regression model was established for the prediction of end-point phosphorus content during BOF steelmaking process [5].An antijamming endpoint prediction model 2 Complexity of extreme learning machine (ELM) was proposed with evolving membrane algorithm [6].By applying input variables selection technique, the input weighted support vector machine modelling was proposed [7], and then the prediction model was established on the basis of improving a casebased reasoning method [8].The neural network prediction modellings [9][10][11][12] were carried out to achieve aimed endpoint conditions in liquid steel.A fuzzy logic control scheme was given for the basic oxygen furnace in [13].Most of these achievements are based on the statistical and intelligent methods.
As an intelligent method, Jayadeva et.al. [14] proposed a twin support vector machine (TSVM) algorithm in 2007.The advantage of this method is that the computational complexity of modelling can be reduced by solving two quadratic programming problems instead of one in traditional method.It is also widely applied to the classification applications.In 2010, Peng [15] proposed a twin support vector regression (TSVR), which can be used to establish the prediction model for industrial data.After that, some improved TSVR methods [16][17][18][19][20][21][22] were proposed.By introducing a K-nearest neighbor (KNN) weighted matrix into the optimization problem in TSVR, the modified algorithms [16,19] were proposed to improve the performance of TSVR.A V-TSVR [17] and asymmetric V-TSVR [20] were proposed to enhance the generalization ability by tuning new model parameters.To solve the ill-conditioned problem in the dual objective functions of the traditional TSVR, an implicit Lagrangian formulation for TSVR [18] was proposed to ensure that the matrices in the formulation are always positive semidefinite matrices.Parastalooi et al. [21] added a new term into the objective function to obtain structural information of the input data.By comparing with the neural network technology, the disadvantage of neural network is that the optimization process may fall into the local optimum.The optimization of TSVR is a pair of quadratic programming problems (QPPs), which means there must be a global optimal solution for each QPP.All above modified TSVR algorithms are focused on the improvements of the algorithm accuracy and the computation speed.Currently, the TSVR algorithm has never been adopted in the BOF applications.Motivated by this, the TSVR algorithm can be used to establish a BOF model.
Wavelet transform can fully highlight the characteristics of some aspects of the problem, which has attracted more and more attention and been applied to many engineering fields.In this paper, the wavelet transform technique is used to denoise the output samples during the learning process, and it is a new application of the combination of wavelet transform and support vector machine method.Then, a novel static control model is proposed based on wavelet transform weighted twin support vector regression (WTWTSVR), which is an extended model of our previous work.In [23], we proposed an end-point prediction model with WTWTSVR for the carbon content and temperature of BOF, and the accuracy of the prediction model is expected.However, the prediction model cannot be used to guide real BOF production directly.Hence, a static control model should be established based on the prediction model to calculate the oxygen volume and the weight of auxiliary raw materials, and the accuracy of the calculations affects the quality of the steel.Therefore, the proposed control model can provide a guiding significance for real BOF production.It is also helpful to other metallurgical prediction and control applications.To improve the performance of the control model, an improvement of the traditional TSVR algorithm is carried out.A new weighted matrix and a coefficient vector are added into the objective function of TSVR.Also, the parameter  in TSVR is not adjustable anymore, which means it is a parameter to be optimized.Finally, the static control model is established based on the real datasets collected from the plant.The performance of the proposed method is verified by comparing with other four existing regression methods.The contributions of this work include the following.(1) It is the first attempt to establish the static control model of BOF by using the proposed WTWTSVR algorithm.(2) New weighted matrix and coefficient vector are determined by the wavelet transform theory, which gives a new idea for the optimization problem in TSVR areas.(3) The proposed algorithm is an extension of the TSVR algorithm, which is more flexible and accurate to establish a prediction and control model.(4) The proposed control model provides a new approach for the applications of BOF control.The application range of the proposed method could be extended to other metallurgical industries such as the prediction and control in the blast furnace process and continuous casting process.
Remark 1.The main difference between primal KNNWTSVR [16] and the proposed method is that the proposed algorithm utilizes the wavelet weighted matrix instead of KNN weighted matrix for the squared Euclidean distances from the estimated function to the training points.Also, a wavelet weighted vector is introduced into the objective functions for the slack vectors. 1 and  2 are taken as the optimized parameters in the proposed algorithm to enhance the generalization ability.Another difference is that the optimization problems of the proposed algorithm are solved in the Lagrangian dual space and that of KNNWTSVR are solved in the primal space via unconstrained convex minimization.
Remark 2. By comparing with available weighted technique like K-nearest neighbor, the advantages of the wavelet transform weighting scheme are embodied in the following two aspects: (1) The Adaptability of the Samples.The proposed method is suitable for dealing with time/spatial sequence samples (such as the samples adopted in this paper) due to the character of wavelet transform.The wavelet transform inherits and develops the idea of short-time Fourier transform.The weights of sample points used in the proposed algorithm are determined by calculating the difference between the sample values and the wavelet-regression values for Gaussian function, which can mitigate the noise, especially the influence of outliers.While KNN algorithm determines the weight of the sample points by calculating the number of adjacent points (determined by Euclidian distance), which is more suitable for the samples of multi-points clustering type distribution.
(2) The Computational Complexity.KNN algorithm requires a large amount of computations, because the distance between each sample to all known samples must be computed to obtain its K nearest neighbors.By comparing with KNN weighting scheme, the wavelet transform weighting scheme has less computational complexity, because it is dealing with one dimensional output samples, and the computation complexity is proportional to the number of samples l.KNN scheme is dealing with the input samples, the computation complexity of KNN scheme is proportional to  2 .With the increasing of dimensions and number of the samples, it will have a large amount of computations.
Therefore, the wavelet transform weighting scheme is more competitive than KNN weighting scheme for the time sequence samples due to its low computational complexity.

Background
2.1.Description of BOF Steelmaking.BOF is used to produce the steel with wide range of carbon, alloy, and special alloy steels.Normally, the capacity of BOF is between 100 tons and 400 tons.When the molten iron is delivered to the converter through the rail, the desulphurization process is firstly required.In BOF, the hot metal and scrap, lime, and other fluxes are poured into the converter.The oxidation reaction is carried out with carbon, silicon, phosphorus, manganese, and some iron by blowing in a certain volume of oxygen.The ultimate goal of steelmaking is to produce the steel with specific chemical composition at suitable tapping temperature.The control of BOF is difficult because the whole smelting process is only half an hour, and there is no opportunity for sampling and analysis in the smelting process.The proportion of the iron and scrap is about 3:1 in BOF.The crane loads the waste into the container and then pours the molten iron into the converter.The water cooled oxygen lance enters the converter, the high purity oxygen is blown into BOF at 16000 cubic feet per minute, and the oxygen is reacted with carbon and other elements to reduce the impurity in the molten metal and converts it into a clean, high quality liquid steel.The molten steel is poured into the ladle and sent to the metallurgical equipment of the ladle [13].
Through the oxidation reaction of oxygen blown in BOF, the molten pig iron and the scrap can be converted into steel.It is a widely used steelmaking method with its higher productivity and low production cost [3].However, the physical and chemical process of BOF is very complicated.Also, there are various types of steel produced in the same BOF, which means that the grade of steel is changed frequently.Therefore, the BOF modelling is a challenge task.The main objective of the modelling is to obtain the prescribed end-point carbon content and temperature.

Nonlinear
Twin Support Vector Regression.Support vector regression (SVR) was proposed for the applications of regression problems.For the nonlinear case, it is based on the structural risk minimization and Vapnik -insensitive loss function.In order to improve the training speed of SVR, Peng proposed a TSVR algorithm in 2010.The difference between TSVR and SVR is that SVR solves one large QPP problem and TSVR solves two small QPP problems to improve the learning efficiency.Assume that a sample is an -dimensional vector and the number of the samples is , which can be expressed as (x 1 ,  1 ), . . ., (x  ,   ).Let A = [x 1 , . . ., x  ]  ∈  × be the input data set of training samples, y = [ 1 , . . ., y  ]  ∈   be the corresponding output, and e = [1, . . ., 1]  be the ones vector with appropriate dimensions.Assume (., .)denotes a nonlinear kernel function.Let (A, A  ) be the kernel matrix with order  and its (, )-th element (,  = 1, 2, . . ., ) be defined by Here, the kernel function (x, x  ) represents the inner product of the nonlinear mapping functions Φ(x  ) and Φ(x  ) in the high dimensional feature space.Because there are various kernel functions, the performance comparisons of kernel functions will be discussed later.In this paper, the radial basis kernel function (RBF) is chosen as follows: where  is the width of the kernel function.Let (x  , A  ) = ((x, x 1 ), (x, x 2 ), . . ., (x, x  )) be a row vector in   .Then, two -insensitive bound functions  1 (x) = (x  , A  ) Nonlinear TSVR can be obtained by solving two QPPs as follows: By introducing the Lagrangian function and Karush-Kuhn-Tucker conditions, the dual formulations of (4) and ( 5) can be derived as follows: where G = [(A, A  ), e], g = y −  1 e, and h = y +  2 e.To solve the above QPPs ( 6) and ( 7), the vectors  1 ,  1 and  2 ,  2 can be obtained: and By substituting the above results into (3), the final regression function can be obtained.

Nonlinear Wavelet Transform Based Weighted Twin
Support Vector Regression

Model Description of Nonlinear WTWTSVR.
In 2017, Xu el.al. [20] proposed the asymmetric V-TSVR algorithm based on pinball loss functions.This new algorithm can enhance the generalization ability by tuning new model parameters.The QPPs of nonlinear asymmetric V-TSVR were proposed as follows: min and min where  1 ,  2 , V 1 , V 2 ≥ 0 are regulating parameters and the parameter  ∈ (0,1) is used to apply a slightly different penalty for the outliers.The contributions of this method are that  1 and  2 are introduced into the objective functions of QPPs, and the parameters V 1 and V 2 are used to regulate the width of  1 and  2 tubes.Also, the parameter  is designed to give an unbalance weight for the slack vectors  and  * .Based on the concept of asymmetric V-TSVR, a nonlinear wavelets transform based weighted TSVR is firstly proposed in this paper.By comparing with the traditional TSVR algorithm, the proposed algorithm introduces a wavelet transform based weighted matrix D 1 and a coefficient vector D 2 into the objective function of TSVR.Also, the parameters  1 and  2 are not adjustable by user anymore, which means they are both introduced into the objective functions.Simultaneously, the regularization is also considered to obtain the optimal solutions.Therefore, the QPPs of nonlinear WTWTSVR are proposed as follows: min and min where  1 ,  2 ,  3 ,  4 , V 1 , V 2 ≥ 0 are regulating parameters, D 1 is a diagonal matrix with the order of  × , and D 2 is a coefficient vector with the length of  × 1.The determination of D 1 and D 2 will be discussed later.
The first term in the objective function of ( 12) or ( 13) is used to minimize the sums of squared Euclidean distances from the estimated function  1 (x) = (x  , A  ) 1 +  1 or  2 (x) = (x  , A  ) 2 +  2 to the training points.The matrix D 1 gives different weights for each Euclidean distance.The second term is the regularization term to avoid the overfitting problem.The third term minimizes the slack vector  or  * and the width of  1 -tube or  2 -tube.The coefficient vector D 2 is a penalty vector for the slack vector.To solve the problem in (12), the Lagrangian function can be introduced, that is, where , , and  are the positive Lagrangian multipliers.By differentiating  with respect to the variables  1 ,  1 , ,  1 , we have The K.K.T. conditions are given by Combining ( 15) and ( 16), we obtain Let H = [(A, A  )e] and u 1 = [  1  1 ]  , and ( 20) can be rewritten as where I is an identity matrix with appropriate dimension.From ( 21), we get From ( 15), ( 16), and ( 19), the following constraints can be obtained: Substituting (22) into Lagrangian function , we can get the dual formulation of ( 12) Similarly, the dual formulation of ( 13) can be derived as follows: (25) To solve the above QPPs, the vectors  1 ,  1 and  2 ,  2 can be expressed by and By substituting the above results into (3), the final regression function can be obtained.

Determination of Wavelets Transform Based Weighted
Matrix D 1 and a Coefficient Vector D 2 .The wavelet transform can be used to denoise the time series signal.Based on the work of Ingrid Daubechies, the Daubechies wavelets are a family of orthogonal wavelets for a discrete wavelet transform.There is a scaling function (called father wavelet) for each wavelet type, which generates an orthogonal multiresolution analysis.Daubechies orthogonal wavelets 1 − 10 are commonly used.
The wavelet transform process includes three stages: decomposition, signal processing, and reconstruction.In each step of decomposition, the signal can be decomposed into two sets of signals: high frequency signal and low frequency signal.Suppose that there is a time series signal S.After the first step of decomposition, it generates two signals: one is high frequency part Sh 1 and another is low frequency part Sl 1 .Then, in the second step of decomposition, the low frequency signal Sl 1 can be decomposed further into Sh 2 and Sl 2 .After  steps of decomposition,  + 1 groups of decomposed sequence (Sh 1 , Sh 2 , . . ., Sh  , Sl  ) are obtained, where Sl  represents the contour of the original signal S and Sh 1 , Sh 2 , . . ., Sh  represent the subtle fluctuations.The decomposition process of signal S is shown in Figure 1 and defined as follows: where Sl −1 with the length  −1 is the signal to be decomposed, Sl  and Sh  are the results in the -th step.( − 2), and ( − 2) are called scaling sequence (low pass filter) and wavelets sequence, respectively [24].In this paper, 2 wavelet with the length of 4 is adopted.After wavelet transforming, appropriate signal processing can be carried out.
In the stage of reconstruction, the processed high frequency signal Sh *  and low frequency signal Sl *  are reconstructed to generate the target signal S * .Let P and Q be the matrix with the order of   ×  −1 , P , = (−2), and Q , = (−2).The reconstruction process is shown in Figure 2. Therefore, (28) and (29) can be rewritten as follows: The signal Sl * −1 can be generated by reconstructing Sl *  and Sh *  : -th sample, which can be defined as the following Gaussian function: where   denotes the weight coefficient and  * is the width of the Gaussian function.Therefore, the wavelets transform based weighted matrix D 1 and the coefficient vector D 2 can be determined by D 1 = diag( ), which implies that the training speed of SVR is about four times the proposed algorithm.Also, a pair of inverse matrices with the size ( + 1) × ( + 1) in QPPs have the same computational cost ( 3 ).During the training process, it is a good way to cache the pair of inverse matrices with some memory cost in order to avoid repeated computations.
In addition, the proposed algorithm contains the wavelet transform weighted matrix and db2 wavelet with length of 4 is used in this paper.Then, the complexity of wavelet transform is less than 8l.By comparing with the computations of QPPs and inverse matrix, the complexity of computing the wavelet matrix can be ignored.Therefore, the computation complexity of the proposed algorithm is about (3 3 ).

Establishment of Static Control Model
The end-point carbon content (denoted as C) and the temperature (denoted as T) are main factors to test the quality of steelmaking.The ultimate goal of steelmaking is to control C and T to a satisfied region.BOF steelmaking is a complex physicochemical process, so its mathematical model is very difficult to establish.Therefore, the intelligent method such as TSVR can be used to approximate the BOF model.From the collected samples, it is easy to see that the samples are listed as a time sequence, which means they can be seen as a time sequence signal.Hence, the proposed WTWTSVR algorithm is appropriate to establish a model for BOF steelmaking.
In this section, a novel static control model for BOF is established based on WTWTSVR algorithm.According to the initial conditions of the hot metal and the desired endpoint carbon content (denoted as   ) and the temperature (denoted as   ), the relative oxygen blowing volume (denoted as V) and the weight of auxiliary raw material (denoted as W) can be calculated by the proposed model.Figure 3 shows the structure of the proposed BOF control model, which is composed of a prediction model for carbon content and temperature (C/T prediction model), a control model for oxygen blowing volume and the weight of auxiliary raw materials (V/W control model), two parameter regulating units (R 1 and R 2 ), and a controller and a basic oxygen furnace in any plant.Firstly, the WTWTSVR C/T prediction model should be established by using the historic BOF samples, which consists of two individual models (C model and T model).C model represents the prediction model of the end-point carbon content and T model represents the prediction of the end-point temperature.The inputs of two models both include the initial conditions of the hot metal, and the outputs of them are C and T, respectively.The parameters of the prediction models can be regulated by R 1 to obtain the optimized models.Secondly, the WTWTSVR V/W control model should be established based on the proposed prediction models, and the oxygen blowing volume can be calculated by V model and the weight of auxiliary raw materials can be determined by W model.The inputs of the control models both include the initial conditions of the hot metal, the desired end-point carbon content   ), and the end-point temperature   .The outputs of them are V and W, respectively.The parameters of the control models can be regulated by R 2 .After the regulation of R 1 and R 2 , the static control model can be established.For any future hot metal, the static control model can be used to calculate V and W by the collected initial conditions and   and   .Then, the calculated values of V and W will be sent to the controller.Finally, the relative BOF system is controlled by the controller to reach the satisfactory end-point region.

Establishment of WTWTSVR C/T Prediction Model.
To realize the static BOF control, an accurate prediction model of BOF should be designed firstly.The end-point prediction model is the foundation of the control model.By collecting the previous BOF samples, the abnormal samples must be discarded, which may contain the wrong information of BOF.Through the mechanism analysis of BOF, the influence factors on the end-point information are mainly determined by the initial conditions of hot metal, which means the influence factors are taken as the independent input variables of the prediction models.The relative input variables are listed in Table 1.Note that the input variable x 9 denotes the sum of the weight of all types of auxiliary materials, which are including the light burned dolomite, the dolomite stone, the lime ore and scrap, etc.It has a guiding significance for real production, and the proportion of the components can be determined by the experience of the user.The output variable of the model is the end-point carbon content C or the endpoint temperature T.
According to the prepared samples of BOF and WTWTSVR algorithm, the regression function  / () can be given by Out of the historical data collected from one BOF of 260 tons in some steel plant in China, 220 samples have been selected and prepared.In order to establish the WTWTSVR prediction models, the first 170 samples are taken as the training data and 50 other samples are taken as the test data to verify the accuracy of the proposed models.In the regulating unit R 1 , the appropriate model parameters ( 1 ,  2 ,  3 ,  4 , V 1 , V 2 , and  1 ,  * 1 ) are regulated manually, and then the C model and T model can be established.In summary, the process of the modelling can be described as follows.
Step 1. Initialize the parameters of the WTWTSVR prediction model, and normalize the prepared 170 training samples from its original range to the range [-1 1] by mapminmax function in Matlab.Step 3. By selecting the appropriate parameter  * 1 in (33), determine the wavelets transform based weighted matrix D 1 and the coefficient vector D 2 .
Step 5. Solve the optimization problems in (24) and (25) by  function in Matlab and return the optimal vector  or .
Step 7. Substitute the parameters in Step 5 into (3) to obtain the function  / (x).
Step 8. Substitute the training samples into (34) to calculate the relative criteria of the model, which will be described in details later.
Step 9.If the relative criteria are satisfactory, then the C model or T model is established.Otherwise, return to Steps from 4 to 8.

Establishment of WTWTSVR V/W Control Model.
Once the WTWTSVR C/T prediction models are established, the WTWTSVR V/W control models are ready to build.In order to control the end-point carbon content and temperature to the satisfactory region for any future hot metal, the relative oxygen blowing volume V and the total weight of auxiliary raw materials W should be calculated based on V/W control models, respectively.Through the mechanism analysis, the influence factors on V and W are mainly determined by the initial conditions of hot metal, and the desired conditions of the steel are also considered as the the influence factors.Then, the independent input variables of the control models are listed in Table 2, where the input variables x 1 -x 7 are the same as the C/T prediction model and other two input variables are the desired carbon content   and desired temperature   , respectively.The output variable of the control model is V or W defined above.
Similar to (34), the regression function  / (x) can be written as where  / (x) denotes the estimation function of V model or W model and x = [ 1 ,  2 , . . .,  9 ]  .In order to establish the WTWTSVR control models, the training samples and the test samples are the same as that of the prediction models.The regulating unit  2 is used to regulate the appropriate model parameters ( 5 ,  6 ,  7 ,  8 , V 3 , V 4 and  2 ,  * 2 ) manually; then the V model and W model can be established.In summary, the process of the modelling can be described as follows.
Steps 1-7.Similar to those in the section of establishing the prediction models except for the input variables listed in Table 2 and the output variable being the oxygen blowing volume V = [ 1 ,  2 , . . .,   ]  or the total weight of auxiliary materials W = [ 1 ,  2 , . . .,   ]  .Also, select appropriate values of  5 ,  6 ,  7 ,  8 , V 3 , V 4 and  2 in the regulating unit  2 ; then the function  / (x) is obtained.
Step 8. Substitute the training samples into (35) to calculate the relative predicted values V and Ŵ of V and W respectively.
Step 9. V and Ŵ are combined with the input variables  1 − 7 in Table 1 to obtain 50 new input samples.Then, substituting them into the C model and T model, 50 predicted values Ĉ and T can be obtained.Finally, calculate the differences between the desired values   ,   and the predicted values Ĉ, T. Also, other relative criteria of the model should be determined.

Results and Discussion
In order to verify the performances of WTWTSVR algorithm and proposed models, the artificial functions and practical datasets are adopted, respectively.All experiments are carried out in Matlab R2011b on Windows 7 running on a PC with Intel (R) Core (TM) i7-4510U CPU 2.60GHz with 8GB of RAM.
The evaluation criteria are specified to evaluate the performance of the proposed method.Assume that the total number of testing samples is ,   is the actual value at the sample point   , ŷ is the estimated value of   , and   = (∑  =1   )/ is the mean value of  1 ,  2 , . . .,   .Therefore, the following criteria can be defined: In the above equations, RMSE denotes the root mean squared error, MAE denotes the mean absolute error, and SSE, SST, and SSR represent the sum of the squared deviation between any two of   , ŷ ,   , respectively.Normally, a smaller value of SSE/SST reflects that the model has a better performance.The decrease in SSE/SST reflects the increase in SSR/SST.However, if the value of SSE/SST is extremely small, it will cause the overfitting problem of the regressor.An important criteria for evaluating the performance of the BOF model is hit rate () calculated by (40), which is defined as the ratio of the number of the satisfactory samples over the total number of samples.For any sample in the dataset, if the absolute error between the estimated value and actual value is smaller than a certain error bound, then the results of the sample hit the end point.A large number of hit rate indicate a better performance of the BOF model.Generally, a hit rate of 90% for C or T is a satisfactory value in the real steel plants.

Performance Test of WTWTSVR.
In this section, the artificial function named Sinc function is used to test the regression performance of the proposed WTWTSVR method, which can be defined as  = sin /,  ∈ [−4, 4].
In order to evaluate the proposed method effectively, the training samples are polluted by four types of noises, which include the Gaussian noises with zero means and the uniformly distributed noises.For the train samples (  ,   ),  = 1, 2, . . ., , we have where [, ] and (,  2 ) denote the uniformly random variable in [, ] and the Gaussian variable with the mean  and variance  2 , respectively.For all regressions of the above four Sinc functions, 252 training samples and 500 test samples are selected.Note that the test samples are uniformly sampled from the Sinc functions without any noise.For all algorithms in this paper, the following sets of parameters are explored: the penalty coefficient c is searched from the set {/1000, /100, /10 |  = 1, 2, . . ., 10}.The tube regulation coefficient v is selected from the set {1, 2, . . ., 10}.The wavelet coefficient  and kernel parameter  * are both searched over the range {/1000, /100, /10,  |  = 1, 2, . . ., 10}.In order to reduce the number of combinations in the parameter search, the following relations are chosen:  Firstly, the choice of kernel function should be considered, RBF kernel is an effective and frequently used kernel function in TSVR research papers.Also, the performance of RBF has been evaluated by comparing with other three existing kernel functions (listed in Table 3), and the results are shown in Table 4.It is easy to see that the RBF kernel achieves the optimal results.Then, the average results of the proposed methods and other four existing methods (TSVR [15], V-TSVR [17], KNNWTSVR [19] and Asy V-TSVR [20]) with 10 independent runs are shown in Table 5, where Type A, B, C and D denote four different types of noises (41)-(44).Obviously, the results of Table 5 show that the proposed method achieves the optimal result of SSE.Also, the proposed method achieves the smallest SSE/SST results in four regressions of Sinc functions, which are 0.0029, 0.0124, 0.0231 and 0.0903.Especially, the obvious performances are enhanced in Type A and B. The SSR/SST of the proposed method takes the first position in Type B, the second position in Type A and C, and the third position in Type D. From the aspect of training time, it can be seen that the proposed method achieves the optimal result in Type B. In Type A, C and D, the training time of the proposed method is faster than that of TSVR and KNNWTSVR, and close to v-Type and Asy v-Type.It verifies that the computational complexity of wavelet transform weighting scheme is lower than KNN weighting scheme.For the effect of outliers, we have verified the performance of the proposed method on the Sinc function with Type A noise. Figure 4 shows that the prediction performance of the proposed method is satisfied against the outliers, as shown in the blue line, and the results of SSE, SSE/SST and SSR/SST achieve 0.1911, 0.0036 and 1.0372, respectively.By comparing with the results in Table 5, it can be concluded that the overall performance of the proposed method with outliers is still better than TSVR, v-TSVR and KNNWTSVR.Therefore, it can be concluded that the overall regression performance of the proposed method is optimal.4.2.Verification of the Proposed BOF Models.In order to verify the effectiveness of the proposed model, 240 heats samples for low carbon steel of 260 BOF were collected from some steel plant in China.Firstly, the preprocessing of the samples should be carried out.The abnormal samples with the unexpected information must be removed, which exceeds the actual range of the signals.It may be caused by the wrong sampling operation.Then, 220 qualified samples are obtained.The next step is to analyze the information of the qualified samples.The end-point carbon content and temperature are mainly determined by the initial conditions of the iron melt, the total blowing oxygen volume and the weight of material additions.Therefore, the unrelated information should be deleted, such as the heat number, the date of the steelmaking and so on.According to the information of Tables 1 and 2, the datasets for the relative prediction and control models can be well prepared.After that, the proposed models are ready to be established.More details of the model have been described in Section 3. The proposed control model consists of two prediction models (C model and T model) and two control models (V model and W model).For each individual model, there are 8 parameters that need to be regulated to achieve the satisfactory results.The regulation processes are related to the units R 1 and R 2 in Figure 3. First of all, the prediction models need to be established.In order to meet the requirements of the real production, the prediction errors bound with 0.005% for C model and 10 ∘ C for T model are selected.Similarly, the control errors bound with 800 Nm 3 for V model and 5.5 tons for W model are selected.The accuracy of each model can be reflected by the hit rate with its relative error bound, and a hit rate of 90% within each individual error bound is a satisfied result.the collected samples, the parameters of WTWTSVR models are specified and shown in Table 6.
By using these parameters, the proposed BOF control model has been established.In order to evaluate the performance of the proposed method, more simulations of other existing regression methods with the samples are carried out, which are TSVR, V-TSVR, Asy V-TSVR, and KNNWTSVR, respectively.The comparison results of the carbon content prediction are listed in Table 7. From the results, it can be seen that the results of RMSE, MAE, and SSE/SST in the proposed method are 0.0023, 0.0026, and 1.1977, respectively.They are all smaller than those of other three existing methods, and the SSR/SST of 0.6930 is the highest result.The error performance and distribution of end-point carbon content between the predicted values and actual values are shown in Figures 5 and 6.It is clear that the proposed C model can achieve the hit rate of 92%, which is better than other four methods.From above analysis, it illustrates that the proposed C model has the best fitting behaviour for carbon content prediction.
Similarly, the performance comparisons of the temperature prediction are also listed in Table 7. From the results of Table 7, the best results of RMSE and SSE/SST and second best result of MAE of the proposed method are obtained.The results of SSR/SST is in the third position.Figures 5  and 6 show that the proposed T model can achieve the hit rate of 96%, which is the optimal results by comparing with other methods.In addition, the double hit rate is also the key criterion in the real BOF applications; the proposed method can achieve a double hit rate of 90%, which is the best result, although the temperature hit rate of V-TSVR and KNNWTSVR method can also achieve 96%.However, their double hit rates only achieve 80% and 84%, respectively.In the real applications, a double hit rate of 90% is satisfactory.Therefore, the proposed model is more efficient to provide a reference for the real applications.Also, it meets the requirement of establishing the static control model.
Based on the prediction models, the control models (V model and W model) can be established by using the relative parameters in Table 6.The performance of the proposed V model is shown in Figure 7.By comparing the proposed method with the existing methods, the comparison results of the oxygen blowing volume calculation are listed in Table 8, which shows that the results of RMSE, MAE, and SSE/SST in the proposed method are 371.3953,411.7855, and 1.2713, respectively.They are all smaller than those of other four existing methods, and the / of 1.0868 is in the fourth position.Figures 5 and 6 show that the predicted values of the proposed model agree well with the actual values of oxygen volume, and the proposed V model has a best hit rate of 90%.It verifies that the proposed V model has the best fitting behaviour for the calculation of oxygen blowing volume.
Similarly, the performance of the proposed W model is shown in Figure 8, and the performance comparisons of the weight of the auxiliary materials are also listed in Table 8.Table 6: Specified parameters of prediction and control models.
Prediction Model  The proposed model achieves the best results of RMSE, MAE, and SSE/SST.Figures 5 and 6 show that the proposed method achieves the best hit rate of 88%.In addition, the training time of the proposed models is faster than that of KNNWTSVR method and slower than that of other three methods.It illustrates that the weighting scheme takes more time to obtain higher accuracy, and the performance of proposed weighting scheme is better than that of KNN weighting scheme for time sequence samples.For 50 test samples, there are 50 calculated values of V and W by V model and W model.Then, they are taken as the input variables into the proposed C model and T model to verify the end-point hit rate.From the results in Table 8, the proposed models can achieve a hit rate of 86% in C, 92% in T, and 82% in  is in the optimal result and the same as that of V-TSVR, Asy V-TSVR, and KNNWTSVR.In the real productions,  is paid more attention rather than the individual HR in C or T. Therefore, the proposed control models are verified to be able to guide the real production.
Based on above analysis, it can be concluded that the proposed static control model is effective and feasible; the hit rate can meet the requirements of the real productions for low carbon steel.For other types of steels, the proposed model is still suitable for use.Firstly, the specific samples of the heats should be obtained and preprocessed, and the analysis of the influence factors on the specific type of steel should be carried out to obtain the input variables of the model.The output variables are the same as the relative proposed models.Secondly, the end-point criteria should be specified, which is determined by the type of steel.Then, the parameters of the relative models can be regulated and determined to achieve the best criteria.Finally, the BOF control model for the specific type of steel is established to guide the production in the plant.
BOF steelmaking is a complex physicochemical process; the proposed static control model can be established based on the real samples collected from the plant.However, there must be some undetected factors during the steelmaking process, which will affect the accuracy of the calculations of V and W. To solve this problem, the following strategies can be introduced: at the early stage of the oxygen blowing, the proposed control model is used to calculate the relative V and W and guide the BOF production.Then, the sublance technology is adopted at the late stage of oxygen blowing, because the physical and chemical reactions tend to be stable in this stage.Hence, the information of the melt liquid can be collected by the sublance.Therefore, another dynamic control model can be established with the sublance samples to achieve a higher end-point hit rate.For medium and small steel plants, the proposed static model is a suitable choice to reduce the consumption and save the cost.
Remark 3.Although this paper is mainly based on static control model, the prediction scheme is also compatible for other datasets over the globe.Especially, the proposed algorithm is competitive for the regression of time sequence datasets, such as the prediction of the blast furnace process and continuous casting process in the metallurgical industry.

Conclusion
In this paper, a WTWTSVR control model has been proposed.The new weighted matrix and the coefficient vector have been determined by the wavelet transform theory and added into the objective function of TSVR to improve the performance of the algorithm.The simulation results have shown that the proposed models are effective and feasible.The prediction error bound with 0.005% in C and 10 ∘ C in T can achieve a hit rate of 92% and 96%, respectively.In addition, the double hit rate of 90% is the best result by comparing with other three existing methods.The control error bound with 800 Nm 3 in V and 5.5 tons in W can achieve the hit rate of 90% and 88%, respectively.Therefore, the proposed method can provide a significant reference for real BOF applications.For the further work, on the basis of the proposed control model, a dynamic control model could be established to improve the end-point double hit rate of BOF up to 90% or higher.

Figure 3 :
Figure 3: Structure of proposed BOF control model.

Figure 4 :
Figure 4: Performance of WTWTSVR on Sinc function with and without outliers.

Figure 5 :
Figure 5: Performance of errors between predicted values and actual values with proposed static control model.

3 Figure 6 :
Figure 6: Performance of error distributions with proposed static control model.

Figure 8 :
Figure 8: Performance of proposed W model.

Table 1 :
Independent input variables of prediction models.

Table 2 :
Independent input variables of control models.Step 10.If the obtained criteria are satisfactory, then the V model or W model is established.Otherwise, return to Steps from 4 to 9.

Table 3 :
Type and functions of four existing kernel functions.

Table 4 :
. Comparisons of WTWTSVR on Sinc functions with different kernel functions.

Table 5 :
Comparisons of four regression methods on Sinc functions with different noises.hit for the same sample.Hence, 170 samples are used to train the prediction and control models, and 50 samples are adopted to test the accuracy of the models.For each run of the relative modelling, the results are returned to the user with the information of the evaluation criteria.It shows the accuracy and fitness of the models with the specific parameters.Smaller values of RMSE, MAE, and SSE/SST and larger values of SSR/SST and hit rate are preferred, which means the model has a better generalization and higher accuracy.The regulation units R 1 and R 2 are used to balance the above criteria by selecting the appropriate values of the parameters.Note that the principle of the parameter selection is satisfactory if the following results are obtained: / > 0.5 and  > 85% with the smallest SSE/SST.Especially, the double hit rate DHR should be greater than 80 or higher.For

Table 7 :
Performance comparisons of C/T prediction models with four methods.

Table 8 :
Performance comparisons of V/W control models with four methods.