Abstract

This paper develops a type of data-driven networked optimal iterative learning control strategy for a class of discrete linear time-varying systems with one-operation Bernoulli-type communication delays. In terms of the stochastic Bernoulli-type one-operation communication delayed inputs and outputs, the previous-iteration synchronous compensations are adopted. By means of deriving gradients of two types of objective functions that express the optimal approximation of the system matrix and the minimal tracking error, the strategy approximates the system matrix and upgrades the control inputs in an interact mode as the iteration evolves. By taking advantage of matrix theory and statistical technique, it is derived that the approximation discrepancy of the system matrix is bounded and the mathematical expectation of the tracking error vanishes as the iteration goes on. Numerical simulations manifest the validity and effectiveness.

1. Introduction

Iterative learning control (ILC) has been acknowledged as one of effectively intelligent strategies, which performs a high-precision trajectory tracking repetitively over a fixed time interval, as surveyed in [13]. Since its invention, numerous ILC strategies have been developed for theoretical analyses and practical applications over the past three decades [411]. Convergence analyses as one of key ILC theoretical issues have been discussed from all aspects of the norm of tracking errors for guaranteeing the ILC implement, such as infinite-norm [4], lambda-norm [5], sup-norm [6], Lebesgue-p norm [7], 2D technique [8], and Lyapunov method [9]. Form practical executions, kinds of system perturbations and uncertainties are avoidable, such as iteration-varying disturbances, time-varying uncertainties, and system uncertainty. For that, robust ILCs have been involved in [1013]. It is reminded that, basically, the strategic feature of the ILC is that its formulation is irrelevant to the system dynamics but the multioperation inputs and outputs. However, in industry applications, model-based ILC can perform better than without any system information ILC, where at least an approximate model is needed. This implies that the ILC may be regarded as a data-driven scheme which utilizes the historical inputs, outputs, and model-approximation to formulate a sequence of updating control inputs. Thus, the terminology of the data-driven ILC has been emerged, such as [1416].

A data-driven terminal ILC approach has been proposed for a kind of linear discrete-time-varying system in [14], where the convergence is derived under the assumption that the product of system matrix and approximated matrix is positively definite. In addition, [15] has presented a data-driven constrained norm-optimal ILC and then approximated the impulse response of the system by measurements of inputs and outputs for linear time-invariant systems. Further, [16] has presented a data-driven predictive ILC scheme based on a dynamic linearization technique for a class of discrete-time nonlinear systems, whose convergence is ensured under the requirement that the approximated matrix is diagonally dominant. The data-driven ILCs mentioned-above work well. However, the requirements that approximation matrix is positively definite or diagonally dominant, however, are quite rigorous. This may confine the feasibility of the schemes.

On the other hand, with the advancement of internet technology, networked control systems (NCSs) have been mushroomed ranging from industrial manufacturing to modern medical technology and so on owing to its lower cost, simple installation, easy maintenance, high reliability, and convenient source sharing [17]. However, in NCSs, the communication delay or data dropout is inevitable due to communication constraints, network congestions, or other factors. These may influence the stability and performance of the controlled system. In the field of NCSs, the major focus is to compensate the communication delayed or dropped data by some appropriate techniques so as to maintain the performance of systems [18]. In particular, for the case when the ILC scheme is implemented through the network to compensate the data, the mode is regarded as a networked ILC scheme, such as [1922]. For the facet, [19] has first introduced a networked ILC scheme to deal with the random data delays and dropouts, where the convergence and stability were analyzed in the mean-square sense for discrete linear time-invariant systems. Thereafter, [20] has proposed a networked ILC approach to a class of nonlinear systems for the network-communicated input and output signals with constant time delays and stochastic packet loss. Recently, in [21], a compensated ILC has been provided for a class of nonlinear systems with random one-step communication delays. Further, two types of compensation schemes are employed for linear discrete-time stochastic systems with one-step communication delays in [22].

However, the existing networked ILC schemes for handing communication delays are almost all required accurate system model within one-step time internal. The requirements of the system dynamics and one-step time delay are rigorous. It is worth minding that the data-driven ILC utilizes the multi-iteration inputs and outputs to construct the updating law in a recursive mode. This implies that the so-called open-loop ILC makes it possible to relax the communication delay within one-operation period. These motivate the paper to develop a data-driven optimal ILC scheme for a class of discrete linear time-varying systems with one-operation communication delays. Differing from the data-driven ILCs in [1416], this paper compensates for the delayed Bernoulli-type inputs and outputs by its previous-iteration synchronous data and analyzes the convergence of the approximation benefiting from matrix theory with no requirement of positively definite or diagonal dominance.

The paper is organized as follows. Section 2 firstly gives the description of networked control systems and provides compensations for one-operation communication delayed data in the form of super vectors and then develops the data-driven networked optimal ILC with the compensation strategy. In Section 3, the convergences of approximation discrepancy and tracking error are derived, respectively. Numerical simulations are illustrated in Section 4 to exhibit the validity and the effectiveness and the last Section 5 concludes the paper.

2. Data-Driven Networked Optimal ILC Scheme

2.1. Networked Control Systems and Data Compensations

Throughout the paper, the 2-norm for a vector is defined as and the induced 2-norm for a matrix is expressed as , where the superscript “” refers to the transpose operation and ,  , are eigenvalues of the matrix .

Consider a class of repetitive discrete linear time-varying SISO systems described as follows:where represents the sampling instant, is the total of instant numbers, and is the operation index. , , and are -dimensional state vector, scalar input, and scalar output at the th operation, respectively. , , and are unknown time-varying but bounded matrices with appropriate dimensions.

Given that ,  , is a predetermined desired trajectory, the data-driven networked optimal ILC scheme is constructed under the assumptions listed as follows.(A1)The desired trajectory ,  , is realizable; that is, there exist such desired state and desired input that(A2)For repetitive system (1), the initial state is resettable. Namely, the resetting condition holds for all operation indices , where is the initial desired state. Without loss of generality, we set .(A3)The stochastic communication data delays are subject to Bernoulli-type distributions but confined within one-operation period.

It is known that in a closed-loop networked control system, the output signal is transmitted from the sensor to the controller through output communication channel, and simultaneously, the input signal is delivered from the controller to the actuator through input communication channel, respectively. Likewise, the data-driven networked optimal iterative learning control (DDNOILC) scheme for system (1) works in the same mode. Its schematic paradigm is exhibited in Figure 1.

In Figure 1, denote as the current-iteration th-instant control input generated from the ILC controller which is transmitted to the actuator via input communication channel, whilst let be the current-iteration tth-instant control input of the actuator for plant stimulation, which is either equal to in the case when the data is timely transmitted in success with the probability or equal to the previous-iteration synchronous in the case when the transmission of data is delayed within one-operation period with the probability . Mathematically, a compensation for is expressed aswhere is independent stochastic variable subject to 0-1 Bernoulli-type distribution withwhere and are probability and expectation of , respectively. Parameter is a priori probability satisfying . The equation means that the signal is transmitted timely to the plant, whilst the equation implies that the signal is delayed and the previous-iteration signal is used to compensate for the delayed data.

Simultaneously, the employed data for the ILC updating is equal to either or in the Bernoulli-type switch. Mathematically, a compensation for is expressed aswhere is independent stochastic variable subject to 0-1 Bernoulli-type distribution with where parameter is a priori probability satisfying .

The implication of is similar with . Moreover, the variables and are independent of each other; that is,

DenoteThen, system (1) is reformulated in a super vector form as where is a lower triangular matrix specified as follows:

From (A1) and (A2), it is easy to obtain

Further, expressions (3) and (5) can be rewritten aswhere and is identity matrix.

Note that the lower triangular matrix is unknown but bounded due to the fact that matrices , , and are unknown but bounded.

2.2. Data-Driven Networked Optimal ILC

In terms of constructing an optimal ILC updating law of the control command in a recursive form for system (9), the ordinary way is to raise an objective function and to solve it by an approximate manner. This implies that the system matrix must be available. But, in usual, the system matrix is hardly to be available due to the complexity of the system modeling. Nevertheless, since the fundamental ILC scheme is to make use of the tracking error to upgrade its control input as the operation repeats, it is thus possible that, in composing the optimal ILC updating rule, the system matrix may be substituted by an appropriated one. Meanwhile, the approximation of the system matrix may be updated benefiting from the upgraded control input as well. This composes an interacted updating fashion of the system matrix and the control inputs detailed in the following.

Let be the th approximation of .

Denotewhere and are the th row of matrices and , respectively.

It is evident that is the th approximation of for all . There is no doubt that the postulation , for all , is equivalent to that of . Besides, description (9) is no other but .

For the purpose of generating an updating law to update the approximated vector in an optimal sense, consider an optimization problem asThe gradient of objective function (15) with respect to is expressed asThus, the gradient-type updating formula for is given bywhere is the updating step. Substituting (17) into (15), it hasObviously, the optimal step is . For ensuring the convergence and strengthening the practicability of the gradient-type algorithm (17), select aswhere is a relaxing factor satisfying , which is adopted for guaranteeing the denominator of being nonzero as .

Equivalently, by (17), is updated by the following:It is hopeful that updating law (20) may improve the approximation as the iteration number increases. The approximation discrepancy is defined as .

Mind that the purpose of the data-driven optimal ILC scheme is to construct an updating law of the control command in a recursive form in order to optimize an objective function. By taking the above-mentioned approximation into account, consider an optimization problem as follows:where is the tracking error defined as , meanwhile in is substituted by .

The gradient of (21) with respect to is derived asThus, the gradient-type algorithm for the control input is derived aswhere is learning gain. Substituting (23) into (21), it obtainsAs the matrix is symmetric and nonnegative definite, all its eigenvalues are nonnegative. Thus, it is easy to inducewhere is the th eigenvalues of and and are the maximal and minimal eigenvalues of , respectively.

Obviously, for ensuring the convergence of the gradient-type algorithm (23), is selected aswhere is a weighing factor.

Combining the above-mentioned updating laws (20) and (23) together, a data-driven optimal ILC (DDOILC) is constructed for system (9) as follows:

Remark 1. The control input updating law (28) is derived for the special case in Figure 1 when the communication delays do not occur as formulated as . This implies that the plant inputs are equal to the ILC updated inputs and, meanwhile, the plant outputs are equal to the employed outputs; that is, and , respectively.

Remark 2. It should be pointed that, for system (1), the system matrices , , and are assumed to be time-varying. For the circumstance, the parameter is lower triangular matrix. As such, for an ideal approximation, it is desirable to guarantee the matrix ,  , to be lower triangular. But this is not easily realizable. However, updating law (27) can insure that the discrepancy of the approximation from the system matrix is monotonously convergent to a bounded constant.

By considering (A3) for the cases and when the stochastic communication data delays are subject to Bernoulli-type distributions but confined within one-operation period, the ILC controller for system (9) with one-operation communication delays is designed aswhere is the tracking error defined as .

Consequently, a data-driven networked optimal ILC (DDNOILC) algorithm is developed for system (9) with one-operation communication delays as follows: and : given test signals;: arbitrarily given and nonsingular; and : set appropriately.Notice that the DDNOILC scheme (30)–(36) is only related to the data , , , and .

Remark 3. It is noticed that the derivations of updating laws (27) and (28) are from the gradient-type mechanism, which are prominently distinct from the existing data-driven ILCs in [1416] which searches Kuhn-Tucker point. The method avoids the complex computation of matrix inversion. Significantly, the derivations of the paper do not rigorously require that the approximation matrices must be diagonally dominant or the product of system matrix and approximated matrix is positively definite. Moreover, the developed DDNOILC scheme (30)–(36) may deal with the system model with unknown parameters and one-operation communication delays, but the accurate system information is required in [1922].

3. Convergence Analysis

In this section, the convergence of approximation discrepancy is discussed in Theorem 4 and the convergences of tracking errors of algorithm DDOILC (27) plus (28) and DDNOILC (30)–(36) are analyzed in Theorems 6 and 7, respectively.

Theorem 4. Assume that the approximation matrix is updated by (27) or (32). Then the system matrix approximation discrepancy matrix is convergent to a bounded constant as the iteration increases.

Proof. Since the approximation discrepancy is defined as , then by (27) or (32), it follows thatTaking 2-norm on both sides of (37) yieldsIn addition, since is a real symmetric matrix, it yieldsSince and , then According to (38), (39), and (40), it is easy to haveHence, the approximation discrepancy consequence is convergent to a bounded constant; that is,where is a bounded constant.
This completes the proof.

Remark 5. It can be seen from (41) that the approximation does not go farther from the system matrix than along the operation. Inequality (40) implies that if is not equal to zero, then the approximation discrepancy is convergent to zero and the boundary is equal to zero. Further, if is equal to zero, then the approximation discrepancy is no longer declining. Thus the boundary is greater than zero. It is worth noting that if is very small but not equal to zero, then declines very slowly.

Theorem 6. Assume that the system matrix is nonsingular and communication delays do not occur in NCSs. Then the tracking errors of DDOILC scheme (27) plus (28) are monotonically convergent to zero if the parameter is appropriately chosen.

Proof. From (26) and (28), the tracking error is derived asIt is noticed that yields . Then,According to the concept of limit, expression (42) implieswhere is a bounded constant satisfying .
Further, from (45), we haveBy (46), choose appropriate parameter such thatwhere is a positive constant and .
Since is nonsingular, then the matrix is symmetrical and positively definite. Therefore, there exist an orthogonal matrix such that where with being positive eigenvalues of matrix . Thus, there exists an appropriate parameter such thatwhere is a positive constant which is less than unity.
Therefore, it follows from (47) and (49) thatConsequently, inequality (50) results inThus,This completes the proof.

Theorem 7. Assume that the system matrix is nonsingular and the DDNOILC algorithm (30)–(36) is applied to system (9). Then the expectation of tracking error is convergent to zero if the parameter is properly selected and the prior probabilities and are appropriately given.

Proof. Denote ,  , and .
From (9), (11), and (30), it yieldsMeanwhile, by (33), it is easy to getFrom (35), (36), (53), and (54), we haveSince , thenwhere and .
Calculating mathematical expectation on both sides of (55) obtainsTaking 2-norm in both sides of (57) reduceswhereThenThus, it can be derived asAccording to (45) and (46), choosing appropriate parameter satisfieswhere is a positive constant satisfying .
Since all the eigenvalues of matrix are positive, there exists an appropriate parameter such thatwhere .
Thus, from (61), (62), and (63), we haveBy (58) and (64), it haswhere .
Then,According to the concept of limit, we haveFrom (53), (54), and (55), it is easy to deduce thatTaking 2-norm on both sides of (68), we haveSince the matrix is bounded. By (68) and (69), we obtain the conclusionThis completes the proof.

Remark 8. Notice that Theorem 7 displays that the tracking errors of DDNOILC algorithm converge to zero when the one-operation compensation approaches (3) and (5) are applied into system (9). In [21, 22], one-step compensation approach has been involved and the tracking errors have been proved to be bounded. Thus, the presented DDNOILC in this paper is effective. The detailed comparisons between one operation and one step are demonstrated in the next section.

4. Numerical Simulations

In microelectronics manufacturing, the rapid thermal processing is regarded as a repetitive batch process [23]. The ILC scheme is adequately to be utilized so that the transient temperature of the reactor to follow a desired trajectory. Suppose that the transfer function of the reactor is identified as , where is the process gain, denotes the heating time constant of the crystal, and denotes the heating time constant of the crystal light. Conventionally, the power ratio of the crystal light is tuned by a proportional-derivative-integral (PID) controller. Given that the transfer function of the PID controller is , where , , and are proportional, integral, and derivative gains, respectively. By converting the dynamics of frequency domain into that of time domain and then discretizing the PID-controller-tuned closed-loop control system with the sampling step , the discrete-time system is described as follows:where

Set the parameters as , , , , , and which is time-varying.

The desired trajectory is defined as , . Set the initial states as ,  . The beginning control input is chosen as .

Select a group parameters as and such that convergent conditions (50) and (64) are satisfied under the given .

In terms of mathematical expectation, the simulations are made for 100 runs, where the notion “one run” means that the proposed ILC scheme is in processing until the perfect tracking is achieved. Thus, and are computed aswhere the superscript stands for the index of the experiment runs.

In [16], for ensuring the convergence of the presented ILC algorithm, the approximation matrix is required to be diagonally dominant such thatwhere and are positive constants. For comparison with DDOILC and DDNOILC, the algorithms in [16] are denoted as DD-DDOILC and DD-DDNOILC, respectively.

In [21, 22], one-step communication delays are discussed as follows:For comparison with DDNOILC, the algorithm in [21, 22] is abbreviated as OS-DDNOILC.

In this section, the approximation behavior and tracking performance of the DDOILC and DDNOILC are simulated by the following cases.

Case 1 (set and ). Let

This case means that no communication delays occur in Figure 1; namely, the OS-DDNOILC is no other than the DDOILC. Thus, it is testified that convergent condition (50) of the proposed DDOILC is guaranteed by setting . It is observed that, for the given matrix , the parameters in (74) may be chosen as , , and . But they break the inequality . This implies that the given matrix does not satisfy the convergent condition of the DD-DDOILC in [16].

Figure 2 displays that the convergence of the approximation discrepancy of DDOILC and OS-DDNOILC is descending but bounded, whilst Figure 3 exhibits that the tracking errors of both the DDOILC and the OS-DDNOILC are monotonously convergent to a bounded constant. This conveys that the convergent condition (74) for the existing DD-DDOILC algorithm is sufficient but not necessary.

Figure 4 demonstrates the outputs of DDOILC at the 5th and 10th iterations, respectively, where the dash curve refers to the desired trajectory, the solid curve plots the output at the 5th iteration, and the dotted one shows the output at the 10th iteration, respectively.

Case 2 (set and ). Let

This case implies that the one-operation communication delays occur in Figure 1. It is computed that convergent condition (64) of the proposed DDNOILC is satisfied by setting . Meanwhile, all the inequalities in formulation (74) are ensured by selecting , , and . This implies that the convergent conditions for both the proposed DDNOILC and the DD-DDNOILC are guaranteed.

Figure 5 manifests the convergences of the approximation discrepancy of DDNOILC, DD-DDNOILC, and OS-DDNOILC, respectively, which conveys that the approximation discrepancies are bounded. However, it is seen that the approximation discrepancies of DDNOILC and OS-DDOILC are monotonously convergent to a bounded constant but that of the DD-DDNOILC is oscillatory with higher magnitude.

Figure 6 gives the convergences of the tracking errors of DD-DDNOILC, OS-DDNOILC, and DDNOILC measured in the forms of and log10, respectively, where is the base-10 logarithm of . It is observed that the tracking error of the proposed DDNOILC is slightly less than that of the DD-DDNOILC, but the tracking error of the OS-DDNOILC is bounded.

Figure 7 exhibits the outputs of DDNOILC at the 5th and 10th iterations, respectively.

5. Conclusion

In this paper, a kind of data-driven optimal ILC (DDOILC) scheme is constructed for a class of discrete linear tine-varying systems. The scheme generates two sequences of system matrix approximations and upgraded control inputs in an interacted mode by solving sequential iteration-varying minimization problems. The paper achieves not only the faster convergence of the tracking error, but also the perfect approximation of the system matrix. The approximation processing may be to some extent regarded as a kind of system identification. This owes to the efficacious harness of the multioperation inputs and outputs. Further, the paper presents a data-driven networked optimal ILC (DDNOILC) strategy for the system with one-operation communication delays. Benefited from the multioperation feature of ILC-driven systems, the strategy relaxes the communication delays from one-step sampling time interval to one-iteration period. As the mending scheme for the delayed data is synchronous for two adjacent iterations, the zero-error convergence is guaranteed. This turns to be the significance of the ILC mechanism. However, how to investigate the tracking behavior of DDOILC and/or DDNOILC for discrete stochastic nonlinear systems with one-operation communication delays is challenging.

Competing Interests

The authors declare that there is no conflict of interests regarding the publication of this article.

Acknowledgments

This work was supported by the National Natural Science Foundation of China under Grants F010114-60974140 and 61273135 and the National Research Foundation Korea Grant NRF-2013R1A2A2A01067449.