Accelerated Iterative Learning Control for Linear Discrete Systems with Parametric Perturbation and Measurement Noise

2022-07-30 01:31XiaoxinYangandSaleemRiaz

Xiaoxin Yang and Saleem Riaz

1School of Energy and Architecture,Xi’an Aeronautical University,Xi’an,710077,China

2School of Automation,Northwestern Polytechnical University,Xi’an,170072,China

ABSTRACT An iterative learning control algorithm based on error backward association and control parameter correction has been proposed for a class of linear discrete time-invariant systems with repeated operation characteristics,parameter disturbance,and measurement noise taking PD type example.Firstly,the concrete form of the accelerated learning law is presented,based on the detailed description of how the control factor is obtained in the algorithm.Secondly,with the help of the vector method,the convergence of the algorithm for the strict mathematical proof,combined with the theory of spectral radius,sufficient conditions for the convergence of the algorithm is presented for parameter determination and no noise,parameter uncertainty but excluding measurement noise,parameters uncertainty and with measurement noise,and the measurement noise of four types of scenarios respectively.Finally,the theoretical results show that the convergence rate mainly depends on the size of the controlled object,the learning parameters of the control law,the correction coefficient,the association factor and the learning interval.Simulation results show that the proposed algorithm has a faster convergence rate than the traditional PD algorithm under the same conditions.

KEYWORDS Iterative learning control;monotone convergence;convergence rate;gain adjustment

1 Introduction

The system has gradually become one of the most highly debated research topics in the field of control in recent years[1].The system is a type of important hybrid system that consists of a set of differential equations,finite differences,and switching rules that change based on actual environmental factors,enabling the whole system to switch between different subsystems to adapt to the demands of different conditions on the system and improve system performance[2].Therefore,the system is widely used in practical engineering systems,such as traffic control systems[3],power systems,circuit systems[4],network control systems[5],etc.At present,many research results related to the system are focused on the system’s stability[6],but the research results on the output tracking control of the system are very limited[7].The reason is that the tracking control of the system is much more difficult to achieve than the stabilization and stability problem.

Iterative learning control[8]has a simple structure,does not require specific model parameters,and can make the behavior of the executed object meet the expected requirements only after enough iterations in a limited interval.This learning algorithm has been widely employed in the control of rigid robot arms[9],batch processing in the process industry[10],aerodynamic systems[11],traffic control systems[12],electrical and power systems,and other areas due to the characteristics as mentioned earlier[13].However,most scholars focus on non-system control problems,and research on system iterative learning control problems is limited[14].

In industrial applications,the controlled system parameters are usually time-varying,so the classical PID and combined PID-like control schemes are particularly inflexible when dealing with the system with uncertain factors[15–17].In addition,the analysis and design process of some existing modern control schemes[18]is complex and difficult.The designed control algorithm and structure should be simple enough and easy to implement to solve these problems.The control scheme should contain the characteristics of nonlinearity,robustness,flexibility and learning ability.With the rapid development of intelligent control technology to solve the uncertainty and complexity of the controlled object,some neural network models and neural network training schemes have been applied to the design of system controllers[19].For example,as a feedforward controller,Plett[20]discussed how neural networks learn to imitate the inverse of the controlled object.However,the neural network has the disadvantages of slow learning speed and weak generalization ability,and there is no systematic method to determine its topology.Suppose there is not a timely manner sable control and compensation.In that case,the system noise and random interference will appear in the input end of the controller,which will greatly reduce the stability of the adaptive process and seriously affect the control accuracy.Adaptive filtering has been widely developed[21,22],and neural network is the most commonly used in all kinds of nonlinear filtering.However,it is highly nonlinear in terms of parameters[23–25].

Above mentioned scholars are studying the model uncertainty in different fields such as model prediction,system identification[26],fault detection[27],motor control[28],and nonlinear control[29].Still,there is no specific control algorithm for satisfactory fast error convergence and specific to consider the system coupling,uncertainty,time-varying characteristics,measurement noise and other factors.Adaptive control strategy is proposed in these literature[30–32]which can compensate at some extent.An adaptive control is mainly used to deal with complex nonlinear systems with unknown parameters.Based on Lyapunove stability theory,parameter novelty law is designed to achieve system stabilization and progressive tracking of target trajectory[33,34].Both some special nonlinear systems linearized to parameters[35,36]and nonlinear systems with general structures[37]have achieved remarkable development.For systems that cannot be modeled or contain un-modeled states,literature[38,39]proposed the model-free adaptive control theory.However,these adaptive control methods cannot solve the problem of complete tracking over a finite time interval[40].

This paper emphases on a class of discrete time-invariant arbitrary systems that perform repeating tracking tasks on an expected trajectory in a finite time interval based on the above analysis.A PD type is taken as an example,under the condition that the switching sequence is randomly determined,and the iteration is unchanged,by applying characteristics of iterative learning control,provide a discrete iterative learning control algorithm with error backward association to correct control quantity of the next iteration.Combined with the theory of hypervector and spectral radius,the algorithm’s convergence is discussed,and sufficient conditions for the algorithm’s convergence are given theoretically.

The article can be divided in different sections in order to demonstrate the contribution briefly.The Main contribution and results are comprised in the following sections.Problem formulation is briefly described in Section 2.The convergence analysis,theory of hypervector,spectral radius,and the sufficient conditions for the error convergence are elaborated in Section 3.Then the following Section 4 has showed the numerical example for the validity of the proposed algorithm.Finally,the results summarization of this paper is described in Section 5.

2 Problem Formulation

Consider the following class of linear discrete time-invariant single input and single output systems with repetitive parameter perturbation and measurement noise over a finite period:

where t∈{0,1,...,N-1},N∈Z+,a subscriptkis the number of iterations,xk(t)∈Rn,uk(t)∈R,yk(t)∈Rare the state,input and output of the system,respectively.TheA,BandCare constant matrices of the corresponding dimension satisfying the condition.nk(t+1)∈Ris the measurement noise of the system,ΔA(t)andΔB(t)is the uncertainty matrix of the system and the uncertainty input matrix at the timet,such that

ΔA(t)=E1P(t)F1

ΔB(t)=E2Q(t)F2

Here,(E1,F1)and(E2,F2)are constant matrices of the corresponding dimension satisfying the condition,define the structure of the uncertain state matrix and the uncertain input matrix;P(t)andQ(t)are unknown matrices,satisfyingP T(t)P(t)≤IandQT(t)Q(t)≤I.

In the iterative learning process of the system(1),the expected trajectory is set as yd(t+1),and the iteration is unchanged,the corresponding expected state is xd(t),and the corresponding predicted control input isud(t),Following assumptions are made:

Assumption 1.In each system iteration,the initial state is equal to the ideal initial state,i.e.,xk(0)=xd(0).

Assumption 2.Expects trajectory yd(t+1)t∈{0,1,...,N-1}is given in advance,independent of the number of iterations.

Assumption 3.For any given desired trajectory yd(t+1),there is an expected state yd(t)and an expected control signalud(t),so that

For ease of description,write the above expression in the form of a hypervector,and introduce a hypervector:

The above equation can be description as

where

*represents an uncertain value by the dynamics and uncertain parameters of the system(1).

System(1),under the condition that assumption 1–3 satisfied,considers a control rule of error backward association and subsequent control quantity correction:

The correction of the error before timetto the control quantity at the current timet

The learning control rule of PD type iterative as

whereuk+1(t)is the control quantity at timetof thek+1th iteration,uk(t)is the control quantity at timetof thekth iteration,is the correction of the control quantity at timetin thekth iteration.βis the proportional gain of PD type learning rule,andγis the differential gain of PD type learning rule.ek(t+1)=yd(t+1)-yk(t+1)is defined as the tracking error.The goal of iterative learning control is to find a control signal sequence{uk(t)}through a certain learning algorithm,so that the output trajectoryyk(t+1)of the controlled system(1)under the control of this sequence can converge to the expected trajectoryyd(t+1)asymptotically with the increasing number of iterations,namely.

The correction of control quantity(5a)is explained in detail below,as shown in Fig.1.In the learning process of thekth iteration,the errorek(1)at point 1 will correct the control quantity ofNmoments in the process of thek+1th iteration,and the correction amount is shown in Table 1.

Figure 1:ek(1)corrected the control quantity at a certain time in the process of the k+1 iteration

Table 1:The quantity of association correction of ek(1)to the control quantity at each moment in the next iteration

ek(2)at point 2 corrected the control quantity ofN-1 moments in the process ofk+1th iteration,as shown in Fig.2.The correction quantity is shown in Table 2.

Table 2:The quantity of association correction of ek(2)to the control quantity at subsequent moments

Figure 2:ek(2)at point 2 corrected the control quantity of N-1 moments in the process of the k+1 iteration

According to this method,up to the point,its error isek(N)and it only corrects the control quantity of momentNin thek+1 iteration as shown in Fig.3.The correction quantity is.

Figure 3:ek(N)will correct the control quantity of moment N in the k+1 iteration

According to the above analysis,the correction quantity of each error to the control quantity of following moments can be plotted.The correctionis the accumulation of the correction for all previous moments(see Table 3),as

Table 3:Correction of the control quantity by the error at each moment

which is consistent with Eq.(5a).

3 Convergence Analysis

3.1 Case of a Determined Model Without Measurement Noise

Theorem 1Consider a linear discrete time-invariant system(1)with single-input and singleoutput.If assumption 1–3 is satisfied,the system model is determined,and no measurement noise is obtained;ΔB(t)=0,nk=0,When PD-type accelerated iterative learning control algorithm(5)with association correction is adopted,if the selected learning parameter matrix satisfies

Then the output trajectory uniformly converges to the expected trajectory,that is,when k→∞,yk(t+1)→yd(t+1),t∈{0,1,...,N-1}.

ProofAccording to the iterative learning control algorithm(5),in thek+1 iteration,the control quantity at each moment in the interval[0,N-1]can be represented as

If we introduce the following hypervector

Ek=[ek(1),ek(2),...ek(N)]T

Then we have

where,

Since the model is determined and has no measurement noise,i.e.,ΔA(t)=0,ΔB(t)=0,nk= 0,Eq.(4)can be written asYk=G Uk+V xk(0).Combiningxk+1(0)=xk(0)=xd(0)(assumption 1)and Eq.(8),the‘error sequence can be derived as

It is easy to know that the necessary and sufficient condition for the convergence of the system is

The theorem is proved.I would like to explain this further about the convergence which is basically shows the output within the finite time intervalt∈[0,T]but for different iterations.For instance,as the number of iterations,k→∞,the system’s tracking error tends to zero.The system’s output after the input update tries to follow within,finite time interval as specified for this systemt∈[0,T].Ultimately a perfect desired trajectoryydis achieved.The result of the system simulation is shown for the different iterations in simulation section.When the tracking error converges after 15 or more number of iterations and tends to zero,the system’s output precisely follows the desired trajectoryyd.So it shows that the proposed algorithm is robust and satisfiesis accurate.

3.2 Case of the Undetermined Model without Measurement Noise

Theorem 2Consider a linear discrete time-invariant system(1)with single-input and singleoutput.If assumption 1–3 is satisfied,the system model is uncertain but there is no measurement noise,i.e.,ΔA(t)/=0,ΔB(t)/=0,nk=0.When PD-type accelerated iterative learning control algorithm(5)with association correction is adopted,if the selected learning parameter matrix satisfies

The theorem is proved.

3.3 Case of the Determined Model with Measurement Noise

If the model is determined and has measurement noise,i.e.,ΔA(t)=0,ΔB(t)=0,nk/=0.Eq.(4)can be written asYk=G Uk+V+nk.Combiningxk+1(0)=xk(0)=xd(0)(assumption 1)and Eq.(8),the error sequence can be derived as

3.4 Case of Undetermined Model with Measurement Noise

Depiction on people’s association thinking,this paper proposes a new type of association iterative learning control algorithm,which,with the help of kernel function(a monotonically decreasing function),uses the information of the present time to make prediction and correction of the future control input in the current iterative process.The information of the current time corrects the subsequent unlearned time,the closer the current time,the greater the influence,the smaller the opposite.Obviously,the kernel function makes the association iterative learning algorithm more reasonable.In the process of theoretical proof of convergence analysis,the kernel function is eliminated,so it is not reflected in the convergence condition.It is proved that the association algorithm and the traditional iterative learning control have the same convergence conditions,but the simulation result of the fifth part of the paper shows that the algorithm does have much better convergence speed than the traditional iterative learning algorithm.

4 Numerical Examples

In order to verify the validity of the associative correction learning rule proposed in this paper,a class of linear discrete time-invariant single-input single-output systems with repetitive parameter perturbation and measurement noise in a finite time period is considered

4.1 Case of Determined Model without Measurement Noise

If the system model is determined and there is no measurement noise,i.e.,ΔA(t)=0,ΔB(t)= 0,nk=0.According to Theorem 1,the sufficient and necessary condition of system convergence is

Let the iterative proportional gainβ=0.15,the differential gainγ=0.25,the association factor k=2,the correction factorL=0.25,and the discrete timeN=40.The calculation results show that

satisfies the convergence condition.

IfL=0,the above algorithm degenerates to a traditional PD-type iterative learning control algorithm,whose convergence condition isρ′=|1-γCB|=0.75<1,ρ<ρ′.According to the spectral radius theory,the smaller the radius of convergence,the faster the iterative learning algorithm converges.

The expected trajectory is yd(t+1)=sin(8t/25),t∈{0,1,...,39}and the initial conditionxk(0)=0,∀k∈Z+,initial control vectoru1(1)=0.When applying the accelerated PD-type learning rule proposed in this paper,the variation trend of||EK||∞the first learning iteration to the 50th learning iteration is shown in Fig.4.The algorithm can ensure||E K||∞converges to 0.Fig.5 shows the system’s output after the first,fourth,seventh and 11th iterations,respectively,and the convergence of the algorithm can be seen in more detail.

Figure 4:The variation trend of the norm of error of accelerated ILC algorithm with the increasing number of iterations

Figure 5:Output trajectory and expected trajectory(a)After the first iteration(b)After the fifth iteration(c)After the seventh iteration(d)After the 11th iteration

When the traditional PD-type learning rule is applied,let the iterative proportional gainβ=0.15,and the differential gainγ=0.25 remain unchanged during the learning process.From the first learning iteration to the 50th learning iteration,the variation trend of||EK||∞is shown in Fig.6.In addition,the variation trend of||EK||∞using the acceleration algorithm proposed in this paper is also included in the figure.In the figure,when the allowable errorε=0.01 is given,the traditional PD-type algorithm needs 13 iterations to reach,and the accelerated PDtype iterative learning algorithm needs 6 iterations.Given the permissible errorε=0.001,the conventional PD-type algorithm needs 25 iterations,and the accelerated PD-type iterative learning algorithm needs 11 iterations.It can be intuitively seen that the convergence speed of the system is significantly accelerated after adopting the PD-type accelerated ILC algorithm proposed in this paper.

Figure 6:Error comparison between the traditional algorithm and the accelerated algorithm

4.2 Case of the Undetermined Model with Measurement Noise

If the system model is not determined and contains measurement noise,that is,ΔA(t)/=0,ΔB(t)/=0,nk/=0.According to Theorem 4,the sufficient condition for the system output to converge to a neighbourhood of the expected trajectory is

where|Φi(t)|<1,t∈{0,1,...,N-1},i=1,2.In the simulation process,Φi(t)andΦ2(t)are generated by random functionrand(*),measurement noisenk=0.08 cos(rand(*))is randomly generated.The parameters of algorithm(5)are as follows:iterative proportional gainβ=0.15,differential gainγ=0.25,association factor k=2,correction factorL=0.25,discrete-timeN=40.The result of the simulation indicate that

meet the convergence condition,whileΔB(t)=(0<0.0013)T.

Expected trajectory is yd(t+1)=sin(8t/25),t∈{0,1,···,39}.The initialxk(0)=0,∀k∈Z+,the initial control vectoru1(1)=0.When applying the accelerated PD-type learning law proposed in this paper,the changing trend of||E K||∞,from the first iteration to the 50th iteration is shown in Fig.7.The algorithm can ensure||EK||∞converges to 0.

Figure 7:The variation trend of the norm of error of accelerated ILC algorithm with increasing number of iterations

In order to observe of the convergence process of the output trajectory,Fig.8 shows the comparison plots of the system output and the expected trajectory after the first,fourth,seventh and 11th iterations.

Figure 8:Comparison of system output and desired trajectory(a)After the first iteration(b)After the fourth iteration(c)after the 7th iteration(d)after the 11th iteration

If we takeL=0,the above algorithm is reduced into the traditional PD type iterative learning control algorithm,the convergence condition for,andρ<ρ′.According to the spectral radius theory,the smaller the convergence radius is,the faster the iterative learning algorithm converges.Therefore,the association correction iterative learning control algorithm proposed in this paper converges faster.Changing trend of||EK||∞from first iteration to 50th iteration in two algorithms is shown in Fig.9.

Figure 9:Comparison of error convergence rate between traditional PD-type learning rule and accelerated learning rule

It can be seen from the Fig.9 that the system tracking error does not converge to 0,but to a boundary.Theorem 4,||EK+1||→pε,t∈{0,1,...,N-1}.From Fig.9,it can be intuitively seen that after adopting the PD-type accelerated ILC algorithm proposed in this paper,the convergence speed of the system is significantly increased.

Table 4 shows that the tracking error of P,D,PD and accelerated proposed PD type ILC laws in the first iteration is 1.1217316.After 15th iterations,the error of the P-type law is 0.062823,D type algorithm is 0.07538,and the error of algorithm PD law is 0.024335.Where the error of the proposed accelerated PD law is 0.003683,from the column of Table 4 to the data,the tracking error of all ILC law is reduced consecutively with the increase of iteration number.However,from the horizontal data in Table 4,the tracking error of the proposed accelerated PD law is the smallest as compared to other ILC laws(P,D,PD)under the same iteration number.Therefore,it can be easily observed from Table 4 that the convergence speed of the proposed accelerated PD law in this paper is significantly higher than that of other traditional laws.

Table 4:Comparison of ILC laws with proposed accelerated PD-type ILC law

The auto-associative ILC proposed in this paper is based on the traditional ILC,namely,using the current information to estimate the future input.Compared to traditional ILC,the new algorithm is characterized as follows:in each trial,the unlearned time is pre-corrected with the current time information.The algorithm can reduce the number of iterations and accelerate the learning convergence speed.The algorithm proposed in this paper differs from the traditional discrete closed-loop algorithm and the higher-order algorithm as follows:

(1)Although the algorithm proposed in this paper is similar in form to the traditional closedloop iterative learning algorithm,the principle is completely different from that of the traditional discrete closed-loop PD-type algorithm(feedback algorithm).The traditional discrete closed-loop ILC algorithm is to correct the control input of the current time directly with the error of the previous time in the same trial.The algorithm proposed in this paper uses the error of the current time to pre-estimate the amount of control after which it does not occur at all times,and plays the role of pre-correction.

(2)Although the associative iterative learning algorithm proposed in this paper is similar in form to the traditional higher-order discrete learning algorithm,the learning process is completely different from the traditional higher-order iterative learning algorithm.The traditional high-order ILC is the algebraic overlay of the control information of the previous two or more trials at the corresponding time.The new iterative learning algorithm proposed in this paper is to pre-correct the subsequent unoccurred time with the error value of the current time in the same trial.

5 Conclusions

The problem of discrete linear time-invariant systems with parameter perturbation and measurement noise is investigated in this paper.It proposes sufficient conditions for convergence of a PD-type accelerated iterative learning algorithm with association correction under the circumstances of parameter determined without measurement noise,parameter undetermined without noise,parameter determined with measurement noise.Parameter undermined with measurement noise,respectively.Under the same simulation conditions,the convergence radius of the proposed algorithm is smaller than that of the traditional PD-type ILC algorithm.The convergence is theoretically proven with the help of hyper vector and spectral radius theory.Numerical simulation shows the effectiveness of the proposed algorithm.The results show that the algorithm can fully track the expected trajectory within finite intervals when uncertain system parameters.In the case of measurement noise existing,the system’s output will converge to a neighborhood of the expected trajectory using the algorithm proposed in this paper.In future studies,we will consider the stability and convergence of nonlinear discrete systems with parameter perturbations and measurement noises and the convergence of arbitrary bounded changes of initial conditions.

Acknowledgement:I want to declare on behalf of my co-authors that the work described is original research that has not been published previously and is not under consideration for publication elsewhere,in whole or in part.I confirmed that no conflict of interest exists in submitting this manuscript and is approved by all authors for publication in your journal.

Funding Statement:The authors received no specific funding for this study.

Conflicts of Interest:The authors declare that they have no conflicts of interest to report regarding the present study.