Next Article in Journal
Visual-Inertial Fusion-Based Five-Degree-of-Freedom Motion Measurement System for Vessel-Mounted Cranes
Previous Article in Journal
Experimental Study of an Industrial Data Transmission Network in the Automatic Control System of a Wind Turbine
Previous Article in Special Issue
Nonlinear Passive Observer for Motion Estimation in Multi-Axis Precision Motion Control
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

LSTM-Inversion-Based Feedforward–Feedback Nanopositioning Control

Department of Mechanical Engineering, Iowa State University, Ames, IA 50014, USA
*
Author to whom correspondence should be addressed.
Machines 2024, 12(11), 747; https://doi.org/10.3390/machines12110747
Submission received: 30 September 2024 / Revised: 19 October 2024 / Accepted: 21 October 2024 / Published: 22 October 2024
(This article belongs to the Special Issue Advances in Applied Mechatronics, Volume II)

Abstract

:
This work proposes a two-degree of freedom (2DOF) controller for motion tracking of nanopositioning devices, such as piezoelectric actuators (PEAs), with a broad bandwidth and high precision. The proposed 2DOF controller consists of an inversion feedforward controller and a real-time feedback controller. The feedforward controller, a sequence-to-sequence LSTM-based inversion model (invLSTMs2s), is used to compensate for the nonlinearity of the PEA, especially at high frequencies, and is collaboratively integrated with a linear MPC feedback controller, which ensures the PEA position tracking performance at low frequencies. Therefore, the proposed 2DOF controller, namely, invLSTMs2s+MPC, is able to achieve high precision over a broad bandwidth. To validate the proposed controller, the uncertainty of invLSTMs2s is checked such that the integration of an inversion model-based feedforward controller has a positive impact on the trajectory tracking performance compared to feedback control only. Experimental validation on a commercial PEA and comparison with existing approaches demonstrate that high tracking accuracies can be achieved by invLSTMs2s+MPC for various reference trajectories. Moreover, invLSTMs2s+MPC is further demonstrated on a multi-dimensional PEA platform for simultaneous multi-direction positioning control.

1. Introduction

Nanopositioning devices, such as piezoelectric actuators (PEAs), are broadly used in many high-precision industries, products, and systems due to their advantages of fast response and high motion resolution. As one of the most popular nanopositioning devices, PEAs have been implemented in various applications, such as atomic force microscopes (AFMs) [1,2], micro forming [3], and adaptive optics [4]. However, the operation of such a system at high frequency and/or large motion range still remains challenging because of the PEA’s nonlinear dynamics (e.g., hysteresis and creep), which is more pronounced when operations are performed under these conditions. As a result, the system operation bandwidth and motion range are rather limited for commercial applications, especially for real-time motion control applications.
To address these issues, efforts in PEA position tracking control have been reported. Feedforward controllers have been proposed to increase the control bandwidth by compensating for the PEA nonlinear dynamics. For example, model-based feedforward control designs have been implemented to improve control performance by compensating for PEA hysteresis, through inverse dynamics approximation [5,6]. To capture the frequency-dependent PEA hysteresis dynamics, rate-independent hysteresis models, such as the improved Preisach model [7], the modified Prandtl–Ishlinskii model [8], and the Bouc–Wen hysteresis model [9], have been implemented in the feedforward controller designs. The feedforward controller can then be combined with a real-time feedback controller to form a two-degree-of-freedom (2DOF) controller to increase the overall tracking accuracy and robustness [8,10]. In these 2DOF controllers, the feedforward part accounts for the nonlinear and high-frequency PEA dynamics, and the feedback controller takes care of the remaining errors to improve the overall control precision. Theoretically, the 2DOF controller should be able to deliver satisfactory trajectory tracking and positioning performance if both feedforward and feedback parts are well designed. However, due to the complicated PEA nonlinear dynamics, model parameter identification is quite challenging, and the trajectory tracking improvements obtained are rather limited [11], especially when system uncertainties exist.
With the recent development of deep learning algorithms, neural networks (NNs) have been widely proposed for system identification and controller design. With adequate training data, NNs can capture the entire PEA dynamics presented in the data quite well, not just limited to hysteresis. Previous studies [12] show a vanilla recurrent neural network (RNN) performs well at handling the time series input and is increasingly popular in controller design. However, a vanilla RNN still has its limits. For example, an RNN can potentially suffer from gradient vanishing or exploding issues [13,14,15] and is prone to overfitting. Furthermore, compared to the long short-term memory (LSTM), it cannot consider and understand the system’s long-term dependencies subject to sequential input, so the accuracy of a vanilla RNN in system dynamic identification can be limited [16,17]. The advantages of LSTM come from the memory cells in network neurons. The memory cells inherit the time-relevant information from long-term and short-term memory together and help LSTM make future predictions based on important historical data [16,18]. This feature enables LSTM to perform better in handling the complexity of nonlinear systems. But LSTM still can possibly encounter overfitting issues [19,20,21]. The reason can be originated from the conventional training mechanism. To train an accurate LSTM, it conventionally uses a very long single time series that concatenates many training data samples to cover a broad frequency range [16,18]. As a result, the conventional LSTM training progress could be extremely time-consuming. To address this issue, sequence-to-sequence LSTM learning (namely, LSTMs2s) has been recently implemented for PEA system identification [22] with high accuracy. This provides potential for controller design in high accuracy PEA position tracking.
Leveraging LSTMs2s in system identification, this work proposes a 2DOF controller (invLSTMs2s+MPC) consisting of a feedforward controller based on an LSTMs2s model of the system inverse dynamics (namely, invLSTMs2s) and a model predictive controller (MPC) as a feedback controller. The MPC was chosen primarily owing to its ability to handle constraints and capability for controlling multivariable plants, among other advantages. Considering the frequency and range dependent PEA nonlinear dynamics, the invLSTMs2s training set was generated using sinusoidal and triangle references with various frequency and amplitude components. To reduce the training data size and training time cost, invLSTMs2s was primarily designed to capture the PEA dynamics at mid- and high-frequency ranges (e.g., a range of tens to hundreds of hertz). In this work, the invLSTMs2s model was trained using a parallel training method, in which groups of training datasets were divided into mini-batches. After training, the modeling accuracy of invLSTMs2s was validated by comparing the model hysteresis prediction with the actual PEA response. With the accuracy of invLSTMs2s in nonlinear dynamics identification confirmed, an MPC based on a linear model was integrated as a feedback controller to take care of the low-frequency control and improve the overall tracking accuracy. Weighting factors were incorporated in the feedforward and feedback control paths to generalize the proposed 2DOF controller for tracking various references in different frequency regions. For validation and demonstration, the proposed 2DOF controller was implemented to control the displacement of a commercial PEA stage and compared with other approaches. The experiment results clearly demonstrate the superiority of invLSTMs2s+MPC for PEA trajectory tracking.

2. Feedforward–Feedback invLSTMs2s+MPC Controller Design

2.1. Feedforward Controller

2.1.1. System Inversion Model

In general, a dynamic system can be modeled in state-space as Equation (1), where U = [ u k , u k + 1 , ] and Y = [ y k , y k + 1 , ] are the input and output, respectively. The corresponding inversion model can be modeled as Equation (2). The inversion model’s (Equation (2)) output becomes U when the Y is fed as the inversion model’s input. If an accurate inversion model can be obtained for a time-invariant system and disturbances/uncertainties are not considered, precise control of the system can be achieved by connecting the inversion model and the system in series where the output of the former is the input of the latter and directly feeding the desired system reference as the input of the combination.
x k + 1 = f ( x k , u k ) y k = h ( x k , u k )
z k + 1 = f i n v ( z k , y k ) u k = h i n v ( z k , y k ) .
For a linear system, its inversion model can be directly determined through linear system theories. However, for a nonlinear system, such as PEA positioning systems in this work, obtaining an accurate inversion model is not straightforward. One way to solve this problem is to train and regress an inversion machine learning model using the data pair { Y , U } from the original system.

2.1.2. LSTMs2s Inversion Model (invLSTMs2s)

The goal of designing an inversion model is to map the output time series of the original system back to the input time series of the original system. Previous studies have shown that LSTM networks are more potent for this task compared to other forms of RNNs [17,22].
In this work, an advanced development of LSTM, sequence-to-sequence LSTM (LSTMs2s), was selected to generate the nonlinear inversion model of PEA for its efficiency and accuracy [22,23,24]. As shown in Figure 1, the LSTMs2s contains two LSTM layers expanded timewise: an LSTM encoder and an LSTM decoder. Both LSTM layers have similar structures.
In LSTMs2s (see Figure 1), the encoder LSTM layer reads the entire input time series u ( r ) and its final hidden states x ( r ) are saved as context vector C v . After the encoder processes the entire input time series, the decoder is coupled with a fully connected network to predict the output time series y ( r ) based on C v . The entire LSTMs2s model can be formulated as follows:
Encoder:
x e , ( k ) = f ( x e , ( k 1 ) , u ( r ) , k ) .
Decoder:
y ( r ) , k 1 = g ( x d , ( k 1 ) ) x d , ( k ) = f ( x d , ( k 1 ) , y ( r ) , k 1 , C v ) y ( r ) , k = g ( x d , ( k ) ) ,
where x d , ( k ) and x e , ( k ) are the hidden states in the encoder and decoder at the sampling instant k, respectively. f ( · ) denotes the LSTM unit function. g ( · ) represents the output layer that is implemented by a fully connected network. In this encoder–decoder model, it is clear that the decoder generates output data points y ( r ) , k at each sampling instant k and uses it as an input parameter to the decoder for the next output prediction. Note that the context vector C v serves as an input parameter to the decoder for each step.

2.1.3. invLSTMs2s Training

invLSTMs2s aims to map the relation from the PEA output to the PEA input, hence the inverse PEA dynamics. Suppose the measured PEA system output is Y ( t s ) subject to the drive input U ( t s ) , and Y ( r t s ) is the inversion model output subject to Y ( t s ) ; a perfect inversion model should generate Y ( r t s ) exactly the same as U ( t s ) . In other words, such a perfect inversion model satisfies that the error | | U ( t s ) Y ( r t s ) | | < ϵ for any ϵ > 0 . Therefore, the invLSTMs2s can be trained using the training set constructed with ( Y t s , U t s ) by solving the following optimization problem:
min W f , b f , W i , b i , W o , b o , W C , b C , W ( r ) , b ( r ) J ( r ) = | | U ( t s ) Y ( r t s ) | | subject to : x e , ( k ) = f ( x e , ( k 1 ) , y ( t s ) , k ) y ( r t s ) , k 1 = g ( x d , ( k 1 ) ) x d , ( k ) = f ( x d , ( k 1 ) , y ( r ) , k 1 , s ) y ( r t s ) , k = g ( x d , ( k ) ) x d , 0 = x e , L = s , k = 1 , 2 , 3 , , L ,
where f ( · ) and g ( · ) are the LSTM unit function and the fully connected output layer, respectively. L is the length of the time series.
We utilized parallel training to train this structure with multiple input–output time series samples for highly efficient system identification. Considering the frequency- and amplitude-dependent behavior of PEAs, each time series U ( t s ) , i is generated using the multi-period sinusoidal function S ( f i , A i ) and triangular function T ( f i , A i ) , with f i as the signal frequency and A i as the amplitude. Mathematically, the collection of k samples of U ( t s ) , i , U ( t s ) is formulated as
U ( t s ) = { U ( t s ) , 1 , U ( t s ) , 2 , , U ( t s ) , k } with U ( t s ) , i = S i , i = k + 1 2 if k is odd number T i , i = k 2 if k is even number S i = S ( f i , A i ) , T i = T ( f i , A i ) , where ( f i , A i ) Ω S ( f i , A i ) = A i [ sin ( 2 π f i t + 3 π 2 ) + 1 ] , t [ 0 , P f ] T ( f i , A i ) = 4 · A i | f i t f i t + 1 2 | , t [ 0 , P f ] ,
where Ω is the set of selected ( f i , A i ) pairs, which can be determined using k-means clustering [16,25]. P is the number of periods the signal sample contains. The PEA system output time series set Y ( t s ) = { Y ( t s ) , 1 , Y ( t s ) , 2 , , Y ( t s ) , k } can be then measured subject to the input set U ( t s ) = { U ( t s ) , 1 , U ( t s ) , 2 , , U ( t s ) , k } . Then, the entire training set can be constructed as { Y ( t s ) , U ( t s ) } . The samples of the entire training set are then sorted based on their length and divided into mini-batches using a pre-defined batch size n (n samples per batch). Then, the inversion model is trained batch by batch, and the total training iteration needed is λ K s / n , where λ is the epoch number, and K s is the total sample size.
Note that the sample length can be very big for small f i s . This can dramatically increase the training time. Given the PEA’s nonlinear dynamics mostly occur for high driving frequencies [22,25], the invLSTMs2s was primarily designed to capture the high-frequency and nonlinear dynamics of the PEA system [22].

2.2. Feedback Controller Design

Given the high-frequency dynamics of the PEA can be accurately compensated by the invLSTMs2s, what is left for the PEA positioning control is to design a controller for the PEA’s low-frequency dynamics, which is mostly linear. Therefore, a linear MPC is proposed for its efficiency and flexibility in handling control constraints and system disturbances. The MPC works in parallel with the invLSTMs2s feedforward controller. The block diagram of the entire controller design is shown in Figure 2.
The MPC utilizes a state observer and an estimated linear model for the PEA system state estimation and output prediction. Such a state observer, like the Kalman filter, can be expressed as,
x ( c ) , k + 1 = A x ( c ) , k + B u k y k ^ = C x ( c ) , k ,
where x ( c ) , k is the estimated state of the state observer, and u k is the system input. y k ^ is the predicted system output. A , B , C are the estimated state-space matrices that can be obtained using a linear system identification algorithm based on system frequency response fitting, and their dimensions depend on the order of the estimated system chosen by the algorithm and user preference.
State estimation: A steady-state Kalman filter [26] was implemented as the state estimator for the MPC. To design the Kalman filter, the process noise covariance and measurement noise covariance were estimated based on the system’s linear model identification accuracy, the PEA stage’s measurement noise level, and the manufacturer’s provided system’s noise-to-signal ratio.
For each control interval k, the system state was estimated as follows:
First, compute the innovation using the current system output measurement y k
e k = y k [ C x ( c ) , k | k 1 ] .
Second, update the current state
x ( c ) , k | k = x ( c ) , k | k 1 + M e k ,
where x ( c ) , k | k 1 is the k instant controller state estimation at the previous instant, k 1 . M is the Kalman innovation gain matrix. Then, the MPC uses the updated state x ( c ) , k | k to predict the system output and to solve for the optimal drive input to the system, u k o p t , with quadratic programming (QP).
Finally, the state estimator predicts the state for the next control interval k + 1
x ( c ) , k + 1 | k = A x ( c ) , k | k 1 + B u k o p t + L e k ,
where L is the Kalman filter gain matrix.
Output prediction: The MPC computes the future system state within the pre-selected prediction horizon p as follows:
x ( c ) , k + i | k = A x ( c ) , k + i 1 | k + B u k + i 1 | k , i = 1 , 2 , 3 , , p .
For each step i, the PEA’s output is:
y ^ k + i = C x ( c ) , k + i | k .
QP matrices: Let U = [ u k + 1 , u k + 1 , , u k + m ] , Δ U = [ u k + 1 u k , u k + 2 u k + 1 , , u k + m u k + m 1 ] , and 1 n = [ 1 , 1 , , 1 ] T , where m is the control horizon. The aforementioned system output prediction can be rewritten in an alternative form:
Y ^ = α x ( c ) , k | k + β U + F u k = α x ( c ) , k | k + β ( S 1 Δ U + 1 n u k ) + F u k = α x ( c ) , k | k + β S 1 Δ U + ( β 1 n + F ) u k ,
where
Y ^ = y ^ k + 1 y ^ k + 2 y ^ k + p p × 1 , U = u k + 1 u k + 2 u k + m m × 1 .
The terms α and β are defined as
α = C A C A 2 C A p p × 1 , β = 0 0 0 0 C B 0 0 0 C A B C B 0 · 0 C A p 2 B C A p 3 B C A p 4 B · 0 p × m , F = C B C A B C A 2 B C A p 1 B , S 1 = 1 0 0 1 1 0 1 1 1 .
Cost function: With the given QP matrices, the cost function can be mathematically expressed as
J = ( Y ^ R ) T ( Y ^ R ) + ρ Δ U T Δ U ,
where R = [ r k + 1 , r k + 2 , , r k + p ] is the reference of desired system output, ρ is the weighting coefficient for the reference tracking error and MV movement suppression, respectively. Let E = α x ( c ) , k | k + ( β 1 n + F ) u k R ; the cost function can be rewritten as
J = ( β S 1 Δ U + E ) T ( β S 1 Δ U + E ) + Δ U T ( ρ I ) Δ U = Δ U T S 1 T β T β S 1 Δ U + E T β S 1 Δ U + ( E T β S 1 Δ U ) T + E T E + Δ U T ( ρ I ) Δ U = Δ U T ( ρ I + S 1 T β T β S 1 ) Δ U + 2 E T β S 1 Δ U + E T E .
Now, it becomes a convex optimization problem for the QP progress to compute the optimal Δ U based on the known value u k and x ( c ) , k | k , and add the first element of Δ U with u k to obtain u k o p t . The convex optimization problem can be formulated as follows:
arg min Δ U J = Δ U T ( ρ I + S 1 T β T β S 1 ) Δ U + 2 E T β S 1 Δ U + E T E .
Solution and stability analysis: The solution for Δ U that can minimize the cost function J should be
Δ U = arg min Δ U J = ( ρ I + S i T β T β S i ) 1 E T β S i .
According to the MPC algorithm, only the first step of the manipulated variable would be taken for system input. Setting G = [ 1 , 0 , 0 , , 0 ] , we can compute
u k + 1 = u k G ( ρ I + S i T β T β S i ) 1 E T β S i = M 1 x c ( k ) + M 2 u k + M 3 R ,
where M 1 = G ( ρ I + S i T β T β S i ) 1 α T β S i , M 2 = 1 G ( ρ I + S i T β T β S i ) 1 ( β 1 n ) T β S i , and M 3 = G ( ρ I + S i T β T β S i ) 1 β S i . In this case, M 1 , M 2 , and M 3 are all constant matrices and can be regarded as the coefficient to x ( c ) , k | k , u k 1 , and R, so that the feedback system can be rewritten as
x ( c ) , k + 1 | k u k + 1 = A B M 1 M 2 x ( c ) , k | k u k + 0 M 3 R .
Based on the alternative form above, it is obvious that the feedback system stability condition needs to satisfy the following:
λ A B M 1 M 2 max < 1 ,
where λ ( · ) denotes the eigenvalues of the matrix.

2.3. Impact of the invLSTMs2s Uncertainty

Previous studies have shown that the inversion’s modeling uncertainty must be small such that the inversion-based feedforward input improves the output tracking performance when compared to the case with feedback control only (regardless of the type of feedback controller used) [27]. Thus, it is necessary to evaluate the uncertainty of invLSTMs2s in this work. According to [27], for the SISO PEA system, the criteria that need to be satisfied such that integrating the inversion feedforward with the MPC improves the tracking performance can be formulated as
| Δ ( j ω ) | 2 | G o ( j ω ) | 2 ,
where Δ ( j ω ) is the modeling uncertainty at different frequencies ω , and it is defined as Δ : = G o G , with G and G o denote the PEA’s true dynamics and the captured dynamics, respectively. Therefore, the inverse model G o 1 ( j ω ) is exactly the trained inversion model, invLSTMs2s. Then, (21) can be rewritten as
| Δ ( j ω ) | 2 | G 0 ( j ω ) | 2 1 | Δ ( j ω ) | 2 | G o 1 ( j ω ) | 2 1 | G 0 ( j ω ) G ( j ω ) | 2 | G o 1 ( j ω ) | 2 1 | 1 G ( j ω ) G o 1 ( j ω ) | 2 1 .
Note that G ( j ω ) G 0 1 ( j ω ) represents the model of the cascaded invLSTMs2s+PEA system, i.e., when the invLSTMs2s feedforward controller is applied to the PEA alone. Thus, the tracking error subject to reference R in this case is defined as E f f o n l y : = R Y = [ 1 G ( j ω ) G o 1 ( j ω ) ] R , and (22) becomes
max | R ( j ω ) | 2 0 | E f f o n l y ( j ω ) | 2 | R ( j ω ) | 2 1 ,
which matches the E r m s in (24) as discussed in next subsection.
Note that | E f f o n l y ( j ω ) | 2 | R ( j ω ) | 2 is the RMS tracking error (see (24)). Therefore, as long as the RMS tracking error satisfies the above (23) when only the invLSTMs2s feedforward controller is used, it is guaranteed that the proposed 2DOF controller performs better than the MPC alone.
Due to the training data limit in practice, such as data length, frequency, and amplitude limits, invLSTMs2s may not capture the PEA dynamics outside of the training frequency and/or amplitude ranges. In this case, the modeling uncertainty Δ may not satisfy condition (21). To solve this issue, we introduced frequency-dependent weighting factors κ f f ( ω ) and κ f b ( ω ) , such that the contribution of the inversion feedforward part could be flexibly adjusted based on the frequency range of the reference (see Figure 2). For example, for low-frequency (lower than the invLSTMs2s lower training frequency bounds) tracking, κ f f can be set small so the MPC feedback control is dominant to minimize the uncertainty effect. Also, a low-pass filter (LPF) was used in the MPC part to avoid high-frequency data that could be fed into the feedback loop.

3. Experiment Results and Discussion

The proposed invLSTMs2s+MPC 2DOF controller was implemented on a PEA stage (Nano-OP30, Mad City Labs, Madison, WI, USA) for the experimental validation. The performance was compared with that of feedforward invLSTMs2s only, an MPC only, and a traditional PI controller. All the signals were acquired using a data acquisition (DAQ) system (NI PCIe-6353, National Instruments, Austin, TX, USA) installed on a desktop workstation (Intel Xeon W-2125, RAM 32 GB), as shown in Figure 3. All the models and controllers were trained/designed using MATLAB (R2023a) and Simulink (10.7) (MathWorks, Inc., Natick, MA, USA). For all the experiments, the sampling frequency was set at 20 kHz.

3.1. invLSTMs2s Training Set Construction and Training Process

The prerequisite step for generating the invLSTMs2s training set { Y ( t s ) , U ( t s ) } is finding sufficient fA pairs. To do so, 10,000 points (blue dots in Figure 4) in the f range of 25–325 Hz and amplitude range of 0–4.5 V were first randomly generated in the fA plane, and the k-means algorithm [22] picked 1000 points (red points in Figure 4) as Ω ’s for the training set generation. Then, (6) was used to generate U ( t s ) with P = 3 . Using U ( t s ) as the drive input of the PEA, the corresponding PEA output Y ( t s ) was collected by the DAQ device. Thus, the entire training set ( Y ( t s ) , U ( t s ) ) contained 2000 samples.
The invLSTMs2s model with 40 hidden states was used to model the PEA inverse dynamics and trained with λ = 32 epochs using parallel training. A batch size of n = 20 was selected. Therefore, the entire training process took λ K s / n = 3200 iterations. Figure 5 shows the invLSTMs2s root-mean-square error (RMSE) learning curve as the epoch number increases. It clearly shows the convergence of the invLSTMs2s training process with very low training error.

3.2. Accuracy of invLSTMs2s

The model accuracy was tested by checking how well invLSTMs2s could compensate for the PEA hysteresis, i.e., whether the hysteresis could be mostly removed when invLSTMs2s was cascaded with the PEA (connected in series). As shown in Figure 6, the original PEA hysteresis curves were measured using different sinusoidal driving voltages with the frequencies 30, 120, and 240 Hz and amplitudes of 2 and 4 V, respectively. Please note that these frequencies were intentionally selected to avoid overlapping with those in the training set to minimize the effect of data discrepancy. It can be seen that the PEA hysteresis tended to be more pronounced with increasing frequency and/or amplitude. However, the corresponding hysteresis curves of the cascaded invLSTMs2s+PEA system were barely notable for all measured frequencies and amplitudes (see Figure 6). Therefore, the invLSTMs2s inversion model was indeed accurate in accounting for the PEA’s nonlinear dynamics. This accuracy ensured that the modeling uncertainty stayed small (i.e., (21) was satisfied) and guaranteed that the output tracking performance of the proposed 2DOF invLSTMs2s+MPC controller was better than that of the MPC alone. More detailed comparisons are presented next.

3.3. Tracking Performance Comparison

The tracking performance of the proposed invLSTMs2s+MPC was demonstrated as follows. For the MPC, the prediction horizon was set to 60, and the stability condition (20) was satisfied. To design the Kalman state estimator, the covariance for the process noise and measurement noise were selected as Q = 10 4 and R = 10 6 , respectively, based on the system’s linear model identification accuracy, the PEA stage’s measurement noise level, and the system’s noise-to-signal ratio provided by the manufacturer. Also, to avoid high-frequency noise being fed into the feedback loop, a third-order Butterworth LPF with the cut-off frequency of 600 Hz was used to avoid high-frequency noise/disturbance being fed into the feedback loop.
Although the invLSTMs2s was trained up to 300 Hz, the control bandwidth was expected to reach 350 Hz due to the generalizability of LSTM. Based on the PEA behavior observed, whose nonlinear dynamics was highly nonlinear at frequencies close or above 200 Hz, frequencies ≥200 Hz were considered as high frequencies. To comprehensively test the controller performance over broad frequency/amplitude ranges, the reference trajectories were designed to have low (≤50 Hz), mid (∼100 Hz), and high frequency (≥200 Hz) profiles: sinusoidal signals (at 30, 120, and 360 Hz), triangle signals (at 25, 100, and 300 Hz), and a multi-frequency reference Γ , where Γ ( t ) = 0.8 sin ( 2 π 5 t + 1.5 π ) + 0.43 sin ( 2 π 50 t ) + 0.12 sin ( 2 π 120 t + 1.2 π ) + 0.3 sin ( 2 π 180 t + π ) .
First, the accuracy of invLSTMs2s as the feedforward controller only was evaluated. Then, the proposed invLSTMs2s+MPC 2DOF controller was compared with the cases of using invLSTMs2s, the MPC, and PI feedback. The PI controller parameters were tuned to P = 0.312 and I = 1465 to achieve the best performance possible. The tracking errors were computed as [28]:
E r m s = | r ( · ) y ( · ) | 2 | r ( · ) | 2 × 100 % , E m a x = | r ( · ) y ( · ) | | r ( · ) | × 100 % ,
where r ( · ) and y ( · ) are the complex vectors of reference and output obtained through discrete Fourier transforms, respectively. Table 1 shows the tracking errors for all the controllers. Figure 7 shows the detailed tracking performance in the time domain for the selected references of (120 Hz sinusoidal signal, 100 Hz triangular signal, and Γ ).
invLSTMs2s+MPC vs. invLSTMs2s: First of all, condition (23) was satisfied for the case when only invLSTMs2s was used when tracking all the designed references: the RMS tracking errors were all much smaller than one. Hence, invLSTMs2s could accurately capture the PEA nonlinear dynamics and produced very small errors for tracking high-frequency references; however, relatively higher errors at the low-frequency range were observed. For example, in Table 1, the tracking error of the invLSTMs2s for both sinusoidal and triangular references at 30 and 25 Hz were a few percent higher than those for 360 and 300 Hz, respectively. This was mainly because these frequencies were very close to the lower bound of the training frequency range. Thus, limited knowledge of the system dynamics at these frequencies was available when training invLSTMs2s. This shortage was compensated for by the MPC, which was designed using a linear model that primarily captured the PEA dynamics at low frequencies. The invLSTMs2s+MPC outperformed invLSTMs2s at the low-frequency range. The tracking errors of the invLSTMs2s+MPC for both sinusoidal and triangular signals at 30 and 25 Hz were reduced by more than 50% compared to those of invLSTms2s. As shown in Figure 7, the low-frequency tracking error was greatly reduced by invLSTMs2s+MPC compared to invLSTMs2s. As the PEA nonlinear dynamics became more pronounced at higher frequencies, the advantage of invLSTMs2s became more significant: invLSTMs2s achieved the highest accuracy for 360 Hz sinusoidal and 300 Hz triangular trajectories. Meanwhile, as the linear MPC was not designed for the PEA’s high-frequency control, it negatively impacted the performance of invLSTMs2s. Therefore, bigger errors were observed (see Table 1). However, compared to the errors of MPC, the performance was still greatly improved, and the achieved accuracy was satisfactory.
invLSTMs2s+MPC vs. MPC: The proposed invLSTMs2s+MPC was compared with the use of the MPC alone to clearly demonstrate the necessity of integrating the invLSTMs2s feedforward control. As the performance improvement condition (23) was satisfied for all the tracking cases ( E r m s < < 1 for the use of invLSTMs2s alone in Table 1), it was expected that the proposed 2DOF invLSTMs2s+MPC controller would outperform the MPC. Table 1 shows that the invLSTMs2s+MPC had less tracking error than the MPC, especially for tracking mid-, high-frequency, and the multi-frequency Γ trajectories. However, it made sense that the tracking errors of invLSTMs2s+MPC at 30 and 25 Hz were a little bit higher than those of the MPC, since the MPC is more accurate than invLSTMs2s at low frequencies, as aforementioned. Note that the slight error increase can be eliminated by setting the feedforward weighting coefficient κ f f ( ω ) to a very small value to make sure the 2DOF controller takes full advantage of the MPC at low frequencies.
invLSTMs2s+MPC vs. PI: To demonstrate the significance of this work, invLSTMs2s+MPC was compared to the case of using a PI controller alone. In Table 1, it can be observed that the tracking error of the PI controller was more pronounced with increasing frequency. For the mid and high frequencies, the PI presented the highest tracking error among all controllers. This is mainly because the PI feedback controller was driven by the errors between references and PEA outputs, and the control bandwidth was low due to the sluggish response of this controller subject to fast error changes.
In summary, invLSTMs2s+MPC is capable of achieving high tracking accuracy for various reference trajectories over a broad bandwidth. The superiority of this 2DOF controller comes from taking the advantages of feedforward invLSTMs2s for the PEA’s high-frequency nonlinear dynamics compensation and MPC for low-frequency position tracking precision.

3.4. Application Demonstration of the invLSTMs2s+MPC

As an application demonstration, invLSTMs2s+MPC was further adopted to control a two-dimensional (XY lateral plane motion) PEA nanopositioning platform to track the pattern of character string “ISU” along the lateral XY plane. The tracked patterns achieved by different controllers are compared in Figure 8. It was designed to complete the motion within 0.11 s, with the stage average traveling speed of 320 μ m/sec. The designed X and Y direction trajectories and tracking results for different controllers are shown in Figure 9a,b, respectively. According to the results, it is clear that the proposed invLSTMs2s+MPC performed better and could achieve very high accuracy. This clearly demonstrated the capability of invLSTMs2s+MPC in two-dimensional positioning tracking control in nanofabrication applications, such as nano/micromanufacturing systems in which the PEA stages are required to track the desired fabrication pattern [29,30].
As invLSTMs2s does not assume any specific forms of nonlinearity, the outstanding performance of invLSTMs2s+MPC directly implies that it can be employed in various nonlinear systems for applications where position tracking is essential, such as precise control of friction force generated by deformable actuators in human–robot reaction [31], AFM applications for material mechanical characterization [32], as well as adaptive optics for high-resolution imaging [33], just to name a few.

4. Conclusions

In this work, we proposed a 2DOF controller, invLSTMs2s+MPC, for nanopositioning control of PEA devices with a broad bandwidth, high precision, and fast response. The key novelty of the proposed controller includes (1) the idea of implementing an LSTMs2s structure to model the PEA inverse dynamics for system nonlinearity compensation and (2) integrating invLSTMs2s and a linear MPC to improve the overall tracking performance. As the invLSTMs2s model can be trained offline and a linear MPC is used during real-time control, the proposed controller can achieve broad-bandwidth output tracking with its low computation requirement.
In the future, the work will focus on designing a dynamics separation mechanism to efficiently optimize the collaboration between the feedforward and feedback components. Also, we will explore other modeling network structures to improve the training speed further and increase the modeling bandwidth, robustness, and accuracy.

Author Contributions

Conceptualization, R.Y. and J.R.; methodology, R.Y. and J.R.; software, R.Y.; validation, R.Y.; formal analysis, R.Y. and J.R.; investigation, R.Y.; resources, J.R.; data curation, R.Y. and J.R.; writing—original draft preparation, R.Y.; writing—review and editing, J.R.; visualization, R.Y.; supervision, J.R.; project administration, J.R.; funding acquisition, R.Y. and J.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Science Foundation (NSF) (CMMI-1751503 and CNS-2409359) and Iowa State University.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
LSTMLong short-term memory
LSTMs2sSequence-to-sequence long short-term memory
invLSTMs2sInversive sequence-to-sequence long short-term memory
NNNeural network
RNNRecurrent neural network
2DOFTwo-degree of freedom
MPCModel predictive controller
PEAPiezoelectric actuator
AFMAtomic force microscopy
QPQuadratic programming

References

  1. Mollaeian, K.; Liu, Y.; Bi, S.; Ren, J. Atomic force microscopy study revealed velocity-dependence and nonlinearity of nanoscale poroelasticity of eukaryotic cells. J. Mech. Behav. Biomed. Mater. 2018, 78, 65–73. [Google Scholar] [CrossRef] [PubMed]
  2. Xie, S.; Ren, J. Iterative Learning-based Model Predictive Control for Precise Trajectory Tracking of Piezo Nanopositioning Stage. In Proceedings of the 2018 Annual American Control Conference (ACC), Milwaukee, WI, USA, 27–29 June 2018; pp. 2922–2927. [Google Scholar]
  3. Tian, Y.; Zhang, D.; Shirinzadeh, B. Dynamic modelling of a flexure-based mechanism for ultra-precision grinding operation. Precis. Eng. 2011, 35, 554–565. [Google Scholar] [CrossRef]
  4. Qin, F.; Zhang, D.; Xing, D.; Xu, D.; Li, J. Laser beam pointing control with piezoelectric actuator model learning. IEEE Trans. Syst. Man Cybern. Syst. 2017, 50, 1024–1034. [Google Scholar] [CrossRef]
  5. Fan, Y.; Tan, U.X. Design of a feedforward-feedback controller for a piezoelectric-driven mechanism to achieve high-frequency nonperiodic motion tracking. IEEE/ASME Trans. Mechatron. 2019, 24, 853–862. [Google Scholar] [CrossRef]
  6. Mao, X.; Wang, Y.; Liu, X.; Guo, Y. A hybrid feedforward-feedback hysteresis compensator in piezoelectric actuators based on least-squares support vector machine. IEEE Trans. Ind. Electron. 2017, 65, 5704–5711. [Google Scholar] [CrossRef]
  7. Baziyad, A.G.; Ahmad, I.; Salamah, Y.B. Precision Motion Control of a Piezoelectric Actuator via a Modified Preisach Hysteresis Model and Two-Degree-of-Freedom H-Infinity Robust Control. Micromachines 2023, 14, 1208. [Google Scholar] [CrossRef]
  8. Cheng, L.; Chen, W.; Tian, L.; Xie, Y. A feedforward-feedback controller based on modified Prandtl–Ishlinskii model and submodel for tracking control the trajectories of piezoelectric actuator. Microsyst. Technol. 2024, 30, 1007–1024. [Google Scholar] [CrossRef]
  9. Ahmad, I. Two degree-of-freedom robust digital controller design with Bouc-Wen hysteresis compensator for piezoelectric positioning stage. IEEE Access 2018, 6, 17275–17283. [Google Scholar] [CrossRef]
  10. Yuan, J.; Wu, H.; Qin, Y.; Han, J. High-Bandwidth Repetitive Trajectory Tracking Control of Piezoelectric Actuators via Phase–Hysteresis Hybrid Compensation and Feedforward–Feedback Combined Control. Micromachines 2023, 14, 2009. [Google Scholar] [CrossRef]
  11. Gan, J.; Zhang, X. A review of nonlinear hysteresis modeling and control of piezoelectric actuators. AIP Adv. 2019, 9, 040702. [Google Scholar] [CrossRef]
  12. Xie, S.; Ren, J. Tracking control using recurrent-neural-network-based inversion model: A case study on a piezo actuator. IEEE Trans. Ind. Electron. 2020, 68, 11409–11419. [Google Scholar] [CrossRef]
  13. Pascanu, R.; Mikolov, T.; Bengio, Y. On the difficulty of training recurrent neural networks. In Proceedings of the International Conference on Machine Learning, PMLR, Atlanta, GA, USA, 17–19 June 2013; pp. 1310–1318. [Google Scholar]
  14. Hochreiter, S. Untersuchungen zu dynamischen neuronalen Netzen. Diploma Tech. Univ. München 1991, 91, 31. [Google Scholar]
  15. Rehmer, A.; Kroll, A. On the vanishing and exploding gradient problem in Gated Recurrent Units. IFAC-PapersOnLine 2020, 53, 1243–1248. [Google Scholar] [CrossRef]
  16. Patil, M.S.; Charuku, B.; Ren, J. Long Short-term Memory Neural Network-based System Identification and Augmented Predictive Control of Piezoelectric Actuators for Precise Trajectory Tracking. IFAC-PapersOnLine 2021, 54, 38–45. [Google Scholar] [CrossRef]
  17. Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
  18. Sherstinsky, A. Fundamentals of recurrent neural network (RNN) and long short-term memory (LSTM) network. Phys. D Nonlinear Phenom. 2020, 404, 132306. [Google Scholar] [CrossRef]
  19. Wang, Y. A new concept using lstm neural networks for dynamic system identification. In Proceedings of the 2017 American Control Conference (ACC), Seattle, WA, USA, 24–26 May 2017; pp. 5324–5329. [Google Scholar]
  20. Ookura, S.; Mori, H. An efficient method for wind power generation forecasting by LSTM in consideration of overfitting prevention. IFAC-PapersOnLine 2020, 53, 12169–12174. [Google Scholar] [CrossRef]
  21. Gonzalez, J.; Yu, W. Non-linear system modeling using LSTM neural networks. IFAC-PapersOnLine 2018, 51, 485–489. [Google Scholar] [CrossRef]
  22. Yin, R.; Ren, J. Sequence-to-sequence LSTM-based Dynamic System Identification of Piezo-electric Actuators. In Proceedings of the 2023 American Control Conference (ACC), San Diego, CA, USA, 31 May–2 June 2023; pp. 673–678. [Google Scholar]
  23. Cho, K.; Van Merriënboer, B.; Gulcehre, C.; Bahdanau, D.; Bougares, F.; Schwenk, H.; Bengio, Y. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv 2014, arXiv:1406.1078. [Google Scholar]
  24. Sutskever, I.; Vinyals, O.; Le, Q.V. Sequence to sequence learning with neural networks. arXiv 2014, arXiv:1409.3215. [Google Scholar]
  25. Xie, S.; Ren, J. Recurrent-neural-network-based predictive control of piezo actuators for trajectory tracking. IEEE/ASME Trans. Mechatron. 2019, 24, 2885–2896. [Google Scholar] [CrossRef]
  26. Franklin, G.F.; Powell, J.D.; Workman, M.L. Digital Control of Dynamic Systems; Addison-Wesley: Menlo Park, CA, USA, 1998; Volume 3. [Google Scholar]
  27. Devasia, S. Should model-based inverse inputs be used as feedforward under plant uncertainty? IEEE Trans. Autom. Control 2002, 47, 1865–1871. [Google Scholar] [CrossRef]
  28. Kim, K.S.; Zou, Q. A modeling-free inversion-based iterative feedforward control for precision output tracking of linear time-invariant systems. IEEE/ASME Trans. Mechatron. 2012, 18, 1767–1777. [Google Scholar] [CrossRef]
  29. Yan, Y.; Sun, T.; Liang, Y.; Dong, S. Investigation on AFM-based micro/nano-CNC machining system. Int. J. Mach. Tools Manuf. 2007, 47, 1651–1659. [Google Scholar] [CrossRef]
  30. Du, P.; Han, L.; Qiu, X.; Chen, W.; Deng, J.; Liu, Y.; Zhang, J. Development of a high-precision piezoelectric ultrasonic milling tool using longitudinal-bending hybrid transducer. Int. J. Mech. Sci. 2022, 222, 107239. [Google Scholar] [CrossRef]
  31. Xiong, J.; Chen, J.; Lee, P.S. Functional fibers and fabrics for soft robotics, wearables, and human–robot interface. Adv. Mater. 2021, 33, 2002640. [Google Scholar] [CrossRef]
  32. Kathavate, V.; Prasad, K.E.; Kiran, M.S.; Zhu, Y. Mechanical characterization of piezoelectric materials: A perspective on deformation behavior across different microstructural length scales. J. Appl. Phys. 2022, 132, 121103. [Google Scholar] [CrossRef]
  33. Ren, K.; Gao, X.; Jin, H.; Qiao, L.; Xia, S.; Li, F. Conformal Ordered Solid–Liquid Coupled Piezoelectric Units for Programmable Adaptive Optics. Adv. Funct. Mater. 2024, 2410173. [Google Scholar] [CrossRef]
Figure 1. LSTMs2s structure.
Figure 1. LSTMs2s structure.
Machines 12 00747 g001
Figure 2. Schematic block diagram of the feedforward–feedback invLSTMs2s+MPC controller.
Figure 2. Schematic block diagram of the feedforward–feedback invLSTMs2s+MPC controller.
Machines 12 00747 g002
Figure 3. PEA control system setup.
Figure 3. PEA control system setup.
Machines 12 00747 g003
Figure 4. Ω (set of ( f i , A i ) pairs) selected for generating the training input U ( t s ) . Red dots represent the selected ( f i , A i ) pairs by the k-means algorithm.
Figure 4. Ω (set of ( f i , A i ) pairs) selected for generating the training input U ( t s ) . Red dots represent the selected ( f i , A i ) pairs by the k-means algorithm.
Machines 12 00747 g004
Figure 5. The RMSE learning curve of invLSTMs2s. With 32 epochs and 100 samples in each mini-batch, the total training iteration number was 3200.
Figure 5. The RMSE learning curve of invLSTMs2s. With 32 epochs and 100 samples in each mini-batch, the total training iteration number was 3200.
Machines 12 00747 g005
Figure 6. PEA displacement vs. input voltage at the frequencies of (a) 30 Hz, (b) 120 Hz, (c) 240 Hz for the comparison between the PEA and the PEA cascaded with inversion model (invLSTMs2s+PEA).
Figure 6. PEA displacement vs. input voltage at the frequencies of (a) 30 Hz, (b) 120 Hz, (c) 240 Hz for the comparison between the PEA and the PEA cascaded with inversion model (invLSTMs2s+PEA).
Machines 12 00747 g006
Figure 7. Comparison of the tracking results of invLSTMs2s, invLSTMs2s+MPC, the MPC, and the PI controller for (a) 100 Hz triangular trajectory, (b) 120 Hz sinusoidal trajectory, and (c) Γ . (df) are the corresponding tracking errors, respectively.
Figure 7. Comparison of the tracking results of invLSTMs2s, invLSTMs2s+MPC, the MPC, and the PI controller for (a) 100 Hz triangular trajectory, (b) 120 Hz sinusoidal trajectory, and (c) Γ . (df) are the corresponding tracking errors, respectively.
Machines 12 00747 g007
Figure 8. Comparison of the stage motion pattern achieved using controllers invLSTMs2s, invLSTMs2s+MPC, and MPC.
Figure 8. Comparison of the stage motion pattern achieved using controllers invLSTMs2s, invLSTMs2s+MPC, and MPC.
Machines 12 00747 g008
Figure 9. Tracking results of invLSTMs2s, invLSTMs2s+MPC, and MPC for (a) X–direction trajectory and (b) Y–direction trajectory. (c,d) are the corresponding tracking errors, respectively.
Figure 9. Tracking results of invLSTMs2s, invLSTMs2s+MPC, and MPC for (a) X–direction trajectory and (b) Y–direction trajectory. (c,d) are the corresponding tracking errors, respectively.
Machines 12 00747 g009
Table 1. Performance comparison.
Table 1. Performance comparison.
ControllerRefs.SinusoidalTriangular Γ
30 Hz120 Hz360 Hz25 Hz100 Hz300 Hz
invLSTMs2s E r m s 9.35%7.25%3.10%9.04%6.00%4.29%5.64%
E m a x 7.91%6.14%2.30%7.13%4.74%2.51%3.49%
invLSTMs2s+MPC E r m s 3.19%2.97%7.53%4.05%1.94%6.32%3.63%
E m a x 2.09%2.07%6.25%2.85%1.05%4.31%2.39%
MPC E r m s 1.66%6.57%27.18%3.28%6.80%21.66%10.06%
E m a x 1.12%4.82%23.26%2.63%3.77%12.68%7.57%
PI E r m s 6.37%24.34%45.89%5.08%18.73%38.01%8.56%
E m a x 5.50%21.04%39.47%3.77%14.88%30.60%5.81%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yin, R.; Ren, J. LSTM-Inversion-Based Feedforward–Feedback Nanopositioning Control. Machines 2024, 12, 747. https://doi.org/10.3390/machines12110747

AMA Style

Yin R, Ren J. LSTM-Inversion-Based Feedforward–Feedback Nanopositioning Control. Machines. 2024; 12(11):747. https://doi.org/10.3390/machines12110747

Chicago/Turabian Style

Yin, Ruocheng, and Juan Ren. 2024. "LSTM-Inversion-Based Feedforward–Feedback Nanopositioning Control" Machines 12, no. 11: 747. https://doi.org/10.3390/machines12110747

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop