Locally supervised neural networks for approximating terramechanics models
Introduction
Recently, research involving the development of motion control systems for planetary rovers has generated increased interest in wheel–terrain interactional systems [1]. Research in this field typically involves the identification of soil parameters and the building of physical models [2]. These proposed terramechanics models are both highly nonlinear and include multi-coupling equations that use many soil parameters with multivariable integration. Thus, they are difficult to apply under unknown soil parameters. Therefore, developing an effective and intelligent modeling algorithm is desirable. It is generally accepted that neural network (NN), as a popular machine learning method, exhibits potent learning abilities for approximating nonlinear functions and predicting outputs [3], [4]. Although there may be no prior knowledge of a system׳s dependences, a special neural network model can still effectively approximate the response of the system from input to output by updating the weights [5], [6]. In particular, when a sufficient number of training samples are obtained, an NN using supervised learning generally produces better prediction results, compared to NNs that use unsupervised learning [7]. To accurately approximate a highly complex nonlinear system, large amounts of experimental data are necessary. In that case, building an appropriate NN structure for a global model is quite difficult. This is due to the fact that a substantial number of neurons and weights must be learned, and iteratively tuning of all NN model parameters produces delays due to slow learning speeds [8]. This limitation prevents the use of NNs in real-time applications for learning complex nonlinear systems, which require large amounts of training data to be rapidly processed. For example, an online approximation of a nonlinear dynamics model for the purpose of robot control requires highly efficient online regression technology.
In recent years, various local learning methods have been proposed to resolve the aforementioned problems [9], [10]. The objective of local learning methods is to express complex global data as simple subsets, based on clustering techniques. Local learning that is considered to be effective locally, would not necessarily be effective globally [11]. The main advantage of local learning is the ability to process training data locally on the multiple individual components, which significantly reduces computation time and provides local robustness [12]. Fast local learning is more effective than a globally optimized model in real-time applications. To partition a full set of training data into smaller local subsets, many clustering algorithms have been proposed. The k-means clustering algorithm has been widely implemented in many fields to partition complex data sets into simpler data subsets [13], [14]. This method requires the number of local models k to be given, and randomly locates the initial k centers. Mixture of Experts (ME) method has been implemented to divide global training models into much simpler local models [15], [16]. However, to facilitate training data clustering, this method depends on the selection of an optimal number of local models k for the particular data set. Therefore, an effective method which can rapidly cluster a large amount of data, and conveniently adjusting the number of local models based on the special requirement.
Although training speed and robustness can be improved by local learning methods, this mixture of local models has remaining drawbacks. In particular, this supervised NN does not utilize an uncertainty model during the learning phase. For each local NN model configuration, an input variable is given, and the corresponding target of the network is an output variable; no other data are considered. However, a real dynamic system is often dominated by uncertainty, and real model predictions that do not include uncertainty may be of limited value and undesirable [17]. A Gaussian process (GP) with the nonparametric Bayesian model, which provides an explicit uncertainty measure, has developed into a useful machine learning tool [18], [19], [20]. It has shown a powerful learning capability, which enables it to accurately approximate complex nonlinear models in high-dimensional space [21], and to identify complex dynamic systems [22]. GP provides a fully generative model without significant formal requirements for training data distribution. As its main advantage, it considers both the noise in the system and uncertainty in the model. When sufficient training data are provided, the GP model can approximate the parametric models accurately.
The contribution of this paper lies in that a novel method is proposed by merging the intersected ε-region, which can conveniently adjusting the parameter ε based on the requirement; a locally supervised NN model is considered to train with uncertainty like GP, and then uses the locally weighted regression [11] to obtain the final predictive distribution.
This article is organized as follows. In Section 2, a novel cluster algorithm is proposed to partition the global NN model into simpler local models. Then, a locally supervised NN model is presented in Section 3. The advantage of a Gaussian process model is introduced, and each local NN is performed with a Gaussian process in Section 4. Predictions are performed by locally weighting each local NN model in Section 5. Finally, the performance of this algorithm is demonstrated by learning the model of a wheel–terrain interaction system in Section 6; the paper is concluded in Section 7.
Section snippets
The task of local learning
An observed set of n data from a specially nonlinear system is used as input; the corresponding target output set is , with p-dimension input and scalar output . In more realistic modeling situations, the outputs are the noisy observations of a latent function . The objective of using a neural network model is to build a mapping function from input data to output data, in order to predict the distribution of the new test data.
Once the training samples
Forward NN model
Here, the observed input data are considered as the centers of hidden neurons. For the kth local model, the requested nonlinear mapping relationship from the training subset to can be expressed by a simple forward NN model:where is the input point i of the kth model; mk is the number of nodes in the hidden layer, and the corresponding output ;
Training NN model with uncertainty like GP
According to the training phase of supervised NNs, it is evident that the expression of predictive models is similar to the model-predicting mean of the Gaussian process. However, no uncertainty model was included in previously described NN implemented during the training phase. Since an actual dynamic system is dominated by uncertainty, a prediction without uncertainty is of limited value and undesirable. A Gaussian process for a regression (GPR) model provides an explicit uncertainty measure
Locally weighted NN models for prediction
Local learning typically depends on the notion of a “neighborhood,” which is always based on the prior measure of locality. Here, we introduce the weight measure to ensure smooth transitions between the local models and the test point. It is a function of the Mahalanobis distance from the test point to a local model [27]. This distance is often selected from the test point to the center of a local model. However, this study utilizes the nearest Mahalanobis distance
Wheel–terrain interaction test platform
Research involving wheel–terrain interaction systems has an important role in the design of motion control systems for planetary rovers. However, most physical modeling performed during this research is highly nonlinear and includes multi-coupling equations. In this case, the locally supervised NN is applied to identify the optimal model for wheel–terrain interaction systems. The platform used for testing is the wheel–terrain interaction test system developed in the Chinese State Key Laboratory
Conclusion
Neural network based on supervised learning is capable of identifying nonlinear models and predicting the distribution of systems more accurately than models based on unsupervised learning. However, for highly complex nonlinear systems, building a global robust network model is quite difficult, since a large amount of experimental data must be rapidly processed, especially in real-time applications. To improve learning ability, this paper has proposed an effective clustering method to build a
Acknowledgments
This study was supported in part by the National Natural Science Foundation of China (Grant no. 61370033/51275106); National Basic Research Program of China (Grant no. 2013CB035502); Harbin Talent Programme for Distinguished Young Scholars (No. 2014RFYXJ001); Fundamental Research Funds for the Central Universities (Grant no. HIT.BRETIII.201411); Research Project of State Key Laboratory of Mechanical System and Vibration (No. MSV201610); and the “111 Project” (Grant no. B07018).
Xingguo Song is a PhD candidate in Mechanical Engineering at Harbin Institute of Technology, State Key Laboratory of Robotics and System, Harbin, China. His current research interests include stability of dynamical systems, neural networks, robotics and automatic control.
References (28)
- et al.
A comparison of supervised and unsupervised neural networks in predicting bankruptcy of Korean firms
Expert Syst. Appl.
(2005) - et al.
Extreme learning machine: theory and applications
Neurocomputing
(2006) - et al.
Real-time model learning using incremental sparse spectrum Gaussian process regression
Neural Netw.
(2013) - et al.
Composite adaptive control with locally weighted statistical learning
Neural Netw.
(2005) - et al.
Experimental study and analysis on driving wheels׳ performance for planetary exploration rovers moving in deformable soil
J. Terramech.
(2011) Theory of Ground Vehicles
(2001)- et al.
Interaction mechanics model for rigid driving wheels of planetary rovers moving on sandy terrain with consideration of multiple physical effects
J. Field Robot.
(2015) - et al.
Real-time learning capability of neural networks
IEEE Trans. Neural Netw.
(2006) - et al.
Real-time model predictive control using a self-organizing neural network
IEEE Trans. Neural Netw.
(2013) - et al.
Global convergence of online BP training with dynamic learning rate
IEEE Trans. Neural Netw. Learn. Syst.
(2012)
Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks
IEEE Trans. Neural Netw. Learn. Syst.
Local learning algorithms
Neural Comput.
Cited by (8)
Terramechanics models augmented by machine learning representations
2023, Journal of TerramechanicsMachine learning in planetary rovers: A survey of learning versus classical estimation methods in terramechanics for in situ exploration
2021, Journal of TerramechanicsCitation Excerpt :However, due to the limited computational resources of planetary rovers, measurements of VO can only occur e.g. once every 90 s (Toupet et al., 2019), even proprioceptive sensor data attains a frequency of 8 Hz (Sullivan et al., 2011) typically. In traditional literature, classical (Bekker-derived) DT methods are both highly nonlinear and include multi-coupling equations, that use many soil parameters with multivariable integration (Song et al., 2016). Solving this problematic requires adopting simplification models to obtain closed-form analytical solutions for the forces and torques, that can be computable online (real-time) at the cost of increasing the modeling error.
Towards in-situ characterization of regolith strength by inverse terramechanics and machine learning: A survey and applications to planetary rovers
2021, Planetary and Space ScienceCitation Excerpt :In the following, we try to clarify most of these aspects defining different learning paradigms. Neural networks (NNs) are one of the most popular machine learning methods (Chen, 1995; Song et al., 2016), and typically characterized according their architecture, node or learning rule characteristics. Although not all terramechanics applications of machine learning use them (e.g. (Gonzalez et al., 2018a; Bouguelia et al., 2017)), their generality will be useful to introduce certain concepts easier.
Environmental contact modeling for the earthworm-like robot via the novel elementary mechanical network
2022, Science China Technological SciencesMarkov Chain Monte Carlo Parameter Estimation for Nonzero Slip Models of Wheeled Mobile Robots: A Skid-Steer Case Study
2021, Journal of Mechanisms and RoboticsLearning-Based Terrain Identification with Proprioceptive Sensors for Mobile Robots
2021, IEEE Transactions on Industrial Electronics
Xingguo Song is a PhD candidate in Mechanical Engineering at Harbin Institute of Technology, State Key Laboratory of Robotics and System, Harbin, China. His current research interests include stability of dynamical systems, neural networks, robotics and automatic control.
Haibo Gao received his Ph.D. degree in Mechanical Engineering from Harbin Institute of Technology, Harbin, China, in 2003. His current research interests include aerospace mechanism and control. He is a Professor and Ph.D. advisor at the State Key Laboratory of Robotics and System, Harbin Institute of Technology. He won the second prize of National Award for Technological Invention.
Liang Ding received his Ph.D. degree in Mechanical Engineering from Harbin Institute of Technology, Harbin, China in 2010. His current research interests include control, simulation, and mechanics for mobile robots. He is an associate Professor and Ph.D. advisor at the State Key Laboratory of Robotics and System, Harbin Institute of Technology. He also is an IEEE member. He won the second prize of National Award for Technological Invention, and 2011 Excellence Award of Hiwin Doctoral Dissertation Award.
Pol D. Spanos is L.B. Ryon Chair in Engineering, Professor of Mechanical Engineering and Materials Science, and of Civil and Environmental Engineering, Rice University. His current research interests include digital signal processing, parameters identification, response determination, dynamical systems, performance and safety assessment for statics and dynamics problems.
Zongquan Deng is a vice-president, Professor and Ph.D. advisor at the State Key Laboratory of Robotics and System, Harbin Institute of Technology. His current research interests include robotics, planetary exploration rovers, aerospace mechanism and control.