Ensemble of surrogates with hybrid method using global and local measures for engineering design

Chen, Liming; Qiu, Haobo; Jiang, Chen; Cai, Xiwen; Gao, Liang

doi:10.1007/s00158-017-1841-y

Ensemble of surrogates with hybrid method using global and local measures for engineering design

RESEARCH PAPER
Published: 06 November 2017

Volume 57, pages 1711–1729, (2018)
Cite this article

Structural and Multidisciplinary Optimization Aims and scope Submit manuscript

Liming Chen¹,
Haobo Qiu¹,
Chen Jiang¹,
Xiwen Cai¹ &
…
Liang Gao¹

859 Accesses
28 Citations
Explore all metrics

Abstract

Surrogate models are usually used as a time-saving approach to reduce the computational burden of expensive computer simulations for engineering design. However, it is difficult to choose an appropriate model for an unknown design space. To tackle this problem, an effective method is forming an ensemble model that combines several surrogate models. Many efforts were made to determine the weight factors of ensemble, which include global and local measures. This article investigates the characteristics of global and local measures, and presents a new ensemble model which combines the advantages of these two measures. In the proposed method, the design space is divided into two parts, and different strategies are introduced to evaluate the weight factors in these two parts respectively. The results from numerical and engineering design cases show that the proposed ensemble model has satisfactory robustness and accuracy (it performs best for most cases tested in this article), while spending almost the equivalent modeling time (the additional cost is not more than 6.7% for any case tested in this article) compared with the combined global and local ensemble models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Dung beetle optimizer: a new meta-heuristic algorithm for global optimization

Article 27 November 2022

Topology optimization of multi-scale structures: a review

Article Open access 08 March 2021

A Review of Multi-objective Optimization: Methods and Algorithms in Mechanical Engineering Problems

Article 21 October 2021

Abbreviations

d :: Number of design variables.
E _i :: Root generalized mean square cross-validation error of the i ^th surrogate.
e _ik :: Cross-validation error of the i ^th surrogate at the k ^th sample point.
$ {\widehat{f}}^{ens} $ :: Predictor of the ensemble.
$ {\widehat{f}}_i $ :: Predictor of the i ^th surrogate.
N :: Number of test points.
N _s :: Number of surrogates used in the ensemble.
n :: Number of sample points.
P _k :: Ratio of the global cross-validation error to the local cross-validation error at the k ^th sample point.
R ^o :: Outer region.
R ⁱ :: Inner region.
r _k :: Radius of the k ^th point’s inner region.
$ {r}_k^{\mathrm{max}} $ :: Euclidean distance between the k ^th sample point and the closest sample point.
S :: Sample points set.
WCVE :: Weighted cross-validation error.
w _i :: Normalized weight of the i ^th surrogate.
$ {w}_i^{\ast } $ :: Unnormalized weight of the i ^th surrogate.
w _ik :: Pointwise weight of the i ^th surrogate at the k ^th sample point.
x ^nearest :: Sample point which is nearest to the prediction point.
$ {\widehat{y}}_{ik} $ :: Response predicted by the i ^th surrogate at the k ^th point, the surrogate is constructed by using leave-one-out cross-validation.
y _k :: True response at the k ^th sample/test point.
$ {\widehat{y}}_k $ :: Prediction response at the k ^th sample/test point.
ρ :: Impact metric of local measure.

References

Acar E, Rais-Rohani M (2009) Ensemble of metamodels with optimized weight factors. Struct Multidiscip Optim 37(3):279–294
Article Google Scholar
Acar E (2010) Various approaches for constructing an ensemble of metamodels using local measures. Struct Multidiscip Optim 42(6):879–896
Article Google Scholar
Acar E (2015) Effect of error metrics on optimum weight factor selection for ensemble of metamodels. Expert Syst Appl 42(5):2703–2709
Article Google Scholar
Bishop CM (1995) Neural networks for pattern recognition. Oxford university press, Oxford
MATH Google Scholar
Box GE, Draper NR (1987) Empirical model-building and response surfaces, vol 424. Wiley, New York
MATH Google Scholar
Buckland ST, Burnham KP, Augustin NH (1997) Model selection: an integral part of inference. Biometrics 53:603–618
Article MATH Google Scholar
Cherkassky V, Shao X, Mulier FM, Vapnik VN (1999) Model complexity control for regression using VC generalization bounds. IEEE Trans Neural Netw 10(5):1075–1089
Article Google Scholar
Dixon LCW, Szegö GP (eds) (1978) Towards global optimisation. North-Holland, Amsterdam
Google Scholar
Forrester AI, Keane AJ (2009) Recent advances in surrogate-based optimization. Prog Aerosp Sci 45(1):50–79
Article Google Scholar
Forrester A, Sobester A, Keane A (2008) Engineering design via surrogate modelling: a practical guide. John Wiley & Sons, Chichester
Book Google Scholar
Goel T, Haftka RT, Shyy W, Queipo NV (2007) Ensemble of surrogates. Struct Multidiscip Optim 33(3):199–216
Article Google Scholar
Hardy RL (1971) Multiquadric equations of topography and other irregular surfaces. J Geophys Res 76(8):1905–1915
Article Google Scholar
Hoeting JA, Madigan D, Raftery AE, Volinsky CT (1999) Bayesian model averaging: a tutorial. Stat Sci:382–401
Jones DR (2001) A taxonomy of global optimization methods based on response surfaces. J Glob Optim 21(4):345–383
Article MathSciNet MATH Google Scholar
Kass RE, Raftery AE (1995) Bayes factors. J Am Stat Assoc 90(430):773–795
Article MathSciNet MATH Google Scholar
Madigan D, Raftery AE (1994) Model selection and accounting for model uncertainty in graphical models using Occam's window. J Am Stat Assoc 89(428):1535–1546
Article MATH Google Scholar
McKay MD, Beckman RJ, Conover WJ (1979) Comparison of three methods for selecting values of input variables in the analysis of output from a computer code. Technometrics 21(2):239–245
MathSciNet MATH Google Scholar
Penrose R (1955) A generalized inverse for matrices. Math Proc Camb Philos Soc 51(03):406–413 Cambridge University Press
Article MATH Google Scholar
Powell MJD (1987) Radial basis functions for multivariable interpolation: a review. In: Mason JC, Cox MG (eds) Proceedings of the IMA conference on algorithms for the approximation of functions and data, Oxford University Press, London, pp 143-167
Queipo NV, Haftka RT, Shyy W, Goel T, Vaidyanathan R, Tucker PK (2005) Surrogate-based analysis and optimization. Prog Aerosp Sci 41(1):1–28
Article Google Scholar
Rennie JDM (2005) Volume of the n-sphere. Retrieved in April 2017, from http://people.csail.mit.edu/jrennie/writing/sphereVolume.pdf
Sacks J, Welch WJ, Mitchell TJ, Wynn HP (1989) Design and analysis of computer experiments. Stat Sci 4:409–423
Article MathSciNet MATH Google Scholar
Sanchez E, Pintos S, Queipo NV (2008) Toward an optimal ensemble of kernel-based approximations with engineering applications. Struct Multidiscip Optim 36(3):247–261
Article Google Scholar
Smola AJ, Schölkopf B (2004) A tutorial on support vector regression. Stat Comput 14(3):199–222
Article MathSciNet Google Scholar
Viana FA, Haftka RT, Steffen V (2009) Multiple surrogates: how cross-validation errors can help us to obtain the best predictor. Struct Multidiscip Optim 39(4):439–457
Article Google Scholar
Zerpa LE, Queipo NV, Pintos S, Salager JL (2005) An optimization methodology of alkaline–surfactant–polymer flooding processes using field scale numerical simulation and multiple surrogates. J Pet Sci Eng 47(3):197–208
Article Google Scholar
Zhou XJ, Ma YZ, Li XF (2011) Ensemble of surrogates with recursive arithmetic average. Struct Multidiscip Optim 44(5):651–671
Article Google Scholar

Download references

Acknowledgments

Financial support from the National Natural Science Foundation of China under Grant No. 51675198, 973 National Basic Research Program of China under Grant No. 2014CB046705 and National Natural Science Foundation of China under Grant No. 51421062 are gratefully acknowledged.

Author information

Authors and Affiliations

State Key Laboratory of Digital Manufacturing Equipment and Technology, School of Mechanical Science and Engineering, Huazhong University of Science & Technology, Wuhan, 430074, People’s Republic of China
Liming Chen, Haobo Qiu, Chen Jiang, Xiwen Cai & Liang Gao

Authors

Liming Chen
View author publications
You can also search for this author in PubMed Google Scholar
Haobo Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Chen Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Xiwen Cai
View author publications
You can also search for this author in PubMed Google Scholar
Liang Gao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Haobo Qiu.

Appendices

Appendix A: Description of the selected surrogate models

In this appendix, a brief overview of the mathematical formulations of PRS, RBF and KRG surrogate models is provided.

1.1 A.1 Polynomial response surface (PRS)

The PRS approximation is one of the most well-established surrogate models. The most commonly used PRS model is the following second-order form

$$ \widehat{f}(x)={\beta}_0+\sum \limits_{i=1}^d{\beta}_i{x}_i+\sum \limits_{i=1}^d\sum \limits_{j\ge i}^d{\beta}_{ij}{x}_i{x}_j $$

(27)

where d is the number of design variables, β ₀, β _i and β _ij are the unknown coefficients to be determined by the least squares technique. Here we run the MATLAB® routine “pinv” to obtain the Moore-Penrose generalized inverse matrix of the unknown coefficients, which was proved to be the optimal least square solution by Penrose (1955).

1.2 A.2 Radial basis function (RBF)

RBF models were originally developed to approximate multivariate functions. The general form of the RBF approximation can be expressed as

$$ \widehat{f}(x)=\sum \limits_{i=1}^n{w}_i\varphi \left(\left\Vert x-{x}_i\right\Vert \right) $$

(28)

where n denotes the number of sample points, w _i are the unknown coefficients to be determined, ‖·‖ represents the Euclidean norm and φ(·) is the so-called basis function. Powell (1987) suggested several forms of the basis function φ(·):

Gaussian $ \varphi (r)={e}^{-\frac{r^2}{2{\sigma}^2}} $
ultiquadric $ \varphi (r)=\sqrt{r^2+{\sigma}^2} $
nverse Multiquadric $ \varphi (r)=1/\sqrt{r^2+{\sigma}^2} $
Thin-Plate Spline φ(r) = r ² ln(r)

where σ ≥ 0. In this article we use the multiquadric basis function with σ = 1 (suggested by Acar and Rais-Rohani (2009)), for its prediction accuracy and convergence ability with increased sample points. In order to obtain the unknown coefficients w _i, we substitute the n sample points into the (28) to form an equation as

$$ y=\varPhi \cdot w $$

(29)

where y is the vector of sample responses and Φ is an n × n matrix of basis functions. The coefficients vector w is obtained by solving (29).

1.3 A.3 Kriging (KRG)

The basic assumption of KRG is the estimation of the response in the form

$$ Y(x)=\mu (x)+Z(x) $$

(30)

where the response Y consists of a known polynomial μ(x) which globally approximates the trend of the function and a stochastic component Z(x) which generates deviations, so that the Kriging model interpolates the sample points. The correlation between the random variables Y(x⁽ⁱ⁾) and Y(x^(j)) is given by

$$ Corr\left[Y\left({\mathrm{x}}^{(i)}\right),Y\left({\mathrm{x}}^{(j)}\right)\right]=\exp \left(-\sum \limits_l^d{\theta}_l{\left|{x}_l^{(i)}-{x}_l^{(j)}\right|}^{p_l}\right) $$

(31)

where d is the number of design variables, θ _l and P _l(l = 1, ⋯, d) are unknown parameters to be estimated. Here we only consider a constant term to represent the mean of the overall surface (the Ordinary Kriging) and fix the parameters P _l = 2(l = 1, ⋯, d) (the stationary Gaussian correlation function case). Then we search the optimal θ _l in the range of [10⁻³, 10²] (suggested by Forrester et al. (2008)) with the GA (Genetic Algorithm) toolbox of MATLAB®.

Once the correlation function has been selected, the response is predicted as

$$ \widehat{y}=\widehat{\mu}+{r}^T{R}^{-1}\left(y-\mathbf{1}\widehat{\mu}\right) $$

(32)

where the matrix R ⁻¹ is the inverse of the correlation matrix R whose element R _ij is equal to the (31), y is the vector of sample responses and 1 represents an n × 1 vector of ones. The estimated value of $ \widehat{\mu} $ and the expressions of r ^T are

$$ \widehat{\mu}=\frac{{\mathbf{1}}^T{R}^{-1}y}{{\mathbf{1}}^T{R}^{-1}\mathbf{1}} $$

(33)

$$ {r}^T=\left( Corr\left[Y\left({x}^{(1)}\right),Y(x)\right]\kern0.5em \cdots \kern0.5em Corr\left[Y\left({x}^{(n)}\right),Y(x)\right]\right) $$

(34)

Detailed derivation of Kriging can be found in Jones (2001) and Forrester et al. (2008).

Appendix B: Description of the numerical test functions

In this appendix, the description of six numerical test functions is provided. The landscapes of two-variable functions are depicted in Figs. 6 and 7.

1.1 B.1 Branin-Hoo function

$$ f(x)={\left({x}_2-\frac{5.1{x}_1^2}{4{\pi}^2}+\frac{5{x}_1}{\pi }-6\right)}^2+10\left(1-\frac{1}{8\pi}\right)\cos \left({x}_1\right)+10 $$

(35)

where x ₁ ∈ [−5, 10] and x ₂ ∈ [0, 15]_.

1.2 B.2 Camelback function

$$ f(x)=\left(4-2.1{x}_1^2+\frac{x_1^4}{3}\right){x}_1^2+{x}_1{x}_2+\left(-4+4{x}_2^2\right){x}_2^2 $$

(36)

where x ₁ ∈ [−2, 2] and x ₂ ∈ [−2, 2]_.

1.3 B.3 and B.4 Hartman functions

$$ f(x)=-\sum \limits_{i=1}^4{c}_i\exp \left[-\sum \limits_{j=1}^n{a}_{ij}{\left({x}_j-{p}_{ij}\right)}^2\right] $$

(37)

where x _i ∈ [0, 1]. Two types of Hartman functions are given based on different number of input variables: (1) Hartman-3 with three input variables (test function 3), and (2) Hartman-6 with six input variables (test function 4). While the parameter c in each function is the same vector $ {\left[1\kern0.5em 1.2\kern0.5em 3\kern0.5em 3.2\right]}^T $, the other two parameters a and p are shown in Table 5 and 6.

Table 5 Parameters used in Hartman-3 function, j = 1, 2, 3

Full size table

Table 6 Parameters used in Hartman-6 function, j = 1, 2, ⋯, 6

Full size table

1.4 B.5 Extended-Rosenbrock function

$$ f(x)=\sum \limits_{i=1}^{m-1}\left[{\left(1-{x}_i\right)}^2+100{\left({x}_{i+1}-{x}_i^2\right)}^2\right] $$

(38)

where x _i ∈ [−5, 10], i = 1, 2, ⋯, m = 9.

1.5 B.6 Dixon-Price function

$$ f(x)={\left({x}_1-1\right)}^2+\sum \limits_{i=2}^mi{\left[2{x}_i^2-{x}_{i-1}\right]}^2 $$

(39)

$$ {x}_i\in \left[-10,10\right],i=1,2,\cdots, m=12. $$

(where)

Appendix C: Test results for determining the form of ES-HGL

In this appendix, the test results referenced in Section 3.3 are provided in Table 7, 8 and 9.

Table 7 Test results of different region radius forms

Full size table

Table 8 Test results of different HWF forms

Full size table

Table 9 Test result for evaluating the effect of the hybrid method

Full size table

1.1 C.1 Test result for determining the form of the region radius

1.2 C.2 Test result for determining the form of the HybridWeight Factor (HWF)

1.3 C.3 Test result for evaluating the effect of the hybrid method

Appendix D: Boxplots for six numerical examples (Figures 8, 9, and 10)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, L., Qiu, H., Jiang, C. et al. Ensemble of surrogates with hybrid method using global and local measures for engineering design. Struct Multidisc Optim 57, 1711–1729 (2018). https://doi.org/10.1007/s00158-017-1841-y

Download citation

Received: 27 April 2017
Revised: 05 September 2017
Accepted: 19 October 2017
Published: 06 November 2017
Issue Date: April 2018
DOI: https://doi.org/10.1007/s00158-017-1841-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Ensemble of surrogates with hybrid method using global and local measures for engineering design

Abstract

Access this article

Similar content being viewed by others

Dung beetle optimizer: a new meta-heuristic algorithm for global optimization

Topology optimization of multi-scale structures: a review

A Review of Multi-objective Optimization: Methods and Algorithms in Mechanical Engineering Problems

Abbreviations

References

Acknowledgments