A transformation-based discretization method for solving general semi-infinite optimization problems

Discretization methods are commonly used for solving standard semi-infinite optimization (SIP) problems. The transfer of these methods to the case of general semi-infinite optimization (GSIP) problems is difficult due to the x\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathbf {x}$$\end{document}-dependence of the infinite index set. On the other hand, under suitable conditions, a GSIP problem can be transformed into a SIP problem. In this paper we assume that such a transformation exists globally. However, this approach may destroy convexity in the lower level, which is very important for numerical methods. We present in this paper a solution approach for GSIP problems, which cleverly combines the above mentioned two techniques. It is shown that the convergence results for discretization methods in the case of SIP problems can be transferred to our transformation-based discretization method under suitable assumptions on the transformation. Finally, we illustrate the operation of our approach as well as its performance on several examples, including a problem of volume-maximal inscription of multiple variable bodies into a larger fixed body, which has never before been considered as a GSIP test problem.


Introduction
In the present paper we consider general semi-infinite optimization problems of the following form: where Y : R m ⇒ R n is a set-valued mapping, |Y (x)| = ∞ for at least some x ∈ X , I := {1, ..., p} is a finite index set and f , g i , i ∈ I , are real-valued and at least continuous functions.By X we summarize the finite restrictions on the decision variables.Moreover, we assume that X is non-empty and closed and for every x ∈ X the so-called infinite index set Y (x) is closed and compact.
The infinite index set can, e.g., be given as the solution set of finitely many inequalities: where J := {1, . . ., q} is a finite index set and u j , j ∈ J , are again real-valued and at least continuous functions.For the special case, where Y (x) does not depend on x, the problem above is called a standard or ordinary semi-infinite optimization problem and is abbreviated by SIP.
One of the keys for both the theoretical and numerical treatment of semi-infinite optimization problems lies in their bi-level structure.The parametric lower level problems of a semi-infinite problem are given by Q i (x) : max y∈R n g i (x, y) s.t.y ∈ Y (x). (1) The decision variables x of the semi-infinite problem are the parameters of the lower level problems and the index variables y of the semi-infinite problem are the decision variables of these problems.By we denote the so-called optimal value functions of the semi-infinite optimization problem.Obviously, but the functions ϕ i , i ∈ I , are only given implicitly and may be non-differentiable even if the functions g i , i ∈ I , and u j , j ∈ J , are linear.A natural and widespread solution technique for standard semi-infinite optimization problems is discretizing the infinite index set Y , solving the induced finite optimization problem, and refining the discretization (see e.g.Blankenship and Falk 1976;Reemtsen 1991;Reemtsen and Goerner 1998;López and Still 2007).The direct transfer of this approach to the case of a GSIP problem is possible in principle, but problematic: One problem in this case is the x-dependency of the infinite index set.Thus, the discretization has to depend on x, too.To ensure the closedness of the feasible set of the discretization-induced optimization problems, the discretizations must even depend at least continuously on x.How such a discretization can be constructed is shown in Still (2001b).Due to additional conditions (see Still 2001b), which must be met for convergence of these methods in the case of GSIP problems, they are difficult to implement.And, to our knowledge, there are no numerical experiences with these discretization methods so far.
On the other hand, under suitable assumptions, any general semi-infinite optimization problem can be at least locally converted into a standard one (see Weber 1999;Still 1999).Such a transformation is of practical value only in cases where it is defined globally.We assume in the following that such a global transformation is given.For problems which have a geometric background, i.e. the infinite index set is a geometrically simple body or arises from geometrically simple bodies via intersection or union, such globally defined transformations can often be specified (see Floudas and Stein 2007;Steuermann 2011).Furthermore, these examples often have convex lower level problems.Unfortunately, the transfer to a SIP can destroy convexity of these problems.Then, heavy-weighted methods of global optimization must be applied to solve the resulting SIP (see Steuermann 2011).
We present in this paper a numerical method for solving general semi-infinite optimization problems, which can maintain convex lower-level problems.It combines the transformation into a standard semi-infinite problem with discretization techniques for such problems.In the algorithm we switch between two problems: the original GSIP problem and the SIP problem induced by the transformation.Starting from a given initial point, an initial discretization of the infinite index set of the GSIP problem is calculated and transferred into a discretization of the infinite index set of the induced SIP problem.Subsequently, a sequence of discretized SIP problems is solved, in which the discretization is refined from step to step.However, the lower level problems of the induced SIP problem are not solved for refinement, rather the lower level problems of the GSIP problem are solved.Then, the global solutions of these lower level problems are transferred into points of the index set of the induced SIP problem.It can be shown that the latter ones are global solutions, too.This is necessary for the correctness of the discretization method.
Numerical examples show that one only needs to add relatively few points to achieve feasibility within a given tolerance.Therefore this method is also suited for complex problems like the inscription of multiple bodies into a fixed design.
Early convergence results for discretization methods solving SIP problems are for example given by Reemtsen (1991) and Reemtsen and Goerner (1998).These results focus on the convergence to feasible points and the convergence of global solutions.Results regarding convergence rates of local solutions were presented by Still (2001a).We will show in Sect. 3 how these results carry over to the transformation-based discretization method.Moreover, we add an assumption to the main statement in Still (2001a) and present a counterexample for the case that the assumption is not fulfilled.This assumption is missing in Still (2001a).
A different reformulation approach to a SIP problem and subsequent discretization is followed by Mitsos and Tsoukalas (2015).There, a GSIP is reformulated using disjunctive semi-infinite constraints.The problem is then used to construct better upper and lower bounds for the original problem iteratively.This is done by a discretization strategy which adaptively adds discretization points.The disjunctive programs resulting from the discretization are solved globally using a solver for mixed integer nonlinear problems.In Djelassi et al. (2019) the strategy is extended to problems with equality constraints in the lower level.Instead of using a reformulation to a mixed integer program, in Kirst and Stein (2019) the authors develop tailored strategies to solve the disjunctive problems.An early attempt to globally solve GSIP problems was followed by Lemonidis (2008).In this work an inner and an outer approximation of the variable index set is calculated which doesn't depend on the optimization variables anymore.Then, the same techniques as presented in Bhattacharjee et al. (2005) are applied to the standard semi-infinite optimization problems.These techniques provide lower and upper bounds, which can be used for global optimization.Furthermore, in Lemonidis (2008) it is shown for the SIP application kinetic model reduction (KMR) how GSIP problems with box-shaped infinite index sets can be transferred into an equivalent SIP problem by a linear coordinate transformation (convex combination of the lower and upper bounds).
Our focus is different from these approaches.We try to avoid a reformulation using disjunctive constraints.Our approach allows to maintain helpful structure, e.g.convexity of the lower level problems, which comes with the GSIP problem.And, we try to avoid solving problems by global solvers.
The paper is organized as follows.In Sect. 2 we show how a GSIP problem can be reformulated into a SIP one.We point out why it is beneficial to solve the lower level problem of the given GSIP.Then, we introduce the algorithm.Section 3 contains results about convergence to a feasible point or global solutions, along with rates of convergence.Section 4 discusses the conditions necessary to retain convexity of the induced SIP lower level problems and of the discretized SIP problems.In Sect. 5 we illustrate the application of our method and its performance on three classes of examples: some native GSIP problems, a design centering problem, and a problem of inscribing two (or more) variable bodies volume-maximal into a larger fixed container.Section 6 concludes the paper and points out future directions of research.

The transformation-based discretization method (TDM)
Transformation: In principle, under suitable assumptions, each GSIP problem can be transformed, at least locally, into an equivalent SIP problem (see Still 1999;Weber 1999 for details).However, such a transformation is only of practical value in cases where this transformation is defined globally.The ideal situation is the following: Assumption 1 (Existence of transformation) There exist a nonempty, compact set Z ⊆ R ñ and a continuous function t : Given Assumption 1, for every i ∈ I , the feasibility condition g i (x, y) ≤ 0 for all y ∈ Y (x), can be equivalently written as which is of ordinary semi-infinite type.Thus, the feasible sets of the GSIP problem and the induced SIP problem coincide.For the rest of the paper we assume that Assumption 1 is fulfilled.
For an interval, or more general box-shaped, index set , such a transformation can be easily constructed by convex combination of the interval ends resp.box vertices.In the case of higher dimensional index sets such a transformation exists for example, if they are star-shaped (see Still 1999 for details).But often the application itself already suggests such a transformation [see for example the design centering problems in Steuermann (2011)].

Remark 1
In general, it won't be possible to parametrize the set Y (x) by one function.However, it is sufficient for the following considerations, that the set Y (x) can be partitioned such that for every subset Y l (x), l ∈ {1, ..., s}, a set Z l and a function t l according to Assumption 1 exist.The results and the algorithm presented in the following carry over to this case directly.
We denote the transformation-induced standard semi-infinite optimization problem by SIP : where the functions gi are defined as in (2). By we denote the lower level problems and the optimal value functions of the (transformation-)induced SIP problem, respectively.We show in the following that the optimal value functions φi and ϕ i coincide for every i ∈ I .
Discretization: A widespread approach for solving SIPs is discretizing the infinite index set.Therefore, for a subset Ż of Z , we introduce the optimization problem SIP( Ż ) : Discretization methods which adaptively add points to the discretization are typically very successful.An algorithm for SIP problems using this methodology was first introduced by Blankenship and Falk (1976).The points which are added are the global solutions of the lower level problem.This is important for guaranteeing feasibility.Unfortunately, the transformation may destroy given convexity in the lower level problem (see Kaplan and Tichatschke 1997;Still 2001b).More precisely, if g i (x, y) is convex in y, gi (x, z) can be non-convex in z.We illustrate this effect by the following example.
Example 1 We consider a so-called squircle as infinite index set, i.e.
Furthermore, let g 1 (x, y) := y 2 − y 2 1 ≤ 0 be the semiinfinite constraint.Obviously, Y (x) is a convex set for all x and g 1 (x, y) is concave in y (for all x).Hence, for all x the (lower level) problem Q 1 (x) : max y∈Y (x) g 1 (x, y) is a convex one (see Fig. 1a for a graphical visualization).The representation of a circle as image of the set [0, 1] × [0, π] can be generalized to this case (see for example Jaklic et al. 2000): is not concave in z for all x, e.g.not in x = (0.8, 0.5, 1) T as one can see in Fig. 1b.There are two different local solutions for the transformed lower level problem Q1 (x) : max z∈Z g(x, z).Hence, the lower level problems of SIP may be non-convex in general.
The loss of convexity can cause numerical difficulties in solving the lower level problems.In the worst case time-consuming global optimization routines have to be used to solve these problems.That's why we consider the original convex lower level problems in the algorithm.Then, we can map these solutions to solutions of the The following lemma states that this is a feasible approach.From it also follows that the transformation does not change the feasible set defining functions ϕ i , i ∈ I .
Lemma 1 Let x ∈ X and i ∈ I .The point z * ∈ Z is a global solution of Qi (x) if and only if y * = t(x, z * ) is a global solution of Q i (x).
Proof Let y * be a global solution of Q i (x), i.e. g i (x, y * ) ≥ g i (x, y) for all y ∈ Y (x).This relation is under for all z ∈ Z .Thus, z * is a global solution of Qi (x).The reverse applies under analogous considerations and the surjectivity of t.
The above lemma states that one can calculate a global solution of the generally non-convex problem Qi (x) by computing a global solution of the convex problem Q i (x) and mapping this one via the transformation t(x, •) into Z .Using an adaptive discretization as in Blankenship and Falk (1976), these considerations lead to the following algorithm: Algorithm 1 Transformation-based discretization method (TDM) for GSIP problems 1: Choose a starting point x 0 ∈ X and a tolerance α ≥ 0. 2: Choose/Compute an initial discretization Ẏ0 x 0 ⊆ Y x 0 and determine Ż0 such that t x 0 , Ż0 = Ẏ0 x 0 .
3: Set k = 0. 4: repeat 5: Compute a solution x k+1 of the discretized SIP problem SIP Żk using x k as starting point.6: Let Z new = ∅.7: for i ∈ I do 8: Compute a global solution y k+1 i of Q i x k+1 .9: end if 12: end for 13: Set Żk+1 = Żk ∪ Żnew and replace k by k + 1.
If, in Step 2, an initial discretization Ẏ0 x 0 of Y x 0 is not at hand, one can be obtained by solving the lower level problems and transferring the solutions (see Steps 8 and 10 ).A feasible starting point for Step 8 can be calculated using the transformation t(x, •) from any feasible point of Z .
While we demand to calculate a global solution in Step 8, we can either calculate a global or a local solution in Step 5.As we will see in the proof of Theorem 1, the global solution of the lower level problem is important to guarantee feasibility.The type of solution of the discretized problems determines the type of solution calculated for the GSIP problem.
If the feasible set of SIP Żk becomes empty, also the feasible set M of the GSIP problem is empty and Algorithm 1 can be stopped.
With regard to one of the main applications of semi-infinite optimization, the inscribing of one or multiple designs, the infinite index sets are often simple geometric objects, such as (hyper)ellipsoids or (hyper)boxes, but also objects composed of geometrically simple objects.For these, (re-)parameterization, e.g. in the form of polar, cylindrical or spherical coordinates, are often well known.On the other hand, infinite index sets, especially in this context, can often be written as a translation, rotation, or scaling of fixed objects.Hence, the transformation is known, too.Finally, under suitable conditions, there are "generic" transformations.For example, any starshaped set can be described by the star center and a radius (function) depending on the distance to the center and the direction.In Still (1999) conditions are stated under which such a constructive transformation exists for infinite index sets.

Convergence results
There is a huge variety of convergence results for adaptive discretization methods solving ordinary semi-infinite optimization problems.The goal of this section is to show that the results can be carried over to the transformation-based discretization method under suitable assumptions on the transformation.
We start by assuming that we can guarantee finding global solutions in Step 5 of Algorithm 1. Similar results regarding the convergence in the case of ordinary semi-infinite optimization problems can be found for example in Blankenship and Falk (1976), Reemtsen (1991) and Reemtsen and Goerner (1998).The next theorem summarizes the convergence properties for GSIP problems.
Theorem 1 Let the (initial) feasible set M( Ż0 ) be compact and the feasible set M of GSIP be non-empty.Let the tolerance α = 0.Then, either Algorithm 1 terminates after a finite number of steps, or it holds: Proof For the discretizations Żk , k ∈ N 0 , of Z calculated by Algorithm 1, the following relation is valid: Hence, it holds for the feasible sets of the discretized SIP-problems SIP( Żk ), k ∈ N 0 , and the feasible set of SIP: The functions gi , i ∈ I , are continuous as compositions of continuous functions and, hence, the sets M( Żk ), k ∈ N 0 , and M are closed sets.Because of (3) they are bounded and, thus, are compact.Because of its continuity the function f takes its minimum on the sets M = M and M( Żk ), k ∈ N 0 .Hence, assertion (i) is shown.For (ii) note that x k and z k i , i ∈ I , are sequences in M( Ż0 ) and Z respectively.Both sets are compact by assumption.Hence, there is at least one accumulation point.For every converging subsequence {x k l } l∈N we can choose a subsequence such that z k l i , i ∈ I is convergent as well.Using continuity of gi and optimality of z k l i for Qi (x k l ), we have for i ∈ I and z ∈ Z Thus, every accumulation point is feasible.Moreover, because x ∈ M ⇒ x ∈ M( Żk l ) by ( 3), we also have for x ∈ M, which shows optimality.
The remaining assertions (iii) and (iv) follow easily by (ii) and relation (3).
If the SIP-defining functions f and gi , i ∈ I , are non-convex in x, it can be costly to guarantee the calculation of global solutions.In the literature there also exist results regarding the case, that only local solutions can be calculated (see, e.g., Reemtsen 1994).The same techniques can be used here.Therefore, assume that for every k ∈ N 0 , the current iterate x k is a local minimizer of SIP( Żk ) with radius r k > 0, i.e. for all x in M( Żk ) with x − x k < r k the following holds: If the radii do not converge towards zero, i.e. r := inf k∈N 0 r k > 0, then every accumulation point x * of the sequence {x k } k∈N 0 is a local minimizer of GSIP.This can be proven by restricting the analysis to a ball of radius r /2 around x * .For large k, the current iterates are global solutions within this ball.
These results and the assertions of Theorem 1 hold only for a limit point.We have introduced a parameter α as a finite termination criterion.The next Lemma shows that this is a valid approach.
Lemma 2 Again let the (initial) feasible set M( Ż0 ) be compact.For a tolerance α > 0 Algorithm 1 terminates after finitely many steps.
Proof Assume that the contrary is the case.In this case in every iteration the algorithm has to add at least one point to the discretization.This means that there is an i ∈ I and sequences and for all k > k: As Z and M( Ż0 ) are both compact there are convergent subsequences with: Because t is continuous, also gi is continuous.Together with (4) and ( 5) we have which is a contradiction.Hence, no such sequences can exist and the algorithm terminates after a finite number of steps.
The following question arises: After k steps, what is the distance between the obtained solution and a local solution of the GSIP problem.In Still (2001a) the distance is bounded for a sequence of solutions for finer discretizations.The transfer of these results will be the topic of the remainder of this section.
To do so, we have to strengthen the initial assumptions.First, in Assumption 2 we assume that the iterates converge towards a strict local minimizer.Second, in Assumption 3 we assume some differentiability of the functions g i , i ∈ I , and t as well as an extension of the Mangasarian-Fromovitz Constraint Qualification (MFCQ).Finally, we assume in Theorem 2 that the objective function is locally Lipschitz continuous.
A point x * ∈ M is called a strict local minimizer of order ρ = 1 or ρ = 2, if there is a q > 0 and a neighborhood U such that for every We now assume that the sequence generated by Algorithm 1 converges to a local solution: Assumption 2 Let x * be a strict local minimizer of order ρ = 1 or ρ = 2 of GSIP in the neighborhood U and {x k } k∈N 0 a sequence of local minimizers with radius r k calculated by the transformation-based discretization method (TDM).Furthermore, assume that The assumption presented here is very similar to the main assumption made in Still (2001a).The difference is the assumption of non-vanishing radii (6) of the local minimizers.As we will show in Sect.3.1 this is a necessary assumption.
For the construction of a feasible point a MFCQ-like condition for SIP is needed.Therefore, we ask under which conditions, the functions D 1 gi , i ∈ I , are continuous on U x * × Z , where U x * is a neighborhood of x * .
Lemma 3 Let the functions g i (x, y), i ∈ I , be differentiable w.r.t.(x, y) and t(x, z) be differentiable with respect to its first argument x.Let U x * be a neighborhood of x * .We assume that Then, the functions gi , i ∈ I , are differentiable w.r.t.x around x * and this derivative is given by: Before introducing Theorem 2, we collect the required conditions in an assumption.
Assumption 3 Let the functions g i (x, y), i ∈ I , be differentiable w.r.t.(x, y) and t(x, z) be differentiable in its first argument x.Assume, that there exists a neighborhood U x * of x * such that: (iii There exists a vector ξ ∈ R m such that for all i ∈ I : where For a sequence {x k } k∈N 0 calculated by Algorithm 1 we denote the current violation for k ∈ N 0 by: Now, we can bound the difference of the current iterate to the limit point.
Theorem 2 Let Assumptions 2 and 3 hold and the objective function f of GSIP be Lipschitz-continuous near x * .
(i) There is a constant c 1 > 0 and a k ∈ N 0 such that (ii) There is a constant c 2 > 0 and a k ∈ N 0 such that We have added Eq. 6 to the assumptions.In Sect.3.1 we will show that this additional assumption is needed and the statement is wrong without this assumption.That's why we give a complete proof and don't reduce the statement to the statements in Still (2001a) Proof Construction of a feasible point: In a first step we construct a feasible point xk .To do so, we move the current iterate towards feasibility: For k ∈ N 0 and t ∈ [0, 1] let where ξ is chosen according to Assumption 3.iii.We show next that the point xk (1) is feasible.Firstly, we consider indices z ∈ Z which are close to an active index.
For every i ∈ I the function D 1 gi (•, •) is by Assumption 3 continuous and Z is compact.Thus, there is some ε such that for every i ∈ I it holds Choose k 1 large enough such that By the mean value theorem there is for every i It remains to consider the indices which have a given distance to the active indices.
As for i ∈ I , the set Z \Z i,ε 0 (x * ) is compact and the functions gi (•, •) are continuous, there is a ε 2 such that gi (x, z) < 0, for z ∈ Z \Z i,ε 0 (x * ) and x − x * 2 < ε 2 .Choose k 2 large enough such that Now the two claims follow easily: Proof of i): By Assumption 2 the radii r k of the local minimizers x k don't converge to 0. Thus for large k the distance of the local solution x * to x k is smaller than r k and we have Again, for large k, the feasible point xk is contained in U .By the Lipschitz-continuity of f , there is a constant L such that for large k which shows the first assertion.Proof of ii): By Assumption 2 the limit point x * is a local solution of order ρ.Thus for large k: where we've used Eqs.( 7) and ( 8) for the third and fourth inequality.

Counterexample for rate of convergence
We have added the assumptions of non-vanishing radii (6) for the above convergence result.The following example shows that without this assumption the convergence rate may be arbitrarily bad.
We construct the example within two steps.First, we present a Lipschitz continuous function on X = [−1, 1] 2 with an infinite number of local solutions.
Therefore, let {c (i) } i∈N with Around every c (i) we choose a circle U i with radius r i := 1 10 i+1 : Note that the circles U i are disjoint.We introduce the objective function f : X → R by: We summarize the properties of the objective function in the following lemma: Lemma 4 For the above construction, the following is true: For every i ∈ N the center point c i is a strict local minimum of f with radius r i .
Proof (i) It is easy to see that we can write f for every x ∈ X as As every single function is Lipschitz continuous with Lipschitz constant 3, the minimum is again Lipschitz continuous.(ii) Consider for i ∈ N a point x ∈ U i with x = c (i) .The triangle inequality yields We consider the following standard semi-infinite problem: As transformation we choose the identity.Then, we have Z = Y .We investigate the structure of the problem in the next lemma.

Lemma 5 Problem SIP ex fulfills the following properties:
(i) The feasible set is given by (ii) The origin x * = (0, 0) is a local minimizer of order 1. (iii) There exists a vector ξ ∈ R 2 such that D 1 g(x * , y 0 ) • ξ ≤ −1 for every y 0 ∈ Y with g(x * , y 0 ) = 0. Proof (i) Let x ∈ X .The solution of the lower level is given by y = x 2 .Thus, max y∈Y g(x, y) = x 1 .
(ii) On the feasible set M the objective function coincides with the norm, which clearly has a local minimum of order 1 in x * = (0, 0).(iii) As by part (i) the only solution of the lower level for x * is given by y 0 = 0, we have with ξ := (−1, 0) If we take the strict local solution c (1) as initial point and no point as initial discretization for Algorithm 1, the first point added to the discretization will be 1 1 .The next element of the sequence of strict local minimizers is still in M( Ż1 ) thus we can choose c (2) as next iterate.Proceeding this way yields in the k-th iteration the local minima c (k+1)  as iterate x (k+1) and the discretization The constraint violation after the k-th step is max y∈Y g(x (k+1) , y) = 1 10 k+1 , but the distance to limiting solution is greater than 1 k+1 , which violates the convergence rate given in Theorem 2.

Remark 2 (i) The second component of the local solutions 1
i can be replaced by an arbitrarily slow converging sequence.For this reason, there can be an arbitrarily bad rate of convergence.(ii) We presented this example in the context of an adaptively chosen discretization.
The same example can also be used to construct a counterexample for uniformly chosen discretization points.Thus, this example shows that the assumption of non-vanishing radii ( 6) is missing in Still (2001a).

Convexity preserving transformations
Of course not all transformations destroy the convexity in the lower level problems.The topic of this section outlines under which conditions convexity properties can be maintained.We begin with the lower level problem.
The lower level problems Qi (x), i ∈ I , of the transformation-induced SIP problem are convex for all x ∈ X , if the transformation-induced functions gi (x, z), i ∈ I , are concave in z for all x and the set Z is convex.
A class of mappings, which map convex sets onto convex sets and preserve concavity, are the affine-linear ones.Therefore, it holds: Lemma 6 For all x ∈ X let the functions g i (x, •), i ∈ I , be concave and the set Z convex.Furthermore, let the transformation t be affine-linear in its second argument, i.e.

t(x, z)
where the mappings A : X → R n× ñ and b : X → R n are at least continuous for all x ∈ X .Then, the problems Qi (x), i ∈ I , are convex for all x ∈ X.
Proof As the set Z is by definition convex, it only remains to prove the concavity of the objective function of the lower level problem.We show the concavity by checking the definition.Therefore fix x ∈ X and consider After the lower level problems, we now consider the discretized problems solved in Step 5.For the computation of global solutions, it is of great value to have convex discretized problems, i. e., the functions gi (x, z), i ∈ I , (and the function f ) are convex in x (for all z).The following example illustrates that the transformation can destroy this property as well.
Example 2 Let the infinite index set be a semi-circle in arbitrary position with variable radius, which can be modeled as follows: The mapping, by which the semi-circle can be represented as image of the set with atan2 being the bivariate arctangent atan2 : R 2 \{0} → R, Obviously, the sets Y (x), x ∈ X , as well as the set Z are convex.We consider the function g(x, y) = x 2 6 − y 1 + y 2 , which is convex in x and y.However, the function g(x, z) := g(x, t(x, z)) is not convex in x for all z ∈ Z .For example with x 1 = (0, 0, 1, 1, −1, 0) T , x 2 = (0, 0, 1, 1, 0, 0) T and z = (1, 0) T one gets: The next lemma states, under which conditions on the functions g i , i ∈ I , and the transformation t, convexity in x is preserved.It follows by the fact that the composition of a convex and a linear mapping is convex again.
Lemma 7 Let the functions g i (x, y), i ∈ I , be convex in (x, y).Furthermore, let the transformation t be affine-linear in its first argument, i.e.

t(x, z)
where A : R ñ → R n×m and b : R ñ → R n .Then the functions gi (x, z), i ∈ I , are convex in x for any z ∈ Z.
Obviously, the rotation used in Example 2 does not satisfy condition (10).However, for translation, scaling, and shearing condition (10) is satisfied.

Numerical examples
In this section we apply Algorithm 1 to some native GSIP problems and two GSIP problems stemming from an important application of semi-infinite optimization, namely design centering.We illustrate how the arising problems are performantly solved by the proposed algorithm.A more detailed numerical analysis of the transformationbased discretization method as well as its application to a real world problem, the volume-maximal utilization of gemstones, can be found in Schwientek (2013).Note that the following numerical examples shouldn't be a full comparison to previously developed algorithms.Instead the examples illustrate that even complicated problems like the inscribing of two designs in a container can be solved in an easy way.
We implemented Algorithm 1 in Matlab R2016a.In Step 1 we chose α = 10 −6 as the termination tolerance.For the solution of the discretized SIP problems SIP( Żk ), k ∈ N 0 , (Step 5) as well as for the solution of the convex lower level problems Q i (x k ), i ∈ I , k ∈ N 0 , (Step 8) we used the SQP algorithm of the routine fmincon of the Optimization Toolbox V7.4 (R2016a) with default settings and using first derivatives.The computations have been performed on a 64-bit Windows machine with a Intel ® Core™i7 5600U processor and 8 GB RAM.The time needed for the computation have been measured using the Matlab function timeit.

Some native examples
In the first numerical example we consider the general semi-infinite optimization problem outlined in Lemonidis (2008), Example 1 (see also Jongen et al. 1998, Example 4-2).We added the constraint x 1 ≥ 0, otherwise the infinite index set Y (x) could be empty.
The infinite index set Y (x) can be explicitly calculated and is equal to is convex (in y), but the corresponding optimal value function ϕ and its optimal solution x * = (0, 0).By the simple transformation GSIP 1 can be transferred into a standard semi-infinite optimization problem, whose discretized problems are convex.We chose x 0 = (1, 1) as starting point ( f 0 = 25/16) and the point 0.5 as initial discretization.After 2 iterations (0.12 s) Algorithm 1 stopped with x * = (0, 0) and f * = 0.0625.The number of finite constraints induced by discretization grew from 1 to 2.
The global solution of the lower level problem is given by the lower bound of the interval In the first iteration the discretization point z = 1 is added.The chosen transformation maps this discretization point to the global solution of the lower level.This means that after the first iteration the discretized problem SIP( Żk ) and the GSIP problem have the same feasible set.
We tested our new algorithm on the other 15 examples collected by Lemonidis (2008), too.This test set was also used in the recent publications (Mitsos and Tsoukalas 2015;Kirst and Stein 2019).However, to apply Algorithm 1 we need to ensure that the central Assumption 1 is satisfied.This means in particular that the infinite index set Y (x) must be non-empty for every x ∈ X .Unfortunately, most of the examples collected in Lemonidis (2008) do not satisfy this condition, namely Examples 1,2,[4][5][6][7][8][9]11,13,and 16. However,in most of these examples (1,2,[6][7][8]11,and 13) it is possible to meet this condition by adding a single constraint.For the remaining examples (4, 5, 9, and 16) it is as follows.As shown by Lemonidis (2008), the infinite index set in Example 4 is either empty or the semi-infinite constraint can not be satisfied.For the Examples 5, 9, and 16 we could not easily derive a transformation.That's why we excluded these four examples from our numerical evaluations.The considered original and modified examples together with their transformation are listed in "Appendix 1".
In most of the taken examples the infinite index set is given by an interval.Then, a transformation can easily be constructed by convex combination of the interval ends.In Example 3 the infinite index set can be obtained by scaling the unit ball by the radius.A possible transformation for Example 6 is given by In Example 14 the infinite index set is a two-dimensional box.Thus, the Cartesian product of the convex combination of the lower and upper bounds is a suitable transformation.
In the 12 examples for which the transformation-based discretization method is applicable the algorithm stopped after two iterations.The time needed to solve the problems ranges from 0.08 to 0.14 s.The reasons for the small number of iterations are the same as described for the first example above.
In all examples, except Example 13, Algorithm 1 found the optimal solution reported in the literature.In Example 13 the solution can not be attained, because we removed it by adding the mentioned constraint to avoid an empty index set.The transformation-based discretization method then finds the point x * = (0, 0.25, 0.25) with an objective value f * = 3.5681.
To further demonstrate our new method, we turn to more complex examples with multiple semi-infinite constraints now.

Design centering
One important application of (general) semi-infinite programming is design centering (DC).There, the task is the following: given a set C ⊆ R n -the so-called container -and a second, parametrized set D(x) ⊆ R n , x ∈ X ⊆ R m -the so-called designinscribe D(x) into C such that some functional, commonly the volume, of D(x) is maximized: Under the assumption that the sets C and D(x) are given as solutions sets of some systems of inequalities Problem DC can be rewritten as a general semi-infinite optimization problem For a detailed discussion of the reformulation of design centering problems as semiinfinite ones we refer to Stein (2006) and the references therein.Different solution techniques are discussed in Harwood and Barton (2017).
One interesting application of design centering in the context of semi-infinite optimization is the maximal material usage in gemstone cutting.There, the task is to produce a set of precious gems from an irregularly shaped raw one in such a way that their total value is as high as possible.In the case of producing only one precious gem, the problem corresponds to a design centering, where the precious gem corresponds to the design, the raw one to the container and the volume of the precious shall be maximized.For a detailed introduction into the modeling and solution of this problem we refer to Winterfeld (2008); Küfer et al. (2008Küfer et al. ( , 2015)).
In a concrete case we consider a two-dimensional design centering problem, i.e. n = 2. Firstly, we introduce the container, called concavified unit square (see Fig. 3a for a graphical illustration), with

Fig. 2 Functional description and parametrization of the boat design
The last two constraints aren't visible in Fig. 3, but are necessary to avoid unboundedness of the container.Now, we present the design.Remember that in addition to a functional description of the design (12) and a calculation rule for their area, we need for the transformationbased discretization method a representation of the design as an image of a compact set Z under a continuously differentiable mapping t : As design we consider the intersection of two variable, but equally sized circles.The center of each circle is a boundary point of the other one (see Fig. 2).We call this design boat.Thus, The intersection points connecting line divides the boat into two equally sized circle segments of angle 2π/3 and radius x 3 (see Fig. 2).Consequently, Moreover, we have, and, where atan2 is the bivariate arctangent.We chose x 0 = (0.25, 0.5, 0.5, 1, 0) as starting point with f 0 = −0.3071(see Fig. 3a for a graphical illustration of this configuration).The initial discretization of the design consists of the four points {(1, −0.5), (1, 0.5), (1, 0), (1, 1)} yielding 24 constraints (4 points for each of the 6 container constraints) in the initial discretized SIP problem.After 5 iterations/refinements (0.69 s) Algorithm 1 stopped with x * = (0.5367, 0.0325, 0.4685, 0.4997, 1.2712) and f * = −0.2696(see Fig. 3c for a visualization of the final solution).The final discretization consisted of 14 points and thus 84 discretization-induced constraints.A extensive numerical study of the solution of DC problems as semi-infinite ones with different design-container-combinations can be found in Schwientek (2013).
The transformation-induced discretized SIP problems as well as the underlying GSIP problem stemming from a design centering problem are non-convex in general.For this reason, Algorithm 1 will terminate with either a local solution or a stationary point.

Inscribing multiple designs
A natural extension of the design centering task is to arrange two or more designs in a container yielding maximal total (design) volume.In practice, such problems arise as cutting or packing problems.In addition to the condition that the designs are completely located in the container, the designs are not allowed to overlap.In the following we will reformulate and evaluate such a problem as GSIP in the case of two designs.While we only consider the case of two designs, the extension of the approach to inscribe more than two designs is straightforward.For details we refer to Schwientek (2013), Küfer et al. (2015).
Let the container C be as given in (11) and D 1 , D 2 two (parametrized) designs as given in (12) with common parameter vector x ∈ R m .Then, the task in multi-body design centering (in the case of two designs) is the following: Under the assumption that the designs D 1 and D 2 are convex sets for each x, a separation theorem can be applied, where η ∈ R n \0 and β are the parameters of the separating hyperplane H (η, β) = {y ∈ R n | η T y = β}, x = (x, η, β), and X = {x | η = 0}.Then, MBDC 2 can be rewritten as a GSIP in the following way: The condition η = 0 can be assured e.g. by requiring η 2 2 = 1.Note that we have two index sets here, but extending Algorithm 1 to this situation is straightforward.
Using this approach, it is also possible to consider forbidden areas in the container when placing the designs and to avoid overlapping these with the designs.If one models the forbidden areas similar to the designs (but independent of x), the separation approach above carries directly over.For more detailed explanations in this regard and further separation techniques we refer to Schwientek (2013).
For a numerical example of problem GSIP MBDC 2 we consider the container from (13) and take as the first design, D 1 , the boat from ( 14).As our second design, D 2 , we consider an ellipse with variable semi-axis lengths in arbitrary position.For such an ellipse, we have m 2 = 5, This constellation results in 13 decision variables, 13 semi-infinite and 2 finite constraints.As starting point we chose the (infeasible) point x0 = (x 0 boat , x 0 ellipse , η 0 , and f * = −0.3487.The final discretization of the boat consists of 37 points and the one of the ellipse of 30 points (see Fig. 4c).
Further numerical evaluations, also with respect to other separation approaches and regarding a minimal distance between the designs, can be found in Schwientek (2013).Concerning the modeling and solution of the gemstone utilization task as semi-infinite optimization problem by means of the transformation-based discretization we refer to Schwientek (2013) and Küfer et al. (2015).

Conclusions and future research
In the present paper we consider general semi-infinite optimization problems (GSIP), which have convex lower level problems and can be globally transformed into a standard semi-infinite optimization problem (SIP).For the numerical solution of such GSIPs we applied a discretization method to its SIP-reformulation.Because the convexity structure in the lower level can be lost through the transformation, but is essential for refining the discretization, we solve in the refinement step the (convex) lower level problems of the underlying GSIP and transfer the global solutions into global solutions of the lower level problems of the induced SIP via the transformation.The convergence results for discretization methods for solving SIPs directly carry over in the case of global and local solutions.An open issue for future research is the convergence of stationary points.Finally, we have demonstrated the operation and performance of our method using three numerical examples.Two interesting aspects for future investigations are the following: (1) As known from Still (2001a), the convergence rates can be improved for discretization methods for solving SIPs, when boundary points of the infinite index set are added in a consistent manner for its discretization and the index set satisfies additional conditions.Since it is required for the transformation only to be surjective, the boundary of the infinite index set Z of the induced SIP is generally not mapped to the boundary of the infinite index set Y (x) and vice versa.This leads to the question whether it is sufficient to add boundary points of Y (x) in the mentioned consistent manner to get improved convergence rates for the transformation-based discretization method.
(2) To keep the dimensions of discretized SIP problems moderate, strategies have been developed to retain only α-active points in the discretization and remove the others during the process.Such algorithms are called exchange methods instead of discretization methods.An exploration of these deletion strategies applied to the transformation-based discretization method could prove interesting.Of special interest would be the impact this may have on the convergence of the method.
Acknowledgements Open Access funding provided by Projekt DEAL.We would like to thank the two anonymous reviewers and the Associate Editor for their many valuable comments, which have contributed to a significant improvement of the article.

Considered examples from Lemonidis' collection
The following examples were collected by Lemonidis (2008).We modified some of the examples in such a way that there exists a transformation as demanded in Assumption 1.We use the original numbering of the examples as in Lemonidis (2008).The constraint ( 18) is added to the original example.Then, a transformation of a fixed index set Z to the variable index set Y (x) is Note that in the collection in Lemonidis (2008) the semi-infinite constraint is different.
We use the version presented in the original source (Jongen et al. 1998).
−y 3 + x 2 ≤ 0 for all y ∈ Y (x), Note that the semi-infinite constraint in the collection in Lemonidis (2008) contains a typing error.We use the version presented in the original source (Rückmann and Shapiro 2001).Note that the constraint describing the infinite index set is different in Lemonidis (2008).We use the version given in the original source (Guerra Vázquez and Rückmann 2005).

Fig. 1
Fig. 1 Transformation destroys convexity in the lower level [dark gray -points with g 1 (x, y) ≤ 0 resp.g1 (x, y) ≤ 0, light gray-feasible set Y (x) resp.Z , circles -local solutions]: a original problem Q 1 (x) with unique optimal solution.b Transformed problem Q1 (x) with two local solutions

Fig. 3
Fig. 3 Area-maximal inscribing of the design D boat in the container C cus [dark gray-container, light gray-design, circles-points of current discretization, which do not violate the container restrictions, diamonds-points of current discretization, which violate container restrictions and will be added in the next iteration]: a start solution with given discretization, b after solution of the first discretized SIP problem SIP( Ż0 ), and c final solution (after 5 refinements) with final discretization

Fig. 4
Fig. 4 Area-maximal inscribing of the design D boat and D ell in the container C cus [dark gray-container, light gray-designs, circles-points of current discretization, which do not violate the container and separation restrictions, diamonds-points of current discretization, which violate container or separation restrictions and will be added in the next iteration, black straight line-line that separates both designs]: a start solution with given discretization, b after solution of the first discretized SIP problem SIP( Ż0 ), and c final solution (after 12 refinements) with final discretization ) = y ∈ [−1, 1] | y 2 − x 1 ≤ 0 .
This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material.If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.