Improving the Approximated Projected Perspective Reformulation by dual information

doi:10.1016/j.orl.2017.08.001

Operations Research Letters

Volume 45, Issue 5, September 2017, Pages 519-524

https://doi.org/10.1016/j.orl.2017.08.001 Get rights and content

Abstract

We propose an improvement of the Approximated Projected Perspective Reformulation (AP $^{2}$ R) for dealing with constraints linking the binary variables. The new approach solves the Perspective Reformulation (PR) once, and then use the corresponding dual information to reformulate the problem prior to applying AP $^{2}$ R, thereby combining the root bound quality of the PR with the reduced relaxation computing time of AP $^{2}$ R. Computational results for the cardinality-constrained Mean–Variance portfolio optimization problem show that the new approach is competitive with state-of-the-art ones.

Introduction

We study solution techniques for convex separable Mixed-Integer Non-Linear Programs (MINLP) with $n$ semi-continuous variables $x_{i} \in R$ for $i \in N = {1, \dots, n}$ which either assume the value $0$ or lie in the interval $X_{i} = [{\underset{̲}{x}}_{i}, {\bar{x}}_{i}]$ ( $- \infty < {\underset{̲}{x}}_{i} < {\bar{x}}_{i} < \infty$ ). This can be expressed, introducing $y_{i} \in {0, 1}$ for $i \in N$ , as $(P) min h (z) + \sum_{i \in N} f_{i} (x_{i}) + c_{i} y_{i}$ $A x + B y + C z = b$ $(x, z) \in O$ ${\underset{̲}{x}}_{i} y_{i} \leq x_{i} \leq {\bar{x}}_{i} y_{i}, y_{i} \in {[0, 1]}^{n}, x_{i} \in R^{n} i \in N$ $y_{i} \in Z i \in N .$ We assume the functions $f_{i}$ to be closed convex, one time continuously differentiable and finite in the interval $({\underset{̲}{x}}_{i}, {\bar{x}}_{i})$ ; w.l.o.g. we also assume $f_{i} (0) = 0$ . In (P) we single out the linking constraints (2) that contain all the relationships linking the $y_{i}$ variables among them and with the other variables of the problem, except those (4) that “define” the semi-continuous nature of the $x_{i}$ . The reformulation technique developed in [7] require (2) to be empty; the extension developed in [1] allows to overcome this limitation, but potentially at the cost of a worse bound quality. The aim of this paper is to deal with constraint (2) in a cost-effective way. For our approach to work, (2) must have a compatible structure with that of (1); we initially assume equality constraints, with extensions discussed in Section 3. Because our approach hinges on availability of dual information for the continuous relaxation, we assume that the function $h (\cdot)$ in the “other variables $z$ ” and the “other constraints (3)” are convex, i.e., (P) is a convex MINLP. Actually, in many applications everything but (1) is linear. It will be sometimes expedient to refer to (3)–(4) as “ $(x, y, z) \in P$ ”, and to $\underset{̲}{P}$ as the set obtained by $P$ relaxing integrality constraints on $z$ and $x$ , if any.

Often, the most pressing issue in solving (P) is to derive tight lower bounds on its optimal value $ν$ (P), which is typically done by solving its (convex) continuous relaxation (P) (we denote by $ν$ (X) and (X), respectively, the optimal value and the continuous relaxation of any problem (X)). However, often $ν$ (P) $≪ ν$ (P), making the solution approaches inefficient. The presence of semi-continuous variables has been exploited to propose reformulations (P $^{'}$ ) of (P) such that $ν$ (P) $= ν$ (P $^{'}$ ) $\geq ν$ (P $^{'}$ ) $≫ ν$ (P). This starts from considering (1) as $h (z) + \sum_{i \in N} f_{i} (x_{i}, y_{i})$ , where $f_{i} (x_{i}, y_{i}) = f_{i} (x_{i}) + c_{i}$ if $y_{i} = 1$ and ${\underset{̲}{x}}_{i} \leq x_{i} \leq {\bar{x}}_{i}$ , $f_{i} (0, 0) = 0$ , and $f_{i} (x_{i}, y_{i}) = \infty$ otherwise. The convex envelope of $f_{i} (x_{i}, y_{i})$ is known [4] to be ${\tilde{f}}_{i} (x_{i}, y_{i}) = y_{i} f_{i} (x_{i} ∕ y_{i}) + c_{i} y_{i}$ – using the perspective function of $f_{i}$ – which yields the Perspective Reformulation of (P) $(PR) min {h (z) + \sum_{i \in N} {\tilde{f}}_{i} (x_{i}, y_{i}) : (2), (x, y, z) \in P, (5)} .$ As $f_{i}$ is convex, ${\tilde{f}}_{i}$ is convex for $y_{i} \geq 0$ ; since $x_{i} = 0$ if $y_{i} = 0$ , ${\tilde{f}}_{i}$ can be extended by continuity assuming $0 f_{i} (0 ∕ 0) = 0$ . Hence, (PR) is a convex MINLP if (P) is. Its continuous relaxation (PR)—the Perspective Relaxation of (P)—usually has $ν$ (PR) $≫ ν$ (P), making (PR) a more convenient formulation [8], [9]. If $f_{i}$ is SOCP-representable then so is ${\tilde{f}}_{i}$ , hence the PR of a Mixed-Integer Second-Order Cone Program (MI-SOCP) is still a MI-SOCP. Thus, (PR) is not necessarily more complex to solve – and, sometimes, even less so [2]– than (P). Alternatively, one can consider a Semi-Infinite MINLP reformulation of (PR) where Perspective Cuts [4]– linear outer approximations of the epigraph of ${\tilde{f}}_{i}$ – are dynamically added. This is often the best approach [6], in particular for “general” (P) where no other structure is available. It is appropriate to remark that the (PR) approach also applies if the $x_{i}$ are vectors such that $y_{i} = 0 ⟹ x_{i} = 0$ and $y_{i} = 1 ⟹ x_{i} \in X_{i}$ , with $X_{i}$ a polytope; yet, here, as in [1], [7], each $x_{i}$ must be a single variable.

While (PR) provides a better bound, it is also usually more time consuming to solve than (P) because ${\tilde{f}}_{i}$ is “more complex” than $f_{i}$ . This trade-off is nontrivial, in particular if $f_{i}$ is “simple”. For instance, if $f_{i}$ is quadratic and everything else is linear, (P) is a Mixed-Integer Quadratic Program (MIQP) whereas (PR) is a MI-SOCP; hence, (P) – a QP – can be significantly cheaper to solve than (PR)—a SOCP. The Projected PR (P $^{2}$ R) idea underpinning the approach studied here was indeed proposed in [7] for the quadratic case, and ${\underset{̲}{x}}_{i} \geq 0$ . It was then extended in [1] to a more general class of functions, and allowing ${\underset{̲}{x}}_{i} < 0$ . However, ${\underset{̲}{x}}_{i} < 0 < {\bar{x}}_{i}$ renders some of the arguments significantly more complex, hence for the sake of simplicity we will only present here the case where ${\underset{̲}{x}}_{i} \geq 0$ ; it will be plain to see that the arguments immediately extend to the more general one. The P $^{2}$ R idea is to analyze ${\tilde{f}}_{i}$ as a function of $x_{i}$ only, i.e., projecting away $y_{i}$ : under appropriate assumptions, and if there are no linking constraints (2), this turns out to be a piecewise-convex functions with a “small” number of pieces, that can be characterized by just looking at the data of (P) (cf. (7)). Hence, (PR) can be reformulated in terms of piecewise-convex objective functions, which makes it easier to solve, especially when $O$ has some valuable structure (e.g., flow or knapsack) [7]. However, in several applications (2) are indeed present [1], [3], [4], [8], [10]. Furthermore, since the binary variables $y_{i}$ are removed from the formulation, branching has to be done “indirectly”, which rules out using off-the-shelf solvers. To overcome these two limitations, in [1] the Approximated P $^{2}$ R (AP $^{2}$ R) reformulation has been proposed whereby the $y_{i}$ , after having been eliminated, are re-introduced in the formulation in order to encode the piecewise nature of ${\tilde{f}}_{i}$ . This is possible even if (2) are present, and it has the advantage that (AP $^{2}$ R) is still a MIQP if (P) is. However, $ν$ (AP²R) $< ν$ (PR) may, and does, happen when linking constraints (2) are present, whence the “Approximate” moniker. This is still advantageous in some cases, but it may happen that the weaker bounds outweigh the faster solution time, making the approach not competitive with more straightforward implementations of the PR [1].

The aim of this paper is to improve the AP $^{2}$ R by presenting a simple and effective way to ensure that $ν$ (AP²R) $= ν$ (PR) even if (2) are present, while keeping the shape of the formulation – and therefore, hopefully, the cost of (AP²R) – exactly the same. Since bound equivalence only holds at the root node of the B&C it is not obvious that the approach, despite the quicker solution times of (AP²R), is competitive. However, this is shown to be true in at least one relevant application, the Mean–Variance problem (with min buy-in and cardinality constraints) in portfolio optimization.

Section snippets

A quick overview of AP $^{2}$ R

We now quickly summarize the analysis in [1], albeit limited to the case ${\underset{̲}{x}}_{i} \geq 0$ , in order to prepare the ground for the new extension. We focus on the basic problem corresponding to one pair $(x_{i}, y_{i})$ $(P_{i}) min {f_{i} (x_{i}) + c_{i} y_{i} : {\underset{̲}{x}}_{i} y_{i} \leq x_{i} \leq {\bar{x}}_{i} y_{i}, y_{i} \in {0, 1}} .$ The analysis hinges on considering the (PR) of (P $_{i}$ ) rewritten as $({\underset{̲}{PR}}_{i}) min {p_{i} (x_{i}) = min {{\tilde{f}}_{i} (x_{i}, y_{i}) : {\underset{̲}{x}}_{i} y_{i} \leq x_{i} \leq {\bar{x}}_{i} y_{i}, y_{i} \in [0, 1]} : x_{i} \in [0, {\bar{x}}_{i}]},$ i.e., first minimizing ${\tilde{f}}_{i} (x_{i}, y_{i})$ with respect to $y_{i}$ , and then minimizing the resulting function $p_{i} (x_{i})$

Improving AP $^{2}$ R using dual information

The idea is to reformulate (P) to include information about the linking constraints (2) in the objective function (1), so that it can be “processed” by the AP $^{2}$ R. This hinges on the availability of dual information, and hence mainly concerns the continuous relaxations. The Lagrangian relaxation of (P) w.r.t. (2) $({\underset{̲}{P}}^{λ}) min {h (z) + \sum_{i \in N} f_{i} (x_{i}) + c_{i} y_{i} + λ (A x + B y + C z - b) : (x, y, z) \in \underset{̲}{P}}$ has an objective function that is still separable in the $x_{i}$ $h (z) + λ C z + \sum_{i \in N} (f_{i} (x_{i}) + λ A^{i} x_{i} + (c_{i} + λ B^{i}) y_{i}) - λ b .$ Hence one can

Computational results

In this section we report results of computational tests of the proposed approach for the Mean–Variance cardinality-constrained portfolio optimization problem on $n$ risky assets $(MV) min {x^{T} Q x : \sum_{i \in N} x_{i} = 1, \sum_{i \in N} μ_{i} x_{i} \geq ρ, \sum_{i \in N} y_{i} \leq k, (4), (5)},$ where $μ$ is the vector of expected unitary returns, $ρ$ is the prescribed total return, $Q$ is the variance–covariance matrix, and $k \leq n$ is the maximum number of purchasable assets. Without the cardinality constraint ( $k = n$ ), (MV) is well suited for AP $^{2}$ R: the bound is the same as

Conclusions

The main advantage of the proposed AP $^{2}$ R $+$ technique is its simplicity: just solving (PR) – possibly even approximately with a dual approach – produces the dual solution $λ^{*}$ which can be used to first construct (P+) and then its (AP $^{2}$ R). Yet, this improves many-fold the performances over plain AP $^{2}$ R, and even more so over P/C. Notably, AP $^{2}$ R $+$ is quite general and applies to a much larger class than MIQP. It may be worth contrasting 175843 s (P/C in Table 1) with 58 s (AP $^{2}$ R $+ +$ in Table 3) for $40 0^{-}$

Acknowledgments

The first and third authors acknowledge the contribution of the Italian Ministry for University and Research under the PRIN 2012 Project 2012JXB3YF “Mixed-Integer Nonlinear Optimization: Approaches and Applications”. All the authors acknowledge networking support by the COST Action TD1207. We are grateful to the anonymous referee and the Associate Editor of the Journal for constructive comments that helped us in significantly improve the manuscript.

References (10)

FrangioniA. et al.
SDP diagonalizations and perspective cuts for a class of nonseparable MIQP
Oper. Res. Lett.
(2007)
FrangioniA. et al.
A computational comparison of reformulations of the perspective relaxation: SOCP vs. cutting planes
Oper. Res. Lett.
(2009)
FrangioniA. et al.
Approximated perspective relaxations: a project&lift approach
Comput. Optim. Appl.
(2016)
FrangioniA. et al.
Delay-constrained shortest paths: Approximation algorithms and second-order cone models
J. Optim. Theory Appl.
(2015)
FrangioniA. et al.

There are more references available in the full text version of this article.

Cited by (0)

View full text

Improving the Approximated Projected Perspective Reformulation by dual information

Abstract

Introduction

Section snippets

A quick overview of AP2R

Improving AP2R using dual information

Computational results

Conclusions

Acknowledgments

Oper. Res. Lett.

Oper. Res. Lett.

Approximated perspective relaxations: a project&lift approach

Comput. Optim. Appl.

Delay-constrained shortest paths: Approximation algorithms and second-order cone models

J. Optim. Theory Appl.

A quick overview of AP $^{2}$ R

Improving AP $^{2}$ R using dual information