Near-optimal control for stochastic recursive problems☆
Introduction
Herein, we are primarily interested in the so-called “stochastic recursive optimal control problems” which can be illustrated by the following example arising from mathematical finance. Consider a financial market with two assets: one bond and one stock, and a “larger” investor who can decide at time , the amount to invest in the stock with some initial endowment and whose behavior can significantly influence the market price of the asset. It follows that his/her wealth process, denoted by , should satisfy some nonlinear stochastic differential equation (SDE) Here is an -valued standard Brownian motion on a certain probability space; is a suitable deterministic function. One interesting application of such nonlinear wealth equation can be found in Cvitanić and Ma [1] where the hedging option problem of a “larger” investor was studied. The case of constraints such as taxes can also be modeled as a special case of this setting (see El Karoui, Peng and Quenez [2]). We further suppose the investor can consume between 0 and and denote the corresponding consumption rate process. Due to El Karoui et al. [2], the utility of the investor is a solution of where and are two appropriate deterministic functions. In the setup, the investor aims to choose a suitable pair such that This formulates a (stochastic) recursive optimal control problem. In fact, Eq. (1) turns out to be a BSDE. The BSDE was originally proposed by Pardoux and Peng [3] in the general nonlinear case. The BSDE has been extensively applied in considerable different areas, especially in finance and economics. For instance, the celebrated Black–Scholes option pricing formula can be recovered by solving a special BSDE. For a comprehensive survey of BSDE theory, the interested reader may refer to the book by Ma and Yong [4] and the references therein.
In the literature [5], Duffie and Epstein independently established a stochastic differential recursive utility, which is a generalization of a standard additive utility with an instantaneous utility depending not only on an instantaneous consumption rate , but also on a future utility. El Karoui et al. [2] pointed out that this utility process can be characterized by a solution of some BSDE.
Due to its importance, the stochastic recursive control problems have been extensively studied in the literature like Peng [6], etc. It is remarkable that all these works typically focus on the study of the “exact” optimality whereas fairly little attention has been paid to the investigation of their near-optimal controls, to the best of our knowledge. However, it is really necessary and imminent to investigate the near-optimal controls of recursive optimal control problems due to their attractive merits from both theoretical and practical viewpoints. For example, the near-optimal controls always exist while the optimal controls only exist in relatively few cases; there are many near-optimal controls which can be selected appropriately for analysis and implementation.
Motivated by these points, this paper studies near-optimality for stochastic recursive control problems. However, it is necessary to point out that our study on the near-optimal controls is not a trivial and straightforward analog to that of the exact optimal controls. For instance, the classical maximum principle is essentially based on the fact that a minimum point of a function implies zero derivative at this point, while this is no longer the case for near-optimality, as a near-minimum of a function does not need to come out with a small derivative. Keep this in mind, we introduce some different techniques to solve these recursive optimal control problems. Our methods are mainly based on Ekeland’s principle. As the main result, a necessary condition as well as a sufficient condition of near-optimality to the aforementioned recursive controls is derived.
The structure of this paper is as follows. In Section 2, we formulate a recursive optimal control problem and introduce the definition of near-optimality. Sections 3 Necessary condition of near-optimality, 4 Sufficient condition of near-optimality are devoted to develop a necessary condition and a sufficient condition of near-optimality for the recursive optimal control problem. In Section 5, some concluding remarks are given.
Section snippets
Problem formulation and preliminaries
In this section, we outline the basic setup. We begin with a finite time horizon . Let be a complete filtered probability space on which an -valued standard Brownian motion is defined. It is also assumed that is the natural filtration generated by and .
Throughout this paper, we make use of the notations:
Some prior estimates
This subsection is mainly devoted to investigate some prior estimates which play an important role in establishing our theoretical result.
Lemma 3.1 If Hypothesis (H1) holds, then there is a constant such that for any
Proof The first estimate is a well-known result, thus we focus on others. Squaring both sides of and using the fact that we get
Sufficient condition of near-optimality
In this section, we will show that, under certain convex conditions, the near-maximum condition of the Hamiltonian function in the integral form is sufficient for near-optimality. To start, we first introduce some definition and hypothesis.
Definition 4.1 Let be a region and be a locally Lipschitz continuous function. The generalized gradient of at , denoted by , is a set defined by Yong and Zhou [10]
Hypothesis (H6) For any , , and , the functions
Concluding remarks
The subject of near-optimality has been studied by only a few researchers, such as Zhou [8], Liu, Yin and Zhou [11], etc. Here, we particularly investigate the near-optimal controls to stochastic recursive optimization problems. This subsumes the study of near-optimality to the forward SDEs as its special cases. Meanwhile, as the classical methodologies fail to work in our setup, some new techniques are needed here to overcome this shortcoming. In this sense, our study is essentially different
Acknowledgements
The authors are grateful to the editor and an anonymous referee for their comments for improving the quality of this work.
References (11)
- et al.
Adapted solution of backward stochastic differential equation
Systems Control Lett.
(1990) - et al.
Near-optimal control problems for linear forward-backward stochastic systems
Automatica
(2010) - et al.
Near-optimal controls of random-switching LQ problems with indefinite control weight costs
Automatica
(2005) - et al.
Hedging options for a large investor and forward-backward sde’s
Ann. Appl. Prob.
(1996) - et al.
Backward stochastic differential equations in finance
Math. Finance
(1997)
Cited by (23)
A revisit to stochastic near-optimal controls: The critical case
2015, Systems and Control LettersCitation Excerpt :Current research focuses on near-optimal controls under various systems. The readers are directed to Ref. [6] for regime-switching systems; Refs. [7,8], and [9] for forward–backward systems; Refs. [10] and [11] for jump-diffusion systems; [12] for recursive systems; and references therein. The rest of this paper is structured as follows.
Necessary and sufficient conditions for near-optimal harvesting control problem of stochastic age-dependent system
2013, Applied Mathematics and ComputationCitation Excerpt :ε-optimal [2] The proof of (2.2) is similar to Lemma 3.1 in [3]. Ekeland’s principle [17]
A MODIFIED METHOD OF SUCCESSIVE APPROXIMATIONS FOR STOCHASTIC RECURSIVE OPTIMAL CONTROL PROBLEMS
2022, SIAM Journal on Control and OptimizationStochastic maximum principle for optimal control problem with a stopping time cost functional
2022, International Journal of ControlMaximum Principle for Near-Optimality of Mean-Field FBSDEs
2020, Mathematical Problems in Engineering
- ☆
Hui acknowledges the financial support from PolyU research account No. 1-ZV1X. Huang acknowledges the financial support from the Departmental General Research Foundation of Hong Kong Polytechnic University (No. G-YH42) and the RGC Earmarked Grant 500909. Li acknowledges the financial support from the RGC Earmarked Grants 524109 and 521610. Wang acknowledges the financial support from the National Science Foundation of China (10926098, 11001156), the Natural Science Foundation of Shandong Province, China (ZR2009AQ017), the Independent Innovation Foundation of Shandong University, China (2010GN063), and the startup foundation of doctoral research program of Shandong Normal University, China.