A Modified Hestenes-Stiefel-Type Derivative-Free Method for Large-Scale Nonlinear Monotone Equations

Dai, Zhifeng; Zhu, Huan

doi:10.3390/math8020168

Open AccessArticle

A Modified Hestenes-Stiefel-Type Derivative-Free Method for Large-Scale Nonlinear Monotone Equations

by

Zhifeng Dai

^* and

Huan Zhu

College of Mathematics and Computational Science, Changsha University of Science and Technology, Changsha 410114, China

^*

Author to whom correspondence should be addressed.

Mathematics 2020, 8(2), 168; https://doi.org/10.3390/math8020168

Submission received: 26 December 2019 / Revised: 16 January 2020 / Accepted: 21 January 2020 / Published: 30 January 2020

(This article belongs to the Special Issue Iterative Methods for Solving Nonlinear Equations and Systems 2020)

Download Versions Notes

Abstract

:

The goal of this paper is to extend the modified Hestenes-Stiefel method to solve large-scale nonlinear monotone equations. The method is presented by combining the hyperplane projection method (Solodov, M.V.; Svaiter, B.F. A globally convergent inexact Newton method for systems of monotone equations, in: M. Fukushima, L. Qi (Eds.) Reformulation: Nonsmooth, Piecewise Smooth, Semismooth and Smoothing Methods, Kluwer Academic Publishers. 1998, 355–369) and the modified Hestenes-Stiefel method in Dai and Wen (Dai, Z.; Wen, F. Global convergence of a modified Hestenes-Stiefel nonlinear conjugate gradient method with Armijo line search. Numer Algor. 2012, 59, 79–93). In addition, we propose a new line search for the derivative-free method. Global convergence of the proposed method is established if the system of nonlinear equations are Lipschitz continuous and monotone. Preliminary numerical results are given to test the effectiveness of the proposed method.

Keywords:

nonlinear equations; monotonicity property; projection method; global convergence

1. Introduction

In this paper, we consider the problem of finding numerical solutions of the following large-scale nonlinear equations

F (x) = 0,

(1)

where the function

F : R^{n} ⟶ R^{n}

is monotone and continuous. If

F (x)

is monotone, it implies that the following inequality holds

〈 F (x) - F (y), x - y 〉 \geq 0, \forall x, y \in R^{n} .

(2)

Nonlinear monotone equations can be applied in different fields, for example, they are used as subproblems in the generalized proximal algorithms with Bregman distances [1]. Some monotone variational inequality problems can be converted into nonlinear monotone equations [2]. Monotone systems of equations can also be applied in

L_{1}

-norm regularization sparse optimization problems (see [3,4]) and discrete mathematics such as graph theory (see [5,6]).

Being aware of the important applications of nonlinear monotone equations, in recent years, many scholars have paid attention to propose efficient algorithms for solving problem (1). These algorithms are mainly divided into the following categories.

Each of the Newton-type method, Levenberg-Marquardt method, and quasi-Newton method enjoy fast local convergence property, and are attractive (see [7,8,9,10,11,12,13]). However, for large-scale problems, a drawback of these methods is that at each iteration these algorithms require computing a large-scale linear system of equations by using approximate systems of equations or a Jacobian matrix. The large demand of storage for matrix results in improper handling of large-scale nonlinear monotone systems.

In recent years, gradient-type algorithms have attracted the attention of many scholars. The main reasons are low storage requirements, easy implementation, and global convergence under mild conditions. For example, the spectral gradient method [14] only needs to use gradient information which is easy but effective for an optimization problem. From a different perspective, the spectral gradient method [14] is extended to solve nonlinear monotone equations by combining it with the projection method (see [15,16]).

In addition, for large-scale unconstrained optimization problems, the conjugate gradient method (CG) is another easy but effective method, due to the two attractive features: one is the low memory requirement, the other is strong global convergence properties. In recent years, the conjugate gradient method has achieved rich results (see [17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36]) from the perspective of the sufficient descent property, qausi-Newton direction and conjugacy condition. Inspired by the extension of spectral gradient method to nonlinear monotone equations, CG methods have been applied in solving the nonlinear monotone equations problem. For example, PRP-type method ([37,38,39,40,41,42,43]); Perry conjugate gradient method ([44]); Liu-Storey type method [45], among others.

In this paper, we will focus on extending the Hestenes-Stiefel (HS) CG method to solve large-scale nonlinear monotone equations. To the best of our knowledge, the HS CG method [46] is generally considered as the most efficient CG method for computing performance. However, the HS CG method does not enjoy sufficient descent property. Based on the modified secant equation [10], Dai and Wen [47] propose a modified HS conjugate gradient method that can generate sufficient descent directions (i.e.,

c > 0

exists such that

g_{k}^{T} d_{k} < - c {∥ g_{k} ∥}^{2}

). Global convergence results under the Armijo line search are obtained in Dai and Wen [47]. Hence, we aim to present a derivative-free method to solve the nonlinear monotone Equations (1). The proposed method can be seem as a further study of the modified HS CG method in Dai and Wen [47] for unconstrained optimization problems.

Our paper makes two contributions to large-scale nonlinear monotone equations. Firstly, a new line search is proposed for the derivative-free method. A significant advantage of this line search is that it is easier to obtain the search stepsize. Secondly, we propose a derivative-free method for solving large-scale nonlinear monotone equations which combines the modified Hestenes-Stiefel method in Dai and Wen [47] for unconstrained optimization problems and the hyperplane projection method [13]. A good property of the proposed method is that it is suitable to solve large-scale nonlinear monotone equations due to its lower storage requirement.

The rest of the article is organized as follows. In Section 2, we give the algorithm and prove the sufficient descent property. The global convergence is proved in Section 3. We report the numerical results In Section 4. The last Section gives the conclusion.

2. Algorithm and the Sufficient Descent Property

In this section, we will present the derivative-free method for solving problem (1), that is a combination of the modified Hestenes-Stiefel method [47] and the hyperplane projection method [13]. Different from the traditional conjugate gradient method, the iteration sequence

{x_{k + 1}}

is obtained in two steps at each iteration.

In the first step, the algorithm produces an iterative sequence

{z_{k} = x_{k} + α_{k} d_{k}}

, where

d_{k}

is the search direction, and

α_{k} > 0

is the steplength obtained by a suitable line search. For most iterative algorithms of optimization problems, the line search plays an important role in convergence analysis and numerical calculation. Zhang and Zhou [15] obtained the steplength

α_{k} > 0

by the following Armijio-type line search: calculating the search steplength

α_{k} = m a x {β ρ^{i} : i = 0, 1, \dots,}

such that

- F {(x_{k} + α_{k} d_{k})}^{T} d_{k} \geq σ α_{k} {∥ d_{k} ∥}^{2},

(3)

where

β

is some initial attempt for

α_{k}

,

β > 0

and

ρ \in (0, 1)

.

In addition, Li and Li [39] introduced an alternative line search, that is, computing the search steplength

α_{k} = m a x {β ρ^{i} : i = 0, 1, \dots,}

such that

- F {(x_{k} + α_{k} d_{k})}^{T} d_{k} \geq σ α_{k} ∥ F (x_{k} + α_{k} d_{k}) ∥ \cdot ∥ d_{k} ∥^{2},

(4)

From the above introduction, we can see that the steplength

α_{k}

is obtained by calculating

α_{k} = m a x {β ρ^{i} : i = 0, 1, \dots,}

such that (3) or (4) is satisfied. If the point

x_{k}

is far from the solution, the obtained steplength

α_{k}

may be very small. Taking this into account, we present the following line search rule where the steplength

α_{k}

is obtained by computing

α_{k} = m a x {β ρ^{i} : i = 0, 1, \dots,}

such that

- F {(x_{k} + α_{k} d_{k})}^{T} d_{k} \geq σ α_{k} \min {∥ d_{k} ∥^{2}, ∥ F (x_{k} + α_{k} d_{k}) ∥ ∥ d_{k} ∥^{2}, - F {(x_{k})}^{T} d_{k}} .

(5)

In the second step,

{x_{k + 1}}

can be determined by using

x_{k}, z_{k}, F (z_{k})

via the hyperplane projection method [13]. Now, let’s introduce how to generate

{x_{k + 1}}

via the hyperplane projection method [13]. Along the search direction

d_{k} > 0

, we can generate a point

z_{k} = x_{k} + α_{k} d_{k}

by a suitable line search such that

F {(z_{k})}^{T} (x_{k} - z_{k}) > 0 .

(6)

On the other hand, the monotonicity of F implies that for any solution

x^{*} (F (x^{*}) = 0)

, the following inequality holds

F {(z_{k})}^{T} (x^{*} - z_{k}) = - {(F (z_{k}) - F (x^{*}))}^{T} (z_{k} - x^{*}) \leq 0 .

(7)

From (6) and (7), we can see that

{F {(z_{k})}^{T} (x_{k} - z_{k}) > 0}

holds for any

x_{k}

and

{F {(z_{k})}^{T} (x^{*} - z_{k}) \leq 0}

holds for the solution

x^{*}

. Therefore, from (6) and (7), there is a hyperplane

H_{k} = {x \in R^{n} | F {(z_{k})}^{T} (x - z_{k}) = 0},

(8)

which can strictly separate the current point

x_{k}

from the

x^{*}

(zero point) of equation in (1).

Following Solodov and Svaiter [13] and Zhang and Zhou [15], taking the projection of

x_{k}

onto the hyperplane (8) as the next iterate

x_{k + 1}

is a reasonable choice. In detail, the next iterate point

x_{k + 1}

can be computed by

x_{k + 1} = x_{k} - \frac{F {(z_{k})}^{T} (x_{k} - z_{k})}{∥ F (z_{k}) ∥^{2}} F (z_{k}) .

(9)

In what following, we pay our attention to the search direction which plays a crucial role in an iterative algorithm. Our main starting point is to extend the search direction of Dai and Wen [47] to nonlinear monotone equations problem (1). Similar to Dai and Wen [47] for unconstrained optimization, we give the search direction as

d_{k} = \{\begin{matrix} - F_{0}, & if & k = 0, \\ - F_{k} + β_{k}^{NHZ} d_{k - 1}, & if & k \geq 1, \end{matrix}

(10)

where

β_{k}^{NHZ} = \frac{F_{k}^{T} y_{k - 1}}{d_{k - 1}^{T} w_{k - 1}} - μ \frac{∥ y_{k - 1} ∥^{2}}{{(d_{k - 1}^{T} w_{k - 1})}^{2}} F_{k}^{T} d_{k - 1}, μ > \frac{1}{4},

(11)

w_{k - 1} = y_{k - 1} + γ {\bar{s}}_{k - 1}, γ > 0, y_{k - 1} = F (x_{k}) - F (x_{k - 1}), {\bar{s}}_{k - 1} = z_{k - 1} - x_{k - 1} = α_{k - 1} d_{k - 1} .

(12)

For simplicity, we refer to (10) and (11) as NHZ method hereafter.

Further, in this paper, the function F is assumed to satisfy the following assumptions witch are often utilized in convergence analysis for nonlinear monotone equations (see, [37,38,39,40,41,42,43,44,45,48]).

Assumption A1.

( $A_{1}$ ): The F is a monotone function:

${(F (x) - F (y))}^{T} (x - y) 〉 \geq 0, \forall x, y \in R^{n} .$

(13)
( $A_{2}$ ): The F is Lipschitz continuous function, namely, there exists a $L > 0$ such that

$∥ F (x) - F (y) ∥ \leq L ∥ x - y ∥, \forall x, y \in R^{n} .$

(14)

In what following, we will describe the proposed Algorithm 1.

Algorithm 1: NHZ derivative-free method.

Step 0: Given

x_{0} \in R^{n}

as an initial point, and the constants

ε > 0, β > 0, σ > 0, ρ \in (0, 1)

. Then set k := 0.
Step 1: Calculate

F (x_{k})

. If

∥ F (x_{k}) ∥ \leq ε

, stop the algorithm. Otherwise, go to step 2.
Step 2: Determine the search direction

d_{k}

by (10), (11) and (12).
Step 3: Calculate the search steplength

α_{k}

by (5). Let

z_{k} = x_{k} + α_{k} d_{k}

.
Step 4: Calculate

F (z_{k})

. If

∥ F (z_{k}) ∥ \leq ε

, stop the algorithm. Otherwise, calculate

x_{k + 1}

by using the projection (9). Set k := k + 1 and go to Step 1.

In what follow, we will show that the proposed NHZ derivative-free method enjoys the sufficient descent property which plays an important role in proof of convergence. From now on, we use

F_{k}

to denote

F (x_{k})

.

Theorem 1.

The search direction

d_{k}

generated by (10), (11) and (12) is sufficient descent direction. That is, if

d_{k}^{T} w_{k} \neq 0

, then we have

F_{k}^{T} d_{k} \leq - (1 - \frac{1}{4 μ}) {∥ F_{k} ∥}^{2}, μ > \frac{1}{4} .

(15)

Proof.

When

k = 0

, we have

F_{0}^{T} d_{0} = - ∥ F_{0} ∥^{2} \leq - (1 - \frac{1}{4 μ}) {∥ F_{0} ∥}^{2} .

It is obvious that (15) is satisfied for

k = 0

. □

Now we will show that the sufficient descent condition (15) holds for

k \geq 1

. We can obtain from (10) and (11) that

\begin{matrix} F_{k}^{T} d_{k} & = & - ∥ F_{k} ∥^{2} + β_{k}^{} F_{k}^{T} d_{k - 1} \\ = & - ∥ F_{k} ∥^{2} + \{\frac{F_{k}^{T} y_{k - 1}}{d_{k - 1}^{T} w_{k - 1}} - μ \frac{∥ y_{k - 1} ∥^{2}}{{(d_{k - 1}^{T} w_{k - 1})}^{2}} F_{k}^{T} d_{k - 1}\} F_{k}^{T} d_{k - 1} \\ = & \frac{F_{k}^{T} y_{k - 1} (d_{k - 1}^{T} w_{k - 1}) (F_{k}^{T} d_{k - 1}) - ∥ F_{k} ∥^{2} {(d_{k - 1}^{T} w_{k - 1})}^{2} - μ {∥ y_{k - 1} ∥}^{2} {(F_{k}^{T} d_{k - 1})}^{2}}{{(d_{k - 1}^{T} w_{k - 1})}^{2}} . \end{matrix}

Define

u_{k} = \frac{1}{\sqrt{2 μ}} (d_{k - 1}^{T} w_{k - 1}) F_{k}, v_{k} = \sqrt{2 μ} (F_{k}^{T} d_{k - 1}) y_{k - 1} .

(16)

By using the Equation (16) and the inequality

u_{k}^{T} v_{k} \leq 1 / 2 (∥ u_{k} ∥^{2} + ∥ v_{k} ∥^{2})

, we have

\begin{matrix} F_{k}^{T} d_{k} & = & \frac{u_{k}^{T} v_{k} - 1 / 2 (∥ u_{k} ∥^{2} + ∥ v_{k} ∥^{2})}{{(d_{k - 1}^{T} w_{k - 1})}^{2}} - (1 - \frac{1}{4 μ}) \frac{{(d_{k - 1}^{T} w_{k - 1})}^{2}}{{(d_{k - 1}^{T} w_{k - 1})}^{2}} {∥ F_{k} ∥}^{2} \\ \leq & - (1 - \frac{1}{4 μ}) {∥ F_{k} ∥}^{2} . \end{matrix}

Thus (15) holds for

k \geq 1

.

3. Global Convergence Analysis

Now, we will investigate the global convergence of Algorithm 1. Firstly, we give the following Lemma which shows the line search strategy (5) is well-defined if the search directions

{d_{k}}

satisfy the sufficient descent property.

Lemma 1.

If the iterative sequences

{x_{k}}

and

{z_{k}}

are generated by the Algorithm 1. Then, there always exists a steplength

α_{k}

satisfying the line search (5).

Proof.

Assume that, for any nonnegative integer i for

β ρ^{i}

, the line search strategy (5) does not hold in the

k_{0}

-th iterate, then we have

- F {(x_{k_{0}} + β ρ^{i} d_{k_{0}})}^{T} d_{k_{0}} < σ β ρ^{i} \min {∥ d_{k_{0}} ∥^{2}, ∥ F (x_{k_{0}} + β ρ^{i} d_{k_{0}}) ∥ ∥ d_{k_{0}} ∥^{2}, - F {(x_{k_{0}})}^{T} d_{k_{0}}} .

(17)

Letting

i \mapsto \infty

, we have from the continuity of F and

ρ \in (0, 1)

that

- F {(x_{k_{0}} + β ρ^{i} d_{k_{0}})}^{T} d_{k_{0}} < 0,

(18)

which contradicts (15). The proof is completed. □

The next Lemma indicates that the line search strategy (5) provides a lower bound for steplength

α_{k}

.

Lemma 2.

If the iterative sequences

{x_{k}}

and

{z_{k}}

are generated by the Algorithm 1. Then, we can obtain that

α_{k} \geq \min \{β, \frac{δ ρ}{(L + σ) ∥} \frac{∥ F_{k} ∥^{2}}{∥ d_{k} ∥^{2}}\},

(19)

where

δ = 1 - \frac{1}{4 μ}

.

Proof.

Tf

α_{k} = β

, it is obviously that (19) holds. Suppose that

α_{k} \neq β

, then we can obtain that

α_{k}^{^{'}} = ρ^{- 1} α_{k}

does not satisfy the line search process (5). That is,

- F {(z_{k}^{^{'}})}^{T} d_{k} < σ α_{k}^{^{'}} \min {∥ d_{k} ∥^{2}, ∥ F (z_{k}^{^{'}}) ∥ ∥ d_{k} ∥^{2}, - F {(x_{k}^{^{'}})}^{T} d_{k}} \leq σ α_{k}^{^{'}} {∥ d_{k} ∥}^{2},

(20)

where

z_{k}^{^{'}} = x_{k} + α_{k}^{^{'}} d_{k} .

□

From the sufficient descent condition (15), we have that

(1 - \frac{1}{4 μ}) ∥ F_{k} ∥^{2} ≐ δ {∥ F_{k} ∥}^{2} \leq - F_{k}^{T} d_{k} .

(21)

From the Lipschitz continuity of F (14), (20) and (21), we can obtain that

\begin{matrix} - F_{k}^{T} d_{k} & = & {(F (z_{k}^{^{'}}) - F (x_{k}))}^{T} d_{k} - F {(z_{k}^{^{'}})}^{T} d_{k} \\ \leq & ∥ F (z_{k}^{^{'}}) - F (x_{k}) ∥ ∥ d_{k} ∥ + σ α_{k}^{^{'}} {∥ d_{k} ∥}^{2} \\ \leq & L ∥ z_{k}^{^{'}} - x_{k} ∥ ∥ d_{k} ∥ + σ α_{k}^{^{'}} {∥ d_{k} ∥}^{2} \\ = & L α_{k}^{^{'}} ∥ d_{k} ∥^{2} + σ α_{k}^{^{'}} {∥ d_{k} ∥}^{2} \\ = & ρ^{- 1} α_{k} (L + σ) {∥ d_{k} ∥}^{2} . \end{matrix}

Therefore, the above inequalities and (21) imply

α_{k} \geq \frac{δ ρ}{(L + σ) ∥} \frac{∥ F_{k} ∥^{2}}{∥ d_{k} ∥^{2}} .

This shows that Lemma about the search steplength

α_{k}

holds.

The next Lemma is proved by Solodov and Svaiter (see Lemma 2.1 in [13]), which can also hold for Algorithm 1. Now, we give this lemma without proof, because its proof is similar in Solodov & Svaiter [13].

Lemma 3.

Assume the function F is monotone and the Lipschitz continuous condition (14) holds. If the iterative sequences

{x_{k}}

is generated by the Algorithm 1, then for any

x^{*}

, such that

F (x^{*}) = 0

, we can obtain that

∥ x_{k + 1} - x^{*} ∥^{2} \leq ∥ x_{k} - x^{*} ∥^{2} - {∥ x_{k + 1} - x_{k} ∥}^{2} .

In particular, the iterative sequence

{x_{k}}

is bounded and

\sum_{k = 0}^{\infty} {∥ x_{k + 1} - x_{k} ∥}^{2} < \infty .

(22)

Remark 1.

The above Lemma 2 confirms that the sequence

{∥ x_{k} - x^{*} ∥}

decreases with k. In addition, (22) implies that

lim_{k \to \infty} {∥ x_{k + 1} - x_{k} ∥}^{} = 0 .

(23)

Theorem 2.

If the iterative sequences

{x_{k}}

is generated by the Algorithm 1, then, we can have that

lim_{k \to \infty} α_{k} ∥ d_{k} ∥ = 0 .

(24)

Proof.

We can obtain from (5) and (9) that, for any k,

∥ x_{k + 1} - x_{k} ∥ = \frac{| F {(z_{k})}^{T} (x_{k} - z_{k}) |}{∥ F (z_{k}) ∥} = \frac{- α_{k} F {(z_{k})}^{T} d_{k}}{∥ F (z_{k}) ∥} \geq σ α_{k}^{2} {∥ d_{k} ∥}^{2} .

(25)

In particular, it follows from (23) and (25) that

lim_{k \to \infty} α_{k} ∥ d_{k} ∥ = 0 .

(26)

□

Lemma 4.

If the iterative sequences

{x_{k}}

is generated by the Algorithm 1, and

x^{*}

satisfies

F (x^{*}) = 0, z_{k}^{^{'}} = x_{k} + α_{k}^{^{'}} d_{k}, α_{k}^{^{'}} = ρ^{- 1} α_{k}

. Then,

{∥ F (z_{k}^{^{'}}) ∥}

and

{∥ F_{k} ∥}

are bounded, i.e, there is a constant

M \geq 0

, such that

∥ F (z_{k}^{^{'}}) ∥ \leq M, ∥ F_{k} ∥ \leq M .

(27)

Proof.

By the Lemma 2, we have

∥ x_{k} - x^{*} ∥ \leq ∥ x_{0} - x^{*} ∥ .

From (26), we have that there is a constant

M_{1} > 0

such that

α_{k} ∥ d_{k} ∥ \leq M_{1}

. Hence

∥ z_{k}^{^{'}} - x^{*} ∥ \leq ∥ x_{k} - x^{*} ∥ + α_{k}^{^{'}} ∥ d_{k} ∥ \leq ∥ x_{0} - x^{*} ∥ + ρ^{- 1} α_{k} ∥ d_{k} ∥ \leq ∥ x_{0} - x^{*} ∥ + M_{1} .

(28)

Since the function

F (x)

is Lipschitz continuous, we can easily obtain the following two inequalities

∥ F (z_{k}^{^{'}}) ∥ \leq ∥ F (z_{k}^{^{'}}) - F (x^{*}) ∥ \leq L ∥ z_{k}^{^{'}} - x^{*} ∥ \leq L (∥ x_{0} - x^{*} ∥ + ρ^{- 1} M_{1},

and

∥ F_{k} ∥ \leq ∥ F (x_{k}) - F (x^{*}) ∥ \leq L ∥ x_{k} - x^{*} ∥ \leq L ∥ x_{0} - x^{*} ∥ .

Let

M = m a x {L ∥ x_{0} - x^{*} ∥, L (∥ x_{0} - x^{*} ∥ + ρ^{- 1} M_{1}}

. We can obtain (27). □

In what following, we will give the global convergence theorem for our proposed method.

Theorem 3.

If the iterative sequences

{x_{k}}

is generated by the Algorithm 1. We can obtain that

\underset{k \to \infty}{lim inf} ∥ F_{k} ∥ = 0 .

(29)

Proof.

We will prove this Theorem by contradiction. Assume that (29) is not true. Then it implies there is a constant

ε > 0

, s.t.

∥ F_{k} ∥ > ε

.

Since

F_{k} \neq 0

, we have from (15) that

d_{k} \neq 0

. Hence, the monotonicity of F together with (10) implies

\begin{matrix} {\bar{s}}_{k - 1}^{T} w_{k - 1} & = & 〈 F (z_{k - 1}) - F (x_{k - 1}), z_{k - 1} - x_{k - 1} 〉 + γ {\bar{s}}_{k - 1}^{T} {\bar{s}}_{k - 1} \\ \geq & γ {\bar{s}}_{k - 1}^{T} {\bar{s}}_{k - 1} . \end{matrix}

This together with the definition of

{\bar{s}}_{k - 1}

implies

d_{k - 1}^{T} w_{k - 1} \geq γ α_{k - 1} {∥ d_{k - 1} ∥}^{2} .

(30)

We have from (11), (12) and (30) that

\begin{matrix} | β_{k}^{NHZ} | & = & | \frac{F_{k}^{T} y_{k - 1}}{d_{k - 1}^{T} w_{k - 1}} - μ \frac{∥ y_{k - 1} ∥^{2}}{{(d_{k - 1}^{T} w_{k - 1})}^{2}} F_{k}^{T} d_{k - 1} | \\ \leq & \frac{L α_{k - 1} ∥ d_{k - 1} ∥ ∥ F_{k} ∥}{γ α_{k - 1} {∥ d_{k - 1} ∥}^{2}} + μ \frac{L^{2} α_{k - 1}^{2} {∥ d_{k - 1} ∥}^{2}}{γ^{2} α_{k - 1}^{2} {∥ d_{k - 1} ∥}^{4}} ∥ F_{k} ∥ ∥ d_{k - 1} ∥ \\ \leq & (\frac{L}{γ} + μ \frac{L^{2}}{γ^{2}}) \frac{∥ F_{k} ∥}{∥ d_{k - 1} ∥} . \end{matrix}

Therefore, from (10) and (30), we can obtain

\begin{matrix} ∥ d_{k} ∥ & \leq & ∥ F_{k} ∥ + | β_{k}^{NHZ} | ∥ d_{k - 1} ∥ \\ \leq & ∥ F_{k} ∥ + (\frac{L}{γ} + μ \frac{L^{2}}{γ}) ∥ F_{k} ∥ \\ \leq & (1 + \frac{L}{γ} + μ \frac{L^{2}}{γ^{2}}) M . \end{matrix}

Define

C = (1 + \frac{L}{γ} + μ \frac{L^{2}}{γ^{2}}) M

. Then, we can have

∥ d_{k} ∥ \leq C

. □

It follows from Lemmas 2, Lemmas 3,

∥ F_{k} ∥ \geq ε

and

∥ d_{k} ∥ \geq ε

that for all k sufficiently large,

\begin{matrix} α_{k} ∥ d_{k} ∥ & \geq & \min \{β, \frac{δ ρ}{(L + σ) ∥} \frac{∥ F_{k} ∥^{2}}{∥ d_{k} ∥^{2}}\} ∥ d_{k} ∥ \\ \geq & \min \{β ε, \frac{δ ρ ε^{2}}{(L + σ) ∥ C}\} > 0 . \end{matrix}

It is obvious that the above inequality contradicts with (24). That is, (29) holds. And we complete this proof.

4. Numerical Experiments

Now, we will give some numerical experiments to test the numerical performance of our proposed method. We try to test the NHZ Algorithm 1 and compare it’s performance with the spectral gradient (SG) method [15] and the MPRP method in [39]. In the testing experiments, all codes were written in Matlab R2018a, and run on a Lenovo PC with 4 GB RAM memory.

To obtain better numerical performance, we select the following initial steplength as in [39] and [43]

β = | \frac{F {(x_{k})}^{T} d_{k}}{d_{k}^{T} (F (x_{k} + ϵ d_{k}) - F (x_{k})) / ϵ} | \approx | \frac{F {(x_{k})}^{T} d_{k}}{d_{k}^{T} \nabla F (x_{k}) d_{k}} |, ϵ = 10^{- 8} .

(31)

We set

ρ = 0.5, σ = 2

. Further, let

β = 1

if

β < 10^{- 4}

.

Following the MPRP method in [39], we terminate the iterative process if the following condition

\min {∥ F (x_{k}) ∥, ∥ F (z_{k}) ∥} \leq a t o l + r t o l ∥ F (x_{0}) ∥

is satisfied, and

r t o l = a t o l = 10^{- 4}

.

The numerical performance of SG, MPRP, and NHZ methods are tested by using the following five nonlinear monotone equations problem with different various sizes and initial points.

Problem 1

([49]). The specific expression of the function

F (x)

is defined as

F_{i} (x) = 2 x_{i} - s i n (| x_{i} |), i = 1, \dots, n .

Problem 2

([49]). The specific expression of the function

F (x)

is defined as

\begin{matrix} F_{1} (x) & = & 2 x_{1} + s i n (x_{1}) - 1, \\ F_{i} (x) & = & - 2 x_{i - 1} + 2 x_{i} + s i n (x_{i}) - 1, i = 2, \dots, n - 1, \\ F_{n} (x) & = & 2 x_{n} + s i n (x_{n}) - 1 . \end{matrix}

Problem 3

([50]). The specific expression of the function

F (x)

is defined as

F_{i} (x) = x_{i} - s i n (x_{i}), i = 1, \dots, n .

Problem 4

([50]). The specific expression of the function

F (x)

is defined as

F (x) = A x + g (x)

where

g (x) = {(e^{x_{1}} - 1, e^{x_{2}} - 1, \dots, e^{x_{n}} - 1)}^{T}

A = (\begin{matrix} 2 & - 1 \\ - 1 & 2 & - 1 \\ ⋱ & ⋱ & ⋱ \\ ⋱ & ⋱ & - 1 \\ - 1 & 2 \end{matrix})

Problem 5.

The specific expression of the function

F (x)

is defined as

F (x) = A x + | X | - B

where

| X | = (| x_{1} |, | x_{2} |, \dots, | x_{n} {|)}^{T}, B = {(1, 1, \dots, 1)}^{T}

, and

A = (\begin{matrix} 2 & - 1 \\ - 1 & 2 & - 1 \\ ⋱ & ⋱ & ⋱ \\ ⋱ & ⋱ & - 1 \\ - 1 & 2 \end{matrix}) .

The numerical results for 5 tested problems are reported in Table 1, Table 2, Table 3, Table 4 and Table 5 respectively, where the given initial points are

x_{1} = {(0.1, \dots, 0.1)}^{T}, x_{2} = {(1, \dots, 1)}^{T}, x_{3} = (1, \frac{1}{2},

\dots, \frac{1}{n})^{T}, x_{4} = {(- 10, \dots, - 10)}^{T}, x_{5} = {(- 0.1, \dots, - 0.1)}^{T}, x_{6} = {(- 1, \dots, - 1)}^{T} .

Meanwhile, the numerical results are listed in the Table 1, Table 2, Table 3, Table 4 and Table 5 where (Time) is the CPU time (in seconds), (Iter) is the number of iterations and (Feval) is the number of function evaluations.

Table 1, Table 2, Table 3, Table 4 and Table 5 report the numerical results of the proposed algorithm, the spectral gradient (SG) method [15] and the MPRP method in [39] with five tested problems where the test indicators of Time, Iter and Feval are used to evaluate numerical performance.

Comparing the CPU time within the tested algorithms, we note that our proposed algorithm needs lower time consuming than both the spectral gradient (SG) method [15] and the MPRP method in [39] for all tested problems, and the difference is substantial and significant especially for large-scale problems. In addition, the MPRP method in [39] needs lower CPU time than the spectral gradient (SG) method [15]. Assessing the number of iterations within the tested algorithms, we find that the NHZ method requires fewer iterations than the spectral gradient (SG) method [15] and the MPRP method in [39] for all tested problems. We also note that our proposed algorithm requires fewer number of function evaluations for all tested problems, and the difference is substantial and significant.

In sum, from the numerical results in Table 1, Table 2, Table 3, Table 4 and Table 5, it isn’t difficult to see that the proposed algorithm performs better than the spectral gradient (SG) method [15] and the MPRP method in [39] for three test indicators which implies that the modified Hestenes-Stiefel-based derivative-free method is computationally efficient for nonlinear monotone equations.

5. Conclusions

This paper aims to present a modified Hestenes-Stiefel method to solve the nonlinear monotone equations which combines the hyperplane projection method [13] and the modified Hestenes-Stiefel method in Dai and Wen [47]. In the proposed method, the search direction satisfies sufficient descent conditions. A new line search is proposed for the derivative-free method. Under appropriate conditions, the proposed method converges globally. The given numerical results show the presented method is more efficient compared to the methods proposed by the spectral gradient method Zhang and Zhou [15] and the MPRP method in Li and Li [39].

In addition, we also expect that our proposed method and its further modifications could produce new applications for problems in relevant areas of symmetric equations [51], image processing [52], and finance [53,54,55].

Author Contributions

Conceptualization, Z.D.; methodology, Z.D. and H.Z.; software, Z.D. and H.Z.; validation, Z.D.; formal analysis, Z.D.; investigation, Z.D.; resources, Z.D. and H.Z.; data curation, Z.D.; writingoriginal draft preparation, Z.D. and H.Z.; writing-review and editing Z.D. and H.Z.; visualization, Z.D. and H.Z.; supervision, Z.D. and H.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research is supported by the NSF of China granted 71771030, 11301041, and fund of Hunan Provincial Education Department granted 19A007.

Acknowledgments

This work was supported by the National Natural Science Foundation of China, grants 71771030, and 11301041; and the Scientific Research Fund of Hunan Provincial Education Department, grant number 19A007.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Iusem, A.N.; Solodov, M.V. Newton-type methods with generalized distances for constrained optimization. Optimization 1997, 41, 257–278. [Google Scholar] [CrossRef]
Zhao, Y.B.; Li, D.H. Monotonicity of fixed point and normal mapping associated with variational inequality and its application. SIAM J. Optim. 2001, 4, 962–973. [Google Scholar] [CrossRef]
Figueiredo, M.; Nowak, R.; Wright, S.J. Gradient projection for sparse reconstruction, application to compressed sensing and other inverse problems. IEEE J-STSP 2007, 1, 586–597. [Google Scholar] [CrossRef] [Green Version]
Xiao, Y.; Zhu, H. A conjugate gradient method to solve convex constrained monotone equations with applications in compressive sensing. J. Math. Anal. Appl. 2013, 405, 310–319. [Google Scholar] [CrossRef]
Shang, Y. Vulnerability of networks: Fractional percolation on random graphs. Phys. Rev. E 2014, 89, 012813. [Google Scholar] [CrossRef]
Shang, Y. Super Connectivity of Erdos-Renyi Graphs. Mathematics 2019, 7, 267. [Google Scholar] [CrossRef] [Green Version]
Brown, P.N.; Saad, Y. Convergence theory of nonlinear Newton-Krylov algorithms. SIAM J. Optim. 1994, 4, 297–330. [Google Scholar]
Gasparo, M. A nonmonotone hybrid method for nonlinear systems. Optim. Methods Softw. 2000, 13, 79–94. [Google Scholar] [CrossRef]
Griewank, A. The global convergence of Broyden-like methods with suitable line search. J. Austral. Math. Soc. Ser. B 1996, 28, 75–92. [Google Scholar] [CrossRef] [Green Version]
Li, D.H.; Fukushima, M. A modified BFGS method and its global convergence in nonconvex minimization. J. Comput. Appl. Math. 2001, 129, 15–35. [Google Scholar] [CrossRef] [Green Version]
Li, D.H.; Fukushima, M. A derivative-free line search and global convergence of Broyden-like method for nonlinear equations. Optim. Methods Softw. 2000, 13, 583–599. [Google Scholar] [CrossRef]
Martínez, J.M. A family of quasi-Newton methods for nonlinear equations with direct secant updates of matrix factorizations. SIAM J. Numer. Anal. 1990, 27, 1034–1049. [Google Scholar] [CrossRef]
Solodov, M.V.; Svaiter, B.F. A globally convergent inexact Newton method for systems of monotone equations. In Reformulation: Nonsmooth, Piecewise Smooth, Semismooth and Smoothing Methods; Fukushima, M., Qi, L., Eds.; Kluwer Academic Publishers: Dordrecht, The Netherlands, 1998; pp. 355–369. [Google Scholar]
Barzilai, J.; Borwein, J.M. Two-point step size gradient methods. IMA J. Numer. Anal. 1998, 8, 141–148. [Google Scholar] [CrossRef]
Zhang, L.; Zhou, W.J. Spectral gradient projection method for solving nonlinear monotone equations. J. Comput. Appl. Math. 2006, 196, 478–484. [Google Scholar] [CrossRef] [Green Version]
Yu, Z.S.; Ji, L.N.; Sun, J.; Xiao, Y.H. Spectral gradient projection method for monotone nonlinear equations with convex constraints. Appl. Numer. Math. 2009, 59, 2416–2423. [Google Scholar] [CrossRef]
Hager, W.W.; Zhang, H. A new conjugate gradient method with guaranteed descent and an efficient line search. SIAM J. Optim. 2005, 16, 170–192. [Google Scholar] [CrossRef] [Green Version]
Zhang, L.; Zhou, W.J.; Li, D.H. A descent modified Polak-Ribière-Polyak conjugate gradient method and its global convergence. IMA J. Numer. Anal. 2006, 26, 629–640. [Google Scholar] [CrossRef]
Cheng, W.Y. A two-term PRP-based descent method. Numer. Funct. Anal. Optim. 2007, 28, 1217–1230. [Google Scholar] [CrossRef]
Zhang, L.; Zhou, W.J.; Li, D.H. Global convergence of a modified Fletcher-Reeves conjugate gradient method with Armijo-type line search. Numer. Math. 2006, 104, 561–572. [Google Scholar] [CrossRef]
Dai, Z.; Zhu, H. Stock return predictability from a mixed model perspective. Pac-Basin. Finac. J. 2020, 60, 101267. [Google Scholar] [CrossRef]
Narushima, Y.; Yabe, H.; Ford, J.A. A three-term conjugate gradient method with sufficient descent property for unconstrained optimization. SIAM J. Optim. 2011, 21, 212–230. [Google Scholar] [CrossRef] [Green Version]
Andrei, N. A simple three-term conjugate gradient algorithm for unconstrained optimization. J. Comp. Appl. Math. 2013, 241, 19–29. [Google Scholar] [CrossRef]
Andrei, N. On three-term conjugate gradient algorithms for unconstrained optimization. Appl. Math. Comput. 2013, 219, 6316–6327. [Google Scholar] [CrossRef]
Liu, Z.X.; Liu, H.W.; Dong, X.L. A new adaptive Barzilai and Borwein method for unconstrained optimization. Optim. Lett. 2018, 12, 845–873. [Google Scholar] [CrossRef]
Babaie-Kafaki, S.; Reza, G. The Dai-Liao nonlinear conjugate gradient method with optimal parameter choices. Eur. J. Oper. Res. 2014, 234, 625–630. [Google Scholar] [CrossRef]
Babaie-Kafaki, S. On optimality of the parameters of self-scaling memoryless quasi-Newton updating formulae. J. Optim. Theory Appl. 2015, 167, 91–101. [Google Scholar] [CrossRef]
Yuan, G.; Zhang, M. A three-terms Polak-Ribiére-Polyak conjugate gradient algorithm for large-scale nonlinear equations. J. Comput. Appl. Math. 2015, 286, 186–195. [Google Scholar] [CrossRef]
Yuan, G.; Meng, Z.H.; Li, Y. A modified Hestenes and Stiefel conjugate gradient algorithm for large-scale nonsmooth minimizations and nonlinear equations. J. Optim. Theory. Appl. 2016, 168, 129–152. [Google Scholar] [CrossRef]
Dong, X.; Han, D.; Reza, G.; Li, X.; Dai, Z. Some new three-term Hestenes-Stiefel conjugate gradient methods with affine combination. Optimization 2017, 66, 759–776. [Google Scholar] [CrossRef]
Dong, X.; Han, D.; Dai, Z.; Li, L.; Zhu, J. An accelerated three-term conjugate gradient method with sufficient descent condition and conjugacy condition. J. Optim. Theory Appl. 2018, 179, 944–961. [Google Scholar] [CrossRef]
Li, M.; Feng, H. A sufficient descent Liu-Storey conjugate gradient method for unconstrained optimization problems. Appl Math Comput. 2011, 218, 1577–1586. [Google Scholar]
Dai, Z.; Wen, F. Another improved Wei-Yao-Liu nonlinear conjugate gradient method with sufficient descent property. Appl Math Comput. 2012, 218, 4721–4730. [Google Scholar] [CrossRef]
Dai, Z.; Tian, B. Global convergence of some modified PRP nonlinear conjugate gradient methods. Optim. Lett. 2011, 5, 615–630. [Google Scholar] [CrossRef]
Dai, Z. Comments on a new class of nonlinear conjugate gradient coefficients with global convergence properties. Appl. Math. Computat. 2016, 276, 297–300. [Google Scholar] [CrossRef] [Green Version]
Dai, Z.; Zhou, H.; Wen, F.; He, S. Efficient predictability of stock return volatility: The role of stock market implied volatility. N. Am. J. Econ. Finance. 2020. forthcoming. [Google Scholar] [CrossRef]
Cheng, W.Y. A PRP type method for systems of monotone equations. Math. Comput. Model. 2009, 50, 15–20. [Google Scholar] [CrossRef]
Yu, G. A derivative-free method for solving large-scale nonlinear systems of equations. J. Ind. Manag. Optim. 2010, 6, 149–160. [Google Scholar] [CrossRef]
Li, Q.; Li, D.H. A class of derivative-free methods for large-scale nonlinear monotone equations. IMA J. Numer. Anal. 2011, 31, 1625–1635. [Google Scholar] [CrossRef]
Yu, G. Nonmonotone spectral gradient-type methods for large-scale unconstrained optimization and nonlinear systems of equations. Pac. J. Optim. 2011, 7, 387–404. [Google Scholar]
Zhou, W.; Shen, D. An inexact PRP conjugate gradient method for symmetric nonlinear equations. Numer. Funct. Anal. Optim. 2014, 35, 370–388. [Google Scholar] [CrossRef]
Sun, M.; Wang, X.; Feng, D. A family of conjugate gradient methods for large-scale nonlinear equations. J. Inequal. Appl. 2017, 236, 1–8. [Google Scholar]
Zhou, W.; Wang, F. A PRP-based residual method for large-scale monotone nonlinear equations. Appl. Math. Comput. 2015, 261, 1–7. [Google Scholar] [CrossRef]
Dai, Z.; Chen, X.; Wen, F. A modified Perry’s conjugate gradient method-based derivative-free method for solving large-scale nonlinear monotone equation. Appl. Math. Comput. 2015, 270, 378–386. [Google Scholar] [CrossRef] [Green Version]
Li, M. A Liu-Storey type method for solving large-scale nonlinear monotone equations. Numer. Funct. Anal. Optim. 2014, 35, 310–322. [Google Scholar] [CrossRef]
Hestenes, M.R.; Stiefel, E. Methods of conjugate gradients for solving linear systems. J. Res. Natl. Bur. Stand. 1952, 49, 409–436. [Google Scholar] [CrossRef]
Dai, Z.; Wen, F. Global convergence of a modified Hestenes-Stiefel nonlinear conjugate gradient method with Armijo line search. Numer Algor. 2012, 59, 79–93. [Google Scholar] [CrossRef]
Yan, Q.R.; Peng, X.Z.; Li, D.H. A globally convergent derivative-free method for solving large-scale nonlinear monotone equations. J. Comput. Appl. Math. 2010, 234, 649–657. [Google Scholar] [CrossRef] [Green Version]
Zhou, W.J.; Li, D.H. Limited memory BFGS method for nonlinear monotone equations. J. Comput. Math. 2007, 25, 89–96. [Google Scholar]
Zhou, W.J.; Li, D.H. A globally convergent BFGS method for nonlinear monotone equations without any merit functions. Math. Comput. 2008, 77, 2231–2240. [Google Scholar] [CrossRef]
Zhou, W.J. A modified BFGS type quasi-Newton method with line search for symmetric nonlinear equations problems. J. Comput. Appl. Math. 2020, 357, 454. [Google Scholar] [CrossRef]
Gao, P.T.; He, C.; Liu, Y. An adaptive family of projection methods for constrained monotone nonlinear equations with applications. Appl. Math. Comput. 2019, 359, 1–16. [Google Scholar] [CrossRef]
Dai, Z.; Zhu, H. Forecasting stock market returns by combining sum-of-the-parts and ensemble empirical mode decomposition. Appl. Econ. 2019. [Google Scholar] [CrossRef]
Dai, Z.; Zhu, H.; Wen, F. Two nonparametric approaches to mean absolute deviation portfolio selection model. J. Ind. Manag. Optim. 2019. [Google Scholar] [CrossRef]
Dai, Z.; Zhou, H. Prediction of stock returns: Sum-of-the-parts method and economic constraint method. Sustainability 2020, 12, 541. [Google Scholar] [CrossRef] [Green Version]

Table 1. Numerical results for the tested Problem 1 with various sizes and given initial points.

Initial	Dim.		SG			MPRP			NHZ
Initial	Dim.	Time	Iter	Feval	Time	Iter	Feval	Time	Iter	Feval
$x_{1}$	1000	1.16	16	37	0.81	12	39	0.65	10	29
$x_{2}$	1000	1.16	16	37	0.81	12	39	0.65	10	29
$x_{3}$	1000	0.77	13	26	0.55	8	27	0.48	7	22
$x_{4}$	1000	1.24	15	43	0.85	9	42	0.78	7	35
$x_{5}$	1000	1.15	14	44	0.53	6	28	0.48	5	24
$x_{6}$	1000	1.15	14	44	0.53	6	28	0.48	5	24
$x_{1}$	5000	6.96	18	37	5.32	13	42	4.55	11	32
$x_{2}$	5000	6.96	17	37	5.01	12	39	3.85	11	29
$x_{3}$	5000	4.16	11	22	3.01	7	24	2.43	6	20
$x_{4}$	5000	10.45	18	62	7.38	12	60	5.99	11	48
$x_{5}$	5000	6.88	14	44	3.40	6	28	2.84	5	24
$x_{6}$	5000	6.96	15	45	3.40	6	28	2.84	5	24
$x_{1}$	10,000	27.77	18	38	21.15	13	42	15.72	11	32
$x_{2}$	10,000	27.62	17	36	19.65	12	39	14.36	11	27
$x_{3}$	10,000	14.90	10	20	11.92	7	24	8.96	6	20
$x_{4}$	10,000	44.65	20	69	35.15	14	72	30.98	12	68
$x_{5}$	10,000	29.55	15	45	13.86	6	28	11.22	5	24
$x_{6}$	10,000	29.55	15	45	13.86	6	28	11.22	5	24

Table 2. Numerical results for the tested Problem 2 with various sizes and given initial points.

Initial	Dim.		SG			MPRP			NHZ
Initial	Dim.	Time	Iter	Feval	Time	Iter	Feval	Time	Iter	Feval
$x_{1}$	1000	0.67	326	646	0.17	196	591	0.12	155	568
$x_{2}$	1000	0.64	364	714	0.11	203	612	0.10	180	584
$x_{3}$	1000	0.64	343	691	0.19	195	588	0.17	174	555
$x_{4}$	1000	1.18	468	941	0.15	243	741	0.14	220	709
$x_{5}$	1000	0.67	479	988	0.17	194	585	0.15	162	549
$x_{6}$	1000	0.65	431	857	0.16	189	571	0.14	165	529
$x_{1}$	5000	10.38	723	2047	8.19	487	1465	8.10	468	1448
$x_{2}$	5000	16.87	753	2622	12.85	769	2310	12.63	744	2205
$x_{3}$	5000	9.01	824	2849	7.98	469	1411	7.65	442	1324
$x_{4}$	5000	18.74	1023	3261	15.96	929	2840	15.51	901	2781
$x_{5}$	5000	10.05	1226	3053	7.59	453	1363	7.50	442	1320
$x_{6}$	5000	11.32	836	2476	6.32	371	1122	6.20	338	1103
$x_{1}$	10,000	47.90	960	2002	34.59	539	1622	10.01	460	1508
$x_{2}$	10,000	82.62	1334	4066	65.06	1023	3072	60.79	1001	3003
$x_{3}$	10,000	50.99	833	2469	34.30	516	1553	32.32	501	1502
$x_{4}$	10,000	56.82	2042	6668	39.65	1668	5075	36.31	1602	5003
$x_{5}$	10,000	49.63	832	2268	31.57	497	1497	30.25	436	1405
$x_{6}$	10,000	45.70	850	1706	25.36	396	1202	22.24	375	1106

Table 3. Numerical results for the tested Problem 3 with various sizes and given initial points.

Initial	Dim.		SG			MPRP			NHZ
Initial	Dim.	Time	Iter	Feval	Time	Iter	Feval	Time	Iter	Feval
$x_{1}$	1000	1.16	16	37	0.81	12	39	0.62	10	28
$x_{2}$	1000	1.17	17	36	0.83	12	39	0.71	11	28
$x_{3}$	1000	0.77	11	24	0.57	8	27	0.49	7	28
$x_{4}$	1000	1.25	14	44	0.88	9	42	0.75	7	32
$x_{5}$	1000	1.16	13	42	0.56	6	28	0.48	5	22
$x_{6}$	1000	1.16	13	42	0.57	6	28	0.48	5	22
$x_{1}$	5000	6.98	17	36	5.42	13	42	4.63	11	32
$x_{2}$	5000	6.98	17	36	5.11	12	39	3.95	11	30
$x_{3}$	5000	4.29	10	22	3.12	7	24	2.34	6	20
$x_{4}$	5000	10.57	19	64	7.46	12	60	6.25	11	52
$x_{5}$	5000	6.99	13	42	3.52	6	28	3.92	5	24
$x_{6}$	5000	6.99	13	42	3.52	6	28	3.92	5	24
$x_{1}$	10,000	27.78	17	36	21.35	13	42	15.97	11	32
$x_{2}$	10,000	27.79	17	36	19.75	12	39	15.86	11	30
$x_{3}$	10,000	15.65	9	26	11.99	7	24	9.98	6	19
$x_{4}$	10,000	44.85	20	69	35.36	14	72	29.98	12	60
$x_{5}$	10,000	29.89	14	45	13.98	6	28	12.56	5	24
$x_{6}$	10,000	29.89	14	45	13.98	6	28	13.59	6	24

Table 4. Numerical results for the tested Problem 4 with various sizes and given initial points.

Initial	Dim.		SG			MPRP			NHZ
Initial	Dim.	Time	Iter	Feval	Time	Iter	Feval	Time	Iter	Feval
$x_{1}$	1000	0.20	219	431	0.06	50	216	0.05	38	168
$x_{2}$	1000	0.28	261	463	0.06	56	252	0.05	46	185
$x_{3}$	1000	0.28	224	329	0.05	34	152	0.03	32	137
$x_{4}$	1000	0.22	263	529	0.07	100	421	0.06	96	399
$x_{5}$	1000	0.28	183	403	0.06	42	187	0.05	40	177
$x_{6}$	1000	0.28	212	424	0.06	60	261	0.05	48	218
$x_{1}$	5000	2.15	263	456	1.11	48	209	1.05	47	183
$x_{2}$	5000	2.45	225	378	1.19	46	224	0.92	38	169
$x_{3}$	5000	1.65	122	267	0.62	27	117	0.65	29	128
$x_{4}$	5000	3.41	265	558	2.59	109	483	2.49	104	455
$x_{5}$	5000	2.86	290	467	1.21	53	231	1.07	44	189
$x_{6}$	5000	2.97	231	477	1.20	54	234	1.16	48	213
$x_{1}$	10,000	5.30	278	502	3.96	45	195	3.83	42	185
$x_{2}$	10,000	6.26	237	574	4.27	41	210	3.84	38	158
$x_{3}$	10,000	5.62	275	585	1.93	42	96	2.62	35	142
$x_{4}$	10,000	18.15	333	596	10.86	117	533	9.45	109	498
$x_{5}$	10,000	13.52	341	595	4.34	49	212	3.85	44	186
$x_{6}$	10,000	13.55	336	553	4.89	56	246	3.78	48	195

Table 5. Numerical results for the tested Problem 5 with various sizes and given initial points.

Initial	Dim.		SG			MPRP			NHZ
Initial	Dim.	Time	Iter	Feval	Time	Iter	Feval	Time	Iter	Feval
$x_{1}$	1000	0.89	119	289	0.66	47	199	0.44	38	168
$x_{2}$	1000	0.78	122	263	0.45	22	105	0.44	24	98
$x_{3}$	1000	0.69	130	235	0.35	48	209	0.28	38	120
$x_{4}$	1000	0.85	190	249	0.47	37	165	0.34	35	98
$x_{5}$	1000	0.75	194	248	0.55	94	237	0.45	66	192
$x_{6}$	1000	1.22	225	462	0.79	174	396	0.75	142	372
$x_{1}$	5000	2.32	113	260	1.22	51	221	0.98	42	172
$x_{2}$	5000	2.92	128	270	0.56	22	105	0.58	28	96
$x_{3}$	5000	3.80	228	412	1.11	47	200	0.79	44	144
$x_{4}$	5000	3.50	216	424	1.20	48	206	0.79	44	142
$x_{5}$	5000	3.00	226	443	1.17	47	206	0.81	44	122
$x_{6}$	5000	6.57	461	881	5.06	308	707	4.25	262	628
$x_{1}$	10,000	5.92	66	209	3.90	44	191	3.42	38	184
$x_{2}$	10,000	6.86	68	218	51.1	22	105	49.1	21	98
$x_{3}$	10,000	5.76	60	181	4.24	47	204	3.23	38	132
$x_{4}$	10,000	11.84	69	227	10.5	48	209	8.52	44	148
$x_{5}$	10,000	10.55	68	221	4.02	45	196	3.82	42	168
$x_{6}$	10,000	12.46	89	326	10.1	74	262	7.83	68	232

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Dai, Z.; Zhu, H. A Modified Hestenes-Stiefel-Type Derivative-Free Method for Large-Scale Nonlinear Monotone Equations. Mathematics 2020, 8, 168. https://doi.org/10.3390/math8020168

AMA Style

Dai Z, Zhu H. A Modified Hestenes-Stiefel-Type Derivative-Free Method for Large-Scale Nonlinear Monotone Equations. Mathematics. 2020; 8(2):168. https://doi.org/10.3390/math8020168

Chicago/Turabian Style

Dai, Zhifeng, and Huan Zhu. 2020. "A Modified Hestenes-Stiefel-Type Derivative-Free Method for Large-Scale Nonlinear Monotone Equations" Mathematics 8, no. 2: 168. https://doi.org/10.3390/math8020168

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Modified Hestenes-Stiefel-Type Derivative-Free Method for Large-Scale Nonlinear Monotone Equations

Abstract

1. Introduction

2. Algorithm and the Sufficient Descent Property

3. Global Convergence Analysis

4. Numerical Experiments

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI