A New Adaptive Accelerated Levenberg–Marquardt Method for Solving Nonlinear Equations and Its Applications in Supply Chain Problems

Li, Rong; Cao, Mingyuan; Zhou, Guoling

doi:10.3390/sym15030588

Open AccessArticle

A New Adaptive Accelerated Levenberg–Marquardt Method for Solving Nonlinear Equations and Its Applications in Supply Chain Problems

by

Rong Li

¹,

Mingyuan Cao

^1,2,*

and

Guoling Zhou

¹

School of Mathematics and Statistics, Beihua University, Jilin 132013, China

²

School of Information Engineering, Hainan Vocational University of Science and Technology, Hainan 571126, China

^*

Author to whom correspondence should be addressed.

Symmetry 2023, 15(3), 588; https://doi.org/10.3390/sym15030588

Submission received: 10 February 2023 / Revised: 21 February 2023 / Accepted: 22 February 2023 / Published: 24 February 2023

(This article belongs to the Topic Evolutionary Differential Equations, Dynamic Systems, Computation and Optimization)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, a new adaptive Levenberg–Marquardt method is proposed to solve the nonlinear equations including supply chain optimization problems. We present a new adaptive update rule which is a segmented function on the ratio between the actual and predicted reductions of the objective function to accept a large number of unsuccessful iterations and avoid jumping in local areas. The global convergence and quadratic convergence of the proposed method are proved by using the trust region technique and local error bound condition, respectively. In addition, we use the proposed algorithm to test on the symmetric and asymmetric linear equations. Numerical results show that the proposed method has good numerical performance and development prospects. Furthermore, we apply the algorithm to solve the fresh agricultural products supply chain optimization problems.

Keywords:

accelerate Levenberg–Marquardt method; adaptive function; trust region technique; local error bound condition

1. Introduction

With the development of science and technology, more and more fields are involved in the solution of nonlinear equation problems, such as chemistry, mechanics, economics and product management [1,2,3,4]. For example, decentralized decision models in supply chain management and gas pressure volume models in physics can be converted into the following nonlinear equations

F (x) = 0,

(1)

where

F (x) : R^{n} \to R^{m}

is a continuously differentiable function. In particular, symmetric nonlinear equations with the Jacobian matrix symmetry also have a wide range of applications, such as the gradient mapping of unconstrained optimization problem, the Karush–Kuhn–Tucker (KKT) of equality constrained optimization problem, and other fields [5,6].

The steepest descent method, Newton method, quasi-Newton method, Gauss–Newton (GN) method are commonly used iterative methods for solving (1) [7,8,9,10]. The GN method is one of the most famous methods, when the Jacobian matrix is Lipschitz continuous and nonsingular at the solution of (1), the GN method has quadratic convergence. However, when the Jacobian matrix is singular or nearly singular, the GN method may not be well defined. In order to overcome this difficulty, the Levenberg–Marquardt (LM) method [11,12] for solving (1) was proposed. At the k-th iteration, the trial step is

d_{k} = - {(J_{k}^{T} J_{k} + λ_{k} I)}^{- 1} (J_{k}^{T} F_{k}),

(2)

where

F_{k} = F (x_{k})

,

J_{k} = J (x_{k})

is a Jacobian matrix of

F (x)

at

x_{k}

, which may be a symmetric matrix or non-symmetric matrix, I is an identity matrix and the LM parameter

λ_{k} > 0

.

The LM method ensures the uniqueness of solution of (1), and it also has quadratic convergence if

J_{k}

is Lipschitz continuous, nonsingular at the solution, and

λ_{k}

is selected appropriately. In this sense, the update of the LM parameter has a great impact on the performance and efficiency of algorithm, many effective LM parameters have been proposed. Yamashita and Fukushima [13] chose the LM parameter as

λ_{k} = {∥ F_{k} ∥}^{2}

, and proved that the LM method had quadratic convergence under the local error bound condition and

J_{k}

is Lipschitz continuous at the solution. However, when

{x_{k}}

is far away from the solution set,

λ_{k}

may be very large, which makes

d_{k}

very small and reduces the efficiency of algorithm; when

{x_{k}}

is sufficiently close to the solution set,

λ_{k}

may be smaller than the machine epsilon and lose its role.

Based on these observations, Fan and Yuan [14] generalized the LM parameter in [13], and proved that the numerical results for choosing

λ_{k} = ∥ F_{k} ∥

is better than choosing

λ_{k} = {∥ F_{k} ∥}^{2}

. Fan [15] first introduced the regularization factor

μ_{k}

into the LM method and chose

λ_{k} = μ_{k} ∥ F_{k} ∥

, with numerical results showing that this choice of

λ_{k}

provides the best performance. However, when

{x_{k}}

is far away from the solution set, the choice of both LM parameters does not provide good results. Therefore, to avoid this situation, Fan and Pan [16] chose the LM parameter as

λ_{k} = μ_{k} ρ (x_{k})

, in which

μ_{k}

is updated by a trust region technique. They defined

ρ (x_{k})

as a positive function of

R^{n} \to R_{+}

, i.e.,

\begin{matrix} ρ (x_{k}) = \{\begin{matrix} \tilde{ρ} (x_{k}), & if \tilde{ρ} (x_{k}) \leq 1, \\ 1, & otherwise, \end{matrix} \end{matrix}

where

\tilde{ρ} (x_{k}) = O (∥ F_{k} ∥^{δ})

. This update strategy can obtain larger LM trial steps, so that the iterative sequence can quickly converge to the solution set when

{x_{k}}

is far away from the solution set. Amini et al. [17] chose the LM parameter as

\begin{matrix} λ_{k} = \frac{μ_{k} ∥ F_{k} ∥}{1 + ∥ F_{k} ∥} . \end{matrix}

It is clear that when

{x_{k}}

is far away from the solution set and

∥ F_{k} ∥

is very large,

\frac{∥ F_{k} ∥}{1 + ∥ F_{k} ∥}

is close to 1, so

λ_{k}

is close to

μ_{k}

. The choice of

λ_{k}

speeds up the efficiency of the algorithm more than previous LM parameters.

In addition to the above different choices of LM parameters, the introduction of adaptive technology also has a great impact on the LM method. As we all know, the ratio

r_{k}

between the actual and predicted reductions of the objective function reflects the degree to which the approximate quadratic model approaches the value function. To make more use of information about the ratio, Fan and Yuan [18] proposed an adaptive LM method by selecting

λ_{k + 1} = μ_{k + 1} {∥ F_{k + 1} ∥}^{δ}

,

μ_{k + 1} = μ_{k} q (r_{k})

,

q (r_{k})

is a continuous non-negative function about

r_{k}

, and

σ \in (0, 2]

. The introduction of

q (r_{k})

avoids discontinuities when crossing the threshold

\frac{μ_{k + 1}}{μ_{k}}

of the ratio, and better numerical results can be obtained.

In fact, similar adaptive techniques have been proposed in the trust region algorithms. If

r_{k}

is sufficiently greater than 1, the iteration is too successful at this time, then we can reduce

μ_{k}

to a very small value. Then, the algorithm will continue to perform a large number of consecutive unsuccessful iterations. On the other hand, if

r_{k} \to - \infty

,

d_{k}

is a far-from-satisfactory trial step, then we can increase

μ_{k}

greatly. At this moment, the successive iteration points will be close to each other and the algorithm will converge slowly. Therefore, Hei [19] proposed an R-function by using an adaptive update strategy to update the trust region radius

Δ_{k}

, i.e.,

Δ_{k + 1} = R (r_{k}) Δ_{k}

. Furthermore, Walmag and Delhez [20] proposed a

Λ

-function to update the trust region radius, i.e.,

Δ_{k + 1} = Λ (r_{k}) Δ_{k}

, where

Λ

is a non-negative and bounded function about

r_{k}

. On this basis, Lu et al. [21] argued that the consistency between the model and the objective function is not good enough in too-successful iterations, so an L-function was proposed to update the trust region radius. They showed that the L-function contains some favorable features of the R-function and the

Λ

-function, and the method is more efficient in too-successful iterations. In this paper, we want to learn from the presentation of the L-function and provide a new adaptive strategy to update the LM parameter. Our innovations mainly include the following:

◊ A new adaptive accelerated LM method is proposed, which can improve the consistency between the model and the objective function in too-successful iterations by using the ratio information of the actual reduction to the predicted reduction;

◊ The new algorithm can solve the situation in which the iterative sequence is far away from the optimal solution set, accept a large number of unsuccessful iterations and avoid jumping in local areas, thus improving the efficiency and stability of the algorithm;

◊ The new adaptive accelerated LM method has global convergence and quadratic convergence under local error bound.

The rest of this paper is organized as follows. In Section 2, we describe in detail a new adaptive accelerated LM method which makes full use of the ratio information. Furthermore, we demonstrate that the new algorithm has global convergence under the appropriate conditions and maintains quadratic convergence under local error bound condition. In Section 3, numerical results are given, indicating that the new algorithm is efficient. The conclusion is given in the last section.

2. Methodology

2.1. The Adaptive Accelerated Levenberg–Marquardt Method

In this section, our main aim is to discuss how to update the LM parameter to propose a new adaptive accelerated LM method. It is easy to see from (2) that

d_{k}

is the solution to the optimization problem

min_{d \in R^{n}} ∥ F_{k} + J_{k} {d ∥}^{2} + λ_{k} {∥ d ∥}^{2} ≐ ψ_{k} (d) .

(3)

If

Δ_{k} = ∥ - {(J_{k}^{T} J_{k} + λ_{k} I)}^{- 1} (J_{k}^{T} F_{k}) ∥,

(4)

then

d_{k}

is also the solution of the subproblem

\begin{matrix} min_{d \in R^{n}} {∥ F_{k} + J_{k} d ∥}^{2} ≐ φ_{k} (d), \\ s . t . ∥ d ∥ \leq Δ_{k} . \end{matrix}

(5)

Therefore, the LM method can be regarded as a trust region method, which implicitly modifies the trust region radius

Δ_{k}

. The difference between the general trust region method and the LM method is that the LM method does not directly update the trust region radius, but updates the regularization factor

μ_{k}

.

We define the actual reduction and predicted reduction of the merit function

∥ F_{k} ∥^{2}

at the k-th iteration as

A r e d_{k} = ∥ F_{k} ∥^{2} - {∥ F (x_{k} + d_{k}) ∥}^{2}

(6)

and

P r e d_{k} = φ_{k} (0) - φ_{k} (d_{k}) .

(7)

The ratio between the actual and predicted reductions of the objective function is defined by

r_{k} = \frac{A r e d_{k}}{P r e d_{k}} .

(8)

This ratio determines whether the trial step

d_{k}

is accepted. Here, we choose the LM parameter as

\begin{matrix} λ_{k + 1} = \frac{μ_{k + 1} ∥ F_{k + 1} ∥}{1 + ∥ F_{k + 1} ∥} . \end{matrix}

(9)

The usual empirical rules [22,23,24,25] of

μ_{k + 1}

can be usually summarized as follows

μ_{k + 1} = \{\begin{matrix} 4 μ_{k}, & if r_{k} < p_{1}, \\ μ_{k}, & if p_{1} \leq r_{k} \leq p_{2}, \\ \max {\frac{μ_{k}}{4}, m}, & if r_{k} > p_{2}, \end{matrix}

(10)

where

m > 0

and

0 < p_{1} < p_{2} < 1

are constants.

Iterations with

r_{k}

greater than

p_{2}

are very successful iterations. In this case, it is usually assumed that the approximation of the model function to the objective function is accurate and

μ_{k}

should be reduced. However, at too-successful iterations, i.e.,

r_{k}

is sufficiently greater than 1, the consistency between the model and the objective function is not good enough. Thus, we use an adaptive strategy to update the factor

μ_{k + 1}

, i.e.,

μ_{k + 1} = K (r_{k}) μ_{k}

, where

K (r_{k})

is a function about

r_{k}

.

We construct

K (r_{k})

as follows:

K (r_{k}) = \{\begin{matrix} β_{1} + (β_{2} - β_{1}) \exp (- {(\frac{- r_{k} + p_{1}}{p_{1}})}^{2}), & if r_{k} \leq p_{1}, \\ β_{2}, & if p_{1} < r_{k} < p_{2}, \\ \frac{1 - β_{3} \exp (p_{2})}{1 - \exp (p_{2})} - \frac{(1 - β_{3}) \exp (p_{2})}{1 - \exp (p_{2})} \exp (- r_{k} + p_{2}) - \frac{1}{2}, & if r_{k} \geq p_{2}, \end{matrix}

(11)

where

0 < β_{2} < 1 < β_{1} \leq β_{3}

and

0 < p_{1} < p_{2} < 1

are constants. Here,

K (r_{k})

satisfies the following properties

(1) lim_{r_{k} \to - \infty} K (r_{k}) = β_{1};

(2) lim_{r_{k} \to p_{1}} K (r_{k}) = β_{2};

(3) lim_{r_{k} \to p_{2}} K (r_{k}) = \frac{1}{2};

(4) lim_{r_{k} \to + \infty} K (r_{k}) = \frac{1 - β_{3} \exp (p_{2})}{1 - \exp (p_{2})} - \frac{1}{2} .

If we obtain a satisfactory trial step

d_{k}

and ratio

r_{k}

, then we accept trial step

d_{k}

and reduce

μ_{k}

; otherwise, we reject trial step

d_{k}

and increase

μ_{k}

. At too-successful iterations, the actual reduction of the objective function obtained at iteration k is obviously greater than the predicted reduction. Although the current iteration allows the algorithm to progress towards the optimum, the approximation of the model function to the objective function is bad. Therefore, to avoid reducing

μ_{k}

too quickly, we use the K-function to update

μ_{k}

.

According to the properties of the K-function, the rate of

μ_{k}

reduction is the fastest when

r_{k}

is close to 1, i.e., when the model function provides an accurate local approximation of the objective function. The new idea we propose is to allow

μ_{k}

to be updated at a variable rate according to

r_{k}

, which would improve the efficiency and stability of the algorithm.

Based on the above analysis, we state a description of the new adaptive accelerated LM method (Algorithm 1) as follows.

In Algorithm 1, m is a given lower bound of the parameter

μ_{k}

. It is introduced to prevent the step from being too large when the sequence is near the solution.

Algorithm 1 NAALM.

0.: Given $x_{0} \in R^{n}$ , $μ_{0} > m > 0$ , $0 \leq p_{0} < p_{1} < p_{2} < 1$ , $0 < β_{2} < 1 < β_{1} \leq β_{3}$ , $ε > 0$ .
Let $k : = 0$ .
1.: Compute $F_{k}$ and $J_{k}$ . If $∥ J_{k}^{T} F_{k} ∥ \leq ε$ , stop. Otherwise, compute $λ_{k}$ by (9).
2.: Solving the following system

$(J_{k}^{T} J_{k} + λ_{k} I) d = - J_{k}^{T} F_{k}$

(12)

to determine $d_{k}$ .
3.: Compute $P r e d_{k}$ , $A r e d_{k}$ and $r_{k}$ by (6)–(8), respectively.
4.: Set

$x_{k + 1} = \{\begin{matrix} x_{k} + d_{k}, & if r_{k} \geq p_{0}, \\ x_{k}, & if r_{k} < p_{0} . \end{matrix}$

(13)
5.: Choose $μ_{k + 1}$ as

$μ_{k + 1} = max {m, K (r_{k}) μ_{k}},$

(14)

where $K (r_{k})$ is given by (11). Set $k : = k + 1$ and go to Step 1.

2.2. The Global Convergence

In this section, to obtain the global convergence of NAALM algorithm, we make the following assumption.

Assumption 1.

F (x)

is continuously differentiable,

F (x)

and the Jacobian matrix

J (x)

are Lipschitz continuous, i.e., there exist positive constants

L_{1}

and

L_{2}

such that

∥ J (y) - J (x) ∥ \leq L_{1} ∥ y - x ∥, \forall x, y \in R^{n},

(15)

and

∥ F (y) - F (x) ∥ \leq L_{2} ∥ y - x ∥, \forall x, y \in R^{n} .

(16)

Lemma 1.

Let

d_{k}

be computed by (12), then the inequality

P r e d_{k} \geq ∥ J_{k}^{T} F_{k} ∥ min \{∥ d_{k} ∥, \frac{∥ J_{k}^{T} F_{k} ∥}{∥ J_{k}^{T} J_{k} ∥}\}

(17)

holds for all

k \geq 0

.

Proof.

From (7), for

α \in [0, 1]

, we have

\begin{matrix} ∥ P r e d_{k} ∥ & = & ∥ F_{k} ∥^{2} - {∥ F_{k} + J_{k} d_{k} ∥}^{2} \\ \geq & ∥ F_{k} ∥^{2} - {∥F_{k} - J_{k} \frac{α ∥ d_{k} ∥}{∥ J_{k}^{T} F_{k} ∥} J_{k}^{T} F_{k}∥}^{2} \\ \geq & 2 α ∥ d_{k} ∥ ∥ J_{k}^{T} F_{k} ∥ - α^{2} ∥ d_{k} ∥^{2} ∥ J_{k}^{T} F_{k} ∥, \end{matrix}

(18)

then

\begin{matrix} ∥ P r e d_{k} ∥ & \geq & max_{0 \leq α \leq 1} (2 α ∥ d_{k} ∥ ∥ J_{k}^{T} F_{k} ∥ - α^{2} ∥ d_{k} ∥^{2} ∥ J_{k}^{T} F_{k} ∥) \\ \geq & ∥ J_{k}^{T} F_{k} ∥ min \{∥ d_{k} ∥, \frac{∥ J_{k}^{T} F_{k} ∥}{∥ J_{k}^{T} J_{k} ∥}\} . \end{matrix}

(19)

The proof is complete. □

Theorem 1.

Under the conditions of Assumption 1, the sequence

{x_{k}}

generated by NAALM algorithm satisfies

lim_{k \to \infty} ∥ J_{k}^{T} F_{k} ∥ = 0 .

(20)

Proof.

If the theorem is not true, then there exist a positive

τ

and infinitely many k such that

∥ J_{k}^{T} F_{k} ∥ \geq τ .

(21)

Let

T_{1}

,

T_{2}

be the sets of all indices that satisfy

\begin{matrix} T_{1} = {k | ∥ J_{k}^{T} F_{k} ∥ \geq τ} \end{matrix}

and

\begin{matrix} T_{2} = {k | ∥ J_{k}^{T} F_{k} ∥ \geq \frac{τ}{2} and x_{k + 1} \neq x_{k}} . \end{matrix}

Then,

T_{1}

is an infinite set. In the following, we will derive the contradictions regarding whether

T_{2}

is finite or infinite.

Case (I)

T_{2}

is finite.

It follows from the definition of

T_{2}

that the set

\begin{matrix} T_{3} = {k | ∥ J_{k}^{T} F_{k} ∥ \geq τ and x_{k + 1} \neq x_{k}} \end{matrix}

is also finite. Let

\tilde{k}

be the largest index of

T_{3}

. Then, we know that

x_{k + 1} = x_{k}

holds for all

k \in {k > \tilde{k} | k \in T_{1}}

. Define the indices set

\begin{matrix} T_{4} = {k > \tilde{k} | ∥ J_{k}^{T} F_{k} ∥ \geq τ and x_{k + 1} = x_{k}} . \end{matrix}

Suppose

k \in T_{4}

. It is easy to see that

∥ J_{k + 1}^{T} F_{k + 1} ∥ \geq τ

. Moreover, we have

x_{k + 2} = x_{k + 1}

. Otherwise, if

x_{k + 2} \neq x_{k + 1}

, then

k + 1 \in T_{3}

, which contradicts the fact that

\tilde{k}

is the largest index of

T_{3}

. Hence, we have

k + 1 \in T_{4}

. By induction, we know that

∥ J_{k}^{T} F_{k} ∥ \geq τ

and

x_{k + 1} = x_{k}

hold for all

k > \tilde{k}

.

It now follows from Step 3 of the NAALM Algorithm that

r_{k} < p_{0}

for all

k > \tilde{k}

, which imply

μ_{k} \to + \infty and λ_{k} \to + \infty,

(22)

due to (12)–(14) and

x_{k + 1} = x_{k}

for all

k > \tilde{k}

. Hence, we have

lim_{k \to \infty} d_{k} = 0 .

(23)

Furthermore, it follows from (21), (23) and Lemma 1 that

\begin{matrix} | r_{k} - 1 | & = & |\frac{A r e d_{k}}{P r e d_{k}} - 1| \\ = & |\frac{∥ F_{k} + J_{k} d_{k} ∥^{2} - {∥ F (x_{k} + d_{k}) ∥}^{2}}{P r e d_{k}}| \\ = & \frac{∥ F_{k} + J_{k} d_{k} ∥ O (∥ d_{k} ∥^{2}) + O (∥ d_{k} ∥^{4})}{P r e d_{k}} \\ \leq & \frac{∥ F_{k} + J_{k} d_{k} ∥ O (∥ d_{k} ∥^{2}) + O (∥ d_{k} ∥^{4})}{∥ J_{k}^{T} F_{k} ∥ min \{∥ d_{k} ∥, \frac{∥ J_{k}^{T} F_{k} ∥}{∥ J_{k}^{T} J_{k} ∥}\}} \\ \leq & \frac{O (∥ d_{k} ∥^{2})}{∥ d_{k} ∥} \to 0, \end{matrix}

(24)

that is,

r_{k} \to 1

. In view of the updating rule of

μ_{k}

, we know that there exists a positive constant

\tilde{m} > m

such that

μ_{k} < \tilde{m}

holds for all sufficiently large k, which is a contradiction to (22). Hence, the supposition (21) cannot be true while

T_{2}

is finite.

Case (II)

T_{2}

is infinite.

It follows from Lemma 1 that

\begin{matrix} ∥ F_{1} ∥ & \geq & \sum_{k \in T_{2}} (∥ F_{k} ∥^{2} - ∥ F_{k + 1} ∥^{2}) \\ \geq & \sum_{k \in T_{2}} p_{0} P r e d_{k} \\ \geq & \sum_{k \in T_{2}} p_{0} ∥ J_{k}^{T} F_{k} ∥ min \{∥ d_{k} ∥, \frac{∥ J_{k}^{T} F_{k} ∥}{∥ J_{k}^{T} J_{k} ∥}\} \\ \geq & \sum_{k \in T_{2}} \frac{p_{0} τ}{2}, \end{matrix}

(25)

which gives

\sum_{k \in T_{2}} ∥ d_{k} ∥ < + \infty .

(26)

The above inequality, together with the Lipschitz conditions (15) and (16), implies that

\begin{matrix} \sum_{k \in T_{2}} |∥ J_{k}^{T} F_{k} ∥ - ∥ J_{k + 1}^{T} F_{k + 1} ∥| < + \infty . \end{matrix}

(27)

Relation (27) and the fact that (21) holds for infinitely many k indicate that there exists a

\hat{k}

with

∥ J_{\hat{k}}^{T} F_{\hat{k}} ∥ \geq τ

such that

\begin{matrix} \sum_{k \in T_{2}, k \geq \hat{k}} |∥ J_{k}^{T} F_{k} ∥ - ∥ J_{k + 1}^{T} F_{k + 1} ∥| < \frac{τ}{2} . \end{matrix}

By induction, we obtain that

∥ J_{k}^{T} F_{k} ∥ \geq \frac{τ}{2}

for all

k \geq \hat{k}

. This result and (26) mean that

lim_{k \to \infty} ∥ d_{k} ∥ = 0 .

(28)

It follows from (12) and (13) that

μ_{k} \to + \infty

. By the same analysis as (24), we know that

μ_{k} \to 1

. Hence, there exists a positive constant

\bar{m} > m

such that

μ_{k} < \bar{m}

holds for all large k, which introduces a contradiction. Therefore, the supposition (21) cannot be true when

T_{2}

is infinite. The proof is complete. □

2.3. Local Convergence

In this section, we will study the local convergence properties of the NAALM algorithm by using the singular value decomposition (SVD) technique. We assume that the sequence

{x_{k}}

generated by the NAALM algorithm converges to the nonempty solution set

X^{*}

and lies in some neighborhood of

x^{*} \in X^{*}

. Firstly, we present some assumptions which the local convergence theory required.

Definition 1.

Let

N \subset R^{n}

such that

N \cap X^{*} \neq ϕ

, we say that

∥ F (x) ∥

provides a local error bound on

N

for (1) if there exists a positive constant

c > 0

such that

c dist (x, X^{*}) \leq ∥ F_{k} ∥, x \in N,

(29)

where

dist (x, X^{*})

is the distance from x to

X^{*}

.

Assumption 2.

(i)

F (x)

is continuously differentiable, and

J (x)

is Lipschitz continuous on

N (x^{*}, b_{1})

with

b_{1} < 1

, i.e., there exists a positive constant

L_{1}

such that

∥ J (y) - J (x) ∥ \leq L_{1} ∥ y - x ∥, \forall x, y \in N (x^{*}, b_{1}) = {x | ∥ x - x^{*} ∥ \leq b_{1}} .

(30)

(ii)

F (x)

provides a local error bound on some neighborhood of

x^{*} \in X^{*}

, i.e., there exists a positive constant

c_{1} > 0

such that

∥ F (x) ∥ \geq c_{1} dist (x, X^{*}), \forall x \in N (x^{*}, b_{1}) .

(31)

By the Lipschitzness of the Jacobian matrix proposed by (30), we have

\begin{matrix} ∥ F (y) - F (x) - J (x) (y - x) ∥ & = & ∥\int_{0}^{1} J (x + t (y - x)) (y - x) d t - J (x) (y - x)∥ \\ \leq & ∥ y - x ∥ \int_{0}^{1} ∥J (x + t (y - x)) - J (x)∥ d t \\ \leq & L_{1} {∥ y - x ∥}^{2}, \end{matrix}

(32)

and

∥ F (y) - F (x) ∥ \leq L_{2} ∥ y - x ∥, \forall x, y \in N (x^{*}, b_{1}),

(33)

where

L_{2}

is a positive constant.

In the following, we use

{\bar{x}}_{k}

to denote the vector in

X^{*}

that satisfies

∥ {\bar{x}}_{k} - x_{k} ∥ = dist (x_{k}, X^{*}), \forall x, y \in N (x^{*}, b_{1}) .

(34)

To obtain the local convergence rate of

x_{k}

, we present some lemmas.

Lemma 2.

Under the conditions of Assumption 2, for all sufficiently large k, there exists a constant

c_{2} > 0

such that

∥ d_{k} ∥ \leq c_{2} ∥ {\bar{x}}_{k} - x_{k} ∥ .

(35)

Proof.

According to (34), we have

∥ {\bar{x}}_{k} - x^{*} ∥ \leq ∥ {\bar{x}}_{k} - x_{k} ∥ + ∥ x_{k} - x^{*} ∥ \leq 2 ∥ x_{k} - x^{*} ∥ \leq b_{1},

(36)

which means that

\bar{x} \in N (x^{*}, b_{1})

. Following from (13),

\begin{matrix} λ_{k} = \frac{μ_{k} ∥ F_{k} ∥}{1 + ∥ F_{k} ∥} & = & μ_{k} (1 - \frac{1}{1 + ∥ F_{k} ∥}) \\ \geq & m (1 - \frac{1}{1 + c_{1} ∥ {\bar{x}}_{k} - x_{k} ∥}) \\ = & \frac{m c_{1} ∥ {\bar{x}}_{k} - x_{k} ∥}{1 + c_{1} ∥ {\bar{x}}_{k} - x_{k} ∥}, \end{matrix}

(37)

and we have from (32) that

∥ F_{k} + J_{k} ({\bar{x}}_{k} - x_{k}) ∥^{2} = ∥ F ({\bar{x}}_{k}) - F_{k} - J_{k} ({\bar{x}}_{k} - x_{k}) ∥^{2} \leq L_{1}^{2} {∥ {\bar{x}}_{k} - x_{k} ∥}^{4} .

(38)

As

d_{k}

is a minimizer of

ψ_{k} (d)

, we have

\begin{matrix} ∥ d_{k} ∥^{2} & \leq & \frac{1}{λ_{k}} φ_{k} (d_{k}) \\ \leq & \frac{1}{λ_{k}} φ_{k} ({\bar{x}}_{k} - x_{k}) \\ = & \frac{1}{λ_{k}} (∥ F_{k} + J_{k} ({\bar{x}}_{k} - x_{k}) ∥^{2} + λ_{k} ∥ {\bar{x}}_{k} - x_{k} ∥^{2}) \\ \leq & \frac{1 + c_{1} ∥ {\bar{x}}_{k} - x_{k} ∥}{m c_{1} ∥ {\bar{x}}_{k} - x_{k} ∥} (L_{1}^{2} ∥ {\bar{x}}_{k} - x_{k} ∥^{4}) + ∥ {\bar{x}}_{k} - x_{k} ∥^{2} \\ = & O (∥ {\bar{x}}_{k} - x_{k} ∥^{2}), \end{matrix}

then there exists a constant

c_{2} > 0

such that

∥ d_{k} ∥ \leq c_{2} ∥ {\bar{x}}_{k} - x_{k} ∥

. The proof is completed. □

Lemma 3.

Under the conditions of Assumption 2, for all sufficiently large k, there exists a positive constant

M > m

such that

μ_{k} \leq M .

(39)

Proof.

First, we show that for sufficiently large k, the following inequality holds

P r e d_{k} = ∥ F_{k} ∥^{2} - ∥ F_{k} + J_{k} d_{k} ∥^{2} \geq min \{\frac{c_{1}}{2 c_{2}}, \frac{c_{1}}{2}\} ∥ F_{k} ∥ ∥ d_{k} ∥ .

(40)

We consider two cases. In one case, if

∥ {\bar{x}}_{k} - x_{k} ∥ \leq ∥ d_{k} ∥

, then the definition of

d_{k}

and Assumption 2 imply that

\begin{matrix} ∥ F_{k} ∥ - ∥ F_{k} + J_{k} d_{k} ∥ & \geq & ∥ F_{k} ∥ - ∥ F_{k} + J_{k} ({\bar{x}}_{k} - x_{k}) ∥ \\ \geq & c_{1} ∥ {\bar{x}}_{k} - x_{k} ∥ - L_{1} {∥ {\bar{x}}_{k} - x_{k} ∥}^{2} \\ \geq & \frac{c_{1}}{2 c_{2}} ∥ d_{k} ∥ . \end{matrix}

(41)

In the other case, if

∥ {\bar{x}}_{k} - x_{k} ∥ > ∥ d_{k} ∥

, then we have

\begin{matrix} ∥ F_{k} ∥ - ∥ F_{k} + J_{k} d_{k} ∥ & \geq & ∥ F_{k} ∥ - ∥F_{k} + \frac{∥ d_{k} ∥}{∥ {\bar{x}}_{k} - x_{k} ∥} J_{k} ({\bar{x}}_{k} - x_{k})∥ \\ \geq & \frac{∥ d_{k} ∥}{∥ {\bar{x}}_{k} - x_{k} ∥} (∥ F_{k} ∥ - ∥ F_{k} + J_{k} ({\bar{x}}_{k} - x_{k}) ∥) \\ \geq & \frac{∥ d_{k} ∥}{∥ {\bar{x}}_{k} - x_{k} ∥} (c_{1} ∥ {\bar{x}}_{k} - x_{k} ∥ - L_{1} {∥ {\bar{x}}_{k} - x_{k} ∥}^{2}) \\ \geq & \frac{c_{1}}{2} ∥ d_{k} ∥ . \end{matrix}

(42)

Inequalities (41) and (42), together with Lemma 2 show that

\begin{matrix} P r e d_{k} & = & (∥ F_{k} ∥ + ∥ F_{k} + J_{k} d_{k} ∥) (∥ F_{k} ∥ - ∥ F_{k} + J_{k} d_{k} ∥) \\ \geq & ∥ F_{k} ∥ (∥ F_{k} ∥ - ∥ F_{k} + J_{k} d_{k} ∥) \\ \geq & min \{\frac{c_{1}}{2 c_{2}}, \frac{c_{1}}{2}\} ∥ F_{k} ∥ ∥ d_{k} ∥, \end{matrix}

(43)

which gives (40). Hence, it follows from (40), Assumption 2 and Lemma 2 that

\begin{matrix} | r_{k} - 1 | & = & |\frac{A r e d_{k}}{P r e d_{k}} - 1| \\ = & \frac{∥ F_{k} + J_{k} d_{k} ∥ O (∥ d_{k} ∥^{2}) + O (∥ d_{k} ∥^{4})}{P r e d_{k}} \\ \leq & \frac{∥ F_{k} ∥ O (∥ d_{k} ∥^{2}) + O (∥ d_{k} ∥^{4})}{O (∥ F_{k} ∥ ∥ d_{k} ∥)} \\ = & O (∥ d_{k} ∥) \to 0 . \end{matrix}

Therefore, we have

r_{k} \to 1

, thus, there exists a constant

M > m

such that

μ_{k} \leq M

for all large k. The proof is completed. □

Without generality, we assume rank

(J (x^{*})) = r

for all

\bar{x} \in N (x^{*}, b_{1}) \cap X^{*}

. Suppose the SVD of

J (\bar{x})

is

J (\bar{x}) = [{\bar{U}}_{1}, {\bar{U}}_{2}] [\begin{matrix} {\bar{Σ}}_{1} & 0 \\ 0 & 0 \end{matrix}] [\begin{matrix} {\bar{V}}_{1}^{T} \\ {\bar{V}}_{2}^{T} \end{matrix}] = {\bar{U}}_{1} {\bar{Σ}}_{1} {\bar{V}}_{1}^{T},

(44)

where

{\bar{Σ}}_{1} = diag ({\bar{σ}}_{1}, {\bar{σ}}_{2}, \dots, {\bar{σ}}_{r})

with

{\bar{σ}}_{1} \geq {\bar{σ}}_{2} \geq \dots \geq {\bar{σ}}_{r} > 0

and

\bar{U} = [{\bar{U}}_{1}, {\bar{U}}_{2}]

,

\bar{V} = [{\bar{V}}_{1}, {\bar{V}}_{2}]

are orthogonal matrices. Correspondingly, we consider SVD of

J (x_{k})

by

J (x_{k}) = [U_{1}, U_{2}, U_{3}] [\begin{matrix} Σ_{1} & 0 & 0 \\ 0 & Σ_{2} & 0 \\ 0 & 0 & 0 \end{matrix}] [\begin{matrix} V_{1}^{T} \\ V_{2}^{T} \\ V_{3}^{T} \end{matrix}] = U_{1} Σ_{1} V_{1}^{T} + U_{2} Σ_{2} V_{2}^{T},

(45)

where

U = [U_{1}, U_{2}, U_{3}]

,

V = [V_{1}, V_{2}, V_{3}]

are orthogonal matrixes,

Σ_{1} = diag (σ_{1}, σ_{2}, \dots, σ_{r})

with

σ_{1} \geq σ_{2} \geq \dots \geq σ_{r} > 0

and

Σ_{2} = diag (σ_{r + 1}, σ_{r + 2}, \dots, σ_{r + q})

with

σ_{r + 1} \geq σ_{r + 2} \geq \dots \geq σ_{r + q} > 0

.

Lemma 4.

Under the conditions of Assumption 2, for all sufficiently large k, we have

(a) ∥ U_{1} U_{1}^{T} F_{k} ∥ \leq O (∥ {\bar{x}}_{k} - x_{k} ∥)

;

(b) ∥ U_{2} U_{2}^{T} F_{k} ∥ \leq O (∥ {\bar{x}}_{k} - x_{k} ∥^{2})

;

(c) ∥ U_{3} U_{3}^{T} F_{k} ∥ \leq O (∥ {\bar{x}}_{k} - x_{k} ∥^{2})

;

(d) ∥ F_{k} + J_{k} d_{k} ∥ \leq O (∥ {\bar{x}}_{k} - x_{k} ∥^{2})

.

Proof.

The result

(a)

follows immediately from (16). By (15) and the theory of matrix perturbation [26], we have

∥ diag (Σ_{1} - {\bar{Σ}}_{1}, Σ_{2}, 0) ∥ \leq ∥ J_{k} - J ({\bar{x}}_{k}) ∥ \leq L_{1} ∥ {\bar{x}}_{k} - x_{k} ∥,

which implies that

∥ Σ_{1} - {\bar{Σ}}_{1} ∥ \leq L_{1} ∥ {\bar{x}}_{k} - x_{k} ∥ and ∥ Σ_{2} ∥ \leq L_{1} ∥ {\bar{x}}_{k} - x_{k} ∥ .

(46)

Let

s_{k} = - J_{k}^{+} F_{k}

, where

J_{k}^{+}

is the pseudo-inverse of

J_{k}

. It is easy to see that

s_{k}

is the least-squares solution of

min ∥ F_{k} + J_{k} s ∥

, so we obtain from (32) that

∥ U_{3} U_{3}^{T} F_{k} ∥ = ∥ F_{k} + J_{k} s_{k} ∥ \leq ∥ F_{k} + J_{k} ({\bar{x}}_{k} - x_{k}) ∥ \leq O (∥ {\bar{x}}_{k} - x_{k} ∥^{2}) .

Let

{\bar{J}}_{k} = U_{1} Σ_{1} V_{1}^{T}

and

{\bar{s}}_{k} = - {\bar{J}}_{k}^{+} F_{k}

. Since

{\bar{s}}_{k}

is the least-squares solution of

min ∥ F_{k} + {\bar{J}}_{k} s ∥

, it follows from (32) that

\begin{matrix} ∥ (U_{2} U_{2}^{T} + U_{3} U_{3}^{T}) F_{k} ∥ & = & ∥ F_{k} + {\bar{J}}_{k} {\bar{s}}_{k} ∥ \\ \leq & ∥ F_{k} + {\bar{J}}_{k} ({\bar{x}}_{k} - x_{k}) ∥ \\ \leq & ∥ F_{k} + J_{k} ({\bar{x}}_{k} - x_{k}) ∥ + ∥ ({\bar{J}}_{k} - J_{k}) ({\bar{x}}_{k} - x_{k}) ∥ \\ \leq & L_{1} ∥ {\bar{x}}_{k} - x_{k} ∥^{2} + ∥ U_{2} Σ_{2} V_{2}^{T} ({\bar{x}}_{k} - x_{k}) ∥ \\ \leq & L_{1} ∥ {\bar{x}}_{k} - x_{k} ∥^{2} + L_{1} ∥ {\bar{x}}_{k} - x_{k} ∥ ∥ {\bar{x}}_{k} - x_{k} ∥ \\ \leq & O (∥ {\bar{x}}_{k} - x_{k} ∥^{2}) . \end{matrix}

Due to the orthogonality of

U_{2}

and

U_{3}

, we obtain the result

(b)

.

Using (12) and (45), we obtain

d_{k} = - V_{1} {(Σ_{1}^{2} + λ_{k} I)}^{- 1} Σ_{1} U_{1}^{T} F_{k} - V_{2} {(Σ_{2}^{2} + λ_{k} I)}^{- 1} Σ_{2} U_{2}^{T} F_{k},

and

\begin{matrix} F_{k} + J_{k} d_{k} & = & F_{k} - U_{1} Σ_{1} {(Σ_{1}^{2} + λ_{k} I)}^{- 1} Σ_{1} U_{1}^{T} F_{k} - U_{2} Σ_{2} {(Σ_{2}^{2} + λ_{k} I)}^{- 1} Σ_{2} U_{2}^{T} F_{k} \\ = & λ_{k} U_{1} {(Σ_{1}^{2} + λ_{k} I)}^{- 1} U_{1}^{T} F_{k} + λ_{k} U_{2} {(Σ_{2}^{2} + λ_{k} I)}^{- 1} U_{2}^{T} F_{k} + U_{3} U_{3}^{T} F_{k} . \end{matrix}

Following from (13) and (33), the LM parameter satisfies

λ_{k} = \frac{μ_{k} ∥ F_{k} ∥}{1 + ∥ F_{k} ∥} \leq μ_{k} ∥ F_{k} ∥ \leq M L_{2} ∥ {\bar{x}}_{k} - x_{k} ∥ .

Since

{x_{k}}

converges to the solution set

X^{*}

, we assume that

L_{1} ∥ {\bar{x}}_{k} - x_{k} ∥ \leq \frac{{\bar{σ}}_{r}}{2}

holds for all sufficiently large k. Then, it follows from (46) that

∥ Σ_{1}^{- 1} ∥ \leq \frac{1}{{\bar{σ}}_{r} - L_{1} ∥ {\bar{x}}_{k} - x_{k} ∥} \leq \frac{2}{{\bar{σ}}_{r}} .

It then follows from Lemmas 3 and 4 that

\begin{matrix} ∥ F_{k} + J_{k} d_{k} ∥ & \leq & λ_{k} ∥ Σ_{1}^{- 2} ∥ ∥ U_{1}^{T} F_{k} ∥ + ∥ U_{2}^{T} F_{k} ∥ + ∥ U_{3} U_{3}^{T} F_{k} ∥ \\ \leq & \frac{4 L_{2} M {∥ {\bar{x}}_{k} - x_{k} ∥}^{2}}{{\bar{σ}}_{r}^{2}} + O (∥ {\bar{x}}_{k} - x_{k} ∥^{2}) + O (∥ {\bar{x}}_{k} - x_{k} ∥^{2}) \\ = & O (∥ {\bar{x}}_{k} - x_{k} ∥^{2}) . \end{matrix}

(47)

The proof is completed. □

We can state the quadratic convergence result of the NAALM algorithm.

Theorem 2.

Let the sequence

{x_{k}}

be generated by the NAALM algorithm, under Assumption 2, the sequence

{x_{k}}

converges quadratically to a solution of nonlinear Equation (1).

Proof.

It follows from Assumption 2, Lemma 2 and (47) that

\begin{matrix} c_{1} ∥ {\bar{x}}_{k + 1} - x_{k + 1} ∥ & \leq & ∥ F (x_{k + 1}) ∥ \\ = & ∥ F (x_{k} + d_{k}) ∥ \\ \leq & ∥ F_{k} + J_{k} d_{k} ∥ + O (∥ d_{k} ∥^{2}) \\ = & O (∥ {\bar{x}}_{k} - x_{k} ∥^{2}) . \end{matrix}

(48)

On the other hand, it is clear that

∥ {\bar{x}}_{k} - x_{k} ∥ = dist (x_{k}, X^{*}) \leq ∥ {\bar{x}}_{k + 1} - x_{k} ∥ \leq ∥ {\bar{x}}_{k + 1} - x_{k + 1} ∥ + ∥ d_{k} ∥ .

It follows from Lemma 2 that, for any sufficiently large k, we have

∥ {\bar{x}}_{k} - x_{k} ∥ \leq 2 ∥ d_{k} ∥ \leq O (∥ {\bar{x}}_{k} - x_{k} ∥) .

Therefore,

∥ d_{k} ∥ = O (∥ {\bar{x}}_{k} - x_{k} ∥)

. This, along with (48), indicates that

∥ d_{k + 1} ∥ \leq O (∥ d_{k} ∥^{2}),

which implies that

{x_{k}}

is quadratically convergent to a solution of set

X^{*}

. The proof is completed. □

3. Numerical Results

In this section, the numerical performance of NAALM algorithm will be listed. All codes were written in MATLAB R2016b on a PC with 1.19 GHz, 8.00 GB RAM, using Windows 11 operation system. In this section, we will expand on the following two aspects. On the one hand, the effectiveness of the NAALM algorithm is illustrated by comparing it with other algorithms on some test questions. On the other hand, it shows that the NAALM algorithm has good development prospects by applying the algorithm to a fresh agricultural products supply chain problem.

3.1. Some Singular Nonlinear Equations Problems

The test problems are constructed by modifying the nonsingular problems given by Moré et al. [27], which have the following form as [28]:

\hat{F} (x) = F (x) - J (x^{*}) A {(A^{T} A)}^{- 1} A^{T} (x - x^{*}),

where

F (x)

is the standard test function,

A \in R^{n \times k}

has full column rank with

0 \leq k \leq n

and x is a solution of the equation

F (x) = 0

. According to the definition of

\hat{F} (x)

, we obtain

\hat{J} (x^{*}) = J (x^{*}) (I - A {(A^{T} A)}^{- 1} A^{T}),

where

\hat{J} (x^{*})

is Jacobian matrix of

F (x)

at

x^{*}

with rank

n - k

and

\hat{F} (x^{*}) = 0

. In our test problems, some of

\hat{J} (x^{*})

are symmetric matrices and some are non-symmetric matrices. Note that some roots of

\hat{F} (x)

may not be roots of

F (x)

. Similar to [28], we construct two sets of singular problems while

\hat{J} (x^{*})

have rank

n - 1

or

n - 2

, by choosing

A = {[1, 1, . . ., 1]}^{T} \in R^{n \times 1},

and

A = {[\begin{matrix} 1 & 1 & 1 & 1 & . . . & 1 \\ 1 & - 1 & 1 & - 1 & . . . & \pm 1 \end{matrix}]}^{T} \in R^{n \times 2} .

We test our NAALM algorithm on some singular nonlinear equations, and compare it with the self-adaptive Levenberg–Marquardt algorithm (SLM) proposed in [18]. The main differences between these two algorithms are in the updating rule of

μ_{k}

.

We set

p_{0} = 10^{- 4}

,

p_{1} = \frac{1}{4}

,

p_{2} = \frac{3}{4}

,

β_{1} = \frac{5}{4}

,

β_{2} = \frac{1}{3}

,

β_{3} = \frac{6}{5}

,

m = 10^{- 8}

,

μ_{0} = 10^{- 2}

, for all the tests. All test methods are terminated when

∥ J_{k}^{T} F_{k} ∥ \leq 10^{- 5}

. The algorithm is considered to fail when the number of iterations exceeds 500. Considering the global convergence of the algorithms, we run each test problem for five starting points,

- 10 x_{0}

,

- x_{0}

,

x_{0}

,

10 x_{0}

and

100 x_{0}

, where

x_{0}

is given by [28]. For n as a variable, we take

n = 500

,

n = 1000

, respectively.

The performance profile of two algorithms. including the number of iterations (NI), function evaluations (NF), gradient evaluations (NG) and CPU time (CPU), is analyzed using the profiles of Dolan and Mor

\overset{´}{e}

[29]. Let Y and W be the set of methods and test problems,

n_{y}

,

n_{w}

be the number of methods and test problems, respectively. The performance profile

ψ : R \to [0, 1]

is for each

y \in Y

and

w \in W

defined that

a_{w, y} > 0

is NI or NF or NG or CPU required to solve problems w by method y. Furthermore, the performance profile is obtained by

\begin{matrix} ψ_{y} (τ) = \frac{1}{n_{w}} s i z e {w \in W : l o g_{2} r_{w, y} \leq τ}, \end{matrix}

where

τ > 0

,

s i z e {\cdot}

is the number of the elements in a set, and

r_{w, y}

is the performance ratio defined as

\begin{matrix} r_{w, y} = \frac{a_{w, y}}{m i n {a_{w, y} : w \in W a n d y \in Y}} . \end{matrix}

Generally, the method whose performance profile plot is on the top right will represent the best method.

As can be seen from Figure 1, the NAALM algorithm is better than the SLM algorithm in terms of the number of iterations, especially when

τ > 2

, the curve of NAALM algorithm becomes stable, which indicates that NAALM algorithm can solve the problem only with fewer iterations. In terms of function evaluations, as shown in Figure 2, the NAALM algorithm curve in

τ > 1.75

, it has reached a stable state, while SLM algorithm can reach a stable state only when the curve coincides with that of NAALM algorithm at

τ > 2.75

; Figure 3 shows the performance diagram of the SLM algorithm and the NAALM algorithm in the Jacobian matrix. It can be seen that the NAALM algorithm can successfully solve test problems up to 98%, while SLM can only reach 94%, which shows that the NAALM algorithm can reduce the calculation times of the Jacobian matrix and save the calculation amount. Figure 4 shows the CPU time performance of the NAALM algorithm and the SLM algorithm. It can be seen from the figure that when

τ < 4.5

, the curves of the NAALM algorithm and the SLM algorithm are similar, but when

τ > 4.5

, both the NAALM algorithm and the SLM algorithm tend to be stable and coincide. Therefore, Figure 1, Figure 2, Figure 3 and Figure 4 show that the accelerated version of the LM algorithm proposed in this paper can not only converge to the solution quickly, but also reduce the computation amount of the Jacobian matrix.

3.2. Supply Chain Optimization Problems

The security and stability of the supply chain has a great impact on promoting high-quality and sustainable development of the economy. Therefore, supply chain has been applied to many fields, such as low-carbon supply chain, manufacturing green supply chain, food trade supply chain. In recent years, with the improvement of living standards, the quality of fresh agricultural products has attracted widespread attention from consumers. In order to meet the demand of consumers for high quality and low price of fresh agricultural products, we use the NAALM algorithm to study how suppliers and retailers make decisions to maximize both their own profits and the total profit of the fresh agricultural products supply chain under the decentralized policy.

In this supply chain, as the leader of Stackelberg game, fresh agricultural product suppliers supply the same variety of ordinary fresh agricultural products (ofp) and green fresh agricultural products (gfp) to retailers as followers, while retailers sell them to consumers. Suppliers need to choose the optimal wholesale price strategy of two fresh agricultural products, and retailers need to choose the optimal retail price strategy of two fresh agricultural products and determine the order quantity of two fresh agricultural products by market demand.

Without considering the impact of emergencies, the market demand for fresh agricultural products is relatively stable, and it is only related to price and freshness. Due to the substitution of the same varieties of ofp and gfp, there is a competitive relationship in the demand market. Based on the demand function theory of alternative price competition, it is assumed that the demand function of two fresh agricultural products is as follows

q_{i} = a - b \frac{p_{i}}{θ} + r \frac{p_{j}}{θ}, i = 1, 2, j = 3 - i,

(49)

where

q_{1}

,

q_{2}

represent the market demand of gfp and ofp, respectively, a represents the total potential market capacity of fresh agricultural products,

p_{1}

,

p_{2}

represent the retail price of gfp and ofp, respectively, b is the price sensitivity coefficient, r is the competitive substitution coefficient of the two products, and it satisfies

b > r > 0

,

θ (0 \leq θ \leq 1)

is the freshness of fresh produce when it arrives at the retailer’s store.

Under the decentralized policy, we regard suppliers and retailers as independent entities, and both with the goal of maximizing their respective interests. Now, the profit function of fresh agricultural products retailer is as follows

max_{p_{1}, p_{2}} π_{s} = (p_{1} - w_{1}) (a - b \frac{p_{1}}{θ} + r \frac{p_{2}}{θ}) + (p_{2} - w_{2}) (a - b \frac{p_{2}}{θ} + r \frac{p_{1}}{θ}),

(50)

where

w_{1}

,

w_{2}

represent the supply price of gfp and ofp, respectively, and the profit function of fresh agricultural products suppliers is as follows

max_{w_{1}, w_{2}} π_{R} = (w_{1} - \frac{c_{1}}{1 - β}) (a - b \frac{p_{1}}{θ} + r \frac{p_{2}}{θ} + (w_{2} - \frac{c_{2}}{1 - β}) (a - b \frac{p_{2}}{θ} + r \frac{p_{1}}{θ}),

(51)

where

β

(0 < β < 1)

represents the quantity loss of fresh produce when it reaches the retailer’s store,

c_{1}

,

c_{2}

represents the unit production cost of gfp and ofp, respectively. Obviously,

p_{1} > p_{1} > 0

and

c_{1} > c_{2} > 0

. We record the total profit of fresh agricultural products supply chain as follows:

π_{T} = π_{s} + π_{R} .

(52)

With reference to the setting of the parameters in the relevant literature [30], we set:

a = 50

,

b = 2

,

c_{1} = 4

,

c_{2} = 2

,

r = 1.5

,

β = 0.2

,

θ = 0.85

. These values satisfy the theoretical proof in [30] and can guarantee that the optimal value has practical significance. Now, we transform the unconstrained optimization problem (51) into a nonlinear equation problem, and then choose different initial points and use the NAALM algorithm to solve the nonlinear equation problem.

As can be seen from Table 1, with certain parameters, the NAALM algorithm can be used to solve the optimization problem, so as to obtain the optimal pricing strategy with maximum profit in the supply chain led by suppliers under the decentralized policy. In addition, the global convergence and robustness of the NAALM algorithm are verified according to different initial values and the number of iterations.

4. Conclusions

We constructed a new function that makes full use of the ratio information to update LM parameters adaptively. Based on this new LM parameter, we presented an adaptive accelerated Levenberg–Marquardt method for solving nonlinear equations. Furthermore, we showed the global convergence analysis of the proposed algorithm. Furthermore, the quadratic convergence is also obtained under the local error bound condition. Numerical experiments demonstrated that our method has good numerical performance. In addition, the application of the NAALM algorithm to a supply chain problem showed that the new algorithm has a good application prospect. We further highlight that the proposed NAALM algorithm can be used in other fields, such as the symmetric system of nonlinear equations. It is vital to note that the method’s convergence analysis in Hölderian local error bound condition will be taken into account in our future work.

Author Contributions

Conceptualization, R.L., M.C. and G.Z.; methodology, R.L., M.C. and G.Z.; software, R.L., M.C. and G.Z. validation, R.L., M.C. and G.Z.; formal analysis, R.L., M.C. and G.Z.; investigation, R.L., M.C. and G.Z.; resources, R.L., M.C. and G.Z.; data curation, R.L., M.C. and G.Z.; writing—original draft preparation, R.L., M.C. and G.Z.; writing—review and editing, R.L., M.C. and G.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the key project of natural science foundation joint fund of Jilin Province (YDZJ202101ZYTS167, YDZJ202201ZYTS303); the project of education department of Jilin Province (JJKH20210030KJ, JJKH20230054KJ); the graduate innovation project of Beihua University (2022033, 2021002); the youth science and technology innovation team cultivation program of Beihua University.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank the anonymous referees and editor for reading this paper carefully, providing valuable suggestions and comments, which grately improved the final version.

Conflicts of Interest

The authors declare no conflict of interest.

References

Musa, Y.B.; Waziri, M.Y.; Noor, M.A. An efficient method for solving system for nonlinear equation. J. Math. Anal. 2022, 13, 1–10. [Google Scholar]
Ribeiro, S.; Lopes, L.G. Overview and computational analysis of PSO variants for solving systems of nonlinear equations. Commun. Intell. Syst. 2022, 461, 1093–1105. [Google Scholar]
Ji, J.Y.; Wong, M.L. Decomposition-based multiobjective optimization for nonlinear equation systems with many and infinitely many roots. Inf. Sci. 2022, 610, 605–623. [Google Scholar] [CrossRef]
Artacho, F.J.A.; Fleming, R.; Vuong, P.T. Accelerating the DC algorithm for smooth functions. Math. Program. 2018, 169B, 95–118. [Google Scholar] [CrossRef] [Green Version]
Sabi’u, J.; Muangchoo, K.; Shah, A.; Abubakar, A.B.; Aremu, K.O. An inexact optimal hybrid conjugate gradient method for solving symmetric nonlinear equations. Symmetry 2021, 13, 1829. [Google Scholar] [CrossRef]
Sabi’u, J.; Muangchoo, K.; Shah, A.; Abubakar, A.B.; Jolaoso, L.O. A modified PRP-CG type derivative-free algorithm with optimal choices for solving large-scale nonlinear symmetric equations. Symmetry 2021, 13, 234. [Google Scholar] [CrossRef]
Niri, T.D.; Heydari, M.; Hosseini, M.M. Correction of trust region method with a new modified Newton method. Int. J. Comput. Math. 2022, 97, 1–15. [Google Scholar]
Bellavia, S.; Morini, B.; Rebegoldi, S. On the convergence properties of a stochastic trust-region method with inexact restoration. Axioms 2023, 12, 38. [Google Scholar] [CrossRef]
Zheng, L.; Chen, L.; Ma, Y.F. A variant of the Levenberg-Marquardt method with adaptive parameters for systems of nonlinear equations. AIMS Math. 2021, 7, 1241–1256. [Google Scholar] [CrossRef]
Yudin, N.E. Adaptive Gauss-Newton method for solving systems of nonlinear equations. Dokl. Math. 2021, 104, 293–296. [Google Scholar] [CrossRef]
Levenberg, K. A method for the solution of certain nonlinear problems in least squares. Quart. Appl. Math. 1944, 2, 164–166. [Google Scholar] [CrossRef] [Green Version]
Marquardt, D.W. An algorithm for least-squares estimation of nonlinear inequalities. SIAM J. Appl. Math. 1963, 11, 431–441. [Google Scholar] [CrossRef]
Yamashita, N.; Fukushima, M. On the Rate of Convergence of the Levenberg-Marquardt Method. Computing 2001, 15, 239–249. [Google Scholar]
Fan, J.Y.; Yuan, Y.X. On the Convergence of a New Levenberg-Marquardt Method; Report No. 005, AMSS; Chinese Academy of Sciences: Beijing, China, 2001. [Google Scholar]
Fan, J.Y. A modified Levenberg-Marquardt algorithm for singular system of nonlinear equation. J. Comput. Math. 2003, 21, 625–636. [Google Scholar]
Fan, J.Y.; Pan, J.Y. A note on the Levenberg-Marquardt parameter. Appl. Mathe. Comput. 2009, 207, 351–359. [Google Scholar] [CrossRef]
Amini, K.; Rostami, F.; Caristi, G. An efficient Levenberg-Marquardt method with a new LM parameter for systems of nonlinear equations. Optimization 2018, 67, 637–650. [Google Scholar] [CrossRef]
Fan, J.Y.; Yuan, Y.X. Convergence properties of a self-adaptive Levenberg-Marquardt algorithm under local error bound condition. Comput. Optim. Appl. 2006, 34, 47–62. [Google Scholar] [CrossRef]
Hei, L. A self-adaptive trust region algorithm. J. Comput. Math. 2003, 21, 229–236. [Google Scholar]
Walmag, J.M.B.; Delhez, E.J.M. A note on trust-region radius update. Siam J. Optim. 2005, 16, 548–562. [Google Scholar] [CrossRef]
Lu, Y.L.; Li, W.Y.; Cao, M.Y.; Yang, Y.T. A novel self-adaptive trust region algorithm for unconstrained optimization. J. Appl. Math. 2014, 2014, 1–8. [Google Scholar] [CrossRef] [Green Version]
Amini, K.; Rostami, F. A modified two steps Levenberg-Marquardt method for nonlinear equations. J. Comput. 2015, 288, 341–350. [Google Scholar] [CrossRef]
He, X.R.; Tang, J.Y. A smooth Levenberg-Marquardt method without nonsingularity condition for wLCP. AIMS Math. 2022, 7, 8914–8932. [Google Scholar] [CrossRef]
Fan, J.Y. Accelerating the modified Levenberg-Marquardt method for nonlinear equations. Math. Comput. 2014, 83, 1173–1187. [Google Scholar] [CrossRef]
Chen, L. A high-order modified Levenberg-Marquardt method for systems of nonlinear equations with fourth-order convergence. Appl. Math. Comput. 2016, 285, 79–93. [Google Scholar] [CrossRef]
Stewart, G.W.; Sun, J.G. Matrix Perturbation Theory; Academic Press: San Diego, CA, USA, 1990. [Google Scholar]
More, J.J. Recent developments in algorithms and software for trust region methods. Math. Program. 1983, 85, 258–287. [Google Scholar]
Schnabel, R.B.; Frank, P.D. Tensor methods for nonlinear equations. SIAM J. Numer. Anal. 1984, 21, 815–843. [Google Scholar] [CrossRef] [Green Version]
Dolan, E.; More, J.J. Benchmarking optimization software with performance profiles. Math. Program. 2022, 91, 201–213. [Google Scholar] [CrossRef]
Wen, H. Research on Profit Maximization Strategy of Fresh Agricultural Products Supply Chain under Different Dominated Subjects. Ph.D. Thesis, Huazhong Agricultural University, Wuhan, China, 2020. [Google Scholar]

Figure 1. Performance profiles for the iterations.

Figure 2. Performance profiles for the function evaluations.

Figure 3. Performance profiles for the gradient evaluations.

Figure 4. Performance profiles for the CPU time.

Table 1. The optimal solution corresponding to different initial points by NAALM.

Initial Point	p₁	p₂	w₁	w₂	q₁	q₂	π_T
(1;1;1;1)	45.0266	43.7205	65.065	64.329	10.5818	13.2479	1.4587 × 10³
(10;10;10;10)	44.9648	43.7460	64.991	64.358	10.5163	13.2607	1.4587 × 10³
(30;30;30;30)	44.9920	43.7496	64.956	64.376	10.5825	13.3037	1.4587 × 10³
(50;50;50;50)	45.0158	43.7525	65.062	64.372	10.5125	13.3036	1.4587 × 10³
(100;100;100;100)	44.9751	43.7553	65.955	64.269	10.5222	13.2679	1.4587 × 10³

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, R.; Cao, M.; Zhou, G. A New Adaptive Accelerated Levenberg–Marquardt Method for Solving Nonlinear Equations and Its Applications in Supply Chain Problems. Symmetry 2023, 15, 588. https://doi.org/10.3390/sym15030588

AMA Style

Li R, Cao M, Zhou G. A New Adaptive Accelerated Levenberg–Marquardt Method for Solving Nonlinear Equations and Its Applications in Supply Chain Problems. Symmetry. 2023; 15(3):588. https://doi.org/10.3390/sym15030588

Chicago/Turabian Style

Li, Rong, Mingyuan Cao, and Guoling Zhou. 2023. "A New Adaptive Accelerated Levenberg–Marquardt Method for Solving Nonlinear Equations and Its Applications in Supply Chain Problems" Symmetry 15, no. 3: 588. https://doi.org/10.3390/sym15030588

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A New Adaptive Accelerated Levenberg–Marquardt Method for Solving Nonlinear Equations and Its Applications in Supply Chain Problems

Abstract

1. Introduction

2. Methodology

2.1. The Adaptive Accelerated Levenberg–Marquardt Method

2.2. The Global Convergence

2.3. Local Convergence

3. Numerical Results

3.1. Some Singular Nonlinear Equations Problems

3.2. Supply Chain Optimization Problems

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI