Convergence criteria of Newton’s method on Lie groups

He, Jinsu; Wang, Jinhua; Yao, Jen-Chih

doi:10.1186/1687-1812-2013-293

Research
Open access
Published: 09 November 2013

Convergence criteria of Newton’s method on Lie groups

Jinsu He¹,
Jinhua Wang^2,3 &
Jen-Chih Yao^4,5

Fixed Point Theory and Applications volume 2013, Article number: 293 (2013) Cite this article

2187 Accesses
3 Citations
Metrics details

Abstract

In the present paper, we study Newton’s method on Lie groups (independent of affine connections) for finding zeros of a mapping f from a Lie group to its Lie algebra. Under a generalized L-average Lipschitz condition of the differential of f, we establish a unified convergence criterion of Newton’s method. As applications, we get the convergence criteria under the Kantorovich’s condition and the γ-condition, respectively. Moreover, applications to optimization problems are also provided.

MSC:65H10, 65D99.

1 Introduction

Newton’s method is one of the most important methods for finding the approximation solution of the equation $f (x) = 0$ , where f is an operator from some domain D in a real or complex Banach space X to another Y. As is well known, one of the most important results on Newton’s method is Kantorovich’s theorem (cf. [1]). Under the mild condition that the second Fréchet derivative of F is bounded (or more general, the first derivative is Lipschitz continuous) on a proper open metric ball of the initial point $x_{0}$ , Kantorovich’s theorem provides a simple and clear criterion ensuring the quadratic convergence of Newton’s method. Another important result on Newton’s method is Smale’s point estimate theory (i.e., α-theory and γ-theory) in [2], where the notions of approximate zeros were introduced and the rules to judge an initial point $x_{0}$ to be an approximate zero were established, depending on the information of the analytic nonlinear operator at this initial point and at a solution $x^{*}$ , respectively. There are a lot of works on the weakness and/or the extension of the Lipschitz continuity made on the mappings; see, for example, [3–7] and references therein. In particular, Zabrejko-Nguen parametrized in [7] the classical Lipschitz continuity. Wang introduced in [6] the notion of Lipschitz conditions with L-average to unify both Kantorovich’s and Smale’s criteria.

In a Riemannian manifold framework, an analogue of the well-known Kantorovich’s theorem was given in [8] for Newton’s method for vector fields on Riemannian manifolds while the extensions of the famous Smale’s α-theory and γ-theory in [2] to analytic vector fields and analytic mappings on Riemannian manifolds were done in [9]. In the recent paper [10], the convergence criteria in [9] were improved by using the notion of the γ-condition for the vector fields and mappings on Riemannian manifolds. The radii of uniqueness balls of singular points of vector fields satisfying the γ-conditions were estimated in [11], while the local behavior of Newton’s method on Riemannian manifolds was studied in [12, 13]. Furthermore, in [14], Li and Wang extended the generalized L-average Lipschitz condition (introduced in [6]) to Riemannian manifolds and established a unified convergence criterion of Newton’s method on Riemannian manifolds. Similarly, inspired by previous work of Zabrejko and Nguen in [7] on Kantorovich’s majorant method, Alvarez et al. introduced in [15] a Lipschitz-type radial function for the covariant derivative of vector fields and mappings on Riemannian manifolds and established a unified convergence criterion of Newton’s method on Riemannian manifolds.

Note also that Mahony used one-parameter subgroups of a Lie group to develop a version of Newton’s method on an arbitrary Lie group in [16], where the algorithm presented is independent of affine connections on the Lie group. This means that Newton’s method on Lie groups is different from the one defined on Riemannian manifolds. On the other hand, motivated by looking for approaches to solving ordinary differential equations on Lie groups, Owren and Welfert also studied in [17] Newton’s method, independent of affine connections on the Lie group, and showed the local quadratical convergence. Recently, Wang and Li [18] established Kantorovich’s theorem (independent of the connection) for Newton’s method on the Lie group. More precisely, under the assumption that the differential of f satisfies the Lipschitz condition around the initial point (which is in terms of one-parameter semigroups and independent of the metric), the convergence criterion of Newton’s method is presented. Extensions of Smale’s point estimate theory for Newton’s method on Lie groups were given in [19].

The purpose of the present paper is to establish a unified convergence criterion for Newton’s method (independent of the connection) on Lie groups under a generalized L-average Lipschitz condition. As applications, we get the convergence criteria under the Kantorovich’s condition and the γ-condition, respectively. Hence, our results extend the corresponding results in [18] and [19], respectively. Moreover, applications to optimization problems are also provided.

The remainder of the paper is organized as follows. Some preliminary results and notions are given in Section 2, while the main results about a unified convergence criterion are presented in Section 3. In Section 4, applications to optimization problems are explored. Theorems under the Kantorovich’s condition and the γ-condition are provided in the final section.

2 Notions and preliminaries

Most of the notions and notations which are used in the present paper are standard; see, for example, [20, 21]. The Lie group $(G, \cdot)$ is a Hausdorff topological group with countable bases which also has the structure of an analytic manifold such that the group product and the inversion are analytic operations in the differentiable structure given on the manifold. The dimension of a Lie group is that of the underlying manifold, and we shall always assume that it is m-dimensional. The symbol e designates the identity element of G. Let $G$ be the Lie algebra of the Lie group G which is the tangent space $T_{e} G$ of G at e, equipped with Lie bracket $[\cdot, \cdot] : G \times G \to G$ .

In the sequel we make use of the left translation of the Lie group G. We define, for each $y \in G$ , the left translation $L_{y} : G \to G$ by

L_{y} (z) = y \cdot z for each z \in G .

(2.1)

The differential of $L_{y}$ at z is denoted by ${(L_{y}^{'})}_{z}$ , which clearly determines a linear isomorphism from $T_{z} G$ to the tangent space $T_{(y \cdot z)} G$ . In particular, the differential ${(L_{y}^{'})}_{e}$ of $L_{y}$ at e determines a linear isomorphism from $G$ to the tangent space $T_{y} G$ . The exponential map $exp : G \to G$ is certainly the most important construction associated to G and $G$ , and is defined as follows. Given $u \in G$ , let $σ_{u} : R \to G$ be a one-parameter subgroup of G determined by the left invariant vector field $X_{u} : y \mapsto {(L_{y}^{'})}_{e} (u)$ ; i.e., $σ_{u}$ satisfies that

σ_{u} (0) = e and σ_{u}^{'} (t) = X_{u} (σ_{u} (t)) = {(L_{σ_{u} (t)}^{'})}_{e} (u) for each t \in R .

(2.2)

The value of the exponential map exp at u is then defined by

exp (u) = σ_{u} (1) .

Moreover, we have that

exp (t u) = σ_{t u} (1) = σ_{u} (t) for each t \in R and u \in G

(2.3)

and

exp (t + s) u = exp (t u) \cdot exp (s u) for any t, s \in R and u \in G .

(2.4)

Note that the exponential map is not surjective in general. However, the exponential map is a diffeomorphism on an open neighborhood of $0 \in G$ . In the case when G is Abelian, exp is also a homomorphism from $G$ to G, i.e.,

exp (u + v) = exp (u) \cdot exp (v) for all u, v \in G .

(2.5)

In the non-abelian case, exp is not a homomorphism and, by the Baker-Campbell-Hausdorff (BCH) formula (cf. [[21], p.114]), (2.5) must be replaced by

exp (w) = exp (u) \cdot exp (v)

(2.6)

for all $u, v$ in a neighborhood of $0 \in G$ , where w is defined by

w : = u + v + \frac{1}{2} [u, v] + \frac{1}{12} ([u, [u, v]] + [v, [v, u]]) + \dots .

(2.7)

Let $f : G \to G$ be a $C^{1}$ -map and let $x \in G$ . We use $f_{x}^{'}$ to denote the differential of f at x. Then, by [[22], p.9] (the proof given there for a smooth mapping still works for a $C^{1}$ -map), for each $△_{x} \in T_{x} G$ and any nontrivial smooth curve $c : (- ε, ε) \to G$ with $c (0) = x$ and $c^{'} (0) = △_{x}$ , one has that

f_{x}^{'} △_{x} = {(\frac{d}{d t} (f \circ c) (t))}_{t = 0} .

(2.8)

In particular,

f_{x}^{'} △_{x} = {(\frac{d}{d t} f (x \cdot exp (t {(L_{x^{- 1}}^{'})}_{x} △_{x})))}_{t = 0} for each △_{x} \in T_{x} G .

(2.9)

Define the linear map $d f_{x} : G \to G$ by

d f_{x} u = {(\frac{d}{d t} f (x \cdot exp (t u)))}_{t = 0} for each u \in G .

(2.10)

Then, by (2.9),

d f_{x} = f_{x}^{'} \circ {(L_{x}^{'})}_{e} .

(2.11)

Also, in view of the definition, we have that for all $t \geq 0$ ,

\frac{d}{d t} f (x \cdot exp (t u)) = d f_{x \cdot exp (t u)} u for each u \in G

(2.12)

and

f (x \cdot exp (t u)) - f (x) = \int_{0}^{t} d f_{x \cdot exp (s u)} u d s for each u \in G .

(2.13)

For the remainder of the present paper, we always assume that $〈 \cdot, \cdot 〉$ is an inner product on $G$ and $∥ \cdot ∥$ is the associated norm on $G$ . We now introduce the following distance on G which plays a key role in the study. Let $x, y \in G$ and define

ϱ (x, y) : = inf {\sum_{i = 1}^{k} ∥ u_{i} ∥ | \begin{array}{l} there exist k \geq 1 and u_{1}, \dots, u_{k} \in G such that \\ y = x \cdot exp u_{1} \cdot \dots \cdot exp u_{k} \end{array}},

(2.14)

where we adapt the convention that $inf \emptyset = + \infty$ . It is easy to verify that $ϱ (\cdot, \cdot)$ is a distance on G and the topology induced by this distance is equivalent to the original one on G.

Let $x \in G$ and $r > 0$ . We denote the corresponding ball of radius r around x of G by $C_{r} (x)$ , that is,

C_{r} (x) : = {y \in G | ϱ (x, y) < r} .

Let $L (G)$ denote the set of all linear operators on $G$ . Below, we will modify the notion of the Lipschitz condition with L-average for mappings on Banach spaces to suit sections. Let L be a positive nondecreasing integrable function on $[0, R]$ , where R is a positive number large enough such that $\int_{0}^{R} (R - s) L (s) d s \geq R$ . The notion of Lipschitz condition in the inscribed sphere with the L average for operators from Banach spaces to Banach spaces was first introduced in [23] by Wang for the study of Smale’s point estimate theory.

Definition 2.1 Let $r > 0$ , $x_{0} \in G$ , and let T be a mapping from G to $L (G)$ . Then T is said to satisfy the L-average Lipschitz condition on $C_{r} (x_{0})$ if

∥ T (x \cdot exp u) - T (x) ∥ \leq \int_{ρ (x_{0}, x)}^{ρ (x_{0}, x) + ∥ u ∥} L (s) d s

(2.15)

holds for any $u, u_{0}, \dots, u_{k} \in G$ and $x \in C_{r} (x_{0})$ such that $x = x_{0} exp u_{0} exp u_{1} \dots exp u_{k}$ and $∥ u ∥ + ρ (x, x_{0}) < r$ , where $ρ (x, x_{0}) : = \sum_{i = 0}^{k} ∥ u_{i} ∥$ .

The majorizing function h defined in the following, which was first introduced and studied by Wang (cf. [23]), is a powerful tool in our study. Let $r_{0} > 0$ and $b > 0$ be such that

\int_{0}^{r_{0}} L (s) d s = 1 and b = \int_{0}^{r_{0}} L (s) s d s .

(2.16)

For $β > 0$ , define the majorizing function h by

h (t) = β - t + \int_{0}^{t} L (s) (t - s) d s for each 0 \leq t \leq R .

(2.17)

Some useful properties are described in the following propositions, see [23].

Proposition 2.1 The function h is monotonic decreasing on $[0, r_{0}]$ and monotonic increasing on $[r_{0}, R]$ . Moreover, if $β \leq b$ , h has a unique zero respectively in $[0, r_{0}]$ and $[r_{0}, R]$ , which are denoted by $r_{1}$ and $r_{2}$ .

Let ${t_{n}}$ denote the sequence generated by Newton’s method with initial value $t_{0} = 0$ for h, that is,

t_{n + 1} = t_{n} - h^{'} {(t_{n})}^{- 1} h (t_{n}) for each n = 0, 1, \dots .

(2.18)

Proposition 2.2 Suppose that $β \leq b$ . Then the sequence ${t_{n}}$ generated by (2.18) is monotonic increasing and convergent to $r_{1}$ .

The following lemma will be useful in the proof of the main theorem.

Lemma 2.1 Let $0 < r \leq r_{0}$ and let $x_{0} \in G$ be such that $d f_{x_{0}}^{- 1}$ exists. Suppose that $d f_{x_{0}}^{- 1} d f$ satisfies the L-average Lipschitz condition on $C_{r} (x_{0})$ . Let $x \in C_{r} (x_{0})$ be such that there exist $k \geq 1$ and $u_{0}, \dots, u_{k} \in G$ satisfying $x = x_{0} \cdot exp u_{0} \cdot \dots \cdot exp u_{k}$ and $ρ (x, x_{0}) : = \sum_{i = 0}^{k} ∥ u_{i} ∥ < r$ . Then $d f_{x}^{- 1}$ exists and

∥ d f_{x}^{- 1} d f_{x_{0}} ∥ \leq \frac{1}{1 - \int_{0}^{ρ (x, x_{0})} L (s) d s} .

(2.19)

Proof Write $y_{0} = x_{0}$ and $y_{i + 1} = y_{i} \cdot exp u_{i}$ for each $i = 0, \dots, k$ . Since (2.15) holds with $T = d f_{x_{0}}^{- 1} d f$ , one has that

∥ d f_{x_{0}}^{- 1} (d f_{y_{i} \cdot exp u_{i}} - d f_{y_{i}}) ∥ \leq \int_{ρ (y_{i}, x_{0})}^{ρ (y_{i + 1}, x_{0})} L (s) d s for each 0 \leq i \leq k .

(2.20)

Noting that $y_{k + 1} = x$ , we have that

Thus the conclusion follows from the Banach lemma and the proof is complete. □

3 Convergence criteria

Following [17], we define Newton’s method with initial point $x_{0}$ for f on a Lie group as follows:

x_{n + 1} = x_{n} \cdot exp (- d f_{x_{n}}^{- 1} f (x_{n})) for each n = 0, 1, \dots .

(3.1)

Recall that $f : G \to G$ is a $C^{1}$ -mapping. In the remainder of this section, we always assume that $x_{0} \in G$ is such that $d f_{x_{0}}^{- 1}$ exists and set $β : = ∥ d f_{x_{0}}^{- 1} f (x_{0}) ∥$ . Let $r_{0}$ and b given by (2.16), and $r_{1}$ be given by Proposition 2.1.

Theorem 3.1 Suppose that $d f_{x_{0}}^{- 1} d f$ satisfies the L-average Lipschitz condition on $C_{r_{1}} (x_{0})$ and that

β = ∥ d f_{x_{0}}^{- 1} f (x_{0}) ∥ \leq b .

(3.2)

Then the sequence ${x_{n}}$ generated by Newton’s method (3.1) with initial point $x_{0}$ is well defined and converges to a zero $x^{*}$ of f. Moreover, the following assertions hold for each $n = 0, 1, \dots$ :

ϱ (x_{n + 1}, x_{n}) \leq ∥ d f_{x_{n}}^{- 1} f (x_{n}) ∥ \leq t_{n + 1} - t_{n};

(3.3)

ϱ (x_{n}, x^{*}) \leq r_{1} - t_{n} .

(3.4)

Proof Write $v_{n} = - d f_{x_{n}}^{- 1} f (x_{n})$ for each $n = 0, 1, \dots$ . Below we shall show that each $v_{n}$ is well defined and

ϱ (x_{n + 1}, x_{n}) \leq ∥ v_{n} ∥ \leq t_{n + 1} - t_{n}

(3.5)

holds for each $n = 0, 1, \dots$ . Granting this, one sees that the sequence ${x_{n}}$ generated by Newton’s method (3.1) with initial point $x_{0}$ is well defined and converges to a zero $x^{*}$ of f, because, by (3.1),

x_{n + 1} = x_{n} \cdot exp v_{n} for each n = 0, 1, \dots .

Furthermore, assertions (3.3) and (3.4) hold for each n and the proof of the theorem is completed.

Note that $v_{0}$ is well defined by assumption and $x_{1} = x_{0} \cdot exp v_{0}$ . Hence, $ϱ (x_{1}, x_{0}) \leq ∥ v_{0} ∥$ . Since $∥ v_{0} ∥ = ∥ - d f_{x_{0}}^{- 1} (f (x_{0})) ∥ = β = t_{1} - t_{0}$ , it follows that (3.5) is true for $n = 0$ . We now proceed by mathematical induction on n. For this purpose, assume that $v_{n}$ is well defined and (3.5) holds for each $n \leq k - 1$ . Then

\sum_{i = 0}^{k - 1} ∥ v_{i} ∥ \leq t_{k} - t_{0} = t_{k} < r_{1} and x_{k} = x_{0} \cdot exp v_{0} \cdot \dots \cdot exp v_{k - 1} .

(3.6)

Thus, we use Lemma 2.1 to conclude that $d f_{x_{k}}^{- 1}$ exists and

∥ d f_{x_{k}}^{- 1} d f_{x_{0}} ∥ \leq \frac{1}{1 - \int_{0}^{t_{k}} L (s) d s} = - h^{'} {(t_{k})}^{- 1} .

(3.7)

Therefore, $v_{k}$ is well defined. Observe that

\begin{array}{rcl} f (x_{k}) & = & f (x_{k}) - f (x_{k - 1}) - d f_{x_{k - 1}} v_{k - 1} \\ = & \int_{0}^{1} d f_{x_{k - 1} \cdot exp (t v_{k - 1})} v_{k - 1} d t - d f_{x_{k - 1}} v_{k - 1} \\ = & \int_{0}^{1} [d f_{x_{k - 1} \cdot exp (t v_{k - 1})} - d f_{x_{k - 1}}] v_{k - 1} d t, \end{array}

where the second equality is valid because of (2.13). Therefore, applying (2.15), one has that

\begin{array}{rcl} ∥ d f_{x_{0}}^{- 1} f (x_{k}) ∥ & \leq & \int_{0}^{1} ∥ d f_{x_{0}}^{- 1} [d f_{x_{k - 1} \cdot exp (t v_{k - 1})} - d f_{x_{k - 1}}] ∥ ∥ v_{k - 1} ∥ d t \\ \leq & \int_{0}^{1} \int_{ρ (x_{k - 1}, x_{0})}^{ρ (x_{k - 1}, x_{0}) + t ∥ v_{k - 1} ∥} L (s) d s ∥ v_{k - 1} ∥ d t \\ \leq & \int_{0}^{1} \int_{t_{k - 1}}^{t_{k - 1} + t (t_{k} - t_{k - 1})} L (s) d s (t_{k} - t_{k - 1}) d t \\ = & \int_{t_{k - 1}}^{t_{k}} L (s) (t_{k} - s) d s \\ = & h (t_{k}) - h (t_{k - 1}) - h^{'} (t_{k - 1}) (t_{k} - t_{k - 1}) \\ = & h (t_{k}), \end{array}

(3.8)

where the first equality holds because $h (t_{k - 1}) + h^{'} (t_{k - 1}) (t_{k} - t_{k - 1}) = 0$ . Combining this with (3.7) yields that

\begin{array}{rcl} ∥ v_{k} ∥ & = & ∥ - d f_{x_{k}}^{- 1} f (x_{k}) ∥ \\ \leq & ∥ d f_{x_{k}}^{- 1} d f_{x_{0}} ∥ ∥ d f_{x_{0}}^{- 1} f (x_{k}) ∥ \\ \leq & - h^{'} {(t_{k})}^{- 1} h (t_{k}) \\ = & t_{k + 1} - t_{k} . \end{array}

(3.9)

Since $x_{k + 1} = x_{k} \cdot exp v_{k}$ , we have $ϱ (x_{k + 1}, x_{k}) \leq ∥ v_{k} ∥$ . This together with (3.9) gives that (3.5) holds for $n = k$ , which completes the proof of the theorem. □

4 Applications to optimization problems

Let $ϕ : G \to R$ be a $C^{2}$ -map. Consider the following optimization problem:

min_{x \in G} ϕ (x) .

(4.1)

Newton’s method for solving (4.1) was presented in [16], where local quadratical convergence result was established for a smooth function ϕ.

Let $X \in G$ . Following [16], we use $\tilde{X}$ to denote the left invariant vector field associated with X defined by

\tilde{X} (x) = {(L_{x}^{'})}_{e} X for each x \in G,

and $\tilde{X} ϕ$ the Lie derivative of ϕ with respect to the left invariant vector field $\tilde{X}$ , that is, for each $x \in G$ ,

(\tilde{X} ϕ) (x) = \frac{d}{d t} |_{t = 0} ϕ (x \cdot exp t X) .

(4.2)

Let ${X_{1}, \dots, X_{n}}$ be an orthonormal basis of $G$ . According to [[24], p.356] (see also [16]), gradϕ is a vector field on G defined by

grad ϕ (x) = ({\tilde{X}}_{1}, \dots, {\tilde{X}}_{n}) {({\tilde{X}}_{1} ϕ (x), \dots, {\tilde{X}}_{n} ϕ (x))}^{T} = \sum_{j = 1}^{n} {\tilde{X}}_{j} ϕ (x) {\tilde{X}}_{j} for each x \in G .

(4.3)

Then Newton’s method with initial point $x_{0} \in G$ considered in [16] can be written in a coordinate-free form as follows.

Algorithm 4.1 Find $X^{k} \in G$ such that ${\tilde{X}}^{k} = {(L_{x}^{'})}_{e} X^{k}$ and

grad ϕ (x_{k}) + grad ({\tilde{X}}^{k} ϕ) (x_{k}) = 0;

Set $x_{k + 1} = x_{k} \cdot exp X^{k}$ ;

Set $k \leftarrow k + 1$ and repeat.

Let $f : G \to G$ be a mapping defined by

f (x) = {(L_{x}^{'})}_{e}^{- 1} grad ϕ (x) for each x \in G .

(4.4)

Define the linear operator $H_{x} ϕ : G \to G$ for each $x \in G$ by

(H_{x} ϕ) X = {(L_{x}^{'})}_{e}^{- 1} grad (\tilde{X} ϕ) (x) for each X \in G .

(4.5)

Then $H_{(\cdot)} ϕ$ defines a mapping from G to $L (G)$ . The following proposition gives the equivalence between $d f_{x}$ and $H_{x} ϕ$ . The following proposition was given in [18].

Proposition 4.1 Let $f (\cdot)$ and $H_{(\cdot)} ϕ$ be defined respectively by (4.4) and (4.5). Then

d f_{x} = H_{x} ϕ for each x \in G .

(4.6)

Remark 4.1 One can easily see from Proposition 4.1 that, with the same initial point, the sequence generated by Algorithm 4.1 for ϕ coincides with the one generated by Newton’s method (3.1) for f defined by (4.4).

Let $x_{0} \in G$ be such that ${(H_{x_{0}} ϕ)}^{- 1}$ exists, and let $β_{ϕ} : = ∥ {(H_{x_{0}} ϕ)}^{- 1} {(L_{x_{0}}^{'})}_{e}^{- 1} grad ϕ (x_{0}) ∥$ . Recall that $r_{0}$ and b are given by (2.16), and $r_{1}$ is given by Proposition 2.1. Then the main theorem of this section is as follows.

Theorem 4.1 Suppose that

β_{ϕ} = ∥ {(H_{x_{0}} ϕ)}^{- 1} {(L_{x_{0}}^{'})}_{e}^{- 1} grad ϕ (x_{0}) ∥ \leq b,

(4.7)

and that ${(H_{x_{0}} ϕ)}^{- 1} (H_{(\cdot)} ϕ)$ satisfies the L-average Lipschitz condition on $C_{r_{1}} (x_{0})$ . Then the sequence generated by Algorithm 4.1 with initial point $x_{0}$ is well defined and converges to a critical point $x^{*}$ of ϕ: $grad ϕ (x^{*}) = 0$ .

Furthermore, if $H_{x_{0}} ϕ$ is additionally positive definite and the following Lipschitz condition is satisfied:

\begin{aligned} ∥ {(H_{x_{0}} ϕ)}^{- 1} ∥ ∥ H_{x \cdot exp u} ϕ - H_{x} ϕ ∥ \\ \leq \int_{ρ (x, x_{0})}^{ρ (x, x_{0}) + ∥ u ∥} L (s) d s for x \in G and u \in G with ρ (x_{0}, x) + ∥ u ∥ < r_{1} . \end{aligned}

(4.8)

Then $x^{*}$ is a local solution of (4.1).

Proof Recall that f is defined by (4.4). Then by Proposition 4.1, $d f_{x} = H_{x} ϕ$ for each $x \in G$ . Hence, by assumptions, $d f_{x_{0}}^{- 1} d f$ satisfies the L-average Lipschitz condition on $C_{r_{1}} (x_{0})$ and condition (3.2) is satisfied because $β_{ϕ} \leq b$ . Thus, Theorem 3.1 is applicable; hence the sequence generated by Newton’s method for f with initial point $x_{0}$ is well defined and converges to a zero $x^{*}$ of f. Consequently, by Remark 4.1, one sees that the first assertion holds.

To prove the second assertion, we assume that $H_{x_{0}} ϕ$ is additionally positive definite and the Lipschitz condition (4.8) is satisfied. It is sufficient to prove that $H_{x^{*}} ϕ$ is positive definite. Let $λ^{*}$ and $λ^{0}$ be the minimum eigenvalues of $H_{x^{*}} ϕ$ and $H_{x_{0}} ϕ$ , respectively. Then $λ^{0} > 0$ . We have to show that $λ^{*} > 0$ . To do this, let ${x_{n}}$ be the sequence generated by Algorithm 4.1 and write $v_{n} = d f_{x_{n}}^{- 1} f (x_{n})$ for each $n = 0, 1, \dots$ . Then, by Remark 4.1,

x_{n + 1} = x_{n} \cdot exp (- v_{n}) for each n = 0, 1, \dots,

(4.9)

and by Theorem 3.1,

∥ v_{n} ∥ \leq t_{n + 1} - t_{n} for each n = 0, 1, \dots .

(4.10)

Therefore, for each $n = 0, 1, \dots$ ,

\begin{aligned} ∥ H_{x_{0}} ϕ^{- 1} ∥ ∥ (H_{x_{n + 1}} ϕ - H_{x_{0}} ϕ) ∥ & = ∥ H_{x_{0}} ϕ^{- 1} ∥ ∥ (H_{x_{n} \cdot exp (- v_{n})} ϕ - H_{x_{0}} ϕ) ∥ \\ = \sum_{j = 0}^{n} ∥ H_{x_{0}} ϕ^{- 1} ∥ ∥ H_{x_{j} \cdot exp (- v_{n})} ϕ - H_{x_{j}} ϕ ∥ \\ \leq \sum_{j = 0}^{n} \int_{ρ (x_{j}, x_{0})}^{ρ (x_{j}, x_{0}) + ∥ v_{n} ∥} L (s) d s \\ \leq \int_{0}^{t_{k}} L (s) d s \\ < \int_{0}^{r_{0}} L (s) d s \\ = 1 \end{aligned}

(4.11)

thanks to (4.8)-(4.10). Since

\begin{array}{rcl} | \frac{λ^{*}}{λ^{0}} - 1 | & = & \frac{1}{λ^{0}} | min_{v \in G, ∥ v ∥ = 1} 〈 (H_{x^{*}} ϕ) v, v 〉 - min_{v \in G, ∥ v ∥ = 1} 〈 (H_{x_{0}} ϕ) v, v 〉 | \\ \leq & ∥ H_{x_{0}} ϕ^{- 1} ∥ ∥ H_{x^{*}} ϕ - H_{x_{0}} ϕ ∥, \end{array}

it follows that

| \frac{λ^{*}}{λ^{0}} - 1 | \leq lim_{n \to \infty} ∥ H_{x_{0}} ϕ^{- 1} ∥ ∥ H_{x_{n + 1}} ϕ - H_{x_{0}} ϕ ∥ < 1

thanks to (4.11). This implies that $λ^{*} > 0$ and completes the proof. □

5 Theorems under the Kantorovich’s condition and the γ-condition

If $L (\cdot)$ is a constant, then the L-average Lipschitz condition is reduced to the classical Lipschitz condition.

Let $r > 0$ , $x_{0} \in G$ , and let T be a mapping from G to $L (G)$ . Then T is said to satisfy the L Lipschitz condition on $C_{r} (x_{0})$ if

∥ T (x \cdot exp u) - T (x) ∥ \leq L ∥ u ∥

holds for any $u, u_{0}, \dots, u_{k} \in G$ and $x \in C_{r} (x_{0})$ such that $x = x_{0} exp u_{0} exp u_{1} \dots exp u_{k}$ and $∥ u ∥ + ρ (x, x_{0}) < r$ , where $ρ (x, x_{0}) = \sum_{i = 0}^{k} ∥ u_{i} ∥$ .

Let $β > 0$ and $L > 0$ . The quadratic majorizing function h is reduced to

h (t) = \frac{L}{2} t^{2} - t + β for each t \geq 0 .

Let ${t_{n}}$ denote the sequence generated by Newton’s method with initial value $t_{0} = 0$ for h, that is,

t_{n + 1} = t_{n} - h^{'} {(t_{n})}^{- 1} h (t_{n}) for each n = 0, 1, \dots .

Assume that $λ : = L β \leq \frac{1}{2}$ . Then h has two zeros $r_{1}$ and $r_{2}$ :

r_{1} = \frac{1 - \sqrt{1 - 2 λ}}{L} and r_{2} = \frac{1 + \sqrt{1 - 2 λ}}{L};

(5.1)

moreover, ${t_{n}}$ is monotonic increasing and convergent to $r_{1}$ , and satisfies that

r_{1} - t_{n} = \frac{q^{2^{n} - 1}}{\sum_{j = 0}^{2^{n} - 1} q^{j}} r_{1} for each n = 0, 1, \dots,

where

q = \frac{1 - \sqrt{1 - 2 λ}}{1 + \sqrt{1 - 2 λ}} .

Recall that $f : G \to G$ is a $C^{1}$ -mapping. As in the previous section, we always assume that $x_{0} \in G$ is such that $d f_{x_{0}}^{- 1}$ exists and set $β : = ∥ d f_{x_{0}}^{- 1} f (x_{0}) ∥$ . Then, by Theorem 3.1, we obtain the following results, which were given in [18].

Theorem 5.1 Suppose that $d f_{x_{0}}^{- 1} d f$ satisfies the L-Lipschitz condition on $C_{r_{1}} (x_{0})$ and that $λ = L β \leq \frac{1}{2}$ . Then the sequence ${x_{n}}$ generated by Newton’s method (3.1) with initial point $x_{0}$ is well defined and converges to a zero $x^{*}$ of f. Moreover, the following assertions hold for each $n = 0, 1, \dots$ :

\begin{matrix} ϱ (x_{n + 1}, x_{n}) \leq ∥ d f_{x_{n}}^{- 1} f (x_{n}) ∥ \leq t_{n + 1} - t_{n}; \\ ϱ (x_{n}, x^{*}) \leq \frac{q^{2^{n} - 1}}{\sum_{j = 0}^{2^{n} - 1} q^{j}} r_{1} . \end{matrix}

Let $x_{0} \in G$ be such that ${(H_{x_{0}} ϕ)}^{- 1}$ exists, and let $β_{ϕ} = ∥ {(H_{x_{0}} ϕ)}^{- 1} {(L_{x_{0}}^{'})}_{e}^{- 1} grad ϕ (x_{0}) ∥$ . Recall that $r_{1}$ is defined by (5.1). Then, by Theorem 4.1, we get the following results, which were given in [18].

Theorem 5.2 Suppose that $λ = L β_{ϕ} \leq \frac{1}{2}$ , and that ${(H_{x_{0}} ϕ)}^{- 1} (H_{(\cdot)} ϕ)$ satisfies the L-Lipschitz condition on $C_{r_{1}} (x_{0})$ . Then the sequence generated by Algorithm 4.1 with initial point $x_{0}$ is well defined and converges to a critical point $x^{*}$ of ϕ: $grad ϕ (x^{*}) = 0$ .

Furthermore, if $H_{x_{0}} ϕ$ is additionally positive definite and the following Lipschitz condition is satisfied:

∥ {(H_{x_{0}} ϕ)}^{- 1} ∥ ∥ H_{x \cdot exp u} ϕ - H_{x} ϕ ∥ \leq L ∥ u ∥ for x \in G and u \in G with ϱ (x_{0}, x) + ∥ u ∥ < r_{1} .

Then $x^{*}$ is a local solution of (4.1).

Let k be a positive integer and assume further that $f : G \to G$ is a $C^{k}$ -map. Define the map $d^{k} f_{x} : G^{k} \to G$ by

d^{k} f_{x} u_{1} \dots u_{k} = {(\frac{\partial^{k}}{\partial t_{k} \dots \partial t_{1}} f (x \cdot exp t_{k} u_{k} \dots exp t_{1} u_{1}))}_{t_{k} = \dots = t_{1} = 0}

for each $(u_{1}, \dots, u_{k}) \in G^{k}$ . In particular,

d^{k} f_{x} u^{k} = {(\frac{d^{k}}{d t^{k}} f (x \cdot exp t u))}_{t = 0} for each u \in G .

Let $1 \leq i \leq k$ . Then, in view of the definition, one has that

d^{k} f_{x} u_{1} \dots u_{k} = d^{k - i} {(d^{i} f_{\cdot} (u_{1} \dots u_{i}))}_{x} u_{i + 1} \dots u_{k} for each (u_{1}, \dots, u_{k}) \in G^{k} .

In particular, for fixed $u_{1}, \dots, u_{i - 1}, u_{i + 1}, \dots, u_{k} \in G$ ,

d^{i} f_{x} u_{1} \dots u_{i - 1} = d {(d^{i - 1} f_{\cdot} (u_{1} \dots u_{i - 1}))}_{x} (\cdot) .

This implies that $d^{i} f_{x} u_{1} \dots u_{i - 1} u$ is linear with respect to $u \in G$ and so is $d^{k} f_{x} u_{1} \dots u_{i - 1} u u_{i + 1} \dots u_{k}$ . Consequently, $d^{k} f_{x}$ is a multilinear map from $G^{k}$ to $G$ because $1 \leq i \leq k$ is arbitrary. Thus we can define the norm of $d^{k} f_{x}$ by

∥ d^{k} f_{x} ∥ = sup {∥ d^{k} f_{x} u_{1} u_{2} \dots u_{k} ∥ : (u_{1}, \dots, u_{k}) \in G^{k} with each ∥ u_{j} ∥ = 1} .

For the remainder of the paper, we always assume that f is a $C^{2}$ -map from G to $G$ . Then, taking $i = 2$ , we have

d^{2} f_{z} v u = d {(d f_{\cdot} v)}_{z} u for any u, v \in G and each z \in G .

Thus, (2.13) is applied (with $d f_{\cdot} v$ in place of $f (\cdot)$ for each $v \in G$ ) to conclude the following formula:

d f_{x \cdot exp (t u)} - d f_{x} = \int_{0}^{t} d^{2} f_{x \cdot exp (s u)} u d s for each u \in G and t \in R .

(5.2)

The γ-conditions for nonlinear operators in Banach spaces were first introduced and explored by Wang [25, 26] to study Smale’s point estimate theory, which was extended in [19] for a map f from a Lie group to its Lie algebra in view of the map $d^{2} f$ as given in Definition 5.1 below. Let $r > 0$ and $γ > 0$ be such that $γ r \leq 1$ .

Definition 5.1 Let $x_{0} \in G$ be such that $d f_{x_{0}}^{- 1}$ exists. f is said to satisfy the γ-condition at $x_{0}$ on $C_{r} (x_{0})$ if, for any $x \in C_{r} (x_{0})$ with $x = x_{0} exp u_{0} exp u_{1} \dots exp u_{k}$ such that $ρ (x, x_{0}) : = \sum_{i = 0}^{k} ∥ u_{i} ∥ < r$ ,

∥ d f_{x_{0}}^{- 1} d^{2} f_{x} ∥ \leq \frac{2 γ}{{(1 - γ ρ (x, x_{0}))}^{3}} .

As shown in Proposition 5.3, if f is analytic at $x_{0}$ , then f satisfies the γ-condition at $x_{0}$ .

Let $γ > 0$ and let L be the function defined by

L (s) = \frac{2 γ}{{(1 - γ s)}^{3}} for each 0 < s < \frac{1}{γ} .

(5.3)

The following proposition shows that the γ-condition implies the L-average Lipschitz condition.

Proposition 5.1 Suppose that f satisfies the γ-condition at $x_{0}$ on $C_{r} (x_{0})$ . Then $d f_{x_{0}}^{- 1} d f$ satisfies the L-average Lipschitz condition on $C_{r} (x_{0})$ with L defined by (5.3).

Proof Let $x \in C_{r} (x_{0})$ and let $u, u_{0}, \dots, u_{k} \in G$ be such that $x = x_{0} exp u_{0} exp u_{1} \dots exp u_{k}$ and $\sum_{i = 0}^{k} ∥ u_{i} ∥ + ∥ u ∥ < r$ . Write $ρ (x, x_{0}) : = \sum_{i = 0}^{k} ∥ u_{i} ∥$ . Observe from (5.2) that

d f_{x \cdot exp u} - d f_{x} = \int_{0}^{1} d^{2} f_{x \cdot exp (s u)} u d s .

Combining this with the assumption yields that

\begin{aligned} ∥ d f_{x_{0}}^{- 1} (d f_{x \cdot exp u} - d f_{x}) ∥ & \leq \int_{0}^{1} ∥ d f_{x_{0}}^{- 1} d^{2} f_{x \cdot exp (s u)} ∥ ∥ u ∥ d s \\ \leq \int_{0}^{1} \frac{2 γ}{{(1 - γ (ρ (x, x_{0}) + s ∥ u ∥))}^{3}} ∥ u ∥ d s \\ = \int_{ρ (x, x_{0})}^{ρ (x, x_{0}) + ∥ u ∥} \frac{2 γ}{{(1 - γ t)}^{3}} d t . \end{aligned}

Hence, $d f_{x_{0}}^{- 1} d f$ satisfies the L-average Lipschitz condition on $C_{r} (x_{0})$ with L defined by (5.3). □

Corresponding to the function L defined by (5.3), $r_{0}$ and b in (2.16) are $r_{0} = (1 - \frac{\sqrt{2}}{2}) \frac{1}{γ}$ and $b = (3 - 2 \sqrt{2}) \frac{1}{γ}$ , and the majorizing function given in (2.17) reduces to

h (t) = β - t + \frac{γ t^{2}}{1 - γ t} for each 0 \leq t \leq R .

Hence the condition $β \leq b$ is equivalent to $α = γ β \leq 3 - 2 \sqrt{2}$ . Let ${t_{n}}$ denote the sequence generated by Newton’s method with the initial value $t_{0} = 0$ for h. Then the following proposition was proved in [27], see also [10] and [6].

Proposition 5.2 Assume that $α = γ β \leq 3 - 2 \sqrt{2}$ . Then the zeros of h are

r_{1} = \frac{1 + α - \sqrt{{(1 + α)}^{2} - 8 α}}{4 γ}, r_{2} = \frac{1 + α + \sqrt{{(1 + α)}^{2} - 8 α}}{4 γ}

and

β \leq r_{1} \leq (1 + \frac{1}{\sqrt{2}}) β \leq (1 - \frac{1}{\sqrt{2}}) \frac{1}{γ} \leq r_{2} \leq \frac{1}{2 γ} .

Moreover, the following assertions hold:

t_{n + 1} - t_{n} = \frac{(1 - μ^{2^{n}}) \sqrt{{(1 + α)}^{2} - 8 α}}{2 α (1 - ν μ^{2^{n} - 1}) (1 - ν μ^{2^{n + 1} - 1})} ν μ^{2^{n} - 1} β \leq μ^{2^{n} - 1} β for each n = 0, 1, \dots,

where

μ = \frac{1 - α - \sqrt{{(1 + α)}^{2} - 8 α}}{1 - α + \sqrt{{(1 + α)}^{2} - 8 α}} and ν = \frac{1 + α - \sqrt{{(1 + α)}^{2} - 8 α}}{1 + α + \sqrt{{(1 + α)}^{2} - 8 α}} .

(5.4)

Recall that $x_{0} \in G$ is such that $d f_{x_{0}}^{- 1}$ exists, and let $β : = ∥ d f_{x_{0}}^{- 1} f (x_{0}) ∥$ . Then, by Theorem 3.1 and Proposition 5.2, we get the following results, which were given in [19].

Theorem 5.3 Suppose that

α : = β γ \leq 3 - 2 \sqrt{2}

and that f satisfies the γ-condition at $x_{0}$ on $C_{r_{1}} (x_{0})$ . Then Newton’s method (3.1) with initial point $x_{0}$ is well defined, and the generated sequence ${x_{n}}$ converges to a zero $x^{*}$ of f. Moreover, if $α < 3 - 2 \sqrt{2}$ , then for each $n = 0, 1, \dots$ ,

ϱ (x_{n + 1}, x_{n}) \leq ν^{2^{n} - 1} β,

where ν is given by (5.4).

Below, we always assume that f is analytic on G. For $x \in G$ such that $d f_{x}^{- 1}$ exists, we define

γ_{x} : = γ (f, x) = sup_{i \geq 2} {∥ \frac{d f_{x}^{- 1} d^{i} f_{x}}{i!} ∥}^{\frac{1}{i - 1}} .

Also, we adopt the convention that $γ (f, x) = \infty$ if $d f_{x}$ is not invertible. Note that this definition is justified and, in the case when $d f_{x}$ is invertible, $γ (f, x)$ is finite by analyticity.

The following proposition is taken from [19].

Proposition 5.3 Let $γ_{x_{0}} : = γ (f, x_{0})$ and let $r = \frac{2 - \sqrt{2}}{2 γ_{x_{0}}}$ . Then f satisfies the $γ_{x_{0}}$ -condition at $x_{0}$ on $C_{r} (x_{0})$ .

Thus, by Theorem 5.3 and Proposition 5.3, we get the following corollary, which was given in [19].

Corollary 5.1 Suppose that

α : = β γ_{x_{0}} \leq 3 - 2 \sqrt{2} .

Then Newton’s method (3.1) with initial point $x_{0}$ is well defined and the generated sequence ${x_{n}}$ converges to a zero $x^{*}$ of f. Moreover, if $α < 3 - 2 \sqrt{2}$ , then for each $n = 0, 1, \dots$ ,

ϱ (x_{n + 1}, x_{n}) \leq ν^{2^{n} - 1} β,

where ν is given by (5.4).

References

Kantorovich LV, Akilov GP: Functional Analysis. Pergamon, Oxford; 1982.
Google Scholar
Smale S: Newton’s method estimates from data at one point. In The Merging of Disciplines: New Directions in Pure, Applied and Computational Mathematics. Edited by: Ewing R, Gross K, Martin C. Springer, New York; 1986:185–196.
Chapter Google Scholar
Ezquerro JA, Hernández MA: Generalized differentiability conditions for Newton’s method. IMA J. Numer. Anal. 2002, 22: 187–205. 10.1093/imanum/22.2.187
Article MathSciNet Google Scholar
Ezquerro JA, Hernández MA: On an application of Newton’s method to nonlinear operators with w -conditioned second derivative. BIT Numer. Math. 2002, 42: 519–530.
Google Scholar
Gutiérrez JM, Hernández MA: Newton’s method under weak Kantorovich conditions. IMA J. Numer. Anal. 2000, 20: 521–532. 10.1093/imanum/20.4.521
Article MathSciNet Google Scholar
Wang XH: Convergence of Newton’s method and uniqueness of the solution of equations in Banach space. IMA J. Numer. Anal. 2000, 20(1):123–134. 10.1093/imanum/20.1.123
Article MathSciNet Google Scholar
Zabrejko PP, Nguen DF: The majorant method in the theory of Newton-Kantorovich approximates and the Ptak error estimates. Numer. Funct. Anal. Optim. 1987, 9: 671–674. 10.1080/01630568708816254
Article MathSciNet Google Scholar
Ferreira OP, Svaiter BF: Kantorovich’s theorem on Newton’s method in Riemannian manifolds. J. Complex. 2002, 18: 304–329. 10.1006/jcom.2001.0582
Article MathSciNet Google Scholar
Dedieu JP, Priouret P, Malajovich G: Newton’s method on Riemannian manifolds: covariant alpha theory. IMA J. Numer. Anal. 2003, 23: 395–419. 10.1093/imanum/23.3.395
Article MathSciNet Google Scholar
Li C, Wang JH: Newton’s method on Riemannian manifolds: Smale’s point estimate theory under the γ -condition. IMA J. Numer. Anal. 2006, 26: 228–251.
Article MathSciNet Google Scholar
Wang JH, Li C: Uniqueness of the singular points of vector fields on Riemannian manifolds under the γ -condition. J. Complex. 2006, 22: 533–548. 10.1016/j.jco.2005.11.004
Article Google Scholar
Li C, Wang JH: Convergence of Newton’s method and uniqueness of zeros of vector fields on Riemannian manifolds. Sci. China Ser. A 2005, 48: 1465–1478. 10.1360/04ys0147
Article MathSciNet Google Scholar
Wang JH: Convergence of Newton’s method for sections on Riemannian manifolds. J. Optim. Theory Appl. 2011, 148: 125–145. 10.1007/s10957-010-9748-4
Article MathSciNet Google Scholar
Li C, Wang JH: Newton’s method for sections on Riemannian manifolds: generalized covariant α -theory. J. Complex. 2008, 24: 423–451. 10.1016/j.jco.2007.12.003
Article Google Scholar
Alvarez F, Bolte J, Munier J: A unifying local convergence result for Newton’s method in Riemannian manifolds. Found. Comput. Math. 2008, 8: 197–226. 10.1007/s10208-006-0221-6
Article MathSciNet Google Scholar
Mahony RE: The constrained Newton method on a Lie group and the symmetric eigenvalue problem. Linear Algebra Appl. 1996, 248: 67–89.
Article MathSciNet Google Scholar
Owren B, Welfert B: The Newton iteration on Lie groups. BIT Numer. Math. 2000, 40(2):121–145.
Article MathSciNet Google Scholar
Wang JH, Li C: Kantorovich’s theorems for Newton’s method for mappings and optimization problems on Lie groups. IMA J. Numer. Anal. 2011, 31: 322–347. 10.1093/imanum/drp015
Article MathSciNet Google Scholar
Li C, Wang JH, Dedieu JP: Smale’s point estimate theory for Newton’s method on Lie groups. J. Complex. 2009, 25: 128–151. 10.1016/j.jco.2008.11.001
Article MathSciNet Google Scholar
Helgason S: Differential Geometry, Lie Groups, and Symmetric Spaces. Academic Press, New York; 1978.
Google Scholar
Varadarajan VS Graduate Texts in Mathematics 102. In Lie Groups, Lie Algebras and Their Representations. Springer, New York; 1984.
Chapter Google Scholar
DoCarmo MP: Riemannian Geometry. Birkhäuser Boston, Cambridge; 1992.
Google Scholar
Wang XH: Convergence of Newton’s method and inverse function theorem in Banach spaces. Math. Comput. 1999, 68: 169–186. 10.1090/S0025-5718-99-00999-0
Article Google Scholar
Helmke U, Moore JB Commun. Control Eng. Ser. In Optimization and Dynamical Systems. Springer, London; 1994.
Chapter Google Scholar
Wang XH, Han DF: Criterion α and Newton’s method in weak condition. Chin. J. Numer. Math. Appl. 1997, 19: 96–105.
MathSciNet Google Scholar
Wang XH: Convergence on the iteration of Halley family in weak conditions. Chin. Sci. Bull. 1997, 42: 552–555. 10.1007/BF03182614
Article Google Scholar
Wang XH, Han DF: On the dominating sequence method in the point estimates and Smale’s theorem. Sci. Sin., Ser. A, Math. Phys. Astron. Tech. Sci. 1990, 33: 135–144.
MathSciNet Google Scholar

Download references

Acknowledgements

The research of the second author was partially supported by the National Natural Science Foundation of China (grant 11001241; 11371325) and by Zhejiang Provincial Natural Science Foundation of China (grant LY13A010011). The research of the third author was partially supported by a grant from NSC of Taiwan (NSC 102-2115-M-037-002-MY3).

Author information

Authors and Affiliations

Department of Mathematics, Zhejiang Normal University, Jinhua, P.R. China
Jinsu He
Department of Mathematics, Zhejiang University of Technology, Hangzhou, 310032, P.R. China
Jinhua Wang
Department of Mathematics, National Sun Yat-sen University, Kaohsiung, Taiwan
Jinhua Wang
Center for Fundamental Science, Kaohsiung Medical University, Kaohsiung, 80702, Taiwan
Jen-Chih Yao
Department of Mathematics, King Abdulaziz University, P.O. Box 80203, Jeddah, 21589, Saudi Arabia
Jen-Chih Yao

Authors

Jinsu He
View author publications
You can also search for this author in PubMed Google Scholar
Jinhua Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jen-Chih Yao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jinsu He.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

All authors participated in its construction and drafted the manuscript. All authors read and approve the final manuscript.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

He, J., Wang, J. & Yao, JC. Convergence criteria of Newton’s method on Lie groups. Fixed Point Theory Appl 2013, 293 (2013). https://doi.org/10.1186/1687-1812-2013-293

Download citation

Received: 20 August 2013
Accepted: 26 August 2013
Published: 09 November 2013
DOI: https://doi.org/10.1186/1687-1812-2013-293

Convergence criteria of Newton’s method on Lie groups

Abstract

1 Introduction

2 Notions and preliminaries

3 Convergence criteria

4 Applications to optimization problems

5 Theorems under the Kantorovich’s condition and the γ-condition

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Rights and permissions

About this article

Cite this article

Share this article

Keywords