New complexity analysis for primal-dual interior-point methods for self-scaled optimization problems

Choi, Bo Kyung; Lee, Gue Myung

doi:10.1186/1687-1812-2012-213

Research
Open access
Published: 26 November 2012

New complexity analysis for primal-dual interior-point methods for self-scaled optimization problems

Bo Kyung Choi¹ &
Gue Myung Lee¹

Fixed Point Theory and Applications volume 2012, Article number: 213 (2012) Cite this article

2846 Accesses
2 Citations
1 Altmetric
Metrics details

Abstract

A linear optimization problem over a symmetric cone, defined in a Euclidean Jordan algebra and called a self-scaled optimization problem (SOP), is considered. We formulate an algorithm for a large-update primal-dual interior-point method (IPM) for the SOP by using a proximity function defined by a new kernel function, and we obtain the best known complexity results of the large-update IPM for the SOP by using the Euclidean Jordan algebra techniques.

MSC:90C51, 90C25, 65K05.

1 Introduction and preliminaries

Primal and dual interior-point methods (IPMs) have been well known as the most effective methods for solving wide classes of optimization problems, for example, the linear optimization (LO) problem, the quadratic optimization problem (QOP), the semidefinite optimization (SDO) problem, the second-order cone optimization (SOCO) problem, and the convex optimization problem (CP).

The so-called barrier update parameter θ in algorithms for IPMs plays an important role in both theory and practice of IPMs. Usually, if θ is a constant independent of the dimension of the problem, then the algorithm is called a large-update method. If it depends on the dimension, then the algorithm is said to be a small-update method. Large-update methods are much more efficient than small-update methods in practice [1], but have a worst-case iteration bound. Such a gap between theory and practice has been referred to as irony of IPMs [2]. Recently, many authors have tried to reduce the gap of the worst-case iteration bound between the large-update IPM and the small-update IPM.

Using self-regular proximity functions instead of a classical logarithmic barrier function, Peng et al. [3–5] improved the complexity of large-update IPMs for the LO problem, the SDO problem, and the SOCO problem. Bai et al. [6] introduced a new class of eligible kernel functions. The class was defined by some simple conditions on the kernel function and its derivatives. The best iteration bound for the LO problem, which was given by Bai et al. [6], is $O (\sqrt{n} log n log \frac{n}{ϵ})$ . Recently, Wang et al. [7] obtained the complexity result $O (n log n / ϵ)$ for the SDO problem based on a simple kernel function. Bai and Wang [8] obtained the best known complexity result for the SOCO problem based on a parametric kernel function including the classical logarithmic function, the prototype regular kernel function, and the non-self-regular kernel function. Very recently, using the kernel function $ϕ (t) = (t^{2} - 1) / 2 + (e^{t^{- q} - 1} - 1) / q$ , Choi and Lee [9, 10] have obtained the complexity results of large-update primal-dual IPMs for SDO and SOCO, $O (\sqrt{n} {(log n)}^{(q + 1) / q} log n / ϵ)$ and $O (\sqrt{N} {(log N)}^{(q + 1) / q} log N / ϵ)$ , respectively.

In this paper, we consider a linear optimization problem over a symmetric cone which is defined in a Euclidean Jordan algebra. Nesterov and Todd [11] proposed first this kind of an optimization problem under the name of convex programming for self-scaled cones and established the polynomial complexity of the primal-dual interior point method using the so-called NT (Nesterov-Todd) direction [12]. We call the linear optimization problem over the symmetric cone the self-scaled optimization problem (SOP).

Faybusovich first studied the SOP in view of a Euclidean Jordan algebra and gave a theoretical background for nondegeneracy assumptions and the uniqueness of solutions for Newton systems in IPMs for the SOP [13], presented a short-step path-following algorithm for a quadratic programming problem defined on the intersection of a symmetric cone with an affine subspace [14] and obtained complexity estimates for a long-step primal-dual interior-point algorithm for the optimization problem of the minimization of a linear function on a feasible set obtained as the intersection of an affine subspace and a symmetric cone [15]. SOPs include linear optimization problems, semidefinite optimization problems, second-order optimization problems, and various combinations of these types of problems as special cases. Schmieta and Alizadeh [16] extended primal-dual interior point algorithms for LOs, SDOs, and SOCOs to SOPs by using logarithmic barrier functions.

Baes raised an open question in his monograph [17] as follows: The theory of self-regular functions has been created for linear programming by Jiming Peng, Cornelius Roos, and Tamás Terlaky [5]. They subsequently extended it to second-order programming and semidefinite programming separately using implicitly the aforementioned construction. However, the unified treatment of this theory using the Jordan algebraic framework is not accomplished yet.

Choi and Lee [18] gave primal-dual interior point algorithms by using a very simple self-regular function $ψ (t) = \frac{1}{2} {(t - \frac{1}{t})}^{2}$ , $t > 0$ for the SOP and gave partial answers for the question of Baes. Very recently, Vieira [19, 20] gave complete answers for the open question of Baes by proving the e-convexity property of eligible kernel functions and, in particular, he presented the iteration complexity results for ten eligible kernel functions. Among ten kernel functions in [19], the best iteration complexity for a large-update method was obtained for $ψ (t) = \frac{t^{2} - 1}{2} + \frac{t^{1 - q} - 1}{q - 1}$ with $q = log r$ , and its iteration complexity is $O (\sqrt{r} log r log \frac{r}{ϵ})$ , which is the best known one.

In this paper, we define a new eligible kernel function $ψ (t) = \frac{t^{2} - 1}{2} + \frac{e^{p (t^{- q} - 1)} - 1}{p q}$ , $p ≧ 1$ and $q ≧ 1$ for $t > 0$ , which was modified from the one in [9, 10], and obtain the best known iteration complexity result for the large-update IPM of the SOP by using the analysis emphasized on the kernel function and the Euclidean Jordan algebra techniques. In our algorithm, we use the well-known lemma for the upper bound of the μ-update (see Lemma 3.1) instead of using Theorem 5.4 in [20]. The lemma makes our analysis in the outer while loop easy. We refer to Theorem 4.9 and Proposition 5.6 in [20] for complexity analysis. But we use Proposition 3.1 in [18] obtained from the technique of Sun and Sun [21] instead of using Proposition 5.7 in [20].

This paper is organized as follows. In Section 2, we introduce our kernel functions, formulate the Newton system for the SOP, and present a useful inequality for our proximity function. In Section 3, we give an algorithm for the SOP and calculate an upper bound for the proximity function after μ-update. We calculate an upper bound for difference between proximity functions after one step in inner iterations and then determine our default step size for search directions. We present a worst-case iteration bound for our large-update primal-dual interior point method for the SOP.

Now, we give definitions and preliminary properties for a Euclidean Jordan algebra which are found in [22] and will be used in the next sections.

Definition 1.1 ([22])

A finite-dimensional real vector space V is called an algebra if a bilinear mapping $(x, y) \to x \circ y$ from $V \times V$ to V is defined.

An algebra V is called a Jordan algebra if the following hold:

(i)
commutativity: for all $x, y \in V$ , $x \circ y = y \circ x$ ;
(ii)
Jordan’s axiom: for all $x, y \in V$ , $x^{2} \circ (x \circ y) = x \circ (x^{2} \circ y)$ , where $x^{2} = x \circ x$ .

A Jordan algebra V is said to be Euclidean if

(iii)
$x^{2} + y^{2} = 0 \Rightarrow x = y = 0$ , equivalently, there exists an inner product $(\cdot | \cdot)$ on V such that $(x \circ y | z) = (y | x \circ z)$ .

A Jordan algebra V is simple if it does not contain any non-trivial ideal. The Jordan algebra may not be associative, but it is power-associative, i.e., $x^{p} \circ x^{q} = x^{p + q}$ . We assume a Jordan algebra V has an identity element, i.e., there exists e such that $x \circ e = e \circ x = x$ . Since V is finite-dimensional, given $x \in V$ , there exists a minimal positive integer k such that the vectors $e, x, \dots, x^{k}$ are linearly dependent. Denote this integer $m (x)$ . We define the rank of V as

rank (V) = r = max {m (x) ∣ x \in V} .

An element $x \in V$ is said to be invertible if there exists an element $y \in R [x]$ such that $x \circ y = e$ , where $R [x]$ is the algebra over ℝ of polynomials in one variable with coefficients in ℝ. It is defined by $x^{- 1}$ . An element $v \in V$ is called idempotent if $v^{2} = v$ . For an element $x \in V$ , let $L (x)$ be a linear map of V defined as $L (x) y = x \circ y$ . The cone of squares

\bar{Ω} : = {x^{2} ∣ x \in V}

is a symmetric cone; the following conditions hold:

(i)
for every pair of $x, y \in int \bar{Ω}$ , there is an invertible linear transformation $L : V \to V$ such that $L (\bar{Ω}) = \bar{Ω}$ and $L (x) = y$ ;
(ii)
${\bar{Ω}}^{*} = \bar{Ω}$ , where ${\bar{Ω}}^{*} : = {y \in V ∣ (x, y) ≧ 0, for any x \in \bar{Ω}}$ .

Let $Ω = int \bar{Ω}$ . Then $Ω = {x^{2} ∣ x \in V is invertible} = {x \in V ∣ L (x) is positive definite}$ .

Definition 1.2 ([22])

Let $c_{1}, \dots, c_{k} \in V$ . Then ${c_{1}, \dots, c_{k}}$ is said to be a Jordan frame if $c_{i}$ , $i = 1, \dots, k$ are non-zero and cannot be written as a sum of other two idempotents, and the following properties hold:

{\begin{matrix} c_{i}^{2} = c_{i}, \\ c_{i} \circ c_{j} = 0 if i \neq j, \\ \sum_{i = 1}^{k} c_{i} = e . \end{matrix}

Theorem 1.1 (Theorem III.1.2 in [22])

For every $x \in V$ , there exist a Jordan frame ${c_{1} (x), \dots, c_{r} (x)}$ and real numbers $λ_{1} (x), \dots, λ_{r} (x)$ such that

x = λ_{1} (x) c_{1} (x) + \dots + λ_{r} (x) c_{r} (x) .

(1)

The numbers $λ_{i} (x)$ , for all $i = 1, \dots, r$ , are said to be the eigenvalues of x, and (1) is called the eigenvalue (or spectral) decomposition of x. Now, it is possible to extend the definition of any real-valued function $ψ (\cdot)$ to elements of the Euclidean Jordan algebra via their eigenvalues:

ψ (x) : = ψ (λ_{1} (x)) c_{1} (x) + \dots + ψ (λ_{r} (x)) c_{r} (x) .

(2)

Particularly, we have some examples as follows:

(i)
Square root: $x^{1 / 2} = λ_{1}^{1 / 2} (x) c_{1} (x) + \dots + λ_{r}^{1 / 2} (x) c_{r} (x)$ if all $λ_{i} (x) ≧ 0$ .
(ii)
Inverse: $x^{- 1} = λ_{1}^{- 1} (x) c_{1} (x) + \dots + λ_{r}^{- 1} (x) c_{r} (x)$ if all $λ_{i} (x) \neq 0$ .
(iii)
Square: $x^{2} = λ_{1}^{2} (x) c_{1} (x) + \dots + λ_{r}^{2} (x) c_{r} (x)$ .

From the above examples, we know that for $x \in \bar{Ω}$ , $λ_{i} (x^{1 / 2}) = λ_{i}^{1 / 2} (x)$ and for $x \in Ω$ , $λ_{i} (x^{- 1}) = λ_{i}^{- 1} (x)$ . Let us denote by $ψ^{'} (x)$ the derivative of $ψ (x)$ with respect to $λ_{i} (x)$ :

ψ^{'} (x) : = ψ^{'} (λ_{1} (x)) c_{1} (x) + \dots + ψ^{'} (λ_{r} (x)) c_{r} (x) .

(3)

In the Jordan algebra, we define the determinant of x and the trace of x as follows:

det (x) = \prod_{i = 1}^{r} λ_{i} (x), tr (x) = \sum_{i = 1}^{r} λ_{i} (x) .

Since V is a Euclidean Jordan algebra, $〈 x, y 〉 : = tr (x \circ y)$ is a scalar product on V (see Proposition III.1.5 in [22]). The following lemma is called the second Pierce decomposition theorem which will be used in Section 3.

Lemma 1.1 (Theorem IV.2.1 in [22], Theorem 2.6.6 (Second Pierce decomposition theorem) in [17])

Let ${c_{1}, \dots, c_{r}}$ be a Jordan frame of V. If

V_{i j} : = {\begin{matrix} {v_{i j} ∣ c_{i} \circ v_{i j} = v_{i j}} & if i = j, \\ {v_{i j} ∣ c_{i} \circ v_{i j} = \frac{1}{2} v_{i j}} \cap {v_{i j} ∣ c_{j} \circ v_{i j} = \frac{1}{2} v_{i j}} & if i \neq j, \end{matrix}

we have

(i)
$V = ⨁_{1 ≦ i ≦ j ≦ r} V_{i j}$ ;
(ii)
$V_{i j} \circ V_{k l} = {0}$ , if ${i, j} \cap {k, l} = \emptyset$ ;
(iii)
$V_{i j} \circ V_{j k} \subset V_{i k}$ , if $i \neq k$ ;
(iv)
$tr (v_{i k}) = 0$ , for $v_{i k} \in V_{i k}$ if $i \neq k$ .

Consider the following self-scaled optimization problem (SOP):

\begin{array}{lll} (P) & Minimize & 〈 c, x 〉 \\ subject to & 〈 a_{i}, x 〉 = b_{i}, i = 1, \dots, m, \\ x \in \bar{Ω}, \end{array}

and its dual problem:

\begin{array}{lll} (D) & Maximize & \sum_{i = 1}^{m} b_{i} y_{i} \\ subject to & \sum_{i = 1}^{m} y_{i} a_{i} + s = c, \\ s \in \bar{Ω}, y \in R^{m}, \end{array}

where $c, a_{1}, \dots, a_{m} \in V$ and $b \in R^{m}$ are given. We call $x \in \bar{Ω}$ primal feasible if $〈 a_{i}, x 〉 = b_{i}$ for $i = 1, \dots, m$ . Similarly, $(y, s) \in R^{m} \times \bar{Ω}$ is called dual feasible if $\sum_{i = 1}^{m} y_{i} a_{i} + s = c$ . Let $A x = {(〈 a_{1}, x 〉, \dots, 〈 a_{m}, x 〉)}^{T}$ for any $x \in V$ . Then $A : V \to R^{m}$ is a linear transformation. Throughout this paper, we assume that A is surjective. Then its adjoint $A^{T}$ is injective and $A^{T} y = \sum_{i = 1}^{m} y_{i} a_{i}$ , where $y = {(y_{1}, \dots, y_{m})}^{T} \in R^{m}$ . So, we can reformulate (P) and (D) as follows:

\begin{array}{lll} (P) & Minimize & 〈 c, x 〉 \\ subject to & A x = b, \\ x \in \bar{Ω}, \end{array}

and its dual problem:

\begin{array}{lll} (D) & Maximize & b^{T} y \\ subject to & A^{T} y + s = c, \\ s \in \bar{Ω}, y \in R^{m} . \end{array}

We can check that weak duality between (P) and (D) holds, that is, $inf (P) ≧ sup (D)$ . From now on, we assume that both (P) and (D) satisfy the interior-point condition (IPC), that is, there exists $(x^{0}, y^{0}, s^{0})$ such that $A x^{0} = b$ , $x^{0} \in Ω$ , $A^{T} y^{0} + s^{0} = c$ , $s^{0} \in Ω$ . Then there exists a pair of optimal solutions $(x, y, s)$ of (P) and (D), and $inf (P) = sup (D)$ [11, 23].

The following lemma is well known [13, 17, 22, 24].

Lemma 1.2 For $x, s \in V$ , the following statements are equivalent:

(i)
$x, s \in \bar{Ω}$ and $〈 x, s 〉 = 0$ ;
(ii)
$x, s \in \bar{Ω}$ and $x \circ s = 0$ .

Using Lemma 1.2, we can check (see Proposition 2.1 in [13]) that finding a pair of optimal solutions $(x, y, s)$ of (P) and (D) is equivalent to solving the following Newton system:

{\begin{matrix} A x = b, \\ A^{T} y + s = c, \\ x \circ s = 0, \\ x, s \in \bar{Ω}, y \in R^{m} . \end{matrix}

(4)

The basic idea of primal-dual IPMs is to replace the third equation in (4), the so-called complementarity condition for the SOP, by the parameterized system with a positive parameter μ:

{\begin{matrix} A x = b, \\ A^{T} y + s = c, \\ x \circ s = μ e, \\ x, s \in Ω, y \in R^{m} . \end{matrix}

(5)

For each $x \in V$ , we define the quadratic representation as follows:

Q_{x} : = 2 L^{2} (x) - L (x^{2}) .

Lemma 1.3 ([16])

Let $x, s \in Ω$ and p be invertible. Then $x \circ s = μ e$ if and only if $Q_{p} x \circ Q_{p^{- 1}} s = μ e$ .

Proposition 1.1 (Proposition 18 in [16])

If $x, s \in Ω$ , then $Q_{x} s \in Ω$ .

Let $x, s \in Ω$ . Then there uniquely exists $p \in Ω$ such that $Q_{p^{2}} x = s$ [25, 26]. So, we can choose $p \in Ω$ such that $Q_{p} x = Q_{p^{- 1}} s$ . Such a choice exists and is unique, and leads to the Nesterov-Todd (NT) method.

From Lemma 1.3, the system (5) becomes

{\begin{matrix} A x = b, \\ A^{T} y + s = c, \\ Q_{p} x \circ Q_{p^{- 1}} s = μ e, \\ x, s \in Ω, y \in R^{m} . \end{matrix}

(6)

Then, for each $μ > 0$ , the parameterized system (6) has a unique solution $(x (μ), y (μ), s (μ))$ [11, 27], which is called a μ-center of (P) and (D). The set of μ-centers, that is, $C = {(x (μ), y (μ), s (μ)) ∣ μ > 0}$ , is said to be the central path of (P) and (D). Therefore, as μ tends to zero, $(x (μ), y (μ), s (μ))$ converges to a pair of optimal solutions of (P) and (D) [13, 28].

In general, IPMs for the SOP consist of two strategies. The first one, which is called the inner iteration scheme, is to keep the iterative sequence in a certain neighborhood of the central path or to keep the iterative sequence in a certain neighborhood of the μ-center. And the second one, called the outer iteration scheme, is to decrease the parameter μ to $μ_{+} : = (1 - θ) μ$ for some $θ \in (0, 1)$ .

2 Proximity functions and search directions

Newton’s method is a well-known procedure to solve a system of nonlinear equations. Most IPMs for solving the SOP employ different search directions together with suitable strategies for following the central path appropriately.

Assume that a starting point $(x^{0}, s^{0})$ in a certain neighborhood of the central path corresponding to $μ = 1$ is available. We then decrease μ to $μ_{+} : = (1 - θ) μ$ for some fixed $θ \in (0, 1)$ and linearize the Newton system for (6) by replacing x, y, s with $x_{+} : = x + Δ x$ , $y_{+} : = y + Δ y$ , $s_{+} : = s + Δ s$ , respectively. Then we get the following system in [16]:

{\begin{matrix} A Δ x = 0, \\ A^{T} Δ y + Δ s = 0, \\ Q_{p} x \circ Q_{p^{- 1}} Δ s + Q_{p} Δ x \circ Q_{p^{- 1}} s = μ_{+} e - Q_{p} x \circ Q_{p^{- 1}} s . \end{matrix}

(7)

To describe our new search direction, we need more notations:

\begin{aligned} \bar{A} : = \frac{1}{\sqrt{μ}} A Q_{p^{- 1}}, v : = \frac{1}{\sqrt{μ}} Q_{p} x = \frac{1}{\sqrt{μ}} Q_{p^{- 1}} s, \\ d x : = \frac{1}{\sqrt{μ}} Q_{p} Δ x, d s : = \frac{1}{\sqrt{μ}} Q_{p^{- 1}} Δ s . \end{aligned}

(8)

In this case,

p = {[Q_{x^{1 / 2}} {(Q_{x^{1 / 2}} s)}^{- 1 / 2}]}^{- 1 / 2} = {[Q_{s^{- 1 / 2}} {(Q_{s^{1 / 2}} x)}^{1 / 2}]}^{- 1 / 2} .

(9)

From Proposition 1.1, $v \in Ω$ . Hence, $L (v)$ is positive definite. Thus, the system (7) is equivalent to the following system:

{\begin{matrix} \bar{A} d x = 0, \\ {\bar{A}}^{T} Δ y + d s = 0, \\ d x + d s = v^{- 1} - v . \end{matrix}

(10)

We say that the above $(d x, Δ y, d s)$ is called the NT search direction for the SOP. Furthermore, $〈 d x, d s 〉 = 0$ , which is coming from the first and second equations of (10) or from the orthogonality of Δx and Δs.

For our IPM, we use the following new eligible kernel function:

ψ (t) = \frac{t^{2} - 1}{2} + \frac{e^{p (t^{- q} - 1)} - 1}{p q}, p ≧ 1 and q ≧ 1 for t > 0 .

(11)

Please see the definition of an eligible function in [6]. The new kernel function (11) satisfies

ψ^{″} (t) > 1, ψ^{‴} (t) < 0 and lim_{t \to 0^{+}} ψ (t) = lim_{t \to \infty} ψ (t) = \infty .

Note that $ψ (1) = ψ^{'} (1) = 0$ . Then $ψ (t)$ is determined:

ψ (t) = \int_{1}^{t} \int_{1}^{ξ} ψ^{″} (ζ) d ζ d ξ .

(12)

The proximity function (measure) for (P) and (D) is

Φ (x, s; μ) : = Ψ (v) : = tr (ψ (v)) = \sum_{i = 1}^{r} ψ (λ_{i} (v)),

(13)

where $ψ (v)$ is defined by (2). Note that $Ψ (v) = 0$ , if $v = e$ (i.e., $x \circ s = μ e$ ) and $Ψ (v) > 0$ , otherwise. Replacing the right-hand side of the last equation in (10) by $- ψ^{'} (v)$ , we have the following system from (10):

{\begin{matrix} \bar{A} d x = 0, \\ {\bar{A}}^{T} Δ y + d s = 0, \\ d x + d s = - ψ^{'} (v) . \end{matrix}

(14)

Let $X = {x \in V ∣ \bar{A} x = 0}$ . Then $X^{⊥} = {{\bar{A}}^{T} y ∣ y \in R^{m}}$ . Hence, the system (14) has a unique solution.We introduce the norm-based proximity measure as follows:

σ : = ∥ d x + d s ∥ = ∥ ψ^{'} (v) ∥ = \sqrt{{∥ d x ∥}^{2} + {∥ d s ∥}^{2}} .

(15)

The following lemma gives a lower bound of σ in terms of $Ψ (v)$ .

Lemma 2.1 For any $v \in Ω$ ,

σ ≧ \sqrt{2 Ψ (v)} .

Proof Since (11) satisfies $2 ψ (t) ≦ {(ψ^{'} (t))}^{2}$ and $σ^{2} = \sum_{i = 1}^{r} {(ψ^{'} (λ_{i} (v)))}^{2}$ ,

2 Ψ (v) ≦ σ^{2} .

This completes the proof. □

Also, our new kernel function (11) satisfies the following exponential convexity property.

Lemma 2.2 Let $t_{1} > 0$ and $t_{2} > 0$ . Then

ψ (\sqrt{t_{1} t_{2}}) ≦ \frac{1}{2} (ψ (t_{1}) + ψ (t_{2})) .

The following proposition can be found in [20], but for the completeness, we give its proof.

Proposition 2.1 (Theorem 4.9 in [20])

Let Ψ be the proximity function defined in (13), then for any $x, s \in Ω$ ,

Ψ ({(Q_{x^{1 / 2}} s)}^{1 / 2}) ≦ \frac{1}{2} (Ψ (x) + Ψ (s)) .

Proof Since $Q_{x^{1 / 2}} s \in Ω$ ,

λ_{i} ({(Q_{x^{1 / 2}} s)}^{1 / 2}) = λ_{i}^{1 / 2} (Q_{x^{1 / 2}} s) and Ψ ({(Q_{x^{1 / 2}} s)}^{1 / 2}) = \sum_{i = 1}^{r} ψ (λ_{i}^{1 / 2} (Q_{x^{1 / 2}} s)) .

By Theorem 3.5 in [20],

\prod_{i = 1}^{k} λ_{i} (Q_{x^{1 / 2}} s) ≦ \prod_{i = 1}^{k} λ_{i} (x) λ_{i} (s), for k = 1, \dots, r - 1,

and

\prod_{i = 1}^{r} λ_{i} (Q_{x^{1 / 2}} s) = \prod_{i = 1}^{r} λ_{i} (x) λ_{i} (s) .

Thus,

\prod_{i = 1}^{k} λ_{i}^{1 / 2} (Q_{x^{1 / 2}} s) ≦ \prod_{i = 1}^{k} λ_{i}^{1 / 2} (x) λ_{i}^{1 / 2} (s), for k = 1, \dots, r - 1,

and

\prod_{i = 1}^{r} λ_{i}^{1 / 2} (Q_{x^{1 / 2}} s) = \prod_{i = 1}^{r} λ_{i}^{1 / 2} (x) λ_{i}^{1 / 2} (s) .

Let $α_{i} = λ_{i}^{1 / 2} (Q_{x^{1 / 2}} s)$ and $β_{i} = λ_{i}^{1 / 2} (x) λ_{i}^{1 / 2} (s)$ . Then $α_{i} > 0$ and $β_{i} > 0$ . Moreover, since these conditions satisfy the assumptions of Corollary 3.3.10 in [29] and (iii) in Corollary 3.3.10 in [29] with our kernel function (11),

\sum_{i = 1}^{r} ψ (λ_{i}^{1 / 2} (Q_{x^{1 / 2}} s)) ≦ \sum_{i = 1}^{r} ψ (λ_{i}^{1 / 2} (x) λ_{i}^{1 / 2} (s)) .

By Lemma 2.2, we obtain the following result:

\sum_{i = 1}^{r} ψ (λ_{i}^{1 / 2} (x) λ_{i}^{1 / 2} (s)) ≦ \frac{1}{2} (\sum_{i = 1}^{r} ψ (λ_{i} (x)) + \sum_{i = 1}^{r} ψ (λ_{i} (s))) = \frac{1}{2} (Ψ (x) + Ψ (s)) .

□

3 Algorithm and its complexity analysis

Now, we explain our algorithm for the large-update primal-dual IPM for the SOP. Assuming that a starting point in a certain neighborhood of the central path is available, we can set out from this point. Then, we will go to the outer ‘while loop’. If μ satisfies $r μ ≧ ϵ$ , then it is reduced by the factor $1 - θ$ , where $θ \in (0, 1)$ . Then, we make use of the inner ‘while loop’, and we repeat the procedure until we find iterates that are ‘close’ to $(x (μ), y (μ), s (μ))$ , that is, the proximity $Φ (x, s; μ) < τ$ . Here, we apply Newton’s method targeting at the new μ-centers to decide a search direction $(Δ x, Δ y, Δ s)$ . We return to the outer ‘while loop’. The whole process is repeated until μ is small enough, say until $r μ < ϵ$ .

The choice of the step size α is another crucial issue in the analysis of the algorithm. It has to be taken so that the closeness of the iterates to the current μ-center can improve by a sufficient amount. In the algorithm, the inner ‘while loop’ is called the inner iteration and the outer ‘while loop’ is called the outer iteration. Each outer iteration consists of an update of the parameter μ and a sequence of (one or more) inner iterations. The total number of inner iterations is the worst-case iteration bound for our algorithm.

The algorithm for our large-update primal-dual IPM for the SOP is given as follows:

3.1 Bound of the proximity function after μ-update

We have $Ψ (v) ≦ τ$ before the update of μ with the factor $1 - θ$ at the start of each outer iteration. After updating μ in an outer iteration, the vector v is divided by the factor $\sqrt{1 - θ}$ , which in general leads to an increase in the value of $Ψ (v)$ . Then during the inner iteration, the value of $Ψ (v)$ decreases until it passes the threshold τ.

As we mentioned, our kernel function (11) is eligible. To obtain an upper bound for a μ-updated proximity function in each outer iteration in the algorithm, we use the well-known Lemma 3.1, which can be induced from the decreasing part of the kernel function, instead of using theorems which can be obtained from some properties for eligible functions (for example, Theorem 3.2 in [6] and Theorem 5.4 in [20]). Both of the following lemmas make our analysis in the outer while loop easy. And we will show a theorem that an upper bound for $Ψ (\frac{1}{\sqrt{1 - θ}} v)$ is expressed with $Ψ (v)$ by using the following two lemmas.

Lemma 3.1 Let $β ≧ 1$ . Then

ψ (β t) ≦ ψ (t) + \frac{(β^{2} - 1)}{2} t^{2} .

Proof Define $ψ_{b} (t) : = \frac{e^{p (t^{- q} - 1)} - 1}{p q}$ . Then $ψ_{b} (t)$ is monotonically decreasing in t. So, we can easily obtain

\begin{array}{rcl} ψ (β t) & = & \frac{β^{2} t^{2} - 1}{2} + ψ_{b} (β t) = \frac{t^{2} - 1}{2} + ψ_{b} (t) + \frac{β^{2} t^{2} - t^{2}}{2} + ψ_{b} (β t) - ψ_{b} (t) \\ ≦ & ψ (t) + \frac{(β^{2} - 1)}{2} t^{2} . \end{array}

□

Lemma 3.2 For any $v \in Ω$ , then

{∥ v ∥}^{2} ≦ 2 (Ψ (v) + 2 r) .

Proof Since $\frac{e^{p (t^{- q} - 1)}}{p q}$ is positive and $p q ≧ 1$ , the kernel function (11) has a lower bound as follows:

ψ (t) ≧ \frac{t^{2} - 1}{2} - \frac{1}{p q} ≧ \frac{t^{2}}{2} - 1 - 1 .

This implies $\frac{1}{2} \sum_{i = 1}^{r} λ_{i}^{2} (v) ≦ Ψ (v) + 2 r$ . □

Theorem 3.1 Let θ be such that $0 < θ < 1$ . Then, for any $v \in Ω$ ,

Ψ (\frac{1}{\sqrt{1 - θ}} v) ≦ \frac{2}{1 - θ} (Ψ (v) + r) .

Proof From Lemma 3.1 with $β = \frac{1}{\sqrt{1 - θ}}$ and Lemma 3.2,

\begin{array}{rcl} Ψ (\frac{1}{\sqrt{1 - θ}} v) & = & \sum_{i = 1}^{r} ψ (\frac{1}{\sqrt{1 - θ}} λ_{i} (v)) ≦ Ψ (v) + \frac{1}{2} (\frac{1}{1 - θ} - 1) {∥ v ∥}^{2} \\ ≦ & Ψ (v) + \frac{θ}{1 - θ} (Ψ (v) + 2 r) ≦ \frac{2}{1 - θ} (Ψ (v) + r), \end{array}

the last inequality comes from $θ \in (0, 1)$ . □

By the assumption $Ψ (v) ≦ τ$ just before the update of μ,

Ψ (\frac{1}{\sqrt{1 - θ}} v) ≦ \frac{2}{1 - θ} (τ + r) .

We define

L (r, θ, τ) = \frac{2}{1 - θ} (τ + r) .

Since $τ = O (r)$ and $θ = Θ (1)$ ,

L = O (r) .

3.2 Determining a default step size

In this section, we compute the feasible step size α such that the proximity function is decreasing and is bound for the decrease during inner iterations; then we give our default step size $\bar{α}$ ; $\bar{α} = {(3 (1 + 3 σ (1 + p q + q) {(1 + p^{- 1} log 3 σ)}^{(q + 1) / q}))}^{- 1}$ . We will show that the step size not only keeps the iterates feasible but also gives rise to a sufficiently large decrease in the barrier function $Ψ (v)$ in each inner iteration. Let us denote the difference between the proximity before and after one step by a function of the step size, that is,

f (α) : = Ψ (v_{+}) - Ψ (v) .

(16)

The main task in the rest of this section is to study the decreasing behavior of $f (α)$ .

Now, in equation (16), $v_{+}$ and $p_{+}$ are determined by x, s in (9) and (8) replaced by $x_{+} : = x + α Δ x$ , $s_{+} : = s + α Δ s$ , respectively, which is as follows:

v_{+} : = \frac{1}{\sqrt{μ}} Q_{p_{+}} (x + α Δ x) = \frac{1}{\sqrt{μ}} Q_{p_{+}^{- 1}} (s + α Δ s) .

Lemma 3.3 (Proposition II.3.3 in [22])

Let x and s be elements in V. Then

(i)
${(Q_{x} s)}^{- 1} = Q_{x^{- 1}} s^{- 1}$ if x and s are invertible.
(ii)
$Q_{Q_{s} x} = Q_{s} Q_{x} Q_{s}$ .

Lemma 3.4 ([16])

Let $x, s, p \in Ω$ . Then

(i)
$Q_{x^{1 / 2}} s$ and $Q_{s^{1 / 2}} x$ have the same eigenvalues.
(ii)
$Q_{x^{1 / 2}} s$ and $Q_{{(Q_{p} x)}^{1 / 2}} (Q_{p^{- 1}} s)$ have the same eigenvalues.

The following proposition was given by Vieira in [20] (see Proposition 5.6 in [20]), but we provide its proof using Lemma 3.3 and Lemma 3.4.

Proposition 3.1 Let Ψ be the proximity function defined in (13). Then we have

Ψ (v_{+}) = Ψ ({(Q_{{(v + α d x)}^{1 / 2}} (v + α d s))}^{1 / 2}) .

Proof From $Q_{p^{- 1}} s = Q_{p} x$ and (i) in Lemma 3.4, we know that $Q_{p} x$ and $Q_{x^{1 / 2}} p^{2}$ have the same eigenvalues. By the definition of p and (ii) in Lemma 3.3,

Q_{x^{1 / 2}} p^{2} = Q_{x^{1 / 2}} {(Q_{x}^{1 / 2} {(Q_{x^{1 / 2}} s)}^{- 1 / 2})}^{- 1} = Q_{x^{1 / 2}} Q_{x^{- 1 / 2}} {(Q_{x^{1 / 2}} s)}^{1 / 2} = {(Q_{x^{1 / 2}} s)}^{1 / 2} .

Then we can find $Q_{p_{+}^{- 1}} s_{+}$ and ${(Q_{x_{+}^{1 / 2}} s_{+})}^{1 / 2}$ have the same eigenvalues. Here, $\sqrt{μ} v_{+} = Q_{p_{+}^{- 1}} s_{+}$ . We know that $x_{+} = \sqrt{μ} Q_{p^{- 1}} (v + α d x)$ and $s_{+} = \sqrt{μ} Q_{p} (v + α d s)$ , by the definition (8) and by (ii) in Lemma 3.4, then ${(Q_{x_{+}^{1 / 2}} s_{+})}^{1 / 2} = \sqrt{μ} {(Q {(Q_{p^{- 1}} (v + α d x))}^{1 / 2} (Q_{p} (v + α d s)))}^{1 / 2}$ and $\sqrt{μ} {(Q_{{(v + α d x)}^{1 / 2}} (v + α d s))}^{1 / 2}$ have the same eigenvalues. Therefore, the proximity function satisfies the equality. □

Then Proposition 2.1 and Proposition 3.1 imply the following inequality:

Ψ (v_{+}) ≦ \frac{1}{2} Ψ (v + α d x) + \frac{1}{2} Ψ (v + α d s) .

So, we can define $f_{1} (α)$ :

f (α) ≦ f_{1} (α) : = \frac{1}{2} (Ψ (v + α d x) + Ψ (v + α d s)) - Ψ (v) .

To facilitate the forthcoming analysis, we also define, for any $x \in V$ ,

λ_{min} (x) : = min {λ_{i} (x) ∣ i = 1, \dots, r} .

The following lemma is obtained from Lemma 14 in [16] so that we can get the common lower bound of eigenvalues of $v + α d x$ and $v + α d s$ , where α satisfies $v + α d x \in Ω$ and $v + α d s \in Ω$ .

Lemma 3.5 For any $α \in (0, \frac{λ_{min} (v)}{σ})$ ,

λ_{min} (v + α d x) ≧ λ_{min} (v) - α σ and λ_{min} (v + α d s) ≧ λ_{min} (v) - α σ,

where σ is a number defined in (15).

Proof Let α be a fixed number in $(0, \frac{λ_{min} (v)}{σ})$ . From Lemma 14 in [16],

λ_{min} (v + α d x) ≧ λ_{min} (v) - α ∥ d x ∥ .

Since $σ ≧ ∥ d x ∥$ , we have

λ_{min} (v + α d x) ≧ λ_{min} (v) - α σ .

Similarly, we obtain

λ_{min} (v + α d s) ≧ λ_{min} (v) - α σ .

□

The proof of the following proposition can be found in [18], but for the completeness, we give its detailed proof.

Proposition 3.2 ([18])

Suppose that the functions $ψ (x)$ and $Ψ (x)$ are defined by (2) and (13), respectively. Then, for any $α \in (0, \frac{λ_{min} (v)}{σ})$ ,

where

Δ ψ^{'} (λ_{i} (\cdot), λ_{j} (\cdot)) = {\begin{matrix} ψ^{″} (λ_{i} (\cdot)) & if λ_{i} (\cdot) = λ_{j} (\cdot), \\ \frac{ψ^{'} (λ_{i} (\cdot)) - ψ^{'} (λ_{j} (\cdot))}{λ_{i} (\cdot) - λ_{j} (\cdot)} & if λ_{i} (\cdot) \neq λ_{j} (\cdot) . \end{matrix}

Proof Using Lemma 3.1 in [21], we have

(17)

Then we have

\begin{array}{rcl} \frac{d}{d α} tr (ψ (v + α d x)) & = & \frac{d}{d α} 〈 ψ (v + α d x), e 〉 \\ = & 〈 \frac{d}{d α} ψ (v + α d x), e 〉 = tr (\frac{d}{d α} ψ (v + α d x)) \\ = & tr (\sum_{i = 1}^{r} ψ^{'} (λ_{i} (v + α d x)) 〈 c_{i} (v + α d x), d x 〉 c_{i} (v + α d x)) \\ (by associativity of trace) \\ = & \sum_{i = 1}^{r} ψ^{'} (λ_{i} (v + α d x)) 〈 c_{i} (v + α d x), d x 〉 tr (c_{i} (v + α d x)) \\ = & 〈 \sum_{i = 1}^{r} ψ^{'} (λ_{i} (v + α d x)) c_{i} (v + α d x), d x 〉 tr (c_{i} (v + α d x)) . \end{array}

Then from Baes [17, 30] we know that $tr (c_{i} (v + α d x)) = 1$ , and hence, from the definition (3), we get

\frac{d}{d α} tr (ψ (v + α d x)) = tr (ψ^{'} (v + α d x) \circ d x) .

Thus, we have

\frac{d}{d α} f_{1} (α) = \frac{1}{2} tr (ψ^{'} (v + α d x) \circ d x) + \frac{1}{2} tr (ψ^{'} (v + α d s) \circ d s) .

So, the first equality holds.

For the second inequality, we will use (17) by replacing ψ by $ψ^{'}$ .

\begin{matrix} \frac{d^{2}}{d α^{2}} tr (ψ (v + α d x)) \\ = \frac{d}{d α} tr (ψ^{'} (v + α d x) \circ d x) \\ = tr ((\frac{d}{d α} ψ^{'} (v + α d x)) \circ d x) \\ = tr ((\sum_{i = 1}^{r} Δ ψ^{'} (λ_{i} (v + α d x), λ_{i} (v + α d x)) 〈 c_{i} (v + α d x), d x 〉 c_{i} (v + α d x) \\ + \sum_{1 ≦ j < l ≦ r} 4 Δ ψ^{'} (λ_{j} (v + α d x), λ_{l} (v + α d x)) \\ \times c_{j} (v + α d x) \circ (c_{l} (v + α d x) \circ d x)) \circ d x) . \end{matrix}

Here, let $d x = \sum_{j = 1}^{r} λ_{j} (d x) c_{j} (d x)$ . Then we have

\begin{matrix} \sum_{i = 1}^{r} {(tr (c_{i} (v + α d x) \circ d x))}^{2} \\ = \sum_{i = 1}^{r} {(tr (\sum_{j = 1}^{r} λ_{j} (d x) c_{i} (v + α d x) \circ c_{j} (d x)))}^{2} \\ = \sum_{i = 1}^{r} {(\sum_{j = 1}^{r} λ_{j} (d x) tr (c_{i} (v + α d x) \circ c_{j} (d x)))}^{2} . \end{matrix}

Since $c_{i} (v + α d x)$ and $c_{j} (d x)$ are in $\bar{Ω}$ which is a self-dual cone, then

tr (c_{i} (v + α d x) \circ c_{j} (d x)) ≧ 0, for i, j = 1, \dots, r .

Furthermore, $\sum_{j = 1}^{r} tr (c_{i} (v + α d x) \circ c_{j} (d x)) = tr (c_{i} (v + α d x)) = 1$ . Then we have

\begin{array}{rcl} \sum_{i = 1}^{r} {(tr (c_{i} (v + α d x) \circ d x))}^{2} & = & \sum_{i = 1}^{r} {(\sum_{j = 1}^{r} tr (c_{i} (v + α d x) \circ c_{j} (d x)) λ_{j} (d x))}^{2} \\ ≦ & \sum_{i = 1}^{r} (\sum_{j = 1}^{r} λ_{j}^{2} (d x) tr (c_{i} (v + α d x) \circ c_{j} (d x))) \\ = & \sum_{j = 1}^{r} (λ_{j}^{2} (d x) \sum_{i = 1}^{r} tr (c_{i} (v + α d x) \circ c_{j} (d x))) . \end{array}

Since $\sum_{i = 1}^{r} tr (c_{i} (v + α d x) \circ c_{j} (d x)) = tr (c_{j} (d x)) = 1$ , we have

\sum_{i = 1}^{r} {(tr (c_{i} (v + α d x) \circ d x))}^{2} ≦ {∥ d x ∥}^{2} .

(18)

Now, we decompose dx along Lemma 1.1 such as $d x = \sum_{1 ≦ i ≦ k ≦ r} d x_{i k}$ for the system of idempotent ${c_{1} (v + α d x), \dots, c_{r} (v + α d x)}$ . Then, for $j < l$ ,

This means, for each $j < l$ ,

tr ((c_{j} (v + α d x) \circ d x) \circ (c_{l} (v + α d x) \circ d x)) ≧ 0 .

Moreover, we have,

(19)

Since for each i, ${(tr (c_{i} (v + α d x) \circ d x))}^{2}$ are nonnegative and for each j, l with $j < l$ , $tr ((c_{j} (v + α d x) \circ d x) \circ (c_{l} (v + α d x) \circ d x))$ are nonnegative, we get from (18) and (19)

\frac{d^{2}}{d α^{2}} tr (ψ (v + α d x)) ≦ 3 max {Δ ψ^{'} (λ_{i} (v + α d x), λ_{j} (v + α d x)) ∣ i, j = 1, \dots, r} {∥ d x ∥}^{2} .

Similarly,

\frac{d^{2}}{d α^{2}} tr (ψ (v + α d s)) ≦ 3 max {Δ ψ^{'} (λ_{i} (v + α d s), λ_{j} (v + α d s)) ∣ i, j = 1, \dots, r} {∥ d s ∥}^{2} .

From the definition of $f_{1} (α)$ ,

\frac{d^{2}}{d α^{2}} f_{1} (α) = \frac{d^{2}}{d α^{2}} (\frac{1}{2} tr (ψ (v + α d x)) + \frac{1}{2} tr (ψ (v + α d s))) .

Thus, we have the conclusion. □

The next result presents an upper bound for the second derivative of $f_{1} (α)$ which is usable for establishing the polynomial complexity of the algorithm.

Proposition 3.3 For any $α \in (0, \frac{λ_{min} (v)}{σ})$ ,

f_{1}^{″} (α) ≦ \frac{3}{2} ψ^{″} (λ_{min} (v) - α σ) σ^{2} .

Proof Since $ψ^{″} (t)$ is a decreasing function on $t \in (0, \infty)$ , using Lemma 3.5 and the mean value theorem, we have

ψ^{″} (λ_{min} (v) - α σ) ≧ max {Δ ψ^{'} (λ_{i} (v + α d x), λ_{j} (v + α d x)) ∣ i, j = 1, \dots, r}

and

ψ^{″} (λ_{min} (v) - α σ) ≧ max {Δ ψ^{'} (λ_{i} (v + α d s), λ_{j} (v + α d s)) ∣ i, j = 1, \dots, r} .

Thus, by Proposition 3.2,

\begin{array}{rcl} \frac{d^{2}}{d α^{2}} f_{1} (α) & ≦ & \frac{3}{2} ψ^{″} (λ_{min} (v) - α σ) {∥ d x ∥}^{2} + \frac{3}{2} ψ^{″} (λ_{min} (v) - α σ) {∥ d s ∥}^{2} \\ = & \frac{3}{2} ψ^{″} (λ_{min} (v) - α σ) σ^{2} . \end{array}

□

We can easily check that $f_{1} (0) = 0$ and $f_{1}^{'} (0) = - \frac{σ^{2}}{2}$ . By Proposition 3.3, we obtain an upper bound $f_{2} (α)$ for $f_{1} (α)$ as follows:

\begin{array}{rcl} f_{1} (α) & = & f_{1} (0) + {f_{1}}^{'} (0) α + \int_{0}^{α} \int_{0}^{ξ} f_{1}^{″} (ζ) d ζ d ξ \\ ≦ & f_{2} (α) : = f_{1} (0) + {f_{1}}^{'} (0) α + \frac{3}{2} σ^{2} \int_{0}^{α} \int_{0}^{ξ} ψ^{″} (λ_{min} (v) - α σ) d ζ d ξ . \end{array}

Note that $f_{2} (0) = 0$ . Furthermore, since $f_{2}^{'} (α) = - \frac{σ^{2}}{2} + \frac{3 σ}{2} (ψ^{'} (λ_{min} (v)) - ψ^{'} (λ_{min} (v) - α σ))$ , we have $f_{2}^{'} (0) = - \frac{σ^{2}}{2}$ which is the same value of $f_{1}^{'} (0)$ , and $f_{2}^{″} (α) = \frac{3 σ^{2}}{2} ψ^{″} (λ_{min} (v) - α σ)$ which is increasing on $α \in [0, \frac{λ_{min} (v)}{σ})$ . Using $f_{1}^{'} (0) = f_{2}^{'} (0)$ and $f_{1}^{″} (α) ≦ f_{2}^{″} (α)$ , we can easily check that

f_{1}^{'} (α) = f_{1}^{'} (0) + \int_{0}^{α} f_{1}^{″} (ξ) d ξ ≦ f_{2}^{'} (α) .

This relation gives that

f_{1}^{'} (α) ≦ 0, if f_{2}^{'} (α) ≦ 0 .

To compute the feasible step size α such that the proximity measure is decreasing when we take a new iterate for fixed μ, we want to calculate the step size α which satisfies that $f_{2}^{'} (α) ≦ 0$ holds with α as large as possible. Since $f_{2}^{″} (α) > 0$ , that is, $f_{2}^{'} (α)$ is monotonically increasing at α, the largest possible value at α satisfying $f_{2}^{'} (α) ≦ 0$ occurs when $f_{2}^{'} (α) = 0$ , that is,

- ψ^{'} (λ_{min} (v) - α σ) + ψ^{'} (λ_{min} (v)) = \frac{σ}{3} .

(20)

Since $ψ^{″} (t)$ is monotonically decreasing, the derivative of the left-hand side in (20) with respect to $λ_{min} (v)$ is

- ψ^{″} (λ_{min} (v) - α σ) + ψ^{″} (λ_{min} (v)) < 0 .

So, the left-hand side in (20) is decreasing at $λ_{min} (v)$ . This implies that if $λ_{min} (v)$ becomes smaller, then α gets smaller with fixed σ. Note that

σ = \sqrt{\sum_{i = 1}^{n} {(ψ^{'} (λ_{i} (v)))}^{2}} ≧ | ψ^{'} (λ_{min} (v)) | ≧ - ψ^{'} (λ_{min} (v))

and the equality is true if and only if $λ_{min} (v)$ is the only coordinate in $(λ_{1} (v), \dots, λ_{r} (v))$ which is different from 1 and $λ_{min} (v) < 1$ , that is, $ψ^{'} (λ_{min} (v)) < 0$ . Hence, the worse situation for the largest step size occurs when $λ_{min} (v)$ satisfies

- ψ^{'} (λ_{min} (v)) = σ .

(21)

In that case, the largest α satisfying (20) is minimal. For our purpose, we need to deal with the worse case, and so we assume that (21) holds.

From now on, we denote that $ρ : [0, \infty) \to (0, 1]$ is the inverse function of the restriction of $- ψ^{'} (t)$ in the interval $(0, 1]$ . Then (21) implies

λ_{min} (v) = ρ (σ) .

(22)

By using (20) and (21), we immediately obtain

- ψ^{'} (λ_{min} (v) - α σ) = \frac{4}{3} σ .

By the definition of ρ and (22), the largest step size α of the worse case is given as follows:

α^{*} = \frac{ρ (σ) - ρ (\frac{4}{3} σ)}{σ} .

(23)

For the purpose of finding an upper bound of $f (α)$ , we need a default step size $\bar{α}$ that is the lower bound of the $α^{*}$ and consists of σ.

Lemma 3.6 Let $σ ≧ 1$ . Then, for $0 < t ≦ ρ (\frac{4}{3} σ)$ ,

ψ^{″} (t) ≦ 1 + 3 σ (1 + p q + q) {(1 + \frac{1}{p} log 3 σ)}^{\frac{q + 1}{q}} .

Proof From $ψ^{'} (t) = t - t^{- q - 1} \cdot e^{p (t^{- q} - 1)}$ , let $- ψ_{b}^{'} (t) = t^{- q - 1} \cdot e^{p (t^{- q} - 1)}$ and let $\underset{̲}{ρ} : [1, \infty) \to (0, 1]$ denote the inverse function of the restriction of $- ψ_{b}^{'} (t)$ to the interval $(0, 1]$ . Let $ρ (\frac{4}{3} σ) = \tilde{t}$ . Then $0 < \tilde{t} ≦ 1$ and $\frac{4}{3} σ = - ψ^{'} (\tilde{t}) = - \tilde{t} - ψ_{b}^{'} (\tilde{t})$ . So, $- ψ_{b}^{'} (\tilde{t}) = \tilde{t} + \frac{4}{3} σ ≦ 1 + 2 σ ≦ 3 σ$ . Since $\underset{̲}{ρ}$ is a decreasing function, $(ρ (\frac{4}{3} σ) = \tilde{t} =) \underset{̲}{ρ} (- ψ_{b}^{'} (\tilde{t})) ≧ \underset{̲}{ρ} (3 σ)$ . Let $\underset{̲}{ρ} (3 σ) = \hat{t}$ . Then

3 σ = - ψ_{b}^{'} (\hat{t}) = {(\underset{̲}{ρ} (3 σ))}^{- q - 1} \cdot e^{p ({(\underset{̲}{ρ} (3 σ))}^{- q} - 1)}

(24)

implies

(25)

the last inequality comes from $\hat{t} \in (0, 1]$ and (25). □

Now, we present a lower bound of the value of $α^{*}$ .

Theorem 3.2 Let $α^{*}$ be as defined in (23). Then

α^{*} ≧ \frac{1}{3 (1 + 3 σ (1 + p q + q) {(1 + \frac{1}{p} log 3 σ)}^{\frac{q + 1}{q}})} .

Proof Since $- ψ^{'} (ρ (σ)) = σ$ , taking the derivative of σ at both sides, we get

ρ^{'} (σ) = - \frac{1}{ψ^{″} (ρ (σ))} .

Moreover, we have

α^{*} = \frac{1}{σ} \int_{\frac{4}{3} σ}^{σ} ρ^{'} (ξ) d ξ = \frac{1}{σ} \int_{σ}^{\frac{4}{3} σ} \frac{1}{ψ^{″} (ρ (ξ))} d ξ ≧ \frac{1}{σ} {[\frac{ξ}{ψ^{″} (ρ (\frac{4}{3} σ))}]}_{σ}^{\frac{4}{3} σ} = \frac{1}{3 ψ^{″} (ρ (\frac{4}{3} σ))},

where the inequality follows from $σ ≦ ξ ≦ \frac{4}{3} σ$ and ρ and $ψ^{″}$ are monotonically decreasing. Also, by Lemma 3.6, we can complete the proof. □

For using $\bar{α}$ as the default step size in the algorithm for the SOP, define the $\bar{α}$ as follows:

\bar{α} = \frac{1}{3 (1 + 3 σ (1 + p q + q) {(1 + \frac{1}{p} log 3 σ)}^{\frac{q + 1}{q}})} .

(26)

We will use $\bar{α}$ as the default step size in our algorithm.

3.3 Decrease of the proximity function during an inner iteration

Now, we show that our proximity function Ψ with our default step size $\bar{α}$ is decreasing. It can be easily established by using the following result.

Lemma 3.7 ([4])

Let $h (t)$ be a twice differentiable convex function with $h (0) = 0$ , $h^{'} (0) < 0$ and let $h (t)$ attain its (global) minimum at $t^{*} > 0$ . If $h^{″} (t)$ is increasing for $t \in [0, t^{*}]$ , then

h (t) ≦ \frac{t h^{'} (0)}{2}, 0 ≦ t ≦ t^{*} .

Since $f_{2} (α)$ satisfies assumptions of the above lemma,

f (α) ≦ f_{1} (α) ≦ f_{2} (α) ≦ \frac{f_{2}^{'} (0)}{2} α for all 0 ≦ α ≦ α^{*} .

Since $f_{2}^{'} (0) = - \frac{σ^{2}}{2}$ , we can obtain the upper bound for the decreasing value of the proximity in the inner iteration by Lemma 3.7.

Theorem 3.3 Let $\bar{α}$ be the default step size as defined in (26). Then we have

f (\bar{α}) ≦ - \frac{1}{6} \cdot \frac{\sqrt{Ψ}}{1 + 3 \sqrt{2} (1 + p q + q) {(1 + \frac{1}{p} log 3 \sqrt{2 Ψ_{0}})}^{\frac{q + 1}{q}}} .

Proof Since $f_{2}^{'} (0) = - \frac{σ^{2}}{2}$ and $\bar{α} \in [0, α^{*}]$ , we have

\begin{array}{rcl} f (\bar{α}) & ≦ & \frac{1}{2} \bar{α} f_{2}^{'} (0) = \frac{1}{2} \cdot \frac{1}{3 (1 + 3 σ (1 + p q + q) {(1 + \frac{1}{p} log 3 σ)}^{\frac{q + 1}{q}})} \cdot (- \frac{σ^{2}}{2}) \\ = & - \frac{1}{12} \cdot \frac{σ^{2}}{1 + 3 σ (1 + p q + q) {(1 + \frac{1}{p} log 3 σ)}^{\frac{q + 1}{q}}} . \end{array}

This expresses the decrease in one inner iteration in terms of σ. Since the decrease depends monotonically on σ, we can express the decrease in terms of $Ψ = Ψ (v)$ by Lemma 2.1 as follows:

\begin{array}{rcl} f (\bar{α}) & ≦ & - \frac{1}{6} \cdot \frac{Ψ}{1 + 3 \sqrt{2 Ψ} (1 + p q + q) {(1 + \frac{1}{p} log 3 \sqrt{2 Ψ})}^{\frac{q + 1}{q}}} \\ ≦ & - \frac{1}{6} \cdot \frac{\sqrt{Ψ} \cdot \sqrt{Ψ}}{\sqrt{Ψ} + 3 \sqrt{2 Ψ} (1 + p q + q) {(1 + \frac{1}{p} log 3 \sqrt{2 Ψ_{0}})}^{\frac{q + 1}{q}}} \\ = & - \frac{1}{6} \cdot \frac{\sqrt{Ψ}}{1 + 3 \sqrt{2} (1 + p q + q) {(1 + \frac{1}{p} log 3 \sqrt{2 Ψ_{0}})}^{\frac{q + 1}{q}}}, \end{array}

where the inequality follows from $Ψ_{0} ≧ Ψ ≧ τ ≧ 1$ . The theorem is satisfied. □

3.4 Iteration bound

We need to count how many inner iterations are required to return to the situation where $Ψ (v) ≦ τ$ after a μ-update. We denote the value of $Ψ (v)$ after μ-update as $Ψ_{0}$ ; the subsequent values in the same outer iteration are denoted as $Ψ_{k}$ , $k = 1, \dots$ . If K denotes the total number of inner iterations in the outer iteration, then we have

Ψ_{0} ≦ L (r, θ, τ) = O (r), Ψ_{K - 1} > τ, 0 ≦ Ψ_{K} ≦ τ

and according to Theorem 3.3,

Ψ_{k + 1} ≦ Ψ_{k} - \frac{1}{6 + 18 \sqrt{2} (1 + p q + q) {(1 + \frac{1}{p} log 3 \sqrt{2 Ψ_{0}})}^{\frac{q + 1}{q}}} Ψ_{k}^{\frac{1}{2}} .

At this stage, we invoke Lemma 14 in [4].

Lemma 3.8 ([4])

Let $t_{0}, t_{1}, \dots, t_{K}$ be a sequence of positive numbers such that

t_{k + 1} ≦ t_{k} - β t_{k}^{1 - γ}, k = 0, 1, \dots, K - 1,

where $β > 0$ and $0 < γ ≦ 1$ . Then

K ≦ \frac{t_{0}^{γ}}{β γ} .

Letting $t_{k} = Ψ_{k}$ , $β = \frac{1}{6 + 18 \sqrt{2} (1 + p q + q) {(1 + \frac{1}{p} log 3 \sqrt{2 Ψ_{0}})}^{\frac{q + 1}{q}}}$ and $γ = \frac{1}{2}$ , we can get the following lemma from Lemma 3.8.

Lemma 3.9 Let K be the total number of inner iterations in the outer iteration. Then we have

K ≦ 2 (6 + 18 \sqrt{2} (1 + p q + q) {(1 + \frac{1}{p} log 3 \sqrt{2} \sqrt{Ψ_{0}})}^{\frac{q + 1}{q}}) Ψ_{0}^{1 / 2},

where $Ψ_{0}$ is the value of $Ψ (v)$ after the μ-update in the outer iteration.

Now, we estimate the total number of iterations of our algorithm.

Theorem 3.4 If $τ ≧ 1$ and $0 < θ < 1$ , the total number of iterations is not more than

⌈ 2 (6 + 18 \sqrt{2} (1 + p q + q) {(1 + \frac{1}{p} log 3 \sqrt{2} \sqrt{Ψ_{0}})}^{\frac{q + 1}{q}}) Ψ_{0}^{1 / 2} ⌉ ⌈ \frac{1}{θ} log \frac{r}{ϵ} ⌉ .

Proof In the algorithm, $r μ ≧ ϵ$ , $μ_{k} : = {(1 - θ)}^{k} μ_{0}$ and $μ_{0} = 1$ . By simple computation, we have

k ≦ \frac{1}{θ} log \frac{r}{ϵ} .

Therefore, the number of outer iterations is bounded above by

\frac{1}{θ} log \frac{r}{ϵ} .

Multiplication of this result by the number in the above lemma satisfies the theorem. □

Since $Ψ_{0}^{1 / 2} = O (\sqrt{r})$ , if we take $p = O (log r)$ and $q = 1$ , then we can get the best known upper bound for the total number of inner iterations in the outer iteration is

O (\sqrt{r} log r) .

Also, we take for θ a constant (not depending on r), namely $\frac{1}{θ} = Θ (1)$ . With $τ = O (r)$ , the best complexity of the primal-dual interior-point method for a linear optimization problem based on our new proximity function with $p = log r$ and $q = 1$ is given by

O (\sqrt{r} log r log \frac{r}{ϵ}) .

References

Andersen ED, Gondzio J, Mészáros C, Xu X: Implementation of interior point methods for large scale linear programming. In Interior Point Methods of Mathematical Programming. Edited by: Terlaky T. Kluwer Academic, Dordrecht; 1996:189–252.
Chapter Google Scholar
Renegar J MPS/SIAM Ser. Optim. In A Mathematical View of Interior-Point Methods in Convex Optimization. SIAM, Philadelphia; 2001.
Chapter Google Scholar
Peng J, Roos C, Terlaky T: Primal-dual interior-point methods for second-order conic optimization based on self-regular proximities. SIAM J. Optim. 2002, 13: 179–203. 10.1137/S1052623401383236
Article MathSciNet Google Scholar
Peng J, Roos C, Terlaky T: Self-regular functions and new search directions for linear and semidefinite optimization. Math. Program. 2002, 93: 129–171. 10.1007/s101070200296
Article MathSciNet Google Scholar
Peng J, Roos C, Terlaky T: Self-Regularity: A New Paradigm for Primal-Dual Interior-Point Algorithms. Princeton University Press, Princeton; 2002.
Google Scholar
Bai YQ, Ghami ME, Roos C: A comparative study of kernel functions for primal-dual interior-point algorithms in linear optimization. SIAM J. Optim. 2004, 15: 101–128. 10.1137/S1052623403423114
Article MathSciNet Google Scholar
Wang GQ, Bai YQ, Roos C: Primal-dual interior-point algorithms for semidefinite optimization based on a simple kernel function. J. Math. Model. Algorithms 2005, 4: 409–433. 10.1007/s10852-005-3561-3
Article MathSciNet Google Scholar
Bai YQ, Wang GQ: Primal-dual interior-point algorithms for second-order cone optimization based on a new parametric kernel function. Acta Math. Sin. Engl. Ser. 2007, 23: 2027–2042. 10.1007/s10114-007-0967-z
Article MathSciNet Google Scholar
Choi BK, Lee GM: On complexity analysis of the primal-dual interior-point methods for semidefinite optimization problem based on a new proximity function. Nonlinear Anal., Theory Methods Appl. 2009, 71: e2628-e2640. 10.1016/j.na.2009.05.078
Article MathSciNet Google Scholar
Choi BK, Lee GM: On complexity analysis of the primal-dual interior-point method for second-order cone optimization problem. J. Korean Soc. Ind. Appl. Math. 2010, 14: 93–111.
MathSciNet Google Scholar
Nesterov YE, Tood M: Primal-dual interior-point methods for self-scaled cones. SIAM J. Optim. 1998, 8: 324–364. 10.1137/S1052623495290209
Article MathSciNet Google Scholar
Muramatsu M: On a commutative class of search directions for linear programming over symmetric cones. J. Optim. Theory Appl. 2002, 112: 595–625. 10.1023/A:1017920200889
Article MathSciNet Google Scholar
Faybusovich L: Linear systems in Jordan algebras and primal-dual interior-point algorithms. J. Comput. Appl. Math. 1997, 86: 149–175. 10.1016/S0377-0427(97)00153-2
Article MathSciNet Google Scholar
Faybusovich L: Euclidean Jordan and interior-point algorithms. Positivity 1997, 1: 331–357. 10.1023/A:1009701824047
Article MathSciNet Google Scholar
Faybusovich L, Arana R: A long-step primal-dual algorithm for symmetric programming problems. Syst. Control Lett. 2001, 43: 3–7. 10.1016/S0167-6911(01)00092-5
Article MathSciNet Google Scholar
Schmieta S, Alizadeh F: Extensions of primal-dual interior-point algorithms to symmetric cones. Math. Program. 2003, 96: 409–438. 10.1007/s10107-003-0380-z
Article MathSciNet Google Scholar
Baes, M: Optimization Methods for Convex Symmetric Problems. Monograph, April (2007)
Google Scholar
Choi, BK, Lee, GM: Complexity analysis for primal-dual interior-point methods for self-scaled optimization problems (submitted)
Vieira, MVC: Jordan algebraic approach to symmetric optimization. Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, The Netherlands (2007)
Google Scholar
Vieira MVC: Interior-point methods based on kernel functions for symmetric optimization. Optim. Methods Softw. 2011, 27: 513–537.
Article MathSciNet Google Scholar
Sun D, Sun J: Lowner’s operator and spectral functions in Euclidean Jordan algebras. Math. Oper. Res. 2008, 33: 421–445. 10.1287/moor.1070.0300
Article MathSciNet Google Scholar
Faraut J, Korányi A: Analysis on Symmetric Cones. Oxford University Press, London; 1994.
Google Scholar
Nesterov YE, Nemirovskii A SIAM Stud. Appl. Math. 13. In Interior Point Polynomial Algorithms in Convex Programming. SIAM, Philadelphia; 1994.
Chapter Google Scholar
Gowda MS, Szajder R, Tao J: Some P-properties for linear transformations on Euclidean Jordan algebras. Linear Algebra Appl. 2004, 393: 203–232.
Article MathSciNet Google Scholar
Faybusovich L: A Jordan-algebraic approach to potential-reduction algorithms. Math. Z. 2002, 239: 117–129. 10.1007/s002090100286
Article MathSciNet Google Scholar
Lim Y: Applications of geometric means on symmetric cones. Math. Ann. 2001, 319: 457–468. 10.1007/PL00004442
Article MathSciNet Google Scholar
Tunçel L: Potential reduction and primal-dual methods. In Handbook of Semidefinite Programming Theory, Algorithms and Applications. Edited by: Wolkowicz H, Saigal R, Vandenberghe L. Kluwer Academic, Boston; 2000:235–265.
Chapter Google Scholar
Alizadeh F, Schmieta S: Symmetric cones, potential reduction methods and word-by-word extensions. In Handbook of Semidefinite Programming, Theory, Algorithms and Applications. Edited by: Wolkowicz H, Saigal R, Vandenberghe L. Kluwer Academic, Boston; 2000:195–233.
Chapter Google Scholar
Horn RA, Johnson CR: Topics in Matrix Analysis. Cambridge University Press, Cambridge; 1991.
Book Google Scholar
Baes M: Convexity and differentiability properties of spectral functions and spectral mappings on Euclidean Jordan algebras. Linear Algebra Appl. 2007, 422: 664–700. 10.1016/j.laa.2006.11.025
Article MathSciNet Google Scholar

Download references

Acknowledgements

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MEST) (No. 2012-0006236).

Author information

Authors and Affiliations

Department of Applied Mathematics, Pukyong National University, Busan, 608-737, Korea
Bo Kyung Choi & Gue Myung Lee

Authors

Bo Kyung Choi
View author publications
You can also search for this author in PubMed Google Scholar
Gue Myung Lee
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gue Myung Lee.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

The authors, together discussed and solved the problems in the manuscript. All authors read and approved the final manuscript.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Choi, B.K., Lee, G.M. New complexity analysis for primal-dual interior-point methods for self-scaled optimization problems. Fixed Point Theory Appl 2012, 213 (2012). https://doi.org/10.1186/1687-1812-2012-213

Download citation

Received: 30 June 2012
Accepted: 07 November 2012
Published: 26 November 2012
DOI: https://doi.org/10.1186/1687-1812-2012-213

New complexity analysis for primal-dual interior-point methods for self-scaled optimization problems

Abstract

1 Introduction and preliminaries

2 Proximity functions and search directions

3 Algorithm and its complexity analysis

3.1 Bound of the proximity function after μ-update

3.2 Determining a default step size

3.3 Decrease of the proximity function during an inner iteration

3.4 Iteration bound

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Rights and permissions

About this article

Cite this article

Share this article

Keywords