# Generalized metrics and Caristi’s theorem

- William A Kirk
^{1}and - Naseer Shahzad
^{2}Email author

**2013**:129

**DOI: **10.1186/1687-1812-2013-129

© Kirk and Shahzad; licensee Springer 2013

**Received: **4 February 2013

**Accepted: **29 April 2013

**Published: **15 May 2013

## Abstract

A ‘generalized metric space’ is a semimetric space which does not satisfy the triangle inequality, but which satisfies a weaker assumption called the quadrilateral inequality. After reviewing various related axioms, it is shown that Caristi’s theorem holds in complete generalized metric spaces without further assumptions. This is noteworthy because Banach’s fixed point theorem seems to require more than the quadrilateral inequality, and because standard proofs of Caristi’s theorem require the triangle inequality.

**MSC:**54H25, 47H10.

### Keywords

fixed points contraction mappings metric spaces semimetric spaces generalized metric spaces Caristi’s theorem## 1 Introduction

In an effort to generalize Banach’s contraction mapping principle, which holds in all complete metric spaces, to a broader class of spaces, Branciari [1] conceived of the notion to replace the triangle inequality with a weaker assumption he called the quadrilateral inequality. He called these spaces ‘generalized metric spaces’. These spaces retain the fundamental notion of distance. However, as we shall see, the quadrilateral inequality, while useful in some sense, ignores the importance of such things as the continuity of the distance function, uniqueness of limits, *etc.* In fact it has been asserted (see, *e.g.*, [2]) that for an accurate generalization of Banach’s fixed point theorem along the lines envisioned by Branciari, one needs the quadrilateral inequality in conjunction with the assumption that the space is Hausdorff.

We begin by discussing the relationship of Branciari’s concept to the classical axioms of semimetric spaces. Then we show that Caristi’s fixed point theorem holds within Branciari’s framework without any additional assumptions. This is possibly surprising. All proofs of Caristi’s theorem that the writers are aware of rely in some way on use of the triangle inequality. (In contrast, it has been noted that the proof of the first author’s fundamental fixed point theorem for nonexpansive mappings does not require the triangle inequality; see [3].)

## 2 Semimetric spaces

In the absence of relevant examples, it is not clear whether Branciari’s concept of weakening the triangle inequality will prove useful in analysis. However, the notion of assigning a ‘distance’ between each two points of an abstract set is fundamental in geometry. According to Blumenthal [[4], p.31], this notion has its origins in the late nineteenth century in axiomatic studies of de Tilly [5]. In his 1928 treatise [6], Karl Menger used the term *halb-metrischer Raume*, *or semimetric space*, to describe the same concept. We begin by summarizing the results of Wilson’s seminal paper [7] on semimetric spaces.

**Definition 1**Let

*X*be a set and let $D:X\times X\to \mathbb{R}$ be a mapping satisfying for each $a,b\in X$:

- I.
$d(a,b)\ge 0$, and $d(a,b)=0\iff a=b$;

- II.
$d(a,b)=d(b,a)$. Then the pair $(X,d)$ is called a

*semimetric space*.

In such a space, convergence of sequences is defined in the usual way: A sequence $\{{x}_{n}\}\subseteq X$ is said to *converge* to $x\in X$ if ${lim}_{n\to \mathrm{\infty}}d({x}_{n},x)=0$. Also, a sequence is said to be *Cauchy* (or *d*-Cauchy) if for each $\epsilon >0$ there exists $N\in \mathbb{N}$ such that $m,n\ge N\Rightarrow d({x}_{m},{x}_{n})<\epsilon $. The space $(X,d)$ is said to be *complete* if every Cauchy sequence has a limit.

With such a broad definition of distance, three problems are immediately obvious: (i) *There is nothing to assure that limits are unique* (*thus the space need not be Hausdorff*); (ii) *a convergent sequence need not be a Cauchy sequence*; (iii) *the mapping* $d(a,\cdot ):X\to \mathbb{R}$ *need not even be continuous*. Therefore it is unlikely there could be an effective topological theory in such a setting.

- VI.(Triangle inequality)
*With**X**and**d**as in Definition*1,*assume also that for each*$a,b,c\in X$,$d(a,b)\le d(a,c)+d(c,b).$

**Definition 2** A pair $(X,d)$ satisfying Axioms I, II, and VI is called a *metric space*.^{a}

- III.
*For each pair of*(*distinct*)*points*$a,b\in X$,*there is a number*${r}_{a,b}>0$*such that for every*$c\in X$,${r}_{a,b}\le d(a,c)+d(c,b).$ - IV.
*For each point*$a\in X$*and each*$k>0$,*there is a number*${r}_{a,k}>0$*such that if*$b\in X$ satisfies $d(a,b)\ge k$, then*for every*$c\in X$,${r}_{a,k}\le d(a,c)+d(c,b).$ - V.
*For each*$k>0$, there is a number ${r}_{k}>0$ such that if $a,b\in X$ satisfy $d(a,b)\ge k$, then for every $c\in X$,${r}_{k}\le d(a,c)+d(c,b).$

Obviously, if Axiom V is strengthened to ${r}_{k}=k$, then the space becomes metric. Chittenden [8] has shown (using an equivalent definition) that a semimetric space satisfying Axiom V is always *homeomorphic* to a metric space.

Axiom III is equivalent to the assertion that there do not exist distinct points $a,b\in X$ and a sequence $\{{c}_{n}\}\subseteq X$ such that $d(a,{c}_{n})+d(b,{c}_{n})\to 0$ as $n\to \mathrm{\infty}$. Thus, as Wilson observes, the following is self-evident.

**Proposition 1** *In a semimetric space*, *Axiom * III *is equivalent to the assertion that limits are unique*.

For $r>0$, let $U(p;r)=\{x\in X:d(x,p)<r\}$. Then Axiom III is also equivalent to the assertion that *X* is Hausdorff in the sense that given any two distinct points $a,b\in X$, there exist positive numbers ${r}_{a}$ and ${r}_{b}$ such that $U(a;{r}_{a})\cap U(b;{r}_{b})=\mathrm{\varnothing}$. This suggests the presence of a topology.

**Definition 3** Let $(X,d)$ be a semimetric space. Then the distance function *d* is said to be *continuous* if for any sequences $\{{p}_{n}\},\{{q}_{n}\}\subseteq X$, ${lim}_{n}d({p}_{n},p)=0$ and ${lim}_{n}d({q}_{n},q)=0\Rightarrow {lim}_{n}d({p}_{n},{q}_{n})=d(p,q)$.

**Remark** Some writers call a space satisfying Axioms I and II a ‘symmetric space’ and reserve the term semimetric space for a symmetric space with a continuous distance function (see, *e.g.*, [9]; *cf.* also [10, 11]). Here we use Menger’s original terminology.

A point *p* in a semimetric space *X* is said to be an *accumulation point* of a subset *E* of *X* if, given any $\epsilon >0$, $U(p;\epsilon )\cap E\ne \mathrm{\varnothing}$. A subset of a semimetric space is said to be *closed* if it contains each of its accumulation points. A subset of a semimetric space is said to be *open* if its complement is closed. With these definitions, if *X* is a semimetric space with a continuous distance function, then $U(p;r)$ is an open set for each $p\in X$ and $r>0$ and, moreover, *X* is a Hausdorff topological space [4].

We now turn to the concept introduced by Branciari.

**Definition 4** ([1])

*X*be a nonempty set, and let $d:X\times X\to [0,\mathrm{\infty})$ be a mapping such that for all $x,y\in X$ and all distinct points $u,v\in X$, each distinct from

*x*and

*y*:

- (i)
$d(x,y)=0\iff x=y$;

- (ii)
$d(x,y)=d(y,x)$;

- (iii)
$d(x,y)\le d(x,u)+d(u,v)+d(v,y)$ (quadrilateral inequality).

Then *X* is called a *generalized metric space* (g.m.s.).

**Proposition 2** *If* $(X,d)$ *is a generalized metric space which satisfies Axiom * III, *then the distance function is continuous*.

*Proof*Suppose that $\{{p}_{n}\},\{{q}_{n}\}\subseteq X$ satisfy ${lim}_{n}d({p}_{n},p)=0$ and ${lim}_{n}d({q}_{n},q)=0$, where $p\ne q$. Also assume that for

*n*arbitrarily large, ${p}_{n}\ne p$ and ${q}_{n}\ne q$. In view of Axiom III, we may also assume that for

*n*sufficiently large, ${p}_{n}\ne {q}_{n}$. Then

Thus ${lim}_{n}d({p}_{n},{q}_{n})=d(p,q)$. □

Therefore if a generalized metric space satisfies Axiom III, it is a Hausdorff topological space. However, the following observation shows that the quadrilateral inequality implies a weaker but useful form of distance continuity. (This is a special case of Proposition 1 of [12].)

**Proposition 3** *Suppose that* $\{{q}_{n}\}$ *is a Cauchy sequence in a generalized metric space* *X* *and suppose* ${lim}_{n}d({q}_{n},q)=0$. *Then* ${lim}_{n}d(p,{q}_{n})=d(p,q)$ *for all* $p\in X$. *In particular*, $\{{q}_{n}\}$ *does not converge to* *p* *if* $p\ne q$.

*Proof*We may assume that $p\ne q$. If ${q}_{n}=p$ for arbitrarily large

*n*, it must be the case that $p=q$. So, we may also assume that $p\ne {q}_{n}$ for all

*n*. Also, ${q}_{n}\ne q$ for infinitely many

*n*; otherwise, the result is trivial. So, we may assume that ${q}_{n}\ne {q}_{m}\ne q$ and ${q}_{n}\ne {q}_{m}\ne p$ for all $m,n\in \mathbb{N}$ with $m\ne n$. Then, by the quadrilateral inequality,

□

with the aid of Proposition 3, Branciari’s proof carries over with only a minor change. The assertion in [2] that the space needs to be Hausdorff is superfluous, a fact first noted in [12]. See also the example in [13].

**Theorem 1** ([1])

*Let* $(X,d)$ *be a complete generalized metric space*, *and suppose that the mapping* $f:X\to X$ *satisfies* $d(f(x),f(y))\le \lambda d(x,y)$ *for all* $x,y\in X$ *and fixed* $\lambda \in (0,1)$. *Then* *f* *has a unique fixed point* ${x}_{0}$, *and* ${lim}_{n}{f}^{n}(x)={x}_{0}$ *for each* $x\in X$.

It is possible to prove this theorem by following the proof given by Branciari up to the point of showing that $\{{f}^{n}(x)\}$ is a Cauchy sequence for each $x\in X$. Then, by completeness of *X*, there exists ${x}_{0}\in X$ such that ${lim}_{n}{f}^{n}(x)={x}_{0}$. But ${lim}_{n}d({f}^{n+1}(x),f({x}_{0}))\le \lambda {lim}_{n}d({f}^{n}(x),{x}_{0})=0$, so ${lim}_{n}{f}^{n+1}x=f({x}_{0})$. In view of Proposition 3, $f({x}_{0})={x}_{0}$.

## 3 Caristi’s theorem

We now turn to a proof of Caristi’s theorem in a complete g.m.s.

**Theorem 2** (*cf.* Caristi [14])

*Let*$(X,d)$

*be a complete g*.

*m*.

*s*.

*Let*$f:X\to X$

*be a mapping*,

*and let*$\phi :X\to {\mathbb{R}}^{+}$

*be a lower semicontinuous function*.

*Suppose that*

*Then* *f* *has a fixed point*.

Typically, proofs of Caristi’s theorem (and there have been many) involve assigning a partial order ⪯ to *X* by setting $x\u2aafy\iff d(x,y)\le \phi (x)-\phi (y)$, and then either using Zorn’s lemma or the Brézis-Browder order principle (see Section 4). However, the triangle inequality is needed for these approaches in order to show that $(X,\u2aaf)$ is transitive. The proof we give below is based on Wong’s modification [15] of Caristi’s original transfinite induction argument [14]. (Recall that if *M* is a metric space, a mapping $\phi :M\to \mathbb{R}$ is said to be *lower semicontinuous* (l.s.c.) if given $x\in X$ and a net $\{{x}_{\alpha}\}$ in *M*, the conditions ${x}_{\alpha}\to x$ and $\phi ({x}_{\alpha})\to r$ imply $\phi (x)\le r$.)

*Proof of Theorem 2*Let $n\in \mathbb{N}$. Then

This proves that $\{{f}^{n}(x)\}$ is a Cauchy sequence. If *f* were continuous, one could immediately conclude that there exists ${x}_{0}\in X$ such that ${lim}_{n}{f}^{n}(x)={x}_{0}=f({x}_{0})$. (The quadrilateral inequality is not needed in this case, but it is necessary for Cauchy sequences to have unique limits.)

- (i)
${x}_{\alpha +1}=f({x}_{\alpha})$ for all $\alpha <\beta $;

- (ii)
if $\gamma <\beta $ is a limit ordinal, then the net ${\{{x}_{\alpha}\}}_{\alpha <\gamma}$ converges to ${x}_{\gamma}$;

- (iii)
if $0\le \alpha \le \mu <\beta $ and $|[\alpha ,\mu ]|\ge 4$, then $d({x}_{\alpha},{x}_{\mu})\le \phi ({x}_{\alpha})-\phi ({x}_{\mu})$.

*β*is a limit ordinal. We claim that ${\{{x}_{\alpha}\}}_{\alpha <\beta}$ is a Cauchy net. If not, there exists $\epsilon >0$ and a strictly increasing sequence $\{{\alpha}_{n}\}$ in $(0,\beta )$ such that $|[{\alpha}_{n},{\alpha}_{n+1}]|\ge 4$ and $d({x}_{{\alpha}_{n}},{x}_{{\alpha}_{n+1}})\ge \epsilon $. This leads to the contradiction

Therefore ${\{{x}_{\alpha}\}}_{\alpha <\beta}$ is a Cauchy net and, since *X* is complete, it is possible to take ${x}_{\beta}={lim}_{\alpha <\beta}{x}_{\alpha}$.

*β*is a limit ordinal, the cardinality of $[\alpha ,\beta ]$ is infinite for all $\alpha <\beta $. Consequently, since

*φ*is lower semicontinuous,

Therefore a net $\{{x}_{\alpha}\}$ has been defined satisfying (i), (ii), and (iii) for all $\alpha \in \mathrm{\Gamma}$. Let ${\mathrm{\Gamma}}^{\mathrm{\prime}}$ denote the set of limit ordinals in Γ. If *f* has no fixed point, the net ${\{\phi ({x}_{\alpha})\}}_{\alpha \in {\mathrm{\Gamma}}^{\mathrm{\prime}}}$ is strictly decreasing. This is a contradiction because ${\mathrm{\Gamma}}^{\mathrm{\prime}}$ is uncountable and any strictly decreasing net of real numbers must be countable. □

## 4 Another approach

We now examine an easy proof of Caristi’s original theorem based on Zorn’s lemma. (A more constructive proof which uses the Brézis-Browder order principle is given in [16].)

**Theorem 3**

*Let*$(X,d)$

*be a complete metric space*.

*Let*$f:X\to X$

*be a mapping*,

*and let*$\phi :X\to {\mathbb{R}}^{+}$

*be a lower semicontinuous function*.

*Suppose that*

*Then* *f* *has a fixed point*.

*Proof*Introduce the Brøndsted partial order on

*X*by setting $x\u2aafy\iff d(x,y)\le \phi (x)-\phi (y)$. Let

*I*be a totally ordered set, and let ${\{{x}_{\gamma}\}}_{\gamma \in I}$ be a chain in $(X,\u2aaf)$. Then $\alpha \le \beta \Rightarrow {x}_{\alpha}\u2aaf{x}_{\beta}\iff d({x}_{\alpha},{x}_{\beta})\le \phi ({x}_{\alpha})-\phi ({x}_{\beta})$. Therefore ${\{\phi ({x}_{\gamma})\}}_{\gamma \in I}$ is decreasing. Since

*φ*is bounded below, ${lim}_{\gamma}\phi ({x}_{\gamma})=r$. This implies ${lim}_{\alpha ,\beta}d({x}_{\alpha},{x}_{\beta})=0$; hence ${\{{x}_{\gamma}\}}_{\gamma \in I}$ is a Cauchy net. Since

*X*is complete, there exists $x\in X$ such that ${lim}_{\gamma}{x}_{\gamma}=x$. Thus for $\alpha \in I$,

Therefore ${x}_{\alpha}\u2aafx$ for each $\alpha \in I$, so *x* is an upper bound for the chain ${\{\phi ({x}_{\gamma})\}}_{\gamma \in I}$. By Zorn’s lemma, $(X,\u2aaf)$ has a maximal element $\overline{x}$. But condition (C) implies $\overline{x}\u2aaff(\overline{x})$, so it must be the case that $\overline{x}=f(\overline{x})$. □

*X*that are limits of nontrivial Cauchy sequences. The proof of Theorem 2 implies that nontrivial Cauchy sequences exist. So, let

*x*,

*y*, and

*z*be three distinct points in $({X}_{C},d)$, and let $\{{z}_{n}\}$ be a Cauchy sequence converging to

*z*. Then, by the quadrilateral inequality,

Letting $n\to \mathrm{\infty}$ and applying Proposition 3, we see that $d(x,y)\le d(x,z)+d(z,y)$. Therefore $({X}_{C},d)$ is a metric space. In the proof of Theorem 3 $\overline{x}\in {X}_{C}$. To show that $\overline{x}\u2aaff(\overline{x})$, it is necessary to show that $f(\overline{x})\in {X}_{C}$. Assume that $\overline{x}\ne f(\overline{x})$. Then $\{{f}^{n}(\overline{x})\}$ is a Cauchy sequence. So, let ${x}_{\mathrm{\infty}}={lim}_{n}{f}^{n}(\overline{x})$.

**Remark** In view of Proposition 3, it seems reasonable to introduce the following definition.

**Definition 5** A point *p* in a generalized metric space *X* is said to be an *accumulation point* of a subset *E* of *X* if some infinite Cauchy sequence in *E* converges to *p*. A set *E* in *X* is said to be *closed* if it contains all of its accumulation points.

Observe that with convergence defined as above, ${lim}_{n}{x}_{n}=x\iff \{{x}_{n}\}$ is a Cauchy sequence and ${lim}_{n}d({x}_{n},x)=0$.

## Endnote

The term ‘metric space’ for spaces satisfying Axioms I, II, and VI is apparently due to Hausdorff [17].

## Notes

## Declarations

### Acknowledgements

We thank a referee for pointing out some oversights in the original draft of this manuscript. The research of N. Shahzad was partially supported by the Deanship of Scientific Research (DSR), King Abdulaziz University, Jeddah, Saudi Arabia.

## Authors’ Affiliations

## References

- Branciari A: A fixed point theorem of Banach-Caccioppoli type on a class of generalized metric spaces.
*Publ. Math. (Debr.)*2000, 57: 31–37.MathSciNetGoogle Scholar - Sarma IR, Rao JM, Rao SS: Contractions over generalized metric spaces.
*J. Nonlinear Sci. Appl.*2009, 2: 180–182.MathSciNetGoogle Scholar - Kirk WA, Kang BG: A fixed point theorem revisited.
*J. Korean Math. Soc.*1997, 34(2):285–291.MathSciNetGoogle Scholar - Blumenthal LM:
*Theory and Applications of Distance Geometry*. 2nd edition. Chelsea, New York; 1970.Google Scholar - de Tilly, J: ‘Essai de géométrie analytique gén érale’, Mémoires couronnés et autres mémoires publiés par l’Académie Royale de Belgique, 47, mémoire 5 (1892–93)
- Menger K: Untersuchungen über allgemeine Metrik.
*Math. Ann.*1928, 100: 75–163. 10.1007/BF01448840MathSciNetView ArticleGoogle Scholar - Wilson WA: On semimetric spaces.
*Am. J. Math.*1931, 53(2):361–373. 10.2307/2370790View ArticleGoogle Scholar - Chittenden EW: On the equivalence of ecart and voisinage.
*Trans. Am. Math. Soc.*1917, 18(2):161–166.MathSciNetGoogle Scholar - Jachymski J, Matkowski J, Świątkowski T: Nonlinear contractions on semimetric spaces.
*J. Appl. Anal.*1995, 1(2):125–134.MathSciNetView ArticleGoogle Scholar - Hicks TL, Rhoades BE: Fixed point theory in symmetric spaces with applications to probabilistic spaces.
*Nonlinear Anal., Theory Methods Appl.*1999, 36(3):331–344. 10.1016/S0362-546X(98)00002-9MathSciNetView ArticleGoogle Scholar - Miheţ DL: A note on a paper of T. L. Hicks and B. E. Rhoades: ‘Fixed point theory in symmetric spaces with applications to probabilistic spaces’ [Nonlinear Anal. 36 (1999), no. 3, Ser. A: Theory Methods, 331–344; MR1688234].
*Nonlinear Anal.*2006, 65(7):1411–1413. 10.1016/j.na.2005.10.021MathSciNetView ArticleGoogle Scholar - Turinici, M: Functional contractions in local Branciari metric spaces. arXiv:1208.4610v1 [math.GN] 22 Aug 2012Google Scholar
- Samet B: Discussion on: a fixed point theorem of Banach-Caccioppoli type on a class of generalized metric spaces by A. Branciari.
*Publ. Math. (Debr.)*2010, 76(4):493–494.MathSciNetGoogle Scholar - Caristi J: Fixed point theorems for mappings satisfying inwardness conditions.
*Trans. Am. Math. Soc.*1976, 215: 241–251.MathSciNetView ArticleGoogle Scholar - Wong CS: On a fixed point theorem of contractive type.
*Proc. Am. Math. Soc.*1976, 57(2):283–284. 10.1090/S0002-9939-1976-0407826-5View ArticleGoogle Scholar - Brézis H, Browder FE: A general principle on ordered sets in nonlinear functional analysis.
*Adv. Math.*1976, 21(3):355–364. 10.1016/S0001-8708(76)80004-7View ArticleGoogle Scholar - Hausdorff, F: Grundzüge der Mengenlehre. Leipzig (1914)Google Scholar

## Copyright

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.