Lecture 6: Spectral Theorem for Compact Operators

More on Compact Operators

In the last lecture we defined compact operators, and stated that they are equivalent to norm limits of finite rank operators. We begin by proving one direction of this theorem (the other direction is left to the homework).

Proof. Assume

T \in K (H)

, let

B \subset H

be the unit ball, and let

K = \overset{―}{T (B)}

. Let

n

be an integer. The set

\cup_{z \in K} {y : ‖ z - y ‖ < 1 / n}

is an open cover of

K

so it must have a finite subcover; call this

S_{n}

. Let

P_{n}

be the orthogonal projection onto

s p a n (S_{n})

. Note that

P_{n} T

is a finite rank operator since

S_{n}

is finite.

Consider a point

x \in H

. Choose

z \in S_{n}

such that

‖ T x - z ‖ < 1 / n

. Since

P_{n} T x

is the closest point to

T x

s p a n (S_{n})

, we must have

‖ P_{n} T x - T x ‖ \leq ‖ z - T x ‖ < 1 / n,

whence

‖ P_{n} T - T ‖ < 1 / n

, so that

‖ P_{n} T - T ‖ \to 0

n \to \infty .

◻

Two interesting families of compact operators are the following.

Integral Kernel Operators. Suppose

K \in L^{2} (0, 1)^{2}

. A Cauchy-Schwartz exercise shows that

‖ T_{K} ‖ \leq ‖ K ‖_{L^{2} (0, 1)^{2}}

, where

T_{K} f := \int_{0}^{1} K (x, y) f (y) d y .

Let

{ϕ_{i}}

be an ONB of

L^{2} (0, 1)

. It is easy to check that the family of bivariate functions

{ϕ_{i} (x) ϕ_{j} (y)}_{i, j}

is an ONB of

L^{2} (0, 1)^{2}

, so we may expand

K (x, y) = \sum_{i j} c_{i j} ϕ_{i} (x) ϕ_{j} (y) .

Let

K_{n} (x, y) = \sum_{i, j \leq n} c_{i j} ϕ_{i} (x) ϕ_{j} (y) .

As the difference

T_{K_{n}} - T_{K}

is an integral kernel operator of the same type, we have

‖ T_{K} - T_{K_{n}} ‖ \leq ‖ K - K_{n} ‖_{L^{2}} = \sum_{i, j > n} | c_{i j} |^{2} \to 0

n \to \infty

, showing that

T_{k}

is compact by the previous theorem.

Diagonal Operators. If

α_{n}

is any sequence with

| α_{n} | \to 0

, then the diagonal multiplication operator

T (e_{i}) = α_{i} e_{i}

is compact, since it is approximated by

T_{n} (e_{i}) = α_{i} e_{i} {i \leq n}

Note that diagonal multiplication operators in

L^{2} (0, 1)

T_{g} f = g (x) f (x)

for

g \in L^{\infty} (0, 1)

are not compact, as seen by considering

g

to be the indicator of any set of positive Lebesgue measure, since then the image

\overset{―}{T_{g} (B)}

contains infinitely many vectors pairwise separated by a constant distance.

Finally, we mention that the set

K (H)

of compact operators on

H

is a 2-sided

* -

ideal in

L (H)

, which means that if

T \in K (H)

then

T^{*}, B T, T B \in K (H)

for any

B \in L (H)

. These properties are easy to verify from the characterization as of

K (H)

as norm limits of finite rank operators.

The Spectral Theorem for Compact Operators

We will now show that the diagonal operators above are in a sense the only examples of compact operators, up to isomorphism.

Definition. If

T f = λ f

for some

f \in H ∖ {0}

then

f

is called an eigenvector of

T

and

λ

is called an eigenvalue.

Theorem. Suppose

T = T^{*} \in K (H)

. Then:

The eigenvalues of
$T$ are real and may be ordered
$λ_{1}, λ_{2}, \dots \to 0$ .
If
$λ \neq 0$ is an eigenvalue of
$T$ , the eigenspace
$V_{λ} := \ker (λ - T)$ is finite dimensional.
There is an orthonormal basis of
$H$ consisting of eigenvectors of
$T$ .

The last property implies that we have the expansion

T = \sum_{n} λ_{n} ϕ_{n} ϕ_{n}^{*}

where

ϕ_{n}^{*}

denotes the linear functional

(ϕ_{n}, x)

dual to

ϕ_{n}

. Alternately, this can be expressed as

T = U D U^{*}

where

U (e_{n}) = ϕ_{n}

is unitary and

D

is a diagonal multiplication operator.

Proof of Theorem. We begin by observing that for any eigenvectors

T v = λ v

and

T w = μ w

μ (v, w) = (v, T w) = (T^{*} v, w) = (T v, w) = \overset{―}{λ} (v, w) .

Plugging in

v = w

shows that

λ = \overset{―}{λ}

λ

must be real. For

μ \neq λ

we then must have

(v, w) = 0

, so eigenvectors from distinct eigenvalues must be orthogonal.

Lemma 1. For every

ϵ > 0

, the subspace

S_{ϵ} := s p a n {x : T x = λ x, | λ | \geq ϵ}

is finite dimensional.
Proof of Lemma. We first show that for every nonzero eigenvalue

λ

V_{λ}

is finite dimensional. Assume not, i.e., there is an infinite sequence of orthonormal vectors

{x_{n}}

such that

T x_{n} = λ x_{n}

. Then

T x_{n} \in K = \overset{―}{T (B)}

and we have

‖ T x_{n} - T x_{m} ‖ = | λ | ‖ x_{n} - x_{m} ‖ = \sqrt{2} | λ |

whenever

n \neq m

. But compactness implies that every sequence in

K

must have a convergent subsequence, so this is impossible.

The orthogonality of distinct eigenspaces (which are closed since they are kernels) implies that

S_{ϵ} = ⨁_{| λ | \geq ϵ} V_{λ} .

Assume for contradiction that there are infinitely many direct summands, and choose one unit eigenvector

x_{n}

from each eigenspace. By the same argument above, we have

‖ T x_{n} - T x_{m} ‖ = ‖ λ_{n} x_{n} - λ_{m} x_{m} ‖ \geq ϵ

whenever

n \neq m

, since the hypotenuse of a right triangle is longer than its shorter side. This is impossible by compactness.

◻

Lemma 1 implies properties (1) and (2) and that there are at most countably many eigenvalues. We now show that there are enough to form an orthonormal basis; the key is to show that we can always find one eigenvector, and the rest will follow by induction.

Lemma 2. Either

‖ T ‖

- ‖ T ‖

is an eigenvalue of

T

Proof of Lemma. Let

c = sup_{‖ x ‖ = 1} ‖ T x ‖

. If

c = 0

then we are done since

T = 0

and any unit vector in the kernel will do. Otherwise assume

c > 0

and let

{x_{n}} \subset B

be a sequence such that

‖ T x_{n} ‖ \to c

. By compactness of

K

we may pass to a subsequence such that

T x_{n}

converges to some vector

y \neq 0

Let

A = c^{2} - T^{2}

, and observe that

(x, A x) \geq 0

for every

x \in H

, since

(x, T^{2} x) = ‖ T x ‖^{2} \leq c^{2}

for every

x

. Observe that

(x_{n}, A x_{n}) = c^{2} - ‖ T x_{n} ‖^{2} \to 0.

Since

A

is positive it has a square root

\sqrt{A}

, so we have

(x_{n}, {\sqrt{A}}^{2} x_{n}) = ‖ \sqrt{A} x_{n} ‖^{2} \to 0.

Since

T \sqrt{A}

is a bounded operator and

T A = A T

, this implies

‖ T A x_{n} ‖ = ‖ A T x_{n} ‖ \to 0,

which by continuity of

A

implies

A y = 0

. Thus we have

(c^{2} - T^{2}) y = (c - T) (c + T) y = 0.

(c + T) y = 0

then

- c

is an eigenvalue; otherwise

c

is an eigenvalue with eigenvector

(c + T) y \neq 0

◻

To finish the proof of the theorem, let

γ_{n}

denote the dimension of

V_{λ_{n}}

and let

{ϕ_{n j}}_{n = 1, j = 1}^{\infty, γ_{n}}

denote a sequence of orthonormal eigenvectors for all of the (countably many by Lemma 1) nonzero eigenvalues

λ_{n}

. Set

M = s p a n {ϕ_{n j}}_{n, j}

(meaning the set of finite linear combinations). Observe that

T (M) \subset M

by construction, and since

T

is continuous this implies

T (\overset{―}{M}) \subset \overset{―}{M} .

On the other hand, if

y \in M^{⊥}

and

x \in M

we have

(T y, x) = (y, T x) = 0

since

T x \in M

. Thus, both

\overset{―}{M}

and

M^{⊥}

are closed invariant subspaces of

T

Observe that if

P = p r o j (M^{⊥})

, the restricted operator

P T P

is also compact. Since it has no nonzero eigenvectors (by construction), Lemma 2 implies that it must satisfy

‖ P T P ‖ = 0

, which means

T x = 0

whenever

x \in M^{⊥}

. Thus, every vector in

M^{⊥}

is an eigenvector of

T

with eigenvalue

0

To complete the proof, take

{ψ_{k}}

to be any orthonormal basis of

M^{⊥}

. Since

H = \overset{―}{M} \oplus M^{⊥}

the union

{ϕ_{n j}}_{n, j} \cup {ψ_{k}}

is an ONB of

H

consisting of eigenvectors of

T

, as desired.

◻

Remark. As pointed out by Tarun in class, it is possible to prove Lemma 2 by considering

c I - T

instead of

c^{2} I - T^{2}

if one assumes

‖ T ‖ = sup_{‖ x ‖ = 1} (x, T x)

, which can be achieved by possibly replacing

T

with

- T

Remark. Yeshwanth pointed out that one can also prove Lemma 2 by showing that

‖ y / c - x_{n} ‖^{2} \to 0

, which can be seen by expanding the left hand side in inner products. This proof has the advantage of not using a square root.