Try   HackMD

Lecture 7: Consequences of the Spectral Theorem; the Fredholm Alternative



We begin by recording some nice consequences of the spectral theorem for s.a. compact operators proved in the last lecture. We will use the notation
σ(T)
for the set of eigenvalues of a compact operator
T
.

Functional Calculus. If

f:σ(T)R
is any bounded function and
TK(H)
is selfadjoint with eigendecomposition
λn,ϕn
, define
f(T):=nf(λn)ϕnϕn.

Then the above definition is an algebra homomorphism, in that
af(T)+bg(T)=(af+bg)(T)

f(T)g(T)=g(T)f(T)=(fg)(T)

for such functions
f,g
.

The proof follows immediately from orthogonality of the eigenvectors

ϕn. Since the norm of a diagonal operator is its largest (in magnitude) entry, we have for compact s.a.
T
:
f(T)=maxλσ(T)|f(λ)|.
Some interesting consequences, by considering
f(t)=t
and
f(t)=|t|
are that for a compact s.a. operator
T
,
T
and
|T|
may be calculated "eigenvalue-wise" using the above calculus, and these are also are compact since
f(t)0
as
t0
for these functions.

Theorem 1. (Singular Value Decomposition) If

TK(H), there are orthnormal sets
ψn
and
ϕn
and nonnegative numbers
σn0
such that
T=nσnψnϕn.

Proof. By the polar decomposition, we have

T=U|T|, and
|T|
is compact by the functional calculus. Since it is also selfadjoint, the spectral theorem yields
|T|=nσnψnψn

for some nonzero eigenvalues
σn
which must be positive since
|T|0
. Moreover
U
is an isometry on
ker(|T|)
, which contains all the
ϕn
; so
ψn:=Uϕn
is an orthonormal set. This gives the desired result.

The second consequence is a variational characterization of the eigenvalues, often useful in proving bounds when it is hard to calculate them exactly (which it often is).

Theorem 2. (Courant-Fisher) If

T0 is compact, then its eigenvalues may be ordered
λ1λ2,λk
. We then have:
λk=maxdim(V)=kminxV{0}(x,Tx)/x2

=mindim(V)=kmaxxV{0}(x,Tx)/x2.

where the max/min is over subspaces of
H
.

This theorem is also true and extremely useful in finite dimensions. The proof is left to the homework. Two of its important conceptual consequences, immediate since the right hand sides are expressed entirely in terms of quadratic forms, are:

  1. If
    0AB
    for compact operators, then
    λk(A)λk(B)
    for every
    k
    .
  2. If
    ABϵ
    for positive compact operators, then
    |λk(A)λk(B)|ϵ
    .

The second part implies in particular that if finite rank

TnT0, then for every
k
λk(Tn)λk(T)
. The result may be extended to nonpositive (but still selfadjoint) operators by writing
A=A+A
for
A+,A0
using the spectral theorem.

Trace-Class and Hilbert-Schmidt Operators

Definition. If

AL(H) is positive and
ϕn
is an ONB of
H
then
tr(A):=n(ϕn,Aϕn)[0,].

The trace is well-defined as a function of

A because the above definition is independent of the choice of ONB; see RS Theorem VI.18 for a proof. Note that the sum above may not necessarily converge, though it converges absolutely whenever it does due to positivity; there can be more subtle convergence issues when dealing with operators which are not positive. The trace is well-behaved on a restricted class of operators defined below.

An operator is called trace class (denoted

S1 for "Schatten-1") if
tr(|A|)<

and Hilbert-Schmidt (denoted
S2
for "Schatten-2") if
tr(|A|2)<.

Theorem 3. If

tr(|A|p)< for
p=1
(resp.
p=2
) then
A
is compact.
Proof. Choose any ONB
ϕn
and notice that
n(ϕn,|A|pϕn)=n|A|p/2ϕn||2<.

Let
PN
be the projection onto the span of the first
N
basis vectors. For any
x=1
:
(IPN)|A|p/2x=n>Nϕn(ϕn,|A|p/2x)=n>N|((|A|p/2)ϕn,x)|2

n>N|A|p/2ϕn2x20,

uniformly in
x
, where the last inequality is Cauchy-Schwartz and we used
|A|=|A|
.
Thus,
|A|p/2
is a limit of finite rank operators and must be compact; so
|A|
(resp.
|A|
) is compact. Since compact operators are an ideal,
|A|
is compact in both cases, whence
A=U|A|
is compact.

Remark. The above proof actually shows that

tr(|A|p)< for any
p>0
is sufficient for compactness, since
|A|p/2
compact implies
|A|
compact by the functional calculus with
f(t)=t2/p
.

Corollary. A trace class operator is Hilbert-Schmidt.
Proof. If

TS1 then
|T|
is compact, so by the SVD and invariance of the trace
tr(|T|)=nσn
, where
σn
are the singular values of
T
. But since this is convergent,
tr(|T|2)=nσn2
must also be convergent.

The punch line is that the definition (and orthogonal invariance) of the trace extends to all of

S1 (including not necessarily positive operators). The following theorem is proven on the homework.

Theorem 4. If

A is trace-class, then for any ONB
ϕn
,
tr(A):=n(ϕn,Aϕn)

is independent of the choice of ONB.

This implies in particular that

tr(A)=nλn for any any selfadjoint trace class
A
.

Remark. There is a deep theorem of Lidskii which shows that

tr(A)=nλn for any trace class
A
(not necessarily self adjoint!).

The Fredholm Alternative

In finite dimensional linear algebra, the solvability of the equation

Ax=b is nicely characterized by the fact that
ran(A)=ker(A),

but this is no longer true in infinite dimensions, since the range may not be closed as we saw with the example of the Volterra operator. The Fredholm alternative states that the finite-dimensional relation can be recovered for operators of type
λIT
where
T
is compact and
λ0
. We will denote
λI
by
λ
as is customary.

Theorem 5. (Fredholm Alternative) If

TK(H) and
λC{0}
then for
A=λT
:

  1. ran(A)=ran(A)=ker(A).
  2. dim(ker(A))=dim(ker(A))<.

This theorem has the following important conceptual consequences:

  1. λ
    is an eigenvalue of
    T
    iff
    λ
    is an eigenvalue of
    T
    (since these correspond to
    ker(A){0}
    and
    ker(A){0}
    .
  2. (λT)x=b
    has a solution for every
    b
    iff
    λ
    is not an eigenvalue of
    T
    . Otherwise, the set of
    b
    for which a solution exists is precisely
    ker(A)
    .

Proof of (1).
Suppose

yn=(λT)xn()
is a Cauchy sequence in
R:=ran(λT)
, converging to some
yH
. We want to show that
yR
.

Let

V=ker(λT), and note that
V
is closed. Decompose each vector
xn=un+vn
for
vnV
. Assume for now that
vn
is a bounded sequence. By compactness,
Tvn
must have a convergent subsequence; passing to this subsequence, let
Tvnx
. Since
unker(λT)
, equation (*) now implies
yn=λvnTvnλvn=yn+Tvny+x,

as
n
. Since
λ0
we have
vn(x+y)/λ
, which satisfies
(λT)v=limn(λT)vn=y,

as desired.

To see that

vn is bounded, assume for contradiction that
vn.
The normalized sequence
vn/vn
is bounded, so
Tvn/vn
must have a convergent subsequence by compactness; let
z
be the limit of this sequence.

Observe that again by (*):

Tvn/vn=λvn/vnyn/vn.
Since the LHS converges to
z
, the RHS must also converge to
z
, revealing that
z=|λ|0
and
zV
since
vnV
for all
n
and that subspace is closed.

On the other hand, we also have

(λT)z=limn(λT)Tvn/vn=limnTyn/vn=0
since
vn
and
Tyn
is bounded, so
zV
, a contradiction.

Proof of (2). Given

λ0, decompose
T=T0+T1
where
T0<|λ|
and
T1
is finite rank (this is possible because
T
is a norm limit of finite rank operators). Observe that since the power series
(1x)1=k0xk=:S(x)

is absolutely geometrically convergent in
{|x|<1}
and satisfies
(1x)S(x)=1
for such
x
, we have that
S(T0/λ)=k0(T0/λ)k

is convergent in norm (by applying the triangle inequality to
S
minus its partial sums). Thus,
(1/λ)S(T0/λ)=(λT0)1L(H)
. We now have
(λT0)1(λT)=I(λT0)1T1=:IT2,

where
T2
has finite rank. As left or right multiplying by an invertible transformation doesn't change the dimension of the kernel,
dim(ker(λT))=dim(ker(IT2))

and
dim(ker(λT))=dim(ker(IT2))

the latter can be shown to be equal by elementary linear algebra (homework) since
T2
is finite rank.