Lecture 7: Consequences of the Spectral Theorem; the Fredholm Alternative

# Lecture 7: Consequences of the Spectral Theorem; the Fredholm Alternative $\newcommand{\tr}{\mathrm{tr}}$ $\newcommand{\ran}{\mathrm{ran}}$ We begin by recording some nice consequences of the spectral theorem for s.a. compact operators proved in the last lecture. We will use the notation $\sigma(T)$ for the set of eigenvalues of a compact operator $T$. **Functional Calculus.** If $\renewcommand{\R}{\mathbb{R}}$ $f:\sigma(T)\rightarrow\R$ is any bounded function and $T\in K(H)$ is selfadjoint with eigendecomposition $\lambda_n,\phi_n$, define $$f(T):=\sum_n f(\lambda_n)\phi_n\phi_n^*.$$ Then the above definition is an algebra homomorphism, in that $$af(T)+bg(T) = (af+bg)(T)$$ $$f(T)g(T)=g(T)f(T)=(fg)(T)$$ for such functions $f,g$. The proof follows immediately from orthogonality of the eigenvectors $\phi_n$. Since the norm of a diagonal operator is its largest (in magnitude) entry, we have for compact s.a. $T$: $$\|f(T)\|=\max_{\lambda\in\sigma(T)} |f(\lambda)|.$$ Some interesting consequences, by considering $f(t)=\sqrt{t}$ and $f(t)=|t|$ are that for a compact s.a. operator $T$, $\sqrt{T}$ and $|T|$ may be calculated "eigenvalue-wise" using the above calculus, and these are also are compact since $f(t)\rightarrow 0$ as $t\rightarrow 0$ for these functions. **Theorem 1. (Singular Value Decomposition)** If $T\in K(H)$, there are orthnormal sets $\psi_n$ and $\phi_n$ and nonnegative numbers $\sigma_n\rightarrow 0$ such that $$T = \sum_n \sigma_n \psi_n \phi_n^*.$$ *Proof.* By the polar decomposition, we have $T=U|T|$, and $|T|$ is compact by the functional calculus. Since it is also selfadjoint, the spectral theorem yields $$|T| = \sum_n \sigma_n \psi_n\psi_n^*$$ for some nonzero eigenvalues $\sigma_n$ which must be positive since $|T|\ge 0$. Moreover $U$ is an isometry on $\ker(|T|)^\perp$, which contains all the $\phi_n$; so $\psi_n := U\phi_n$ is an orthonormal set. This gives the desired result. $\square$ The second consequence is a variational characterization of the eigenvalues, often useful in proving bounds when it is hard to calculate them exactly (which it often is). **Theorem 2. (Courant-Fisher)** If $T\ge 0$ is compact, then its eigenvalues may be ordered $\lambda_1\ge \lambda_2,\ldots\lambda_k\ldots$. We then have: $$ \lambda_k = \max_{\dim(V)=k}\min_{x\in V-\{0\}} (x,Tx)/\|x\|^2$$ $$= \min_{\dim(V)=k}\max_{x\in V^\perp-\{0\}} (x,Tx)/\|x^2\|.$$ where the max/min is over subspaces of $H$. This theorem is also true and extremely useful in finite dimensions. The proof is left to the homework. Two of its important conceptual consequences, immediate since the right hand sides are expressed entirely in terms of quadratic forms, are: 1. If $0\le A\le B$ for compact operators, then $\lambda_k(A)\le \lambda_k(B)$ for every $k$. 2. If $\|A-B\|\le \epsilon$ for positive compact operators, then $|\lambda_k(A)-\lambda_k(B)|\le \epsilon$. The second part implies in particular that if finite rank $T_n\rightarrow T\ge 0$, then for every $k$ $\lambda_k(T_n)\rightarrow \lambda_k(T)$. The result may be extended to nonpositive (but still selfadjoint) operators by writing $A=A_+-A_-$ for $A_+,A_-\ge 0$ using the spectral theorem. ## Trace-Class and Hilbert-Schmidt Operators **Definition.** If $A\in L(H)$ is positive and $\phi_n$ is an ONB of $H$ then $$ \tr(A):=\sum_n (\phi_n,A\phi_n) \in [0,\infty].$$ The trace is well-defined as a function of $A$ because the above definition is *independent* of the choice of ONB; see RS Theorem VI.18 for a proof. Note that the sum above may not necessarily converge, though it converges absolutely whenever it does due to positivity; there can be more subtle convergence issues when dealing with operators which are not positive. The trace is well-behaved on a restricted class of operators defined below. An operator is called *trace class* (denoted $S_1$ for "Schatten-1") if $$ \tr(|A|)<\infty$$ and Hilbert-Schmidt (denoted $S_2$ for "Schatten-2") if $$ \tr(|A|^2)<\infty.$$ **Theorem 3.** If $\tr(|A|^p)<\infty$ for $p=1$ (resp. $p=2$) then $A$ is compact. *Proof.* Choose any ONB $\phi_n$ and notice that $$\sum_n (\phi_n,|A|^p\phi_n) = \sum_n \||A|^{p/2}\phi_n||^2<\infty.$$ Let $P_N$ be the projection onto the span of the first $N$ basis vectors. For any $\|x\|=1$: $$ \|(I-P_N)|A|^{p/2}x\| = \|\sum_{n>N} \phi_n (\phi_n, |A|^{p/2}x)\| = \sum_{n>N} |((|A|^{p/2})^*\phi_n, x)|^2$$ $$ \le \sum_{n>N} \||A|^{p/2}\phi_n\|^2\|\|x\|^2\rightarrow 0,$$ uniformly in $x$, where the last inequality is Cauchy-Schwartz and we used $|A|=|A|^*$. Thus, $|A|^{p/2}$ is a limit of finite rank operators and must be compact; so $\sqrt{|A|}$ (resp. $|A|$) is compact. Since compact operators are an ideal, $|A|$ is compact in both cases, whence $A=U|A|$ is compact.$\square$ **Remark.** The above proof actually shows that $\tr(|A|^p)<\infty$ for *any* $p>0$ is sufficient for compactness, since $|A|^{p/2}$ compact implies $|A|$ compact by the functional calculus with $f(t)=t^{2/p}$. **Corollary.** A trace class operator is Hilbert-Schmidt. *Proof.* If $T\in S_1$ then $|T|$ is compact, so by the SVD and invariance of the trace $\tr(|T|)=\sum_n \sigma_n$, where $\sigma_n$ are the singular values of $T$. But since this is convergent, $\tr(|T|^2)=\sum_n \sigma_n^2$ must also be convergent.$\square$ The punch line is that the definition (and orthogonal invariance) of the trace extends to all of $S_1$ (including not necessarily positive operators). The following theorem is proven on the homework. **Theorem 4.** If $A$ is trace-class, then for any ONB $\phi_n$, $$\tr(A):=\sum_n(\phi_n, A \phi_n)$$ is independent of the choice of ONB. This implies in particular that $\tr(A)=\sum_n \lambda_n$ for any any selfadjoint trace class $A$. **Remark.** There is a deep theorem of Lidskii which shows that $\tr(A)=\sum_n \lambda_n$ for *any* trace class $A$ (not necessarily self adjoint!). ## The Fredholm Alternative In finite dimensional linear algebra, the solvability of the equation $Ax=b$ is nicely characterized by the fact that $$ \ran(A) = \ker(A^*)^\perp,$$ but this is no longer true in infinite dimensions, since the range may not be closed as we saw with the example of the Volterra operator. The Fredholm alternative states that the finite-dimensional relation can be recovered for operators of type $\lambda I - T$ where $T$ is compact and $\lambda\neq 0$. We will denote $\lambda I$ by $\lambda$ as is customary. **Theorem 5. (Fredholm Alternative)** If $T\in K(H)$ and $\lambda\in \mathbb{C}-\{0\}$ then for $A=\lambda - T$: 1. $\ran(A)=\overline{\ran(A)} = \ker(A^*)^\perp.$ 2. $\dim(\ker(A))=\dim(\ker(A^*))<\infty.$ This theorem has the following important conceptual consequences: 1. $\lambda$ is an eigenvalue of $T$ iff $\overline{\lambda}$ is an eigenvalue of $T^*$ (since these correspond to $\ker(A)\neq \{0\}$ and $\ker(A^*)\neq\{0\}$. 2. $(\lambda-T)x=b$ has a solution for every $b$ iff $\lambda$ is not an eigenvalue of $T$. Otherwise, the set of $b$ for which a solution exists is precisely $\ker(A^*)^\perp$. *Proof of (1).* Suppose $$y_n=(\lambda-T)x_n\qquad (*)$$ is a Cauchy sequence in $R:=\ran(\lambda-T)$, converging to some $y\in H$. We want to show that $y\in R$. Let $V=\ker(\lambda-T)$, and note that $V$ is closed. Decompose each vector $x_n=u_n+v_n$ for $v_n\in V^\perp$. Assume for now that $v_n$ is a bounded sequence. By compactness, $Tv_n$ must have a convergent subsequence; passing to this subsequence, let $Tv_n\rightarrow x$. Since $u_n\in\ker(\lambda-T)$, equation (*) now implies $$ y_n = \lambda v_n - Tv_n \Rightarrow \lambda v_n = y_n + Tv_n \rightarrow y + x,$$ as $n\rightarrow\infty$. Since $\lambda\neq 0$ we have $v_n\rightarrow (x+y)/\lambda$, which satisfies $$ (\lambda-T)v=\lim_n (\lambda-T)v_n = y,$$ as desired. To see that $v_n$ is bounded, assume for contradiction that $\|v_n\|\rightarrow\infty.$ The normalized sequence $v_n/\|v_n\|$ is bounded, so $Tv_n/\|v_n\|$ must have a convergent subsequence by compactness; let $z$ be the limit of this sequence. Observe that again by (*): $$ Tv_n/\|v_n\| = \lambda v_n/\|v_n\|-y_n/\|v_n\|.$$ Since the LHS converges to $z$, the RHS must also converge to $z$, revealing that $\|z\|=|\lambda|\neq 0$ and $z\in V^\perp$ since $v_n\in V^\perp$ for all $n$ and that subspace is closed. On the other hand, we also have $$ (\lambda-T)z = \lim_n (\lambda-T) Tv_n/\|v_n\| = \lim_n T y_n/\|v_n\| = 0$$ since $\|v_n\|\rightarrow \infty$ and $Ty_n$ is bounded, so $z\in V$, a contradiction.$\square$ *Proof of (2).* Given $\lambda\neq 0$, decompose $T=T_0+T_1$ where $\|T_0\|<|\lambda|$ and $T_1$ is finite rank (this is possible because $T$ is a norm limit of finite rank operators). Observe that since the power series $$(1-x)^{-1}=\sum_{k\ge 0}x^k=:S(x)$$ is absolutely geometrically convergent in $\{|x|<1\}$ and satisfies $(1-x)S(x)=1$ for such $x$, we have that $$S(T_0/\lambda)=\sum_{k\ge 0} (T_0/\lambda)^k$$ is convergent in norm (by applying the triangle inequality to $S$ minus its partial sums). Thus, $(1/\lambda)S(T_0/\lambda)=(\lambda-T_0)^{-1}\in L(H)$. We now have $$ (\lambda-T_0)^{-1} (\lambda-T) = I - (\lambda-T_0)^{-1}T_1=:I-T_2,$$ where $T_2$ has finite rank. As left or right multiplying by an invertible transformation doesn't change the dimension of the kernel, $$\dim(\ker(\lambda-T))=\dim(\ker(I-T_2))$$ and $$\dim(\ker(\lambda^*-T^*))=\dim(\ker(I-T_2^*))$$ the latter can be shown to be equal by elementary linear algebra (homework) since $T_2$ is finite rank.$\square$