Chapter 6 extra note 1

self-adjoint operator
matrix representation of an adjoint linear transformation

Youtube

3Blue1Brown - Eigenvectors and eigenvalues

Selected lecture notes

self-adjoint operator

Definition:
Let V be an inner product space.

T \in L (V)

is called self-adjoint if

T = T^{*}

, in other word,

\begin{matrix} (1) & ⟨ T (v), w ⟩ = ⟨ v, T (w) ⟩, \forall v, w \in V . \end{matrix}

Theorem:
Let V be an inner product space over

C

T \in L (V)

and

T = T^{*}

, then

All the eigenvalues are real.
Eigenvectors correspond to different eigenvalues are orthogonal.

Proof:
Let
$v$ be an eigenvector of
$T$ correspond to
$λ$ , i.e.,

$T (v) = λ v, v \neq 0 .$
Then

$\begin{matrix} (2) & ⟨ T (v), v ⟩ = ⟨ λ v, v ⟩ = λ ⟨ v, v ⟩ = λ ‖ v ‖^{2}, \end{matrix}$
and

$\begin{matrix} (3) & ⟨ T (v), v ⟩ = ⟨ v, T (v) ⟩ = ⟨ v, λ v ⟩ = \bar{λ} ‖ v ‖^{2} . \end{matrix}$
Therefore

$\begin{matrix} (4) & (λ - \bar{λ}) ‖ v ‖^{2} = 0. \end{matrix}$
But since
$v$ is an eigenvector,
$v \neq 0$ and
$‖ v ‖ \neq 0$ , we have

$\begin{matrix} (5) & λ - \bar{λ} = 0, \end{matrix}$
that gives
$λ \in R$ .
Let
$u$ and
$v$ be non-zero vectors such that

$\begin{matrix} (6) & T (u) = μ u, T (v) = λ v, \end{matrix}$
where
$μ \neq λ$ .
Then

$\begin{matrix} (7) & ⟨ T (u), v ⟩ = ⟨ μ u, v ⟩ = μ ⟨ u, v ⟩, \end{matrix}$
and

$\begin{matrix} (8) & ⟨ T (u), v ⟩ = ⟨ u, T (v) ⟩ = ⟨ u, λ v ⟩ = λ ⟨ u, v ⟩ . \end{matrix}$
Therefore

$\begin{matrix} (9) & (λ - μ) ⟨ u, v ⟩ = 0. \end{matrix}$
Since
$λ \neq μ$ , we must have
$⟨ u, v ⟩ = 0$ .

Lemma:
Let V be an inner product space,

T \in L (V)

and

T = T^{*}

, and

b, c \in R

such that

b^{2} < 4 c

, then the operator

T^{2} + b T + c I

is invertible.

Proof:

Choose
$v \neq 0$ , we consider the following inner product

$\begin{matrix} (10) & \begin{aligned} ⟨ (T^{2} + b T + c I) v, v ⟩ & = ⟨ T^{2} v, v ⟩ + ⟨ b T v, v ⟩ + ⟨ c v, v ⟩ \\ = ⟨ T v, T v ⟩ + ⟨ b T v, v ⟩ + ⟨ c v, v ⟩ \\ = ‖ T v ‖^{2} + ⟨ b T v, v ⟩ + c ‖ v ‖^{2} \\ \geq ‖ T v ‖^{2} - | b | ‖ T v ‖ ‖ v ‖ + c ‖ v ‖^{2} \\ = {(‖ T v ‖ - \frac{| b |}{2} ‖ v ‖)}^{2} - \frac{| b |^{2}}{4} ‖ v ‖^{2} + c ‖ v ‖^{2} \\ = {(‖ T v ‖ - \frac{| b |}{2} ‖ v ‖)}^{2} + \frac{- | b |^{2} + 4 c}{4} ‖ v ‖^{2} \\ > 0, \end{aligned} \end{matrix}$
where the first inequality comes from Cauchy-Schwartz inequality, and the final inequality is the restul of
$v \neq 0$ and
$4 c - b^{2} > 0$ .

We therefore conclude that
$(T^{2} + b T + c I) v \neq 0$ for
$v \neq 0$ , that gives
$Ker (T^{2} + b T + c I) = {0}$ .

Since
$T^{2} + b T + c I \in L (V)$ , it must be invertible.

Theorem:
Let V be an non-zero, finite dimensional, real inner product space,

T \in L (V)

and

T = T^{*}

, then

T

has an eigenvalue.

Proof:

Assume
$dim (V) = n$ , given
$v \in V ∖ {0}$ , then the set

${v, T v, \dots, T^{n} v}$
contains
$n + 1$ vectors and is a linearly dependent set. There exists
$a_{0}, \dots, a_{n} \in R$ not all zero such that

$\begin{matrix} (11) & a_{0} v + a_{1} T v + \dots + a_{n} T^{n} v = 0 . \end{matrix}$

Let
$m$ be such that
$a_{m} \neq 0$ and
$a_{m + 1} = \dots = a_{n} = 0$ .

It is easy to see that
$m \geq 1$ .

We define a polynomial
$p (x) = a_{0} + a_{1} x + \dots a_{m} x^{m}$ . According to the Fundamental theorem of Algebra, and since this is a polynomial of real coefficients, we have

$p (x) = a_{m} (x - μ_{1}) \dots (x - μ_{M}) (x^{2} + b_{1} x + c_{1}) \dots (x^{2} + b_{N} x + c_{N}),$
where
$b_{i}^{2} < 4 c_{i}$ for all
$i$ and
$M + 2 N = m$ .

We can then rewrite (11) similarly as

$\begin{matrix} (12) & \begin{aligned} 0 & = a_{0} v + a_{1} T v + \dots + a_{n} T^{n} v \\ = p (T) v \\ = a_{m} (T - μ_{1} I) \dots (T - μ_{M} I) (T^{2} + b_{1} T + c_{1} I) \dots (T^{2} + b_{N} T + c_{N} I) v . \end{aligned} \end{matrix}$
According to the previous lemma, we have
$T^{2} + b_{i} T + c_{i} I$ are invertible for all
$i$ , we can then apply the inverse of these operator to both side of (11) to have

$\begin{matrix} (13) & 0 = a_{m} (T - μ_{1} I) \dots (T - μ_{M} I) v . \end{matrix}$
Since
$v \neq 0$ and
$a_{m} \neq 0$ , there must be a
$i \in {1, \dots, M}$ such that
$Ker (T - μ_{i} I) \neq {0}$ , that is,
$μ_{i}$ is an eigenvalue of
$T$ .

Theorem:
Let V be a finite dimensional, real inner product space,

T \in L (V)

and

T = T^{*}

, then there exists a basis of orthonormal eigenvectors.

Proof:
Assume
$dim (V) = n$ .

Since
$T \in L (V)$ and
$T = T^{*}$ , based on the previous Theorem, there exists
$λ_{1} \in R$ and
$v_{1} \in V$ such that

$T (v_{1}) = λ_{1} v_{1} .$
Let
$U_{1} = span {v_{1}}$ , then
$U_{1}^{⊥} \subseteq V$ is a vector subspace and is also an inner product space. We also have
$dim (U_{1}^{⊥}) = n - 1$ .

Now we want to define a new map by restricting
$T$ on the domain
$U_{1}^{⊥}$ . Let's find out its range (or codomain) first.
- claim:
  $T (v) \in U_{1}^{⊥}$ for
  $v \in U_{1}^{⊥}$ .
$\begin{matrix} (14) & ⟨ T v, v_{1} ⟩ = ⟨ v, T v_{1} ⟩ = λ_{1} ⟨ v, v_{1} ⟩ . \end{matrix}$
If
$v \in U_{1}^{⊥}$ , we have
$⟨ v, v_{1} ⟩ = 0$ that gives
$⟨ T v, v_{1} ⟩ = 0$ . That is,
$T (v) \in U_{1}^{⊥}$ .

Therefore, we can then define a map
$T_{1} : U_{1}^{⊥} \to U_{1}^{⊥}$ . It should be clear that
$T_{1}$ is linear,
$T_{1} \in L (U_{1}^{⊥})$ , and
$T_{1}$ is self-adjoint.

We use the previous Theorem again on
$T_{1}$ . There exists
$λ_{2} \in R$ and
$v_{2} \in U_{1}^{⊥}$ such that

$T (v_{2}) = λ_{2} v_{2} .$
Since
$v_{2} \in U_{1}^{⊥}$ ,
${v_{1}, v_{2}}$ is orthogonal.

We can repeat the process to find
$n$ eigenvalue and orthogonal eigenvectors. The eigenvectors can then be normalized to be orthonormal.

Matrix representation of an adjoint linear transformation

Inner product with orthonormal basis

Let

V

be an innver product space and

β = {v_{1}, \dots, v_{n}} \subset V

be an orthonormal basis. Given

u

and

v \in V

, there exists

x

and

y \in C^{n}

such that

u = \sum_{i} x_{i} v_{i} = [\begin{matrix} v_{1}, \dots, v_{n} \end{matrix}] x, v = \sum_{i} y_{i} v_{i} = [\begin{matrix} v_{1}, \dots, v_{n} \end{matrix}] y .

Furthermore, we have

\begin{matrix} (15) & ⟨ u, v ⟩_{V} = \sum_{i} x_{y} \bar{y_{i}} = ⟨ x, y ⟩_{C^{n}} . \end{matrix}

Linear transformation between two inner product spaces

Let

V

and

W

be inner product spaces with orthonormal basis

β = {v_{1}, \dots, v_{n}} \subset V

and

μ = {w_{1}, \dots, w_{m}} \subset W

, respectively. Let

T \in L (V, W)

, we have

\begin{matrix} (16) & [T]_{β}^{μ} = A \in M_{m \times n} . \end{matrix}

Given

v \in V

and

w \in W

, there exists

x \in C^{n}

and

y \in C^{m}

such that

v = \sum_{i = 1}^{n} x_{i} v_{i}, w = \sum_{i = 1}^{m} y_{i} w_{i} .

Also we know that the "coordinate" of

T v

in the basis

μ

is given by

A x

. Therefore,

\begin{matrix} (17) & \begin{aligned} ⟨ T v, w ⟩_{W} & = ⟨ A x, y ⟩_{C^{m}} \\ = y^{*} A x \\ = (A^{*} y)^{*} x \\ ⟨ v, T^{*} w ⟩_{V} & = ⟨ x, A^{*} y ⟩_{C^{n}}, \end{aligned} \end{matrix}

where the superscript

^{*}

denotes conjugate transpose.

As a result, we know that the "coordinate" of

T^{*} w

in the basis

β

is given by

A^{*} y

, and the matrix representation for

T^{*}

must be

\begin{matrix} (18) & [T^{*}]_{μ}^{β} = A^{*} . \end{matrix}

Cheang Yu

2024/05/11 09:27:31

Let and be inner product spaces with orthonormal basis and , respectively. Let , we have

The orthonormal basis μ should be in W, right ?

TE-SHENG LIN

2024/05/11 14:18:17

right!!

2024/05/26 05:07:31

{0}

2024/05/26 12:01:41

yes

2024/05/26 05:25:00

a. Is finite dimension required?

2024/05/26 12:02:57

yes, I assume finite dimensional vector space in this semester.

2024/05/26 05:33:18

2024/05/26 12:03:37

許景仲

2024/05/28 06:57:11

finite dimensional

Should V also be a non-zero, finite-dimensional, real inner product space? (Edited)

2024/06/01 14:55:59

yes.