Chapter 6 extra note 1

self-adjoint operator
matrix representation of an adjoint linear transformation

Youtube

Selected lecture notes

self-adjoint operator

Definition:
Let V be an inner product space.

TL(V) is called self-adjoint if
T=T
, in other word,
(1)T(v),w=v,T(w),v,wV.

Theorem:
Let V be an inner product space over

C,
TL(V)
and
T=T
, then

  1. All the eigenvalues are real.
  2. Eigenvectors correspond to different eigenvalues are orthogonal.
  • Proof:

    Let

    v be an eigenvector of
    T
    correspond to
    λ
    , i.e.,
    T(v)=λv,v0.

    Then
    (2)T(v),v=λv,v=λv,v=λv2,

    and
    (3)T(v),v=v,T(v)=v,λv=λ¯v2.

    Therefore
    (4)(λλ¯)v2=0.

    But since
    v
    is an eigenvector,
    v0
    and
    v0
    , we have
    (5)λλ¯=0,

    that gives
    λR
    .

    Let

    u and
    v
    be non-zero vectors such that
    (6)T(u)=μu,T(v)=λv,

    where
    μλ
    .
    Then
    (7)T(u),v=μu,v=μu,v,

    and
    (8)T(u),v=u,T(v)=u,λv=λu,v.

    Therefore
    (9)(λμ)u,v=0.

    Since
    λμ
    , we must have
    u,v=0
    .


Lemma:
Let V be an inner product space,

TL(V) and
T=T
, and
b,cR
such that
b2<4c
, then the operator
T2+bT+cI
is invertible.

  • Proof:

    Choose

    v0, we consider the following inner product
    (10)(T2+bT+cI)v,v=T2v,v+bTv,v+cv,v=Tv,Tv+bTv,v+cv,v=Tv2+bTv,v+cv2Tv2|b|Tvv+cv2=(Tv|b|2v)2|b|24v2+cv2=(Tv|b|2v)2+|b|2+4c4v2>0,

    where the first inequality comes from Cauchy-Schwartz inequality, and the final inequality is the restul of
    v0
    and
    4cb2>0
    .

    We therefore conclude that

    (T2+bT+cI)v0 for
    v0
    , that gives
    Ker(T2+bT+cI)={0}
    .

    Since

    T2+bT+cIL(V), it must be invertible.

Theorem:
Let V be an non-zero, finite dimensional, real inner product space,

TL(V) and
T=T
, then
T
has an eigenvalue.

  • Proof:

    Assume

    dim(V)=n, given
    vV{0}
    , then the set
    {v,Tv,,Tnv}

    contains
    n+1
    vectors and is a linearly dependent set. There exists
    a0,,anR
    not all zero such that
    (11)a0v+a1Tv++anTnv=0.

    Let

    m be such that
    am0
    and
    am+1==an=0
    .

    It is easy to see that

    m1.

    We define a polynomial

    p(x)=a0+a1x+amxm. According to the Fundamental theorem of Algebra, and since this is a polynomial of real coefficients, we have
    p(x)=am(xμ1)(xμM)(x2+b1x+c1)(x2+bNx+cN),

    where
    bi2<4ci
    for all
    i
    and
    M+2N=m
    .

    We can then rewrite (11) similarly as

    (12)0=a0v+a1Tv++anTnv=p(T)v=am(Tμ1I)(TμMI)(T2+b1T+c1I)(T2+bNT+cNI)v.
    According to the previous lemma, we have
    T2+biT+ciI
    are invertible for all
    i
    , we can then apply the inverse of these operator to both side of (11) to have
    (13)0=am(Tμ1I)(TμMI)v.

    Since
    v0
    and
    am0
    , there must be a
    i{1,,M}
    such that
    Ker(TμiI){0}
    , that is,
    μi
    is an eigenvalue of
    T
    .


Theorem:
Let V be a finite dimensional, real inner product space,

TL(V) and
T=T
, then there exists a basis of orthonormal eigenvectors.

  • Proof:

    Assume

    dim(V)=n.

    Since

    TL(V) and
    T=T
    , based on the previous Theorem, there exists
    λ1R
    and
    v1V
    such that
    T(v1)=λ1v1.

    Let
    U1=span{v1}
    , then
    U1V
    is a vector subspace and is also an inner product space. We also have
    dim(U1)=n1
    .

    Now we want to define a new map by restricting

    T on the domain
    U1
    . Let's find out its range (or codomain) first.

    • claim:
      T(v)U1
      for
      vU1
      .

    (14)Tv,v1=v,Tv1=λ1v,v1.
    If
    vU1
    , we have
    v,v1=0
    that gives
    Tv,v1=0
    . That is,
    T(v)U1
    .

    Therefore, we can then define a map

    T1:U1U1. It should be clear that
    T1
    is linear,
    T1L(U1)
    , and
    T1
    is self-adjoint.

    We use the previous Theorem again on

    T1. There exists
    λ2R
    and
    v2U1
    such that
    T(v2)=λ2v2.

    Since
    v2U1
    ,
    {v1,v2}
    is orthogonal.

    We can repeat the process to find

    n eigenvalue and orthogonal eigenvectors. The eigenvectors can then be normalized to be orthonormal.


Matrix representation of an adjoint linear transformation

Inner product with orthonormal basis

Let

V be an innver product space and
β={v1,,vn}V
be an orthonormal basis. Given
u
and
vV
, there exists
x
and
yCn
such that
u=ixivi=[v1,,vn]x,v=iyivi=[v1,,vn]y.

Furthermore, we have
(15)u,vV=ixyyi¯=x,yCn.

Linear transformation between two inner product spaces

Let

V and
W
be inner product spaces with orthonormal basis
β={v1,,vn}V
and
μ={w1,,wm}W
, respectively. Let
TL(V,W)
, we have
(16)[T]βμ=AMm×n.

Given

vV and
wW
, there exists
xCn
and
yCm
such that
v=i=1nxivi,w=i=1myiwi.

Also we know that the "coordinate" of
Tv
in the basis
μ
is given by
Ax
. Therefore,
(17)Tv,wW=Ax,yCm=yAx=(Ay)xv,TwV=x,AyCn,

where the superscript
denotes conjugate transpose.

As a result, we know that the "coordinate" of

Tw in the basis
β
is given by
Ay
, and the matrix representation for
T
must be
(18)[T]μβ=A.