--- title: Ch5-3 tags: Linear algebra GA: G-77TT93X4N1 --- # Chapter 5 extra note 3 ## Selected lecture notes > orthogonal projection > Gram-Schmidt method ### Orthogonal projection :::info **Definition:** Let $E\subset V$ be a vector subspace and ${\bf u}\in V$ be a vector, then ${\bf u}$ is orthogonal to $E$ if $\langle{\bf u}, {\bf v}\rangle=0$ for all ${\bf v}\in E$, and we write denoted by ${\bf u}\perp E$. ::: **Lemma:** Let $E = \text{span}\{{\bf v}_1, \cdots, {\bf v}_n\}$, then ${\bf u}\perp E$ if and only if ${\bf u}\perp {\bf v}_k$ for all $k$. :::info **Definition:** Let $E,F\subset V$ be vector subspaces, then $E\perp F$ if $\langle{\bf u}, {\bf v}\rangle=0$ for all ${\bf u}\in E$ and ${\bf v}\in F$. ::: :::info **Definition:** The orthogonal projection of a vector ${\bf u}$ on a vector subspace $E$ is a vector ${\bf w}$ satisfies 1. ${\bf w}\in E$; 2. $({\bf u}-{\bf w})\perp E$. ::: **Theorem:** Let ${\bf w}$ be the orthogonal projection of ${\bf u}$, then $$ \|{\bf u}-{\bf w}\|\le \|{\bf u}-{\bf v}\|, \quad \forall {\bf v}\in E, $$ and the equality holds if ${\bf v}={\bf w}$. * Proof: > Given ${\bf v}\in E$, > $$ > {\bf u}-{\bf v} = ({\bf u}-{\bf w}) + ({\bf w}-{\bf v}). > $$ > Since ${\bf w, v}\in E$, we have $({\bf w}-{\bf v})\in E$. > Since ${\bf w}$ is the orthogonal projection of ${\bf u}$, we have $({\bf u}-{\bf w})\perp E$, and hence $({\bf u}-{\bf w})\perp ({\bf w}-{\bf v})$. > > By Pythagorean theorem, we obtain > $$ > \|{\bf u}-{\bf v}\|^2 = \|{\bf u}-{\bf w}\|^2 + \|{\bf w}-{\bf v}\|^2. > $$ > Therefore > $$ > \|{\bf u}-{\bf w}\|^2 \le \|{\bf u}-{\bf v}\|^2, > $$ > and the equality holds if $\|{\bf w}-{\bf v}\|^2=0$, which means ${\bf v}={\bf w}$. **Remarks:** The Theorem ensures the ***uniqueness*** of the projection, and therefore we can denote the orthogonal projection of a vector ${\bf u}$ on a vector subspace $E$ by $P_E{\bf u}$. * Proof of the uniqueness: > > Suppose there exists ${\bf w}_2$ that is also an orthogonal projection of ${\bf u}$ on $E$. That is, ${\bf w}_2\in E$ and $({\bf u}-{\bf w}_2)\perp E$. > > Then, since ${\bf w}_2\in E$, according to the above theorem we have > $$ > \|{\bf u}-{\bf w}\|\le \|{\bf u}-{\bf w}_2\|. > $$ > Similarly, ${\bf w}_2\in E$ and $({\bf u}-{\bf w}_2)\perp E$ implies > $$ > \|{\bf u}-{\bf w}_2\|\le \|{\bf u}-{\bf v}\|, \quad \forall {\bf v}\in E, > $$ > so we must have > $$ > \|{\bf u}-{\bf w}_2\|\le \|{\bf u}-{\bf w}\|. > $$ > Therefore, $\|{\bf u}-{\bf w}_2\|=\|{\bf u}-{\bf w}\|$, and, based on the Theorem again, ${\bf w}={\bf w}_2$. **Proposition:** Let $\{{\bf v}_1, \cdots, {\bf v}_n\}$ be an orthogonal basis of a vector subspace $E$, then, given ${\bf u}\in V$, $$ P_E{\bf u} = \sum^n_{k=1} \frac{\langle{\bf u}, {\bf v}_k\rangle}{\langle{\bf v}_k, {\bf v}_k\rangle}{\bf v}_k. $$ * Proof: > 1. $P_E{\bf u}$ is a linear combination of ${\bf v}_k$'s, so $P_E{\bf u}\in E$. > 2. > $$ > \langle{\bf u}-{\bf w}, {\bf v}_j\rangle = \langle{\bf u}, {\bf v}_j\rangle - \langle{\bf w}, {\bf v}_j\rangle. > $$ > We also have, since ${\bf v}_k$'s are orthogonal, > $$ > \langle{\bf w}, {\bf v}_j\rangle = \langle P_E{\bf u}, {\bf v}_j\rangle = \langle\frac{\langle{\bf u}, {\bf v}_j\rangle}{\langle{\bf v}_j, {\bf v}_j\rangle}{\bf v}_j, {\bf v}_j\rangle = \left(\frac{\langle{\bf u}, {\bf v}_j\rangle}{\langle{\bf v}_j, {\bf v}_j\rangle}\right)\langle{\bf v}_j, {\bf v}_j\rangle = \langle{\bf u}, {\bf v}_j\rangle. > $$ > Therefore, $\langle{\bf u}-{\bf w}, {\bf v}_j\rangle=0$ for all $j$, and we have $({\bf u}-{\bf w})\perp E$. **Remarks:** The Proposition ensures the ***existence*** of the projection. ### Gram-Schmidt method Given $S=\{{\bf u}_1, \cdots, {\bf u}_n\}$ be a linearly independent set. We can use Gram-Schmidt process to generate an orthogonal set $\beta=\{{\bf v}_1, \cdots, {\bf v}_n\}$ such that $\text{span}\{\beta\} = \text{span}\{S\}$. **Gram-Schmidt process (the basic idea):** 1. ${\bf v}_1 = {\bf u}_1$, $\quad E_1 = \text{span}\{{\bf v}_1\}$. 2. ${\bf v}_2 = {\bf u}_2 - P_{E_1}{\bf u}_2$, $\quad E_2 = \text{span}\{{\bf v}_1, {\bf v}_2\}$. 3. ${\bf v}_3 = {\bf u}_3 - P_{E_2}{\bf u}_3$, $\quad E_3 = \text{span}\{{\bf v}_1, {\bf v}_2, {\bf v}_3\}$. 4. $\cdots$ **Gram-Schmidt process (algorithm):** $$ \begin{align} {\bf v}_1 &= {\bf u}_1;\\ {\bf v}_2 &= {\bf u}_2 - \frac{\langle{\bf u}_2, {\bf v}_1\rangle}{\langle{\bf v}_1, {\bf v}_1\rangle}{\bf v}_1;\\ {\bf v}_3 &= {\bf u}_3 - \frac{\langle{\bf u}_3, {\bf v}_1\rangle}{\langle{\bf v}_1, {\bf v}_1\rangle}{\bf v}_1- \frac{\langle{\bf u}_3, {\bf v}_2\rangle}{\langle{\bf v}_2, {\bf v}_2\rangle}{\bf v}_2;\\ \vdots \end{align} $$