How to solve a system of linear equations?

In high school, we have seen many system of linear equations with a unique solution, and we know how to solve it. Here we are going to learn how to obtain all solutions of a system of linear equation when its solution is not unique.

Consider the system of linear equations

\begin{aligned} x + y + z + w + u & = 3, \\ x + 2 y + 2 z + 2 w + 2 u & = 4, \\ x + 3 y + 3 z + 4 w + 4 u & = 5. \end{aligned}

Step 1: Transform the system into the augmented matrix

We may record the coefficients on the left into a matrix

A

and the constants on the right into a vector

b

. By setting

A = [\begin{matrix} 1 & 1 & 1 & 1 & 1 \\ 1 & 2 & 2 & 2 & 2 \\ 1 & 3 & 3 & 4 & 4 \end{matrix}], x = [\begin{matrix} x \\ y \\ z \\ w \\ u \end{matrix}], and b = [\begin{matrix} 3 \\ 4 \\ 5 \end{matrix}],

we know the system of linear equations is equivalent to

A x = b

, and we may represent this system by the augmented matrix

[\begin{array}{cc} A & b \end{array}] = [\begin{array}{cccccc} 1 & 1 & 1 & 1 & 1 & 3 \\ 1 & 2 & 2 & 2 & 2 & 4 \\ 1 & 3 & 3 & 4 & 4 & 5 \end{array}] .

Step 2: Make it into an echelon form

By running Gaussian elimination, which is a sequence of row operations, we may obtain

[\begin{array}{cccccc} 1 & 1 & 1 & 1 & 1 & 3 \\ 0 & 1 & 1 & 1 & 1 & 1 \\ 0 & 0 & 0 & 1 & 1 & 0 \end{array}] .

This stair-like structure is called an echelon form . The term echelon is a way to arrange the troops in military; see the pictures in Wikipedia: Echelon formation to get a better sense of this name.

Note that, if preferred, one may run Gaussian elimination further to get the reduced echelon form

[\begin{array}{cccccc} 1 & 0 & 0 & 0 & 0 & 2 \\ 0 & 1 & 1 & 0 & 0 & 1 \\ 0 & 0 & 0 & 1 & 1 & 0 \end{array}] .

Step 3: Recognize the leading variables and the free variables

The echelon form we used above is equivalent to

\begin{aligned} x + & y + z + & w + u & = 3, \\ y + z + & w + u & = 1, \\ w + u & = 0. \end{aligned}

The first (left-most) variable with nonzero coefficient on each equation is called a leading variable . In this case, we have three leading variables

x

y

, and

w

. Any variable that is not a leading variable is called a free variable . In this case,

z

and

u

are the free variables.

Note that given any numbers for the free variables, the leading variables are uniquely determined, as we will see in the next two steps.

Step 4: Find a special solution

For example, we may assign

z = 0

and

u = 0

. Thus, we solve, from right to left, that

w = 0

y = 1

, and

x = 2

. Recording these numbers as a vector

p = [\begin{matrix} 2 \\ 1 \\ 0 \\ 0 \\ 0 \end{matrix}],

we call it as a special solution of the

A x = b

, which means it is one of the solution.

Step 5: Find the homogeneous solutions

In fact, we may assign

z = c_{1}

and

u = c_{2}

to get all solutions. Thus, we have

[\begin{matrix} x \\ y \\ z \\ w \\ u \end{matrix}] = [\begin{matrix} 2 \\ 1 & - c_{1} \\ 0 & c_{1} \\ 0 & - c_{2} \\ 0 & c_{2} \end{matrix}] = [\begin{matrix} 2 \\ 1 \\ 0 \\ 0 \\ 0 \end{matrix}] + c_{1} [\begin{matrix} 0 \\ - 1 \\ 1 \\ 0 \\ 0 \end{matrix}] + c_{2} [\begin{matrix} 0 \\ 0 \\ 0 \\ - 1 \\ 1 \end{matrix}] = p + c_{1} h_{1} + c_{2} h_{2} .

Note that

h_{1}

is usually not a solution of

A x = b

; instead, it is a solution of

A x = 0

. This is not suprising after a second thought. By assigning

z = 0

and

u = 0

, we get the solution

p

, so

A p = b

. By assigning

z = 1

and

u = 0

, we get the solution

p + h_{1}

, so

A (p + h_{1}) = b

. Combining these two facts along with some algebra, it is straightforward to see

A h_{1} = A (p + h_{1}) - A p = b - b = 0 .

Indeed,

h_{1}

is the unique solution of

A x = 0

with

z = 1

and

u = 0

. Therefore, here is another way to obtain

h_{1}

. First, consider the homogeneous equation

A x = 0

. By running the Gaussian elimination, we know it is equivalent to

\begin{aligned} x + & y + z + & w + u & = 0, \\ y + z + & w + u & = 0, \\ w + u & = 0, \end{aligned}

which are the same equations we have been using except that the constants on the right are replaced by zeros. By solving this homogeneous system with

z = 1

and

u = 0

, we see again that

h_{1} = [\begin{matrix} 0 \\ - 1 \\ 1 \\ 0 \\ 0 \end{matrix}] .

In a similar way, one may obtain

h_{2}

without calculating all the parametrization. (Give it a try!)

Now, the set

span ({h_{1}, h_{2}}) = {c_{1} h_{1} + c_{2} h_{2} : c_{1}, c_{2} \in R}

is the solution set of

A x = 0

. It is also called the homogeneous solution of

A x = b

, since we have to replace

b

0

. On the other hand, the set

p + span ({h_{1}, h_{2}}) = {p + c_{1} h_{1} + c_{2} h_{2} : c_{1}, c_{2} \in R}

is the solution set of

A x = b

. It is also called the general solution of

A x = b

, in contrast to a special solution.

As a summary, the general solution is equal to a special solution plus the homogeneous solution.

This note can be found at Course website > Learning resources.