Linear Algebra/Eigenvalues and eigenvectors

Eigenvalues and eigenvectors are related to fundamental properties of matrices.

The word eigenvalue comes from the German Eigenwert which means "proper or characteristic value."

Motivations

Large matrices can be costly, in terms of computational time, to use, and may have to be iterated hundreds or thousands of times for a calculation. Additionally, the behavior of matrices would be hard to explore without important mathematical tools. One mathematical tool, which has applications not only for Linear Algebra but for differential equations, calculus, and many other areas, is the concept of eigenvalues and eigenvectors. Eigenvalues and eigenvectors are based upon a common behavior in linear systems. Let's look at an example.

Let

A={\begin{pmatrix}1&2\\0&-2\\\end{pmatrix}}

and

\mathbf {x} ={\begin{pmatrix}-2\\3\\\end{pmatrix}},\quad \mathbf {y} ={\begin{pmatrix}1\\0\\\end{pmatrix}}.

What happens with x and y if they are transformed by A? Well,

A\mathbf {x} ={\begin{pmatrix}4\\-6\\\end{pmatrix}}

A\mathbf {y} ={\begin{pmatrix}1\\0\\\end{pmatrix}}

But what is remarkable is that

A\mathbf {x} =(-2){\begin{pmatrix}-2\\3\\\end{pmatrix}}=-2\mathbf {x}

A\mathbf {y} =(1){\begin{pmatrix}1\\0\\\end{pmatrix}}=\mathbf {y} =(1)\mathbf {y}

So when we operate on the vector x with the matrix A, instead of getting a different vector (as we would normally do), we get the same vector x multiplied by some constant. And the same goes for vector y.

We call the values 1 and -2 the eigenvalues of the matrix A, and the vectors x and y are called eigenvectors for the matrix A.

Definitions

We now generalize this concept of when a matrix/vector product is the same as a product by a scalar as above: essentially if we have a n×n matrix A, we seek solutions in v to find the eigenvectors, and solutions in λ to find the eigenvalues for the equation

Av=λv

How are we to do this? Let us rearrange the equation

Av-λv=0

(A-λI)v=0 (note we must multiply the scalar by the identity matrix otherwise A-λ makes no sense)

But (A-λI) is a matrix, so we are trying to solve Bv=0 where B=(A-λI), and this solution is merely the kernel of B, ker B. So the eigenvectors are in ker (A-λI), where λ is an eigenvalue. But how do we find the eigenvalues?

Bv=0 has nonzero solution if |B| = det(B) is zero. So to find the eigenvalues, we let |A-λI|=0 and then solve for λ. We will thus obtain a polynomial equation over the complex numbers (eigenvalues can be complex), known as the characteristic equation. The roots of the characteristic equation are the eigenvalues.

Note that we exclude 0 as an eigenvector, because it is trivially a solution to Av=λv and is not really interesting to consider. Additionally, if the zero vector were to be included, it would allow for an infinite number of eigenvalues, since any value of λ satisfies A0=λ0.

If we have an eigenvalue λ of a matrix A, together with a corresponding eigenvector, x, then any multiple of x is also an eigenvector for the same eigenvalue. To see that kx is also an eigenvector, follow this argument: If Ax=λx, then A(kx)=kAx=kλx=λ(kx). (Here k may be any scalar.) Thus, every multiple of an eigenvector is also an eigenvector.

Note the asymmetry here: eigenvalues are unique, while an eigenvalue has many eigenvectors. </gallery> </gallery> </gallery> ~~Bold text~~Æə=== Finding eigenvalues and eigenvectors === Here are some examples of finding eigenvalues and eigenvectors using our definitions.

Let

A={\begin{pmatrix}3&0\\-1&2\end{pmatrix}}

Firstly, we expand |A-λI|=0 to find the eigenvalues:

\left|{\begin{pmatrix}3&0\\-1&2\end{pmatrix}}-{\begin{pmatrix}\lambda &0\\0&\lambda \end{pmatrix}}\right|=0

{\begin{vmatrix}3-\lambda &0\\-1&2-\lambda \end{vmatrix}}=0

(3-\lambda )(2-\lambda )-(0)(-1)=0

(3-\lambda )(2-\lambda )=0

Now, elementary algebra tells us the roots of this equation are 3 and 2, and thus these are our eigenvalues.

(Exercise: prove that in a 2×2 triangular matrix the eigenvalues are on the principal diagonal. Harder: generalize this result)

Now we can find our eigenvectors. Consider the first eigenvalue λ=3. To find our first eigenvector

{\mbox{ker}}(A-3I)={\mbox{ker}}{\begin{pmatrix}3-3&0\\-1&2-3\end{pmatrix}}={\mbox{ker}}{\begin{pmatrix}0&0\\-1&-1\end{pmatrix}}

At this point we can row-reduce and back-substitute, but usually it suffices to guess the kernel since our matrix is small and we have linearly dependent columns. Now, observe:

{\begin{pmatrix}0&0\\-1&-1\end{pmatrix}}{\begin{pmatrix}a\\-a\end{pmatrix}}=\mathbf {0}

So, for any scalar a, the vector

{\begin{pmatrix}a\\-a\end{pmatrix}}

is an eigenvector. Stated another way, the set of all eigenvectors of the matrix A includes the set

{\mbox{span}}\{{\begin{pmatrix}1\\-1\end{pmatrix}}\}

. In the plane, this represents a line of slope -1 through the origin.

As noted above the eigenvalues of a matrix are uniquely determined, but for each eigenvalue there are many eigenvectors. We usually choose an eigenvector for some convenience such as "most whole number entries", "first entry is 1", or "length of the eigenvector is 1". Most Computer Algebra Systems choose unit vectors for eigenvectors.

So here we may take ${\begin{pmatrix}1\\-1\end{pmatrix}}$ to be the eigenvector, for example.

Similarly for our second eigenvalue λ=2, to find our second eigenvector:

{\mbox{ker}}(A-2I)={\mbox{ker}}{\begin{pmatrix}1&0\\-1&0\end{pmatrix}}={\mbox{span}}\{{\begin{pmatrix}0\\1\end{pmatrix}}\}=\mathbf {0}

And so, our second eigenvector is chosen as

{\begin{pmatrix}0\\1\end{pmatrix}}.

Our eigenvalues then are λ=2,3, with eigenvectors ${\begin{pmatrix}1\\-1\end{pmatrix}},{\begin{pmatrix}0\\1\end{pmatrix}}$ , as may be checked by multiplying each by the given matrix.

(We also could choose ${\begin{pmatrix}1/{\sqrt {(}}2)\\-1/{\sqrt {(}}2)\end{pmatrix}}$ as an eigenvector for the eigenvalue λ=3 . Check this.)

Problem set

Given the above, find the eigenvalues and eigenvectors of the following matrices (Answers follow to even-numbered questions):

${\begin{pmatrix}3&0\\-4&5\end{pmatrix}}$
${\begin{pmatrix}1&1\\3&-1\end{pmatrix}}$
${\begin{pmatrix}-2&0&3\\2&4&0\\1&0&0\end{pmatrix}}$

(Harder. Hint: one eigenvalue is 4.)

Answers

eigenvalues: 3, 5; eigenvectors: ${\begin{pmatrix}1\\2\end{pmatrix}},{\begin{pmatrix}0\\1\end{pmatrix}}$
eigenvalues: -2, 2; eigenvectors: ${\begin{pmatrix}-1\\3\end{pmatrix}},{\begin{pmatrix}1\\1\end{pmatrix}}$
eigenvalues: -3, 1, 4; eigenvectors: ${\begin{pmatrix}21\\-6\\-7\end{pmatrix}},{\begin{pmatrix}3\\-2\\3\end{pmatrix}},{\begin{pmatrix}0\\1\\0\end{pmatrix}}$

Applications

Eigenvalues and eigenvectors are not mere pretty facts about these vectors; they have relevant and important applications.

Matrix powers

Let us first examine a certain class of matrices known as diagonal matrices: these are matrices in the form

{\begin{pmatrix}a_{0}&0&0&\ldots &0\\0&a_{1}&0&\ldots &0\\0&0&a_{2}&\ldots &0\\0&0&0&\ldots &a_{k}\end{pmatrix}}

Now, observe that

{\begin{pmatrix}a_{0}&0&0&\ldots &0\\0&a_{1}&0&\ldots &0\\0&0&a_{2}&\ldots &0\\0&0&0&\ldots &a_{k}\end{pmatrix}}^{k}={\begin{pmatrix}a_{0}^{k}&0&0&\ldots &0\\0&a_{1}^{k}&0&\ldots &0\\0&0&a_{2}^{k}&\ldots &0\\0&0&0&\ldots &a_{k}^{k}\end{pmatrix}}

This is a useful property! However, the number of matrices to which we can apply this fact is clearly limited, so we ask ourselves whether we can transform a given matrix into a diagonal matrix.

The answer to this question is "sometimes", but for the moment, we will only look at matrices for which this answer is "yes".

What we seek is a matrix P such that

PAP^-1=D

where D is diagonal.

If such a matrix P exists, we say that A is diagonalizable. (Note that xyx^-1 is often called a similarity transformation).

Then

PAP^-1=D

AP^-1=P^-1D

by multiplying throughout forward by P^-1, then

A=P^-1DP

by multiplying backward by P.

Now, we have

A^k=(P^-1DP)^k

=(P^-1DP)(P^-1DP)(P^-1DP)... (k times)

=P^-1D(PP^-1)D(PP^-1)DP... (k times)

The PP^-1 terms cancel to give

=P^-1DDD...P (k times)

=P^-1D^kP

We can calculate D^k easily, so we need to find P.

It turns out (the entire proof is quite difficult) that we simply create a matrix from concatenating the linearly independent eigenvectors to create P.

D, then, is the diagonal matrix containing the eigenvalues on the main diagonal corresponding to the associated eigenvectors (the eigenvalue in the first place corresponds to the eigenvector it is created from, in the first column).

Example

Let's work through an example to show these ideas.

A={\begin{pmatrix}3&1\\4&0\\\end{pmatrix}}

So what do we do if we want to find A¹⁴? Let's use the method we've just described.

Find the eigenvalues:

|A-λI|=0

(3-λ)(-λ)-4=0

λ²-3λ-4=0

λ=-1, 4

Find the eigenvectors:

for λ=-1

{\mbox{ker}}{\begin{pmatrix}4&1\\4&1\\\end{pmatrix}}={\mbox{span}}\{{\begin{pmatrix}-1\\4\end{pmatrix}}\}

for λ=4

{\mbox{ker}}{\begin{pmatrix}-1&1\\4&-4\\\end{pmatrix}}={\mbox{span}}\{{\begin{pmatrix}1\\1\end{pmatrix}}\}

The eigenvectors are then

{\begin{pmatrix}-1\\4\end{pmatrix}},{\begin{pmatrix}1\\1\end{pmatrix}}

so put the eigenvectors together to form the matrix P

P={\begin{pmatrix}-1&1\\4&1\end{pmatrix}}

Now -1 generated the eigenvector in the first column, and 4 generated the eigenvector in the second column, so form D in this way:

D={\begin{pmatrix}-1&0\\0&4\\\end{pmatrix}}

We can easily calculate (-1)¹⁴=1, so we get

D^{14}={\begin{pmatrix}1&0\\0&4^{14}\\\end{pmatrix}}

and we have the fast method for creating inverses of 2×2 matrices:

P^{-1}=-{\frac {1}{5}}{\begin{pmatrix}1&-1\\-4&-1\end{pmatrix}}

So now we can now directly multiply out

-{\frac {1}{5}}{\begin{pmatrix}1&-1\\-4&-1\end{pmatrix}}{\begin{pmatrix}1&0\\0&4^{14}\end{pmatrix}}{\begin{pmatrix}-1&1\\4&1\end{pmatrix}}

Simplifying we get

{\begin{pmatrix}214748365&53687091\\214748364&53687092\end{pmatrix}}