Linear Algebra/Jordan Canonical Form
This subsection moves from the canonical form for nilpotent matrices to the one for all matrices.
We have shown that if a map is nilpotent then all of its eigenvalues are zero. We can now prove the converse.
- Lemma 2.1
A linear transformation whose only eigenvalue is zero is nilpotent.
- Proof
If a transformation t on an n-dimensional space has only the single eigenvalue of zero then its characteristic polynomial is xn. The Cayley-Hamilton Theorem says that a map satisfies its characteristic polynimial so tn is the zero map. Thus t is nilpotent.
We have a canonical form for nilpotent matrices, that is, for each matrix whose single eigenvalue is zero: each such matrix is similar to one that is all zeroes except for blocks of subdiagonal ones. (To make this representation unique we can fix some arrangement of the blocks, say, from longest to shortest.) We next extend this to all single-eigenvalue matrices.
Observe that if t's only eigenvalue is λ then t − λ's only eigenvalue is 0 because
if and only if
. The natural way to extend the results for nilpotent matrices is to represent t − λ in the canonical form N, and try to use that to get a simple representation T for t. The next result says that this try works.
- Lemma 2.2
If the matrices T − λI and N are similar then T and N + λI are also similar, via the same change of basis matrices.
- Proof
With N = P(T − λI)P − 1 = PTP − 1 − P(λI)P − 1 we have N = PTP − 1 − PP − 1(λI) since the diagonal matrix λI commutes with anything, and so N = PTP − 1 − λI. Therefore N + λI = PTP − 1, as required.
- Example 2.3
The characteristic polynomial of
is (x − 3)2 and so T has only the single eigenvalue 3. Thus for
the only eigenvalue is 0, and T − 3I is nilpotent. The null spaces are routine to find; to ease this computation we take T to represent the transformation
with respect to the standard basis (we shall maintain this convention for the rest of the chapter).
The dimensions of these null spaces show that the action of an associated map t − 3 on a string basis is
. Thus, the canonical form for t − 3 with one choice for a string basis is
and by Lemma 2.2, T is similar to this matrix.
We can produce the similarity computation. Recall from the Nilpotence section how to find the change of basis matrices P and P − 1 to express N as P(T − 3I)P − 1. The similarity diagram
describes that to move from the lower left to the upper left we multiply by
and to move from the upper right to the lower right we multiply by this matrix.
So the similarity is expressed by
which is easily checked.
- Example 2.4
This matrix has characteristic polynomial (x − 4)4
and so has the single eigenvalue 4. The nullities of t − 4 are: the null space of t − 4 has dimension two, the null space of (t − 4)2 has dimension three, and the null space of (t − 4)3 has dimension four. Thus, t − 4 has the action on a string basis of
and
. This gives the canonical form N for t − 4, which in turn gives the form for t.
An array that is all zeroes, except for some number λ down the diagonal and blocks of subdiagonal ones, is a Jordan block. We have shown that Jordan block matrices are canonical representatives of the similarity classes of single-eigenvalue matrices.
- Example 2.5
The
matrices whose only eigenvalue is 1 / 2 separate into three similarity classes. The three classes have these canonical representatives.
In particular, this matrix
belongs to the similarity class represented by the middle one, because we have adopted the convention of ordering the blocks of subdiagonal ones from the longest block to the shortest.
We will now finish the program of this chapter by extending this work to cover maps and matrices with multiple eigenvalues. The best possibility for general maps and matrices would be if we could break them into a part involving their first eigenvalue λ1 (which we represent using its Jordan block), a part with λ2, etc.
This ideal is in fact what happens. For any transformation
, we shall break the space V into the direct sum of a part on which t − λ1 is nilpotent, plus a part on which t − λ2 is nilpotent, etc. More precisely, we shall take three steps to get to this section's major theorem and the third step shows that
where
are t's eigenvalues.
Suppose that
is a linear transformation. Note that the restriction[1] of t to a subspace M need not be a linear transformation on M because there may be an
with
. To ensure that the restriction of a transformation to a "part" of a space is a transformation on the partwe need the next condition.
- Definition 2.6
Let
be a transformation. A subspace M is t invariant if whenever
then
(shorter:
).
Two examples are that the generalized null space
and the generalized range space
of any transformation t are invariant. For the generalized null space, if
then
where n is the dimension of the underlying space and so
because
is zero also. For the generalized range space, if
then
for some
and then
shows that
is also a member of
.
Thus the spaces
and
are t − λi invariant. Observe also that t − λi is nilpotent on
because, simply, if
has the property that some power of t − λi maps it to zero— that is, if it is in the generalized null space— then some power of t − λi maps it to zero. The generalized null space
is a "part" of the space on which the action of t − λi is easy to understand.
The next result is the first of our three steps. It establishes that t − λj leaves t − λi's part unchanged.
- Lemma 2.7
A subspace is t invariant if and only if it is t − λ invariant for any scalar λ. In particular, where λi is an eigenvalue of a linear transformation t, then for any other eigenvalue λj, the spaces
and
are t − λj invariant.
- Proof
For the first sentence we check the two implications of the "if and only if" separately. One of them is easy: if the subspace is t − λ invariant for any λ then taking λ = 0 shows that it is t invariant. For the other implication suppose that the subspace is t invariant, so that if
then
, and let λ be any scalar. The subspace M is closed under linear combinations and so if
then
. Thus if
then
, as required.
The second sentence follows straight from the first. Because the two spaces are t − λi invariant, they are therefore t invariant. From this, applying the first sentence again, we conclude that they are also t − λj invariant.
The second step of the three that we will take to prove this section's major result makes use of an additional property of
and
, that they are complementary. Recall that if a space is the direct sum of two others
then any vector
in the space breaks into two parts
where
and
, and recall also that if
and
are bases for
and
then the concatenation
is linearly independent (and so the two parts of
do not "overlap"). The next result says that for any subspaces
and
that are complementary as well as t invariant, the action of t on
breaks into the "non-overlapping" actions of t on
and on
.
- Lemma 2.8
Let
be a transformation and let
and
be t invariant complementary subspaces of V. Then t can be represented by a matrix with blocks of square submatrices T1 and T2
where Z1 and Z2 are blocks of zeroes.
- Proof
Since the two subspaces are complementary, the concatenation of a basis for
and a basis for
makes a basis
for V. We shall show that the matrix
has the desired form.
Any vector
is in
if and only if its final q components are zeroes when it is represented with respect to B. As
is t invariant, each of the vectors
, ...,
has that form. Hence the lower left of RepB,B(t) is all zeroes.
The argument for the upper right is similar.
To see that t has been decomposed into its action on the parts, observe that the restrictions of t to the subspaces
and
are represented, with respect to the obvious bases, by the matrices T1 and T2. So, with subspaces that are invariant and complementary, we can split the problem of examining a linear transformation into two lower-dimensional subproblems. The next result illustrates this decomposition into blocks.
- Lemma 2.9
If T is a matrices with square submatrices T1 and T2
where the Z's are blocks of zeroes, then
.
- Proof
Suppose that T is
, that T1 is
, and that T2 is
. In the permutation formula for the determinant
each term comes from a rearrangement of the column numbers
into a new order
. The upper right block Z2 is all zeroes, so if a φ has at least one of
among its first p column numbers
then the term arising from φ is zero, e.g., if ϕ(1) = n then
.
So the above formula reduces to a sum over all permutations with two halves: any significant ϕ is the composition of a ϕ1 that rearranges only
and a ϕ2 that rearranges only
. Now, the distributive law (and the fact that the signum of a composition is the product of the signums) gives that this
equals
.
- Example 2.10
From Lemma 2.9 we conclude that if two subspaces are complementary and t invariant then t is nonsingular if and only if its restrictions to both subspaces are nonsingular.
Now for the promised third, final, step to the main result.
- Lemma 2.11
If a linear transformation
has the characteristic polynomial
then (1)
and (2)
.
- Proof
Because dim(V) is the degree
of the characteristic polynomial, to establish statement (1) we need only show that statement (2) holds and that
is trivial whenever
.
For the latter, by Lemma 2.7, both
and
are t invariant. Notice that an intersection of t invariant subspaces is t invariant and so the restriction of t to
is a linear transformation. But both t − λi and t − λj are nilpotent on this subspace and so if t has any eigenvalues on the intersection then its "only" eigenvalue is both λi and λj. That cannot be, so this restriction has no eigenvalues:
is trivial (Lemma V.II.3.10 shows that the only transformation without any eigenvalues is on the trivial space).
To prove statement (2), fix the index i. Decompose V as 
and apply Lemma 2.8.
By Lemma 2.9,
. By the uniqueness clause of the Fundamental Theorem of Arithmetic, the determinants of the blocks have the same factors as the characteristic polynomial
and
, and the sum of the powers of these factors is the power of the factor in the characteristic polynomial: q1 + r1 = p1, ...,
. Statement (2) will be proved if we will show that qi = pi and that qj = 0 for all
, because then the degree of the polynomial
— which equals the dimension of the generalized null space— is as required.
For that, first, as the restriction of t − λi to
is nilpotent on that space, the only eigenvalue of t on it is λi. Thus the characteristic equation of t on
is
. And thus qj = 0 for all
.
Now consider the restriction of t to
. By Note V.III.2.2, the map t − λi is nonsingular on
and so λi is not an eigenvalue of t on that subspace. Therefore, x − λi is not a factor of
, and so qi = pi.
Our major result just translates those steps into matrix terms.
- Theorem 2.12
Any square matrix is similar to one in Jordan form
where each Jλ is the Jordan block associated with the eigenvalue λ of the original matrix (that is, is all zeroes except for λ's down the diagonal and some subdiagonal ones).
- Proof
Given an
matrix T, consider the linear map
that it represents with respect to the standard bases. Use the prior lemma to write
where
are the eigenvalues of t. Because each
is t invariant, Lemma 2.8 and the prior lemma show that t is represented by a matrix that is all zeroes except for square blocks along the diagonal. To make those blocks into Jordan blocks, pick each
to be a string basis for the action of t − λi on
.
Jordan form is a canonical form for similarity classes of square matrices, provided that we make it unique by arranging the Jordan blocks from least eigenvalue to greatest and then arranging the subdiagonal 1 blocks inside each Jordan block from longest to shortest.
- Example 2.13
This matrix has the characteristic polynomial (x − 2)2(x − 6).
We will handle the eigenvalues 2 and 6 separately.
Computation of the powers, and the null spaces and nullities, of T − 2I is routine. (Recall from Example 2.3 the convention of taking T to represent a transformation, here
, with respect to the standard basis.)

So the generalized null space
has dimension two. We've noted that the restriction of t − 2 is nilpotent on this subspace. From the way that the nullities grow we know that the action of t − 2 on a string basis
. Thus the restriction can be represented in the canonical form
where many choices of basis are possible. Consequently, the action of the restriction of t to
is represented by this matrix.
The second eigenvalue's computations are easier. Because the power of x − 6 in the characteristic polynomial is one, the restriction of t − 6 to
must be nilpotent of index one. Its action on a string basis must be
and since it is the zero map, its canonical form N6 is the
zero matrix. Consequently, the canonical form J6 for the action of t on
is the
matrix with the single entry 6. For the basis we can use any nonzero vector from the generalized null space.
Taken together, these two give that the Jordan form of T is
where B is the concatenation of B2 and B6.
- Example 2.14
Contrast the prior example with
which has the same characteristic polynomial (x − 2)2(x − 6).
While the characteristic polynomial is the same,

here the action of t − 2 is stable after only one application— the restriction of of t − 2 to
is nilpotent of index only one. (So the contrast with the prior example is that while the characteristic polynomial tells us to look at the action of the t − 2 on its generalized null space, the characteristic polynomial does not describe completely its action and we must do some computations to find, in this example, that the minimal polynomial is (x − 2)(x − 6).) The restriction of t − 2 to the generalized null space acts on a string basis as
and
, and we get this Jordan block associated with the eigenvalue 2.
For the other eigenvalue, the arguments for the second eigenvalue of the prior example apply again. The restriction of t − 6 to
is nilpotent of index one (it can't be of index less than one, and since x − 6 is a factor of the characteristic polynomial to the power one it can't be of index more than one either). Thus t − 6's canonical form N6 is the
zero matrix, and the associated Jordan block J6 is the
matrix with entry 6.
Therefore, T is diagonalizable.
(Checking that the third vector in B is in the nullspace of t − 6 is routine.)
- Example 2.15
A bit of computing with
shows that its characteristic polynomial is (x − 3)3(x + 1)2. This table

shows that the restriction of t − 3 to
acts on a string basis via the two strings
and
.
A similar calculation for the other eigenvalue

shows that the restriction of t + 1 to its generalized null space acts on a string basis via the two separate strings
and
.
Therefore T is similar to this Jordan form matrix.
We close with the statement that the subjects considered earlier in this Chpater are indeed, in this sense, exhaustive.
- Corollary 2.16
Every square matrix is similar to the sum of a diagonal matrix and a nilpotent matrix.
[edit] Exercises
- Problem 1
Do the check for Example 2.3.
- Problem 2
Each matrix is in Jordan form. State its characteristic polynomial and its minimal polynomial.
- This exercise is recommended for all readers.
- Problem 3
Find the Jordan form from the given data.
- The matrix T is
with the single eigenvalue 3. The nullities of the powers are: T − 3I has nullity two, (T − 3I)2 has nullity three, (T − 3I)3 has nullity four, and (T − 3I)4 has nullity five. - The matrix S is
with two eigenvalues. For the eigenvalue 2 the nullities are: S − 2I has nullity two, and (S − 2I)2 has nullity four. For the eigenvalue − 1 the nullities are: S + 1I has nullity one.
- Problem 4
Find the change of basis matrices for each example.
- This exercise is recommended for all readers.
- Problem 5
Find the Jordan form and a Jordan basis for each matrix.
- This exercise is recommended for all readers.
- Problem 6
Find all possible Jordan forms of a transformation with characteristic polynomial (x − 1)2(x + 2)2.
- Problem 7
Find all possible Jordan forms of a transformation with characteristic polynomial (x − 1)3(x + 2).
- This exercise is recommended for all readers.
- Problem 8
Find all possible Jordan forms of a transformation with characteristic polynomial (x − 2)3(x + 1) and minimal polynomial (x − 2)2(x + 1).
- Problem 9
Find all possible Jordan forms of a transformation with characteristic polynomial (x − 2)4(x + 1) and minimal polynomial (x − 2)2(x + 1).
- This exercise is recommended for all readers.
- Problem 10
- Diagonalize these.
- This exercise is recommended for all readers.
- Problem 11
Find the Jordan matrix representing the differentiation operator on
.
- This exercise is recommended for all readers.
- Problem 12
Decide if these two are similar.
- Problem 13
Find the Jordan form of this matrix.
Also give a Jordan basis.
- Problem 14
How many similarity classes are there for
matrices whose only eigenvalues are − 3 and 4?
- This exercise is recommended for all readers.
- Problem 15
Prove that a matrix is diagonalizable if and only if its minimal polynomial has only linear factors.
- Problem 16
Give an example of a linear transformation on a vector space that has no non-trivial invariant subspaces.
- Problem 17
Show that a subspace is t − λ1 invariant if and only if it is t − λ2 invariant.
- Problem 18
Prove or disprove: two
matrices are similar if and only if they have the same characteristic and minimal polynomials.
- Problem 19
The trace of a square matrix is the sum of its diagonal entries.
- Find the formula for the characteristic polynomial of a
matrix. - Show that trace is invariant under similarity, and so we can sensibly speak of the "trace of a map". (Hint: see the prior item.)
- Is trace invariant under matrix equivalence?
- Show that the trace of a map is the sum of its eigenvalues (counting multiplicities).
- Show that the trace of a nilpotent map is zero. Does the converse hold?
- Problem 20
To use Definition 2.6 to check whether a subspace is t invariant, we seemingly have to check all of the infinitely many vectors in a (nontrivial) subspace to see if they satisfy the condition. Prove that a subspace is t invariant if and only if its subbasis has the property that for all of its elements,
is in the subspace.
- This exercise is recommended for all readers.
- Problem 21
Is t invariance preserved under intersection? Under union? Complementation? Sums of subspaces?
- Problem 22
Give a way to order the Jordan blocks if some of the eigenvalues are complex numbers. That is, suggest a reasonable ordering for the complex numbers.
- Problem 23
Let
be the vector space over the reals of degree j polynomials. Show that if
then
is an invariant subspace of
under the differentiation operator. In
, does any of
, ...,
have an invariant complement?
- Problem 24
In
, the vector space (over the reals) of degree n polynomials,
and
are the even and the odd polynomials; p(x) = x2 is even while p(x) = x3 is odd. Show that they are subspaces. Are they complementary? Are they invariant under the differentiation transformation?
- Problem 25
Lemma 2.8 says that if M and N are invariant complements then t has a representation in the given block form (with respect to the same ending as starting basis, of course). Does the implication reverse?
- Problem 26
A matrix S is the square root of another T if S2 = T. Show that any nonsingular matrix has a square root.
[edit] Footnotes
- ↑ More information on restrictions of functions is in the appendix.
This page may need to be 















![\cdot
\bigg(\sum_{\begin{array}{c}\\[-19pt]
\scriptstyle\text{perms }\phi_2 \\[-5pt]
\scriptstyle\text{of } p+1,\dots,p+q
\end{array}}
\!\!\! t_{p+1,\phi_2(p+1)}\cdots t_{p+q,\phi_2(p+q)}\sgn(\phi_2)
\bigg)](http://upload.wikimedia.org/wikibooks/en/math/f/0/e/f0ed33897b448ab5098ba44dfda62196.png)


































