For matrices that resist diagonalization, we develop the Jordan normal form as the next-best canonical structure. After introducing Jordan blocks and generalized eigenspaces, we state the existence and uniqueness theorem, explore the connection with the minimal polynomial, and apply the theory to compute the matrix exponential.
Not every matrix is diagonalizable. For instance, A=(2012) has eigenvalue 2 with algebraic multiplicity 2 but geometric multiplicity 1, so it cannot be diagonalized.
Nevertheless, even for such matrices there exists a simplest possible form to which they can be reduced. That form is the Jordan normal form.
2 Jordan Blocks
Definition 1 (Jordan block).
For λ∈K and a positive integer k, the k×k matrix
Jk(λ)=λO1λ1⋱O1λ∈Mk(K)
is called a Jordan block of size k with eigenvalue λ. Its diagonal entries are all λ, the superdiagonal entries are all 1, and every other entry is 0.
A 1×1 Jordan block J1(λ)=(λ) is just a diagonal entry. A diagonal matrix is therefore the special case in which every Jordan block has size 1.
3 Generalized Eigenspaces
Definition 4 (Generalized eigenspace).
Let λ be an eigenvalue of A with algebraic multiplicity m. The subspace
Wλ=ker(A−λI)m
is called the generalized eigenspace of λ.
Theorem 5.
The generalized eigenspace satisfies dimWλ=m (equal to the algebraic multiplicity).
Theorem 6.
If the distinct eigenvalues of A are λ1,…,λs with algebraic multiplicities m1,…,ms, then the whole space decomposes as a direct sum of generalized eigenspaces:
Kn=Wλ1⊕Wλ2⊕⋯⊕Wλs.
4 Existence and Uniqueness of the Jordan Normal Form
Theorem 7 (Jordan normal form).
Let K be an algebraically closed field (in particular, K=C). For any n×n matrix A, there exists an invertible matrix P such that
P−1AP=Jk1(λ1)OJk2(λ2)O⋱Jkr(λr).
The right-hand side is called the Jordan normal form of A. Up to the ordering of the Jordan blocks, it is uniquely determined by A.
Example 8.
Suppose a 4×4 matrix A has characteristic polynomial (λ−1)2(λ−3)2, giving eigenvalues λ=1 (algebraic multiplicity 2) and λ=3 (algebraic multiplicity 2).If dimV1=1 and dimV3=1, the Jordan normal form is
J=1000110000300013.
If instead dimV1=2 and dimV3=1, the Jordan normal form becomes
J=1000010000300013.
5 The Minimal Polynomial
Definition 9 (Minimal polynomial).
The minimal polynomialmA(λ) of a matrix A is the monic polynomial of least degree satisfying mA(A)=O.
Theorem 10.
The minimal polynomial divides the characteristic polynomial.
The minimal polynomial and the characteristic polynomial share the same roots (eigenvalues).
A is diagonalizable if and only if mA(λ) has no repeated roots.
Theorem 11.
The minimal polynomial is determined by the Jordan normal form as follows. If the largest Jordan block corresponding to each eigenvalue λi has size ki, then
mA(λ)=i=1∏s(λ−λi)ki.
Example 12.
Consider the Jordan matrix J=2000120000200003. The largest Jordan block for λ=2 is 2×2, and for λ=3 it is 1×1. Hence mA(λ)=(λ−2)2(λ−3), while the characteristic polynomial is pA(λ)=(λ−2)3(λ−3).
6 The Matrix Exponential
Definition 13 (Matrix exponential).
For an n×n matrix A, the matrix exponential is defined by the convergent power series
eA=k=0∑∞k!Ak=I+A+2!A2+3!A3+⋯
Theorem 14.
The matrix exponential of a Jordan block is given by
Decompose the Jordan block as Jk(λ)=λI+N, where N is the nilpotent matrix with ones on the superdiagonal and zeros elsewhere. Since Nk=O, the matrix N is nilpotent of index k. Because λI and N commute, we have etJk(λ)=eλtIetN=eλtetN. The series for etN terminates after k terms because Nk=O, producing the stated upper triangular matrix.□
Example 15.
The system of linear ordinary differential equations x′(t)=Ax(t) has the solution x(t)=etAx(0). When A=PJP−1, we compute etA=PetJP−1.For A=(2012)=J2(2), we obtain
etA=e2t(10t1).
With initial condition x(0)=(11), the solution is x(t)=e2t(1+t1).
Remark 16.
The Jordan normal form is a natural generalization of diagonalization. Where a diagonal matrix represents independent scalings along each coordinate axis, a Jordan block captures scaling together with an infinitesimal shearing. This shearing manifests in the matrix exponential etJk(λ) as polynomial factors in t multiplying the exponential eλt— the hallmark of the interaction between exponential growth and algebraic (polynomial) growth that governs the behavior of non-diagonalizable linear systems.