Given an endomorphism T on a finite-dimensional vector space V over a field F, let IT be the set defined as
where F[t ] is the space of all polynomials over the field F. IT is a proper ideal of F[t ]. Since F is a field, F[t ] is a principal ideal domain, thus any ideal is generated by a single polynomial, which is unique up to a unit in F. A particular choice among the generators can be made, since precisely one of the generators is monic. The minimal polynomial is thus defined to be the monic polynomial that generates IT. It is the monic polynomial of least degree in IT.
An endomorphism φ of a finite-dimensional vector space over a field F is diagonalizable if and only if its minimal polynomial factors completely over F into distinct linear factors. The fact that there is only one factor X − λ for every eigenvalue λ means that the generalized eigenspace for λ is the same as the eigenspace for λ: every Jordan block has size 1. More generally, if φ satisfies a polynomial equation P(φ) = 0 where P factors into distinct linear factors over F, then it will be diagonalizable: its minimal polynomial is a divisor of P and therefore also factors into distinct linear factors. In particular one has:
These cases can also be proved directly, but the minimal polynomial gives a unified perspective and proof.
For a nonzero vector v in V define:
This definition satisfies the properties of a proper ideal. Let μT,v be the monic polynomial which generates it.
and for these coefficients one has
Define T to be the endomorphism of R3 with matrix, on the canonical basis,
Taking the first canonical basis vector e1 and its repeated images by T one obtains
of which the first three are easily seen to be linearly independent, and therefore span all of R3. The last one then necessarily is a linear combination of the first three, in fact
so that:
This is in fact also the minimal polynomial μT and the characteristic polynomial χT : indeed μT, e1 divides μT which divides χT, and since the first and last are of degree 3 and all are monic, they must all be the same. Another reason is that in general if any polynomial in T annihilates a vector v, then it also annihilates T ⋅v (just apply T to the equation that says that it annihilates v), and therefore by iteration it annihilates the entire space generated by the iterated images by T of v; in the current case we have seen that for v = e1 that space is all of R3, so μT, e1(T ) = 0. Indeed one verifies for the full matrix that T 3 + 4T 2 + T − I3 is the zero matrix: