Table of Contents
Last modified on December 9th, 2024
The characteristic polynomial of a square matrix A is a polynomial function f(λ) defined as:
f(λ) = det(A – λIn)
Here,
The characteristic polynomial is primarily used to find the eigenvalues of a matrix as its roots correspond directly to the eigenvalues of the given matrix.
Let us find the characteristic polynomial of the matrix ${A=\begin{pmatrix} 2 & 1 \\ 1 & 3 \end{pmatrix}}$
Using the formula, we have
f(λ) = det(A – λI2)
Here, (A – λI2)
= ${\begin{pmatrix} 2 & 1 \\ 1 & 3 \end{pmatrix}-\lambda \begin{pmatrix} 1 & 0 \\ 0 & 1 \end{pmatrix}}$
= ${\begin{pmatrix} 2 & 1 \\ 1 & 3 \end{pmatrix}-\begin{pmatrix} \lambda & 0 \\ 0 & \lambda \end{pmatrix}}$
= ${\begin{pmatrix} 2-\lambda & 1 \\ 1 & 3-\lambda \end{pmatrix}}$
Now, det(A – λI2)
= ${\det \begin{pmatrix} 2-\lambda & 1 \\ 1 & 3-\lambda \end{pmatrix}}$
= ${\begin{vmatrix} 2-\lambda & 1 \\ 1 & 3-\lambda \end{vmatrix}}$
= (2 – λ)(3 – λ) – (1)(1)
= 6 – 5λ + λ2 – 1
= λ2 – 5λ +5
Thus, the characteristic polynomial is f(λ) = λ2 – 5λ +5
Let us find the characteristic polynomial of the matrix ${A=\begin{pmatrix} 4 & 2 & 0 \\ 2 & 4 & 2 \\ 0 & 2 & 4 \end{pmatrix}}$
As we know, the formula of the characteristic polynomial is f(λ) = det(A – λI2)
Given,
${A=\begin{pmatrix} 4 & 2 & 0 \\ 2 & 4 & 2 \\ 0 & 2 & 4 \end{pmatrix}}$
${I=\begin{pmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{pmatrix}}$
Here, (A – λI2)
= ${\begin{pmatrix} 4 & 2 & 0 \\ 2 & 4 & 2 \\ 0 & 2 & 4 \end{pmatrix}-\lambda \begin{pmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{pmatrix}}$
= ${\begin{pmatrix} 4 & 2 & 0 \\ 2 & 4 & 2 \\ 0 & 2 & 4 \end{pmatrix}-\begin{pmatrix} \lambda & 0 & 0 \\ 0 & \lambda & 0 \\ 0 & 0 & \lambda \end{pmatrix}}$
= ${\begin{pmatrix} 4-\lambda & 2 & 0 \\ 2 & 4-\lambda & 2 \\ 0 & 2 & 4-\lambda \end{pmatrix}}$
Now, det(A – λI2)
= ${\det \begin{pmatrix} 4-\lambda & 2 & 0 \\ 2 & 4-\lambda & 2 \\ 0 & 2 & 4-\lambda \end{pmatrix}}$
= ${\left( 4-\lambda \right) \begin{vmatrix} 4-\lambda & 2 \\ 2 & 4-\lambda \end{vmatrix}-2\begin{vmatrix} 2 & 2 \\ 0 & 4-\lambda \end{vmatrix}}$
= (4 – λ)[(4 – λ)(4 – λ) – (2)(2)] – 2[(2)(4 – λ) – (0)(2)]
= (4 – λ)(16 – 4λ – 4λ + λ2 – 4) – 2(8 – 2λ)
= (4 – λ)(λ2 – 8λ + 12) – 16 + 4λ
= 4λ2 – 32λ + 48 – λ3 + 8λ2 – 12λ – 16 + 4λ
= -λ3 + 12λ2 – 40λ + 32
Thus, the characteristic polynomial is f(λ) = -λ3 + 12λ2 – 40λ + 32
The roots of the characteristic polynomial are the eigenvalues of the matrix. To find the eigenvalues of a square matrix, we follow the theorem given below:
Statement
If A is an n × n matrix, and f(λ) = det(A – λIn) is its characteristic polynomial, then a number λ0 is an eigenvalue of A if and only if f(λ0) = 0
Proof
As we know, the matrix equation (A – λ0In)x = 0 has a nontrivial solution if and only if det (A – λ0In) = 0 (by the matrix theorem)
Thus,
λ0 is an eigenvalue of the matrix A ⇔ (A – λ0In)x = 0 has a nontrivial solution
This means,
A – λ0In is not invertible
⇒ det(A – λ0In) = 0
⇒ f(λ0) = 0
Also, if f(λ0) = 0
⇒ det(A – λ0In) = 0, indicating A – λ0In is not invertible
⇒ (A – λ0In)x = 0 has a nontrivial solution
⇒ λ0 is an eigenvalue of A
Here, we observe that eigenvalues are the scaling factor of the matrix, and the linear transformation of this factor is known as the eigenvector.
Eigenvectors are the vector quantities that take the form Ax = λx, where λ is the eigenvalue.
Let us find the eigenvalues and eigenvectors of the matrix ${A=\begin{pmatrix} 4 & 1 \\ 2 & 3 \end{pmatrix}}$
As we know, the characteristic polynomial is given by
f(λ) = det(A – λI2)
A – λI2 = ${\begin{pmatrix} 4-\lambda & 1 \\ 2 & 3-\lambda \end{pmatrix}}$
The determinant is f(λ) = ${\det \begin{pmatrix} 4-\lambda & 1 \\ 2 & 3-\lambda \end{pmatrix}}$
= (4 – λ)(3 – λ) – (2)(1)
= 12 – 4λ – 3λ + λ2 – 2
= λ2 – 7λ + 10
Solve the quadratic equation λ2 – 7λ + 10 = 0 by the quadratic formula, we get
${\lambda =\dfrac{-\left( -7\right) \pm \sqrt{\left( -7\right) ^{2}-4\times 1\times 10}}{2\times 1}}$
= ${\dfrac{7\pm \sqrt{49-40}}{2}}$
= ${\dfrac{7\pm 3}{2}}$
Thus, the eigenvalues are λ1= 5 and λ2= 2
Now, for λ1= 5, (A – 5I2)x = 0
⇒ ${\begin{pmatrix} 4-5 & 1 \\ 2 & 3-5 \end{pmatrix}\begin{pmatrix} x_{1} \\ x_{2} \end{pmatrix}=0}$
⇒ ${\begin{pmatrix} -1 & 1 \\ 2 & -2 \end{pmatrix}\begin{pmatrix} x_{1} \\ x_{2} \end{pmatrix}=0}$
⇒ The system of equations are: -x1 + x2= 0 and 2x1 – 2x2 = 0
Thus, the eigenvector is: x1 = ${\begin{pmatrix} 1 \\ 1 \end{pmatrix}}$
Similarly, for λ2= 2, (A – 2I2)x = 0
⇒ ${\begin{pmatrix} 4-2 & 1 \\ 2 & 3-2 \end{pmatrix}\begin{pmatrix} x_{1} \\ x_{2} \end{pmatrix}=0}$
⇒ ${\begin{pmatrix} 2 & 1 \\ 2 & 1 \end{pmatrix}\begin{pmatrix} x_{1} \\ x_{2} \end{pmatrix}=0}$
⇒ The system of equations are: 2x1 + x2= 0 and 2x1 + x2 = 0
Thus, the eigenvector is: x2 = ${\begin{pmatrix} 1 \\ -2 \end{pmatrix}}$
Hence, the eigenvalues are λ1= 5 and λ2= 2 and their corresponding eigenvectors are x1 = ${\begin{pmatrix} 1 \\ 1 \end{pmatrix}}$ and x2 = ${\begin{pmatrix} 1 \\ -2 \end{pmatrix}}$ respectively.
To solve higher-degree characteristic polynomials, such as 3 × 3 or 4 × 4 Matrices, we use the rational root theorem.
Rational Root Theorem
If A is an n × n matrix having integer entries, then any rational root of the characteristic polynomial f(λ) must be an integer divisor of det(A).
This theorem provides a way to test for potential eigenvalues manually. Once a root is found, the polynomial can be reduced using polynomial long division to find other roots.
Let us find the eigenvalues of the matrix:
${B=\begin{pmatrix} 6 & 2 & 1 \\ -2 & 3 & 0 \\ 1 & 0 & -1 \end{pmatrix}}$
By expanding cofactors along the first row, the characteristic polynomial is
f(λ) = det(B – λI) = ${\det \begin{pmatrix} 6 & 2 & 1 \\ -2 & 3 & 0 \\ 1 & 0 & -1 \end{pmatrix}}$
Expanding along the first row,
f(λ) = ${\left( 6-\lambda \right) \det \begin{pmatrix} 3-\lambda & 0 \\ 0 & -1-\lambda \end{pmatrix}-2\det \begin{pmatrix} -2 & 0 \\ 1 & -1-\lambda \end{pmatrix}+1\det \begin{pmatrix} -2 & 3-\lambda \\ 1 & 0 \end{pmatrix}}$
= (6 – λ)[(3 – λ)(3 – λ) – (0)(0)] – 2[(-2)(-1 – λ) – (1)(0)] + 1[(-2)(0) – (1)(3 – λ)]
= (6 – λ)(λ2 – 2λ – 3) – 2(2 + 2λ) + (λ – 3)
= 6λ2 – 12λ – 18 – λ3 + 2λ2 + 3λ – 4 – 4λ + λ – 3
= -λ3 + 6λ2 + 2λ2 – 12λ + 3λ – 4λ + λ – 18 – 3
= -λ3 + 8λ2 – 14λ – 25
Now, the constant term of f(λ) is -25, and the leading coefficient is -1
By the rational root theorem, the possible integer roots are ±1, ±5, and ±25
At λ = -1, f(-1) = -(-1)3 + 8(-1)2 – 14(-1) – 25 = 1 + 8 + 14 – 25 = -2, not a root
At λ = 5, f(5) = (5)3 + 8(5)2 – 14(5) – 25 = -125 + 200 – 70 – 25 = 0
Thus, λ = 5 is a root.
By using polynomial long division to divide f(λ) = -λ3 + 8λ2 – 14λ – 25 by (λ – 5), we get
f(λ) = (λ – 5)(-λ2 + 3λ + 5)
Solving the quadratic -λ2 + 3λ + 5 = 0 using the quadratic formula, we get
${\lambda =\dfrac{-b\pm \sqrt{b^{2}-4ac}}{2a}}$
Here, a = -1, b = 3, and c = 5
Thus, ${\lambda =\dfrac{-3\pm \sqrt{3^{2}-4\left( -1\right) \left( 5\right) }}{2\left( -1\right) }}$
= ${\lambda =\dfrac{-3\pm \sqrt{9+20}}{-2}}$
= ${\lambda =\dfrac{-3\pm \sqrt{29}}{-2}}$
⇒ the eigenvalues are λ1 = ${\lambda =\dfrac{3-\sqrt{29}}{2}}$ and λ2 = ${\lambda =\dfrac{3+\sqrt{29}}{2}}$
To find the degree and coefficient of characteristic polynomials of a square matrix, we follow the theorem given below:
Statement
If A is an n × n matrix, and f(λ) = det(A – λIn) is its characteristic polynomial, then the degree of f(λ) is n and f(λ) takes the form of
f(λ) = (-1)nλn + (-1)n – 1Tr(A)λn – 1 + … + det(A)
This means,
The other coefficients are just numbers without names.
The trace of a square matrix A, denoted by Tr(A), is the number obtained by summing the diagonal entries of A:
${Tr\begin{pmatrix} a_{11} & a_{12} & \ldots & a_{1n} \\ a_{21} & a_{22} & \ldots & a_{2n} \\ \vdots & \vdots & \ddots & \vdots \\ a_{n1} & a_{n2} & \ldots & a_{nn} \end{pmatrix}}$
= a11 + a22 + … + ann
Proof
Since f(0) = det(A – 0In) = det(A)
Thus, the constant term is always det(A)
Let us consider a 2 × 2 matrix as A = ${\begin{pmatrix} a & b \\ c & d \end{pmatrix}}$
Then, f(λ) = det(A – λI2)
⇒ f(λ) = ${\begin{pmatrix} a-\lambda & b \\ c & d-\lambda \end{pmatrix}}$
⇒ f(λ) = (a – λ)(d – λ) – bc
⇒ f(λ) = λ2 – (a+d)λ + (ad – bc)
⇒ f(λ) = λ2 – Tr(A)λ + det(A)
Thus, for n = 2, the characteristic polynomial can be obtained by
f(λ) = λ2 – Tr(A)λ + det(A)
⇒ f(λ) = (-1)nλn + (-1)n – 1Tr(A)λn – 1 + det(A), proved.
Problem: Finding the characteristic polynomial with TRACE and DETERMINANT
Find the characteristic polynomial of the matrix ${A=\begin{pmatrix} 3 & 2 \\ 4 & 5 \end{pmatrix}}$
As we know, the formula for the characteristic polynomial is
f(λ) = λ2 – Tr(A)λ + det(A)
Given,
${A=\begin{pmatrix} 3 & 2 \\ 4 & 5 \end{pmatrix}}$
Here,
Tr(A) = 3 + 5 = 8
det(A) = (3)(5) – (2)(4) = 15 – 8 = 7
Substituting into the formula,
f(λ) = λ2 – 8λ + 7
Statement
For a n × n matrix A, it satisfies its characteristic equation.
Mathematically, if the characteristic polynomial of A is:
f(λ) = (-1)n[λn + c1λn – 1 + c2λn – 2 + … + cn – 1λ + cn], then
f(A) = (-1)n[An + c1An – 1 + c2An – 2 + … + cn – 1A + cnIn] = 0
Here,
Proof
To prove this, we use the adjoint and determinant properties.
The cofactor expansion of a determinant is M ⋅ adj(M) = det(M) ⋅ I, where adj(M) is the transpose of the cofactor matrix.
By substituting M = A – λI,
(A – λI) ⋅ adj(A – λI) = det(A – λI) ⋅ I …..(i)
As we know, the characteristic polynomial is f(λ) = det(A – λI) …..(ii)
Then, (A – λI) ⋅ adj(A – λI) = p(λ) ⋅ I
Now, let us expand the adjoint matrix.
Since adj(A – λI) is a matrix polynomial of degree n – 1, it can be written as:
adj(A – λI) = B0 + B1λ + B2λ2 + ⋯ + Bn – 1λn – 1, where B0, B1, … , Bn – 1 are n × n matrices
Substituting adj(A – λI) into the equation, we get
(A – λI)(B0 + B1λ + B2λ2 + ⋯ + Bn – 1λn – 1) = f(λ) ⋅ I
Comparing the coefficients of powers of λ, we derive n + 1 equations for Bk, we get
From the above equations, substituting λ = A into f(λ), we get
f(A) = An + c1An – 1 + c2An – 2 + … + cn – 1A + cnIn = 0
Note: If we substitute λ = A directly into p(λ) = det(A – λI), this approach does not work, as P(A) is a matrix, not a scalar.
Verify the Cayley-Hamilton theorem for the matrix ${A=\begin{pmatrix} 2 & 1 \\ 3 & 4 \end{pmatrix}}$
Given,
${A=\begin{pmatrix} 2 & 1 \\ 3 & 4 \end{pmatrix}}$
The characteristic polynomial is f(λ) = λ2 – Tr(A)λ + det(A)
Here,
Tr(A) = 2 + 4 = 6
det(A) = (2)(4) – (1)(3) = 8 – 3 = 5
By substituting into the formula,
f(λ) = λ2 – 6λ + 5
By substituting A into the Characteristic Polynomial,
f(A) = A2 – 6A + 5I = 0, where I is the identity matrix
Computing A2, we get
A2 = A ⋅ A = ${\begin{pmatrix} 2 & 1 \\ 3 & 4 \end{pmatrix}\cdot \begin{pmatrix} 2 & 1 \\ 3 & 4 \end{pmatrix}}$ = ${\begin{pmatrix} 7 & 6 \\ 18 & 19 \end{pmatrix}}$
Computing 6A, we get
6A = ${6\cdot \begin{pmatrix} 2 & 1 \\ 3 & 4 \end{pmatrix}}$ = ${\begin{pmatrix} 12 & 6 \\ 18 & 24 \end{pmatrix}}$
Computing 5I, we get
5I = ${5\cdot \begin{pmatrix} 1 & 0 \\ 0 & 1 \end{pmatrix}}$ = ${\begin{pmatrix} 5 & 0 \\ 0 & 5 \end{pmatrix}}$
Now, f(A) = ${\begin{pmatrix} 7 & 6 \\ 18 & 19 \end{pmatrix}-\begin{pmatrix} 12 & 6 \\ 18 & 24 \end{pmatrix}+\begin{pmatrix} 5 & 0 \\ 0 & 5 \end{pmatrix}}$
= ${\begin{pmatrix} -5 & 0 \\ 0 & -5 \end{pmatrix}+\begin{pmatrix} 5 & 0 \\ 0 & 5 \end{pmatrix}}$
= ${\begin{pmatrix} 0 & 0 \\ 0 & 0 \end{pmatrix}}$
Since f(A) = 0 (the zero matrix), the matrix A satisfies its characteristic equation.
Thus, the Cayley-Hamilton theorem for the given matrix is verified.
Last modified on December 9th, 2024