Hermitian matrices can be characterized in a number of equivalent ways, some of which are listed below:
A square matrix A {\displaystyle A} is Hermitian if and only if it is equal to its conjugate transpose, that is, it satisfies ⟨ w , A v ⟩ = ⟨ A w , v ⟩ , {\displaystyle \langle \mathbf {w} ,A\mathbf {v} \rangle =\langle A\mathbf {w} ,\mathbf {v} \rangle ,} for any pair of vectors v , w , {\displaystyle \mathbf {v} ,\mathbf {w} ,} where ⟨ ⋅ , ⋅ ⟩ {\displaystyle \langle \cdot ,\cdot \rangle } denotes the inner product operation.
This is also the way that the more general concept of self-adjoint operator is defined.
An n × n {\displaystyle n\times {}n} matrix A {\displaystyle A} is Hermitian if and only if ⟨ v , A v ⟩ ∈ R , for all v ∈ C n . {\displaystyle \langle \mathbf {v} ,A\mathbf {v} \rangle \in \mathbb {R} ,\quad {\text{for all }}\mathbf {v} \in \mathbb {C} ^{n}.}
A square matrix A {\displaystyle A} is Hermitian if and only if it is unitarily diagonalizable with real eigenvalues.
Hermitian matrices are fundamental to quantum mechanics because they describe operators with necessarily real eigenvalues. An eigenvalue a {\displaystyle a} of an operator A ^ {\displaystyle {\hat {A}}} on some quantum state | ψ ⟩ {\displaystyle |\psi \rangle } is one of the possible measurement outcomes of the operator, which requires the operators to have real eigenvalues.
In signal processing, Hermitian matrices are utilized in tasks like Fourier analysis and signal representation.2 The eigenvalues and eigenvectors of Hermitian matrices play a crucial role in analyzing signals and extracting meaningful information.
Hermitian matrices are extensively studied in linear algebra and numerical analysis. They have well-defined spectral properties, and many numerical algorithms, such as the Lanczos algorithm, exploit these properties for efficient computations. Hermitian matrices also appear in techniques like singular value decomposition (SVD) and eigenvalue decomposition.
In statistics and machine learning, Hermitian matrices are used in covariance matrices, where they represent the relationships between different variables. The positive definiteness of a Hermitian covariance matrix ensures the well-definedness of multivariate distributions.3
Hermitian matrices are applied in the design and analysis of communications system, especially in the field of multiple-input multiple-output (MIMO) systems. Channel matrices in MIMO systems often exhibit Hermitian properties.
In graph theory, Hermitian matrices are used to study the spectra of graphs. The Hermitian Laplacian matrix is a key tool in this context, as it is used to analyze the spectra of mixed graphs.4 The Hermitian-adjacency matrix of a mixed graph is another important concept, as it is a Hermitian matrix that plays a role in studying the energies of mixed graphs.5
In this section, the conjugate transpose of matrix A {\displaystyle A} is denoted as A H , {\displaystyle A^{\mathsf {H}},} the transpose of matrix A {\displaystyle A} is denoted as A T {\displaystyle A^{\mathsf {T}}} and conjugate of matrix A {\displaystyle A} is denoted as A ¯ . {\displaystyle {\overline {A}}.}
See the following example:
[ 0 a − i b c − i d a + i b 1 m − i n c + i d m + i n 2 ] {\displaystyle {\begin{bmatrix}0&a-ib&c-id\\a+ib&1&m-in\\c+id&m+in&2\end{bmatrix}}}
The diagonal elements must be real, as they must be their own complex conjugate.
Well-known families of Hermitian matrices include the Pauli matrices, the Gell-Mann matrices and their generalizations. In theoretical physics such Hermitian matrices are often multiplied by imaginary coefficients,67 which results in skew-Hermitian matrices.
Here, we offer another useful Hermitian matrix using an abstract example. If a square matrix A {\displaystyle A} equals the product of a matrix with its conjugate transpose, that is, A = B B H , {\displaystyle A=BB^{\mathsf {H}},} then A {\displaystyle A} is a Hermitian positive semi-definite matrix. Furthermore, if B {\displaystyle B} is row full-rank, then A {\displaystyle A} is positive definite.
The entries on the main diagonal (top left to bottom right) of any Hermitian matrix are real.
By definition of the Hermitian matrix H i j = H ¯ j i {\displaystyle H_{ij}={\overline {H}}_{ji}} so for i = j the above follows, as a number can equal its complex conjugate only if the imaginary parts are zero.
Only the main diagonal entries are necessarily real; Hermitian matrices can have arbitrary complex-valued entries in their off-diagonal elements, as long as diagonally-opposite entries are complex conjugates.
A matrix that has only real entries is symmetric if and only if it is a Hermitian matrix. A real and symmetric matrix is simply a special case of a Hermitian matrix.
H i j = H ¯ j i {\displaystyle H_{ij}={\overline {H}}_{ji}} by definition. Thus H i j = H j i {\displaystyle H_{ij}=H_{ji}} (matrix symmetry) if and only if H i j = H ¯ i j {\displaystyle H_{ij}={\overline {H}}_{ij}} ( H i j {\displaystyle H_{ij}} is real).
So, if a real anti-symmetric matrix is multiplied by a real multiple of the imaginary unit i , {\displaystyle i,} then it becomes Hermitian.
Every Hermitian matrix is a normal matrix. That is to say, A A H = A H A . {\displaystyle AA^{\mathsf {H}}=A^{\mathsf {H}}A.}
A = A H , {\displaystyle A=A^{\mathsf {H}},} so A A H = A A = A H A . {\displaystyle AA^{\mathsf {H}}=AA=A^{\mathsf {H}}A.}
The finite-dimensional spectral theorem says that any Hermitian matrix can be diagonalized by a unitary matrix, and that the resulting diagonal matrix has only real entries. This implies that all eigenvalues of a Hermitian matrix A with dimension n are real, and that A has n linearly independent eigenvectors. Moreover, a Hermitian matrix has orthogonal eigenvectors for distinct eigenvalues. Even if there are degenerate eigenvalues, it is always possible to find an orthogonal basis of Cn consisting of n eigenvectors of A.
The sum of any two Hermitian matrices is Hermitian.
( A + B ) i j = A i j + B i j = A ¯ j i + B ¯ j i = ( A + B ) ¯ j i , {\displaystyle (A+B)_{ij}=A_{ij}+B_{ij}={\overline {A}}_{ji}+{\overline {B}}_{ji}={\overline {(A+B)}}_{ji},} as claimed.
The inverse of an invertible Hermitian matrix is Hermitian as well.
If A − 1 A = I , {\displaystyle A^{-1}A=I,} then I = I H = ( A − 1 A ) H = A H ( A − 1 ) H = A ( A − 1 ) H , {\displaystyle I=I^{\mathsf {H}}=\left(A^{-1}A\right)^{\mathsf {H}}=A^{\mathsf {H}}\left(A^{-1}\right)^{\mathsf {H}}=A\left(A^{-1}\right)^{\mathsf {H}},} so A − 1 = ( A − 1 ) H {\displaystyle A^{-1}=\left(A^{-1}\right)^{\mathsf {H}}} as claimed.
The product of two Hermitian matrices A and B is Hermitian if and only if AB = BA.
( A B ) H = ( A B ) T ¯ = B T A T ¯ = B T ¯ A T ¯ = B H A H = B A . {\displaystyle (AB)^{\mathsf {H}}={\overline {(AB)^{\mathsf {T}}}}={\overline {B^{\mathsf {T}}A^{\mathsf {T}}}}={\overline {B^{\mathsf {T}}}}\ {\overline {A^{\mathsf {T}}}}=B^{\mathsf {H}}A^{\mathsf {H}}=BA.} Thus ( A B ) H = A B {\displaystyle (AB)^{\mathsf {H}}=AB} if and only if A B = B A . {\displaystyle AB=BA.}
Thus An is Hermitian if A is Hermitian and n is an integer.
If A and B are Hermitian, then ABA is also Hermitian.
( A B A ) H = ( A ( B A ) ) H = ( B A ) H A H = A H B H A H = A B A {\displaystyle (ABA)^{\mathsf {H}}=(A(BA))^{\mathsf {H}}=(BA)^{\mathsf {H}}A^{\mathsf {H}}=A^{\mathsf {H}}B^{\mathsf {H}}A^{\mathsf {H}}=ABA}
For an arbitrary complex valued vector v the product v H A v {\displaystyle \mathbf {v} ^{\mathsf {H}}A\mathbf {v} } is real because of v H A v = ( v H A v ) H . {\displaystyle \mathbf {v} ^{\mathsf {H}}A\mathbf {v} =\left(\mathbf {v} ^{\mathsf {H}}A\mathbf {v} \right)^{\mathsf {H}}.} This is especially important in quantum physics where Hermitian matrices are operators that measure properties of a system, e.g. total spin, which have to be real.
The Hermitian complex n-by-n matrices do not form a vector space over the complex numbers, ℂ, since the identity matrix In is Hermitian, but i In is not. However the complex Hermitian matrices do form a vector space over the real numbers ℝ. In the 2n2-dimensional vector space of complex n × n matrices over ℝ, the complex Hermitian matrices form a subspace of dimension n2. If Ejk denotes the n-by-n matrix with a 1 in the j,k position and zeros elsewhere, a basis (orthonormal with respect to the Frobenius inner product) can be described as follows: E j j for 1 ≤ j ≤ n ( n matrices ) {\displaystyle E_{jj}{\text{ for }}1\leq j\leq n\quad (n{\text{ matrices}})}
together with the set of matrices of the form 1 2 ( E j k + E k j ) for 1 ≤ j < k ≤ n ( n 2 − n 2 matrices ) {\displaystyle {\frac {1}{\sqrt {2}}}\left(E_{jk}+E_{kj}\right){\text{ for }}1\leq j<k\leq n\quad \left({\frac {n^{2}-n}{2}}{\text{ matrices}}\right)}
and the matrices i 2 ( E j k − E k j ) for 1 ≤ j < k ≤ n ( n 2 − n 2 matrices ) {\displaystyle {\frac {i}{\sqrt {2}}}\left(E_{jk}-E_{kj}\right){\text{ for }}1\leq j<k\leq n\quad \left({\frac {n^{2}-n}{2}}{\text{ matrices}}\right)}
where i {\displaystyle i} denotes the imaginary unit, i = − 1 . {\displaystyle i={\sqrt {-1}}~.}
An example is that the four Pauli matrices form a complete basis for the vector space of all complex 2-by-2 Hermitian matrices over ℝ.
If n orthonormal eigenvectors u 1 , … , u n {\displaystyle \mathbf {u} _{1},\dots ,\mathbf {u} _{n}} of a Hermitian matrix are chosen and written as the columns of the matrix U, then one eigendecomposition of A is A = U Λ U H {\displaystyle A=U\Lambda U^{\mathsf {H}}} where U U H = I = U H U {\displaystyle UU^{\mathsf {H}}=I=U^{\mathsf {H}}U} and therefore A = ∑ j λ j u j u j H , {\displaystyle A=\sum _{j}\lambda _{j}\mathbf {u} _{j}\mathbf {u} _{j}^{\mathsf {H}},} where λ j {\displaystyle \lambda _{j}} are the eigenvalues on the diagonal of the diagonal matrix Λ . {\displaystyle \Lambda .}
The singular values of A {\displaystyle A} are the absolute values of its eigenvalues:
Since A {\displaystyle A} has an eigendecomposition A = U Λ U H {\displaystyle A=U\Lambda U^{H}} , where U {\displaystyle U} is a unitary matrix (its columns are orthonormal vectors; see above), a singular value decomposition of A {\displaystyle A} is A = U | Λ | sgn ( Λ ) U H {\displaystyle A=U|\Lambda |{\text{sgn}}(\Lambda )U^{H}} , where | Λ | {\displaystyle |\Lambda |} and sgn ( Λ ) {\displaystyle {\text{sgn}}(\Lambda )} are diagonal matrices containing the absolute values | λ | {\displaystyle |\lambda |} and signs sgn ( λ ) {\displaystyle {\text{sgn}}(\lambda )} of A {\displaystyle A} 's eigenvalues, respectively. sgn ( Λ ) U H {\displaystyle \operatorname {sgn}(\Lambda )U^{H}} is unitary, since the columns of U H {\displaystyle U^{H}} are only getting multiplied by ± 1 {\displaystyle \pm 1} . | Λ | {\displaystyle |\Lambda |} contains the singular values of A {\displaystyle A} , namely, the absolute values of its eigenvalues.8
The determinant of a Hermitian matrix is real:
det ( A ) = det ( A T ) ⇒ det ( A H ) = det ( A ) ¯ {\displaystyle \det(A)=\det \left(A^{\mathsf {T}}\right)\quad \Rightarrow \quad \det \left(A^{\mathsf {H}}\right)={\overline {\det(A)}}} Therefore if A = A H ⇒ det ( A ) = det ( A ) ¯ . {\displaystyle A=A^{\mathsf {H}}\quad \Rightarrow \quad \det(A)={\overline {\det(A)}}.}
(Alternatively, the determinant is the product of the matrix's eigenvalues, and as mentioned before, the eigenvalues of a Hermitian matrix are real.)
Additional facts related to Hermitian matrices include:
Main article: Rayleigh quotient
In mathematics, for a given complex Hermitian matrix M and nonzero vector x, the Rayleigh quotient10 R ( M , x ) , {\displaystyle R(M,\mathbf {x} ),} is defined as:11: p. 234 12 R ( M , x ) := x H M x x H x . {\displaystyle R(M,\mathbf {x} ):={\frac {\mathbf {x} ^{\mathsf {H}}M\mathbf {x} }{\mathbf {x} ^{\mathsf {H}}\mathbf {x} }}.}
For real matrices and vectors, the condition of being Hermitian reduces to that of being symmetric, and the conjugate transpose x H {\displaystyle \mathbf {x} ^{\mathsf {H}}} to the usual transpose x T . {\displaystyle \mathbf {x} ^{\mathsf {T}}.} R ( M , c x ) = R ( M , x ) {\displaystyle R(M,c\mathbf {x} )=R(M,\mathbf {x} )} for any non-zero real scalar c . {\displaystyle c.} Also, recall that a Hermitian (or real symmetric) matrix has real eigenvalues.
It can be shown13 that, for a given matrix, the Rayleigh quotient reaches its minimum value λ min {\displaystyle \lambda _{\min }} (the smallest eigenvalue of M) when x {\displaystyle \mathbf {x} } is v min {\displaystyle \mathbf {v} _{\min }} (the corresponding eigenvector). Similarly, R ( M , x ) ≤ λ max {\displaystyle R(M,\mathbf {x} )\leq \lambda _{\max }} and R ( M , v max ) = λ max . {\displaystyle R(M,\mathbf {v} _{\max })=\lambda _{\max }.}
The Rayleigh quotient is used in the min-max theorem to get exact values of all eigenvalues. It is also used in eigenvalue algorithms to obtain an eigenvalue approximation from an eigenvector approximation. Specifically, this is the basis for Rayleigh quotient iteration.
The range of the Rayleigh quotient (for matrix that is not necessarily Hermitian) is called a numerical range (or spectrum in functional analysis). When the matrix is Hermitian, the numerical range is equal to the spectral norm. Still in functional analysis, λ max {\displaystyle \lambda _{\max }} is known as the spectral radius. In the context of C*-algebras or algebraic quantum mechanics, the function that to M associates the Rayleigh quotient R(M, x) for a fixed x and M varying through the algebra would be referred to as "vector state" of the algebra.
Archibald, Tom (2010-12-31), Gowers, Timothy; Barrow-Green, June; Leader, Imre (eds.), "VI.47 Charles Hermite", The Princeton Companion to Mathematics, Princeton University Press, p. 773, doi:10.1515/9781400830398.773a, ISBN 978-1-4008-3039-8, retrieved 2023-11-15 978-1-4008-3039-8 ↩
Ribeiro, Alejandro. "Signal and Information Processing" (PDF). https://www.seas.upenn.edu/~ese2240/wiki/Lecture%20Notes/sip_PCA.pdf ↩
"MULTIVARIATE NORMAL DISTRIBUTIONS" (PDF). https://dspace.mit.edu/bitstream/handle/1721.1/121170/6-436j-fall-2008/contents/lecture-notes/MIT6_436JF08_lec15.pdf ↩
Lau, Ivan. "Hermitian Spectral Theory of Mixed Graphs" (PDF). https://www.sfu.ca/~iplau/Edinburgh_CS_Project.pdf ↩
Liu, Jianxi; Li, Xueliang (February 2015). "Hermitian-adjacency matrices and Hermitian energies of mixed graphs". Linear Algebra and Its Applications. 466: 182–207. doi:10.1016/j.laa.2014.10.028. https://doi.org/10.1016%2Fj.laa.2014.10.028 ↩
Frankel, Theodore (2004). The Geometry of Physics: an introduction. Cambridge University Press. p. 652. ISBN 0-521-53927-7. 0-521-53927-7 ↩
Physics 125 Course Notes Archived 2022-03-07 at the Wayback Machine at California Institute of Technology http://www.hep.caltech.edu/~fcp/physics/quantumMechanics/angularMomentum/angularMomentum.pdf ↩
Trefethan, Lloyd N.; Bau, III, David (1997). Numerical linear algebra. Philadelphia, PA, USA: SIAM. p. 34. ISBN 0-89871-361-7. OCLC 1348374386. 0-89871-361-7 ↩
Horn, Roger A.; Johnson, Charles R. (2013). Matrix Analysis, second edition. Cambridge University Press. ISBN 9780521839402. 9780521839402 ↩
Also known as the Rayleigh–Ritz ratio; named after Walther Ritz and Lord Rayleigh. /wiki/Walther_Ritz ↩
Parlet B. N. The symmetric eigenvalue problem, SIAM, Classics in Applied Mathematics,1998 ↩