There are even and odd Zernike polynomials. The even Zernike polynomials are defined as
(even function over the azimuthal angle φ {\displaystyle \varphi } ), and the odd Zernike polynomials are defined as
(odd function over the azimuthal angle φ {\displaystyle \varphi } ) where m and n are nonnegative integers with n ≥ m ≥ 0 (m = 0 for spherical Zernike polynomials), φ {\displaystyle \varphi } is the azimuthal angle, ρ is the radial distance 0 ≤ ρ ≤ 1 {\displaystyle 0\leq \rho \leq 1} , and R n m {\displaystyle R_{n}^{m}} are the radial polynomials defined below. Zernike polynomials have the property of being limited to a range of −1 to +1, i.e. | Z n m ( ρ , φ ) | ≤ 1 {\displaystyle |Z_{n}^{m}(\rho ,\varphi )|\leq 1} . The radial polynomials R n m {\displaystyle R_{n}^{m}} are defined as
for even n − m, while it is 0 for odd n − m. A special value is
Rewriting the ratios of factorials in the radial part as products of binomials shows that the coefficients are integer numbers:
A notation as terminating Gaussian hypergeometric functions is useful to reveal recurrences, to demonstrate that they are special cases of Jacobi polynomials, to write down the differential equations, etc.:
for n − m even.
The inverse relation expands ρ j {\displaystyle \rho ^{j}} for fixed m ≤ j {\displaystyle m\leq j} into R n m ( ρ ) {\displaystyle R_{n}^{m}(\rho )}
with rational coefficients h j , n , m {\displaystyle h_{j,n,m}} 3
for even j − m = 0 , 2 , 4 , … {\displaystyle j-m=0,2,4,\ldots } .
The factor ρ n − 2 k {\displaystyle \rho ^{n-2k}} in the radial polynomial R n m ( ρ ) {\displaystyle R_{n}^{m}(\rho )} may be expanded in a Bernstein basis of b s , n / 2 ( ρ 2 ) {\displaystyle b_{s,n/2}(\rho ^{2})} for even n {\displaystyle n} or ρ {\displaystyle \rho } times a function of b s , ( n − 1 ) / 2 ( ρ 2 ) {\displaystyle b_{s,(n-1)/2}(\rho ^{2})} for odd n {\displaystyle n} in the range ⌊ n / 2 ⌋ − k ≤ s ≤ ⌊ n / 2 ⌋ {\displaystyle \lfloor n/2\rfloor -k\leq s\leq \lfloor n/2\rfloor } . The radial polynomial may therefore be expressed by a finite number of Bernstein Polynomials with rational coefficients:
Applications often involve linear algebra, where an integral over a product of Zernike polynomials and some other factor builds a matrix elements. To enumerate the rows and columns of these matrices by a single index, a conventional mapping of the two indices n and m to a single index j has been introduced by Noll.4 The table of this association Z n m → Z j {\displaystyle Z_{n}^{m}\rightarrow Z_{j}} starts as follows (sequence A176988 in the OEIS). j = n ( n + 1 ) 2 + | m | + { 0 , m > 0 ∧ n ≡ { 0 , 1 } ( mod 4 ) ; 0 , m < 0 ∧ n ≡ { 2 , 3 } ( mod 4 ) ; 1 , m ≥ 0 ∧ n ≡ { 2 , 3 } ( mod 4 ) ; 1 , m ≤ 0 ∧ n ≡ { 0 , 1 } ( mod 4 ) . {\displaystyle j={\frac {n(n+1)}{2}}+|m|+\left\{{\begin{array}{ll}0,&m>0\land n\equiv \{0,1\}{\pmod {4}};\\0,&m<0\land n\equiv \{2,3\}{\pmod {4}};\\1,&m\geq 0\land n\equiv \{2,3\}{\pmod {4}};\\1,&m\leq 0\land n\equiv \{0,1\}{\pmod {4}}.\end{array}}\right.}
The rule is the following.
OSA 5 and ANSI single-index Zernike polynomials using:
The Fringe indexing scheme is used in commercial optical design software and optical testing in, e.g., photolithography.67
j = ( 1 + n + | l | 2 ) 2 − 2 | l | + ⌊ 1 − sgn l 2 ⌋ {\displaystyle j=\left(1+{\frac {n+|l|}{2}}\right)^{2}-2|l|+\left\lfloor {\frac {1-\operatorname {sgn} l}{2}}\right\rfloor }
where sgn l {\displaystyle \operatorname {sgn} l} is the sign or signum function. The first 20 fringe numbers are listed below.
James C. Wyant uses the "Fringe" indexing scheme except it starts at 0 instead of 1 (subtract 1).8 This method is commonly used including interferogram analysis software in Zygo interferometers and the open source software DFTFringe.
They satisfy the Rodrigues' formula
and can be related to the Jacobi polynomials as
The orthogonality in the radial part reads9
or
∫ 1 0 R n m ( ρ ) R n ′ m ( ρ ) ρ d ρ = δ n , n ′ 2 n + 2 . {\displaystyle {\underset {0}{\overset {1}{\mathop {\int } }}}\,R_{n}^{m}(\rho )R_{{n}'}^{m}(\rho )\rho d\rho ={\frac {{\delta }_{n,{n}'}}{2n+2}}.}
Orthogonality in the angular part is represented by the elementary
where ϵ m {\displaystyle \epsilon _{m}} (sometimes called the Neumann factor because it frequently appears in conjunction with Bessel functions) is defined as 2 if m = 0 {\displaystyle m=0} and 1 if m ≠ 0 {\displaystyle m\neq 0} . The product of the angular and radial parts establishes the orthogonality of the Zernike functions with respect to both indices if integrated over the unit disk,
where d 2 r = ρ d ρ d φ {\displaystyle d^{2}r=\rho \,d\rho \,d\varphi } is the Jacobian of the circular coordinate system, and where n − l {\displaystyle n-l} and n ′ − l ′ {\displaystyle n'-l'} are both even.
Any sufficiently smooth real-valued phase field over the unit disk G ( ρ , φ ) {\displaystyle G(\rho ,\varphi )} can be represented in terms of its Zernike coefficients (odd and even), just as periodic functions find an orthogonal representation with the Fourier series. We have
where the coefficients can be calculated using inner products. On the space of L 2 {\displaystyle L^{2}} functions on the unit disk, there is an inner product defined by
The Zernike coefficients can then be expressed as follows:
Alternatively, one can use the known values of phase function G on the circular grid to form a system of equations. The phase function is retrieved by the unknown-coefficient weighted product with (known values) of Zernike polynomial across the unit grid. Hence, coefficients can also be found by solving a linear system, for instance by matrix inversion. Fast algorithms to calculate the forward and inverse Zernike transform use symmetry properties of trigonometric functions, separability of radial and azimuthal parts of Zernike polynomials, and their rotational symmetries.
The reflections of trigonometric functions result that the parity with respect to reflection along the x axis is
The π shifts of trigonometric functions result that the parity with respect to point reflection at the center of coordinates is
where ( − 1 ) l {\displaystyle (-1)^{l}} could as well be written ( − 1 ) n {\displaystyle (-1)^{n}} because n − l {\displaystyle n-l} as even numbers are only cases to get non-vanishing Zernike polynomials. (If n is even then l is also even. If n is odd, then l is also odd.) This property is sometimes used to categorize Zernike polynomials into even and odd polynomials in terms of their angular dependence. (it is also possible to add another category with l = 0 since it has a special property of no angular dependence.)
The radial polynomials are also either even or odd, depending on order n or m:
These equalities are easily seen since R n m ( ρ ) {\displaystyle R_{n}^{m}(\rho )} with an odd (even) m contains only odd (even) powers to ρ (see examples of R n m ( ρ ) {\displaystyle R_{n}^{m}(\rho )} below).
The periodicity of the trigonometric functions results in invariance if rotated by multiples of 2 π / l {\displaystyle 2\pi /l} radian around the center:
The Zernike polynomials are eigenfunctions of the Zernike differential operator, in modern formulation10
self-adjoint over the unit disk, with negative eigenvalues L [ Z n m ] = − n ( n + 2 ) Z n m {\displaystyle L[Z_{n}^{m}]=-n(n+2)Z_{n}^{m}} . Other self-adjoint differential operators can be constructed for which the Zernike polynomials form a spectrum, for example ∇ ⋅ ( 1 − ρ 2 ) ∇ Z n m = ( m 2 − n ( n + 2 ) ) Z n m {\displaystyle \nabla \cdot (1-\rho ^{2})\nabla Z_{n}^{m}=\left(m^{2}-n(n+2)\right)Z_{n}^{m}} (relating to rough surface BRDFs11), which differs from the above by a factor ∂ φ φ {\displaystyle \partial _{\varphi \varphi }} .
The Zernike polynomials satisfy the following recurrence relation which depends neither on the degree nor on the azimuthal order of the radial polynomials:12
From the definition of R n m {\displaystyle R_{n}^{m}} it can be seen that R m m ( ρ ) = ρ m {\displaystyle R_{m}^{m}(\rho )=\rho ^{m}} and R m + 2 m ( ρ ) = ( ( m + 2 ) ρ 2 − ( m + 1 ) ) ρ m {\displaystyle R_{m+2}^{m}(\rho )=((m+2)\rho ^{2}-(m+1))\rho ^{m}} . The following three-term recurrence relation13 then allows to calculate all other R n m ( ρ ) {\displaystyle R_{n}^{m}(\rho )} :
The above relation is especially useful since the derivative of R n m {\displaystyle R_{n}^{m}} can be calculated from two radial Zernike polynomials of adjacent degree:14
The differential equation of the Gaussian Hypergeometric Function is equivalent to
The first few radial polynomials are:
The first few Zernike modes, at various indices,15 are shown below. They are normalized such that: ∫ 0 2 π ∫ 0 1 Z 2 ⋅ ρ d ρ d ϕ = π {\displaystyle \int _{0}^{2\pi }\int _{0}^{1}Z^{2}\cdot \rho \,d\rho \,d\phi =\pi } , which is equivalent to Var ( Z ) unit circle = 1 {\displaystyle \operatorname {Var} (Z)_{\text{unit circle}}=1} .
Further information: Optical_aberration § Zernike_model_of_aberrations
The functions are a basis defined over the circular support area, typically the pupil planes in classical optical imaging at visible and infrared wavelengths through systems of lenses and mirrors of finite diameter. Their advantages are the simple analytical properties inherited from the simplicity of the radial functions and the factorization in radial and azimuthal functions; this leads, for example, to closed-form expressions of the two-dimensional Fourier transform in terms of Bessel functions.1617 Their disadvantage, in particular if high n are involved, is the unequal distribution of nodal lines over the unit disk, which introduces ringing effects near the perimeter ρ ≈ 1 {\displaystyle \rho \approx 1} , which often leads attempts to define other orthogonal functions over the circular disk.181920
In precision optical manufacturing, Zernike polynomials are used to characterize higher-order errors observed in interferometric analyses. In wavefront slope sensors like the Shack-Hartmann, Zernike coefficients of the wavefront can be obtained by fitting measured slopes with Zernike polynomial derivatives averaged over the sampling subapertures.21 In optometry and ophthalmology, Zernike polynomials are used to describe wavefront aberrations of the cornea or lens from an ideal spherical shape, which result in refraction errors. They are also commonly used in adaptive optics, where they can be used to characterize atmospheric distortion. Obvious applications for this are IR or visual astronomy and satellite imagery.
Another application of the Zernike polynomials is found in the Extended Nijboer–Zernike theory of diffraction and aberrations.
Zernike polynomials are widely used as basis functions of image moments. Since Zernike polynomials are orthogonal to each other, Zernike moments can represent properties of an image with no redundancy or overlap of information between the moments. Although Zernike moments are significantly dependent on the scaling and the translation of the object in a region of interest (ROI), their magnitudes are independent of the rotation angle of the object.22 Thus, they can be utilized to extract features from images that describe the shape characteristics of an object. For instance, Zernike moments are utilized as shape descriptors to classify benign and malignant breast masses23 or the surface of vibrating disks.24 Zernike Moments also have been used to quantify shape of osteosarcoma cancer cell lines in single cell level.25 Moreover, Zernike Moments have been used for early detection of Alzheimer's disease by extracting discriminative information from the MR images of Alzheimer's disease, Mild cognitive impairment, and Healthy groups.26
The concept translates to higher dimensions D if multinomials x 1 i x 2 j ⋯ x D k {\displaystyle x_{1}^{i}x_{2}^{j}\cdots x_{D}^{k}} in Cartesian coordinates are converted to hyperspherical coordinates, ρ s , s ≤ D {\displaystyle \rho ^{s},s\leq D} , multiplied by a product of Jacobi polynomials of the angular variables. In D = 3 {\displaystyle D=3} dimensions, the angular variables are spherical harmonics, for example. Linear combinations of the powers ρ s {\displaystyle \rho ^{s}} define an orthogonal basis R n ( l ) ( ρ ) {\displaystyle R_{n}^{(l)}(\rho )} satisfying
(Note that a factor 2 n + D {\displaystyle {\sqrt {2n+D}}} is absorbed in the definition of R here, whereas in D = 2 {\displaystyle D=2} the normalization is chosen slightly differently. This is largely a matter of taste, depending on whether one wishes to maintain an integer set of coefficients or prefers tighter formulas if the orthogonalization is involved.) The explicit representation is27
for even n − l ≥ 0 {\displaystyle n-l\geq 0} , else identical to zero, with special case R n ( n ) ( ρ ) = 2 n + D ρ n . {\displaystyle R_{n}^{(n)}(\rho )={\sqrt {2n+D}}\rho ^{n}.}
Its differential equation for the Gaussian Hypergeometric Function is equivalent to ρ 2 ( ρ 2 − 1 ) d 2 d ρ 2 R n ( l ) ( ρ ) = [ n ρ 2 ( n + D ) − l ( D − 2 + l ) ] R n ( l ) ( ρ ) + ρ [ D − 1 − ( D + 1 ) ρ 2 ] d d ρ R n ( l ) ( ρ ) . {\displaystyle \rho ^{2}(\rho ^{2}-1){\frac {d^{2}}{d\rho ^{2}}}R_{n}^{(l)}(\rho )=\left[n\rho ^{2}(n+D)-l(D-2+l)\right]R_{n}^{(l)}(\rho )+\rho \left[D-1-(D+1)\rho ^{2}\right]{\frac {d}{d\rho }}R_{n}^{(l)}(\rho ).}
Zernike, F. (1934). "Beugungstheorie des Schneidenverfahrens und Seiner Verbesserten Form, der Phasenkontrastmethode". Physica. 1 (8): 689–704. Bibcode:1934Phy.....1..689Z. doi:10.1016/S0031-8914(34)80259-5. /wiki/Bibcode_(identifier) ↩
Born, Max & Wolf, Emil (1999). Principles of Optics: Electromagnetic Theory of Propagation, Interference and Diffraction of Light (7th ed.). Cambridge, UK: Cambridge University Press. p. 986. ISBN 9780521642224. (see also at Google Books) 9780521642224 ↩
Mathar, R. J. (2009). "Zernike Basis to Cartesian Transformations". Serbian Astronomical Journal. 179 (179): 107–120. arXiv:0809.2368. Bibcode:2009SerAJ.179..107M. doi:10.2298/SAJ0979107M. S2CID 115159231. /wiki/Serbian_Astronomical_Journal ↩
Noll, R. J. (1976). "Zernike polynomials and atmospheric turbulence" (PDF). J. Opt. Soc. Am. 66 (3): 207. Bibcode:1976JOSA...66..207N. doi:10.1364/JOSA.66.000207. https://ftp.bioeng.auckland.ac.nz/pub/pub/jtur044/references/fitting/NOLL1976.pdf ↩
Thibos, L. N.; Applegate, R. A.; Schwiegerling, J. T.; Webb, R. (2002). "Standards for reporting the optical aberrations of eyes". Journal of Refractive Surgery. 18 (5): S652-60. doi:10.3928/1081-597X-20020901-30. PMID 12361175. /wiki/Doi_(identifier) ↩
Loomis, J., "A Computer Program for Analysis of Interferometric Data," Optical Interferograms, Reduction and Interpretation, ASTM STP 666, A. H. Guenther and D. H. Liebenberg, Eds., American Society for Testing and Materials, 1978, pp. 71–86. ↩
Genberg, V. L.; Michels, G. J.; Doyle, K. B. (2002). "Orthogonality of Zernike polynomials". Optomechanical design and Engineering 2002. Proc SPIE. Vol. 4771. pp. 276–286. doi:10.1117/12.482169. /wiki/Doi_(identifier) ↩
Eric P. Goodwin; James C. Wyant (2006). Field Guide to Interferometric Optical Testing. p. 25. ISBN 0-8194-6510-0. 0-8194-6510-0 ↩
Lakshminarayanan, V.; Fleck, Andre (2011). "Zernike polynomials: a guide". J. Mod. Opt. 58 (7): 545–561. Bibcode:2011JMOp...58..545L. doi:10.1080/09500340.2011.554896. S2CID 120905947. /wiki/Bibcode_(identifier) ↩
Pogosyan, George S.; Salto-Alegre, Cristina; Wolf, Kurt Bernardo; Yakhno, Alexander (2017). "Quantum superintegrable Zernike system". J. Math. Phys. 58 (7). arXiv:1702.08570. Bibcode:2017JMP....58g2101P. doi:10.1063/1.4990794. /wiki/ArXiv_(identifier) ↩
"A Unitary BRDF for Surfaces with Gaussian Deviations". https://github.com/pec27/urdf ↩
Honarvar Shakibaei, Barmak (2013). "Recursive formula to compute Zernike radial polynomials". Opt. Lett. 38 (14): 2487–2489. Bibcode:2013OptL...38.2487H. doi:10.1364/OL.38.002487. PMID 23939089. /wiki/Bibcode_(identifier) ↩
Kintner, E. C. (1976). "On the mathematical properties of the Zernike Polynomials". Opt. Acta. 23 (8): 679–680. Bibcode:1976AcOpt..23..679K. doi:10.1080/713819334. /wiki/Bibcode_(identifier) ↩
Niu, Kuo; Tian, Chao (2022). "Zernike polynomials and their applications". J. Opt. 24 (12). doi:10.1088/2040-8986/ac9e08. /wiki/Doi_(identifier) ↩
Tatulli, E. (2013). "Transformation of Zernike coefficients: a Fourier-based method for scaled, translated, and rotated wavefront apertures". J. Opt. Soc. Am. A. 30 (4): 726–32. arXiv:1302.7106. Bibcode:2013JOSAA..30..726T. doi:10.1364/JOSAA.30.000726. PMID 23595334. S2CID 23491106. /wiki/ArXiv_(identifier) ↩
Janssen, A. J. E. M. (2011). "New analytic results for the Zernike Circle Polynomials from a basic result in the Nijboer-Zernike diffraction theory". Journal of the European Optical Society: Rapid Publications. 6: 11028. Bibcode:2011JEOS....6E1028J. doi:10.2971/jeos.2011.11028. https://doi.org/10.2971%2Fjeos.2011.11028 ↩
Barakat, Richard (1980). "Optimum balanced wave-front aberrations for radially symmetric amplitude distributions: Generalizations of Zernike polynomials". J. Opt. Soc. Am. 70 (6): 739–742. Bibcode:1980JOSA...70..739B. doi:10.1364/JOSA.70.000739. /wiki/Bibcode_(identifier) ↩
Janssen, A. J. E. M. (2011). "A generalization of the Zernike circle polynomials for forward and inverse problems in diffraction theory". arXiv:1110.2369 [math-ph]. /wiki/ArXiv_(identifier) ↩
Mathar, R. J. (2018). "Orthogonal basis function over the unit circle with the minimax property". arXiv:1802.09518 [math.NA]. /wiki/ArXiv_(identifier) ↩
Akondi, Vyas; Dubra, Alfredo (22 June 2020). "Average gradient of Zernike polynomials over polygons". Optics Express. 28 (13): 18876–18886. Bibcode:2020OExpr..2818876A. doi:10.1364/OE.393223. ISSN 1094-4087. PMC 7340383. PMID 32672177. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7340383 ↩
Tahmasbi, A. (2010). An Effective Breast Mass Diagnosis System using Zernike Moments. 17th Iranian Conf. on Biomedical Engineering (ICBME'2010). Isfahan, Iran: IEEE. pp. 1–4. doi:10.1109/ICBME.2010.5704941. /wiki/Isfahan ↩
Tahmasbi, A.; Saki, F.; Shokouhi, S.B. (2011). "Classification of Benign and Malignant Masses Based on Zernike Moments". Computers in Biology and Medicine. 41 (8): 726–735. doi:10.1016/j.compbiomed.2011.06.009. PMID 21722886. /wiki/Computers_in_Biology_and_Medicine ↩
Rdzanek, W. P. (2018). "Sound radiation of a vibrating elastically supported circular plate embedded into a flat screen revisited using the Zernike circle polynomials". J. Sound Vib. 434: 91–125. Bibcode:2018JSV...434...92R. doi:10.1016/j.jsv.2018.07.035. S2CID 125512636. /wiki/Bibcode_(identifier) ↩
Alizadeh, Elaheh; Lyons, Samanthe M; Castle, Jordan M; Prasad, Ashok (2016). "Measuring systematic changes in invasive cancer cell shape using Zernike moments". Integrative Biology. 8 (11): 1183–1193. doi:10.1039/C6IB00100A. PMID 27735002. /wiki/Doi_(identifier) ↩
Gorji, H. T., and J. Haddadnia. "A novel method for early diagnosis of Alzheimer’s disease based on pseudo Zernike moment from structural MRI." Neuroscience 305 (2015): 361–371. ↩