Menu
Home Explore People Places Arts History Plants & Animals Science Life & Culture Technology
On this page
Unicode subscripts and superscripts
Unicode denominator & numerator glyphs

Unicode includes subscripted and superscripted characters, covering a full set of Arabic numerals, enabling the representation of polynomials, chemical, and other equations directly in plain text without relying on markup languages like HTML or TeX. The World Wide Web Consortium and the Unicode Consortium recommend using style markup such as MathML for mathematical superscripts and subscripts, but prefer encoded superscripts and subscripts in text when semantic distinctions are important, such as in phonetic or phonemic transcription.

We don't have any images related to Unicode subscripts and superscripts yet.
We don't have any YouTube videos related to Unicode subscripts and superscripts yet.
We don't have any PDF documents related to Unicode subscripts and superscripts yet.
We don't have any Books related to Unicode subscripts and superscripts yet.
We don't have any archived web articles related to Unicode subscripts and superscripts yet.

Uses

The intended use3 when these characters were added to Unicode was to produce true superscripts and subscripts so that chemical and algebraic formulas could be written without markup. Thus "H₂O" (using a subscript 2 character) is supposed to be identical to "H2O" (with subscript markup).

In reality, many fonts that include these characters ignore the Unicode definition, and instead design the digits for mathematical numerator and denominator glyphs,45 which are aligned with the cap line and the baseline, respectively. When used with the solidus or the Fraction Slash, they produce an almost typographically correct diagonal fraction, such as ³/₄ for the ¾ glyph. Super and subscript markup does not produce a correct fraction (compare markup 3/4 with precomposed ¾). The change also makes the superscript letters useful for ordinal indicators, more closely matching the ª and º characters.

Unicode intended that diagonal fractions be rendered by a different mechanism: the fraction slash U+2044 is visually similar to the solidus, but when used with the ordinary digits (not the superscripts and subscripts), it instructs the layout system that a fraction such as ¾ is to be rendered using automatic glyph substitution.67 User-end support was quite poor for a number of years, but fonts,8 browsers,9 word processors,10 desktop publishing software11 and others increasingly support the intended Unicode behavior. This browser and your default font render it as 3⁄4. (See Slash (punctuation)#Fractions for rendering in various other fonts.)

Superscripts and subscripts block

Main article: Superscripts and Subscripts (Unicode block)

The most common superscript digits (1, 2, and 3) were included in ISO-8859-1 and were therefore carried over into those code points in the Latin-1 range of Unicode. The remainder were placed along with basic arithmetical symbols, and later some Latin subscripts, in a dedicated block at U+2070 to U+209F. The table below shows these characters together. Each superscript or subscript character is preceded by a baseline x to show the height of subscripting/superscripting.

Six code points in the "Superscripts and Subscripts" block are unassigned, and remain available for future characters. As of November 2024, three of these (209D, 209E, and 209F) were provisionally assigned to new subscript characters, namely Latin lowercase w, y, and z.1213

Unicode characters
0123456789ABCDEF
U+00Bx
U+207xx⁰xⁱx⁴x⁵x⁶x⁷x⁸x⁹x⁺x⁻x⁼x⁽x⁾xⁿ
U+208xx₀x₁x₂x₃x₄x₅x₆x₇x₈x₉x₊x₋x₌x₍x₎
U+209xxₐxₑxₒxₓxₔxₕxₖxₗxₘxₙxₚxₛxₜ
  Not yet assigned.   Other characters from Latin-1 not related to super- or sub-scripts.

Other superscript and subscript characters

Unicode also includes codepoints for subscript and superscript characters that are intended for semantic usage, in the following blocks:1415

Superscript
  • The Latin-1 Supplement block contains the feminine and masculine ordinal indicators ª and º.
  • The Latin Extended-C block contains one superscript, ⱽ.
  • The Latin Extended-D block contains six superscripts: ꝰ ꟲ ꟳ ꟴ ꟸ ꟹ.
  • The Latin Extended-E block contains five superscripts: ꭜ ꭝ ꭞ ꭟ ꭩ.
  • The Latin Extended-F block is entirely superscript IPA letters: 𐞁 𐞂 𐞃 𐞄 𐞅 𐞇 𐞈 𐞉 𐞊 𐞋 𐞌 𐞍 𐞎 𐞏 𐞐 𐞑 𐞒 𐞓 𐞔 𐞕 𐞖 𐞗 𐞘 𐞙 𐞚 𐞛 𐞜 𐞝 𐞞 𐞟 𐞠 𐞡 𐞢 𐞣 𐞤 𐞥 𐞦 𐞧 𐞨 𐞩 𐞪 𐞫 𐞬 𐞭 𐞮 𐞯 𐞰 𐞲 𐞳 𐞴 𐞵 𐞶 𐞷 𐞸 𐞹 𐞺.
  • The Spacing Modifier Letters block has superscripted letters and symbols used for phonetic transcription: ʰ ʱ ʲ ʳ ʴ ʵ ʶ ʷ ʸ ˀ ˁ ˠ ˡ ˢ ˣ ˤ.
  • The Phonetic Extensions block has several superscripted letters and symbols: Latin/IPA ᴬ ᴭ ᴮ ᴯ ᴰ ᴱ ᴲ ᴳ ᴴ ᴵ ᴶ ᴷ ᴸ ᴹ ᴺ ᴻ ᴼ ᴽ ᴾ ᴿ ᵀ ᵁ ᵂ ᵃ ᵄ ᵅ ᵆ ᵇ ᵈ ᵉ ᵊ ᵋ ᵌ ᵍ ᵏ ᵐ ᵑ ᵒ ᵓ ᵖ ᵗ ᵘ ᵚ ᵛ, Greek ᵝ ᵞ ᵟ ᵠ ᵡ, Cyrillic ᵸ, other ᵎ ᵔ ᵕ ᵙ ᵜ. These are intended to indicate secondary articulation.
  • The Phonetic Extensions Supplement block has several more: Latin/IPA ᶛ ᶜ ᶝ ᶞ ᶟ ᶠ ᶡ ᶢ ᶣ ᶤ ᶥ ᶦ ᶧ ᶨ ᶩ ᶪ ᶫ ᶬ ᶭ ᶮ ᶯ ᶰ ᶱ ᶲ ᶳ ᶴ ᶵ ᶶ ᶷ ᶸ ᶹ ᶺ ᶻ ᶼ ᶽ ᶾ, Greek ᶿ.
  • The Cyrillic Extended-B block contains two Cyrillic superscripts: ꚜ ꚝ.
  • The Cyrillic Extended-D block contains many Cyrillic superscripts: 𞀰 𞀱 𞀲 𞀳 𞀷 𞀵 𞀶 𞀷 𞀸 𞀹 𞀺 𞀻 𞀼 𞀽 𞀾 𞀿 𞁀 𞁁 𞁂 𞁃 𞁄 𞁅 𞁆 𞁇 𞁈 𞁉 𞁊 𞁋 𞁌 𞁍 𞁎 𞁏 𞁐 𞁫 𞁬 𞁭.
  • The Georgian block contains one superscripted Mkhedruli letter: ჼ.
  • The Kanbun block has superscripted annotation characters used in Japanese copies of Classical Chinese texts: ㆒ ㆓ ㆔ ㆕ ㆖ ㆗ ㆘ ㆙ ㆚ ㆛ ㆜ ㆝ ㆞ ㆟.
  • The Tifinagh block has one superscript letter : ⵯ.
  • The Unified Canadian Aboriginal Syllabics and its Extended blocks contain several mostly consonant-only letters to indicate syllable coda called Finals, along with some characters that indicate syllable medial known as Medials: Main block ᐜ ᐝ ᐞ ᐟ ᐠ ᐡ ᐢ ᐣ ᐤ ᐥ ᐦ ᐧ ᐨ ᐩ ᐪ ᑉ ᑊ ᑋ ᒃ ᒄ ᒡ ᒢ ᒻ ᒼ ᒽ ᒾ ᓐ ᓑ ᓒ ᓪ ᓫ ᔅ ᔆ ᔇ ᔈ ᔉ ᔊ ᔋ ᔥ ᔾ ᔿ ᕀ ᕁ ᕐ ᕑ ᕝ ᕪ ᕻ ᕯ ᕽ ᖅ ᖕ ᖖ ᖟ ᖦ ᖮ ᗮ ᘁ ᙆ ᙇ ᙚ ᙾ ᙿ; Extended block: ᣔ ᣕ ᣖ ᣗ ᣘ ᣙ ᣚ ᣛ ᣜ ᣝ ᣞ ᣟ ᣳ ᣴ ᣵ.
Combining superscript
  • The Combining Diacritical Marks block contains medieval superscript letter diacritics. These letters are written directly above other letters appearing in medieval Germanic manuscripts, and so these glyphs do not include spacing, for example uͤ. They are shown here over the dotted circle placeholder ◌: ◌ͣ ◌ͤ ◌ͥ ◌ͦ ◌ͧ ◌ͨ ◌ͩ ◌ͪ ◌ͫ ◌ͬ ◌ͭ ◌ͮ ◌ͯ.
  • The Combining Diacritical Marks Extended block contains three combining insular letters for the Middle English Ormulum, ◌ᫌ ◌ᫍ ◌ᫎ.16
  • The Combining Diacritical Marks Supplement block contains additional medieval superscript letter diacritics, enough to complete the basic lowercase Latin alphabet except for j, q and y, a few small capitals and ligatures (ae, ao, av), and additional letters: ◌᷒ ◌ᷓ ◌ᷔ ◌ᷕ ◌ᷖ ◌ᷗ ◌ᷘ ◌ᷙ ◌ᷚ ◌ᷛ ◌ᷜ ◌ᷝ ◌ᷞ ◌ᷟ ◌ᷠ ◌ᷡ ◌ᷢ ◌ᷣ ◌ᷤ ◌ᷥ ◌ᷦ ◌ᷧ ◌ᷨ ◌ᷪ ◌ᷫ ◌ᷬ ◌ᷭ ◌ᷮ ◌ᷯ ◌ᷰ ◌ᷱ ◌ᷲ ◌ᷳ ◌ᷴ, Greek ◌ᷩ.
  • The Cyrillic Extended-A and -B blocks contains multiple medieval superscript letter diacritics, enough to complete the basic lowercase Cyrillic alphabet used in Church Slavonic texts, also includes an additional ligature (ст): ◌ⷠ ◌ⷡ ◌ⷢ ◌ⷣ ◌ⷤ ◌ⷥ ◌ⷦ ◌ⷧ ◌ⷨ ◌ⷩ ◌ⷪ ◌ⷫ ◌ⷬ ◌ⷭ ◌ⷮ ◌ⷯ ◌ⷰ ◌ⷱ ◌ⷲ ◌ⷳ ◌ⷴ ◌ⷵ ◌ⷶ ◌ⷷ ◌ⷸ ◌ⷹ ◌ⷺ ◌ⷻ ◌ⷼ ◌ⷽ ◌ⷾ ◌ⷿ ◌ꙴ ◌ꙵ ◌ꙶ ◌ꙷ ◌ꙸ ◌ꙹ ◌ꙺ ◌ꙻ ◌ꚞ ◌ꚟ.
  • The Cyrillic Extended-D block has one additional combining character, that being і: ◌𞂏.
Subscript
  • The Latin Extended-C block contains one subscript, ⱼ.
  • The Phonetic Extensions block has several subscripted letters and symbols: Latin/IPA ᵢ ᵣ ᵤ ᵥ and Greek ᵦ ᵧ ᵨ ᵩ ᵪ.
  • The Cyrillic Extended-D block also contains many Cyrillic subscripts: 𞁑 𞁒 𞁓 𞁔 𞁕 𞁖 𞁗 𞁘 𞁙 𞁚 𞁛 𞁜 𞁝 𞁞 𞁟 𞁠 𞁡 𞁢 𞁣 𞁤 𞁥 𞁦 𞁧 𞁨 𞁩 𞁪.
Combining subscript

Latin, Greek, Cyrillic, and IPA tables

Consolidated, the Unicode standard contains superscript and subscript versions of a subset of Latin, Greek and Cyrillic letters. Here they are arranged in alphabetical order for comparison (or for copy and paste convenience). Since these characters appear in different Unicode ranges, they may not appear to be the same size or position due to font substitution by the browser. Shaded cells mark petite capitals that are not very distinct from minuscules in roman typeface, but they may be distinct in italic typeface, as is used in some phonetic notation.

Little punctuation is encoded. Parentheses are shown in the basic superscript block above, and the exclamation mark ⟨ꜝ⟩ is shown in the IPA table below. In a supporting font, a question mark may be created with a superscript gelded question mark and a combining dot below: ⟨ˀ̣⟩.

Latin superscript and subscript letters
ABCDEFGHIJKLMNOPQRSTUVWXYZ
Superscript capitalᴿ*
Superscript small capital*𐞄**𐞒𐞖§§*𐞪§𐞲
Superscript minusculeʰʲˡ𐞥ʳˢʷˣʸ
Overscript small capital§◌ᷛ§§◌ᷞ◌ᷟ◌ᷡ◌ᷢ§
Overscript minuscule◌ͣ◌ᷨ◌ͨ◌ͩ◌ͤ◌ᷫ◌ᷚ◌ͪ◌ͥ◌ᷜ◌ᷝ◌ͫ◌ᷠ◌ͦ◌ᷮ◌ͬ◌ᷤ◌ͭ◌ͧ◌ͮ◌ᷱ◌ͯ§◌ᷦ
Subscript minuscule§***
Underscript minuscule◌᷊◌ᪿ

*Superscript versions of S, of petite capital A, D, E and P, of ƀ, and subscript versions of w, y and z have been accepted for a future version of the Unicode Standard.17

§ Cyrillic 𞀹 𞀻 𞁀, ◌ⷡ ◌ⷩ ◌ⷦ ◌ⷮ ◌ꙷ and 𞁞 might be substituted for these letters, but the same font would have to cover both ranges for it to look right.

Additional Latin characters
ÆɃƎƏŊ
Superscript capital
Superscript minuscule𐞃*
Overscript minuscule◌ᷔ◌ᷪ◌ᷬ
Subscript minuscule

Some of these superscript capitals are small caps in the source documents in the Unicode proposals.

Shaded cells Greek letters that are indistinguishable from Latin, and so would not be expected to be supported by Unicode.

Greek superscript and subscript letters
ΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΤΥΦΧΨΩ
Superscript minuscule18ᶿ19**
Overscript minuscule20◌ᷩ
Subscript minusculeͺ21
Underscript minuscule◌ͅ◌̫22

*Superscript versions of Greek psi and omega have been accepted for a future version of the Unicode Standard.23

Russian superscript and subscript letters
АБВГДЕЖЗИКЛМНОПРСТУФХЦЧШЩЪЫЬЭЮЯ
Superscript𞀰𞀱𞀲𞀳𞀴𞀵𞀶𞀷𞀸𞀹𞀺𞀻𞀼𞀽𞀾𞀿𞁀𞁁𞁂𞁃𞁄𞁅𞁆𞁇𞁈𞁉
Overscript◌ⷶ◌ⷠ◌ⷡ◌ⷢ◌ⷣ◌ⷷ◌ⷤ◌ⷥ◌ꙵ◌ⷦ◌ⷧ◌ⷨ◌ⷩ◌ⷪ◌ⷫ◌ⷬ◌ⷭ◌ⷮ◌ꙷ◌ꚞ◌ⷯ◌ⷰ◌ⷱ◌ⷲ◌ⷳ◌ꙸ◌ꙹ◌ꙺ◌ⷻ
Subscript𞁑𞁒𞁓𞁔𞁕𞁖𞁗𞁘𞁙𞁚𞁛𞁜𞁝𞁞𞁟𞁠𞁡𞁢𞁣𞁤𞁥𞁦
Additional modern Cyrillic characters
ӘҐЄЅІЇЈӨҪҮҰЏӀ
Superscript𞁋𞁊𞁌𞁌̈𞁍𞁎𞁫𞁏𞁭𞁐
Overscript◌ꙴ◌𞂏◌ꙶ
Subscript𞁧𞁩𞁨𞁨̈𞁪
Additional medieval Cyrillic characters
ѠѢѤѦѪѬѲ
Overscript◌ⷹ◌ꙻ◌ⷺ◌ⷼ◌ꚟ◌ⷽ◌ⷾ◌ⷿ◌ⷴ
Superscript𞁬

Superscript and subscript ё, ї, й, ў etc. are handled with diacritics, ⟨𞀵̈ 𞁌̈ 𞀸̆ 𞁁̆⟩ etc. Many of the Cyrillic characters were added to the Cyrillic Extended-D block, which was added to the free Gentium and Andika fonts with version 6.2 in February 2023.

See also small caps in Unicode.

Superscript IPA

The Latin Extended-F block was created for the remaining superscript IPA letters. They are supported by the free Gentium and Andika fonts. Additional superscript characters for historical and para-IPA letters have been accepted for future versions of the Unicode Standard.24

Consonant letters

The Unicode characters for superscript (modifier) IPA and extIPA consonant letters are as follows. The entire Latin Extended-F block is dedicated to superscript IPA. Characters for sounds with secondary articulation are set off in parentheses and placed below the base letters.

IPA and extIPA consonants, along with superscript variants and their Unicode code points
Bi­labialLabio­dentalDentalAlveolarPost­alveolarRetro­flexPalatalVelarUvularPharyn­gealGlottal
Nasalm ᵐ1D50ɱ ᶬ1DACn ⁿ207F(ᶇ)  (ȵ)ɳ ᶯ1DAFɲ ᶮ1DAEŋ ᵑ1D51ɴ ᶰ1DB0
Plosivep ᵖ1D56b ᵇ1D47t ᵗ1D57(ƫ ᶵ)1DB5d ᵈ1D48(ᶁ)  (ȶ)  (ȡ)ʈ 𐞯107AFɖ 𐞋1078Bc ᶜ1D9Cɟ ᶡ1DA1k ᵏ1D4Fɡ ᶢ/g ᵍ1DA2/1D4Dq 𐞥107A5ɢ 𐞒10792ʡ 𐞳107B3ʔ ˀ02C0
Affricateʦ 𐞬107ACʣ 𐞇10787ʧ 𐞮107AE(ʨ 𐞫)107ABʤ 𐞊1078A(ʥ 𐞉)10789ꭧ 𐞭107AD(𝼜)ꭦ 𐞈10788(𝼙)
Fricativeɸ ᶲ1DB2β ᵝ1D5Df ᶠ1DA0v ᵛ1D5Bθ ᶿ1DBFð ᶞ1D9Es ˢ02E2(ᶊ)z ᶻ1DBB(ᶎ)ʃ ᶴ1DB4(ɕ ᶝ)1D9Dʒ ᶾ1DBE(ʑ ᶽ)1DBDʂ ᶳ1DB3(ᶘ)ʐ ᶼ1DBC(ᶚ)ç ᶜ̧1D9C + 032725ʝ ᶨ1DA8x ˣ02E3(ɧ 𐞗)10797ɣ ˠ02E0χ ᵡ1D61ʁ ʶ02B6ħ 𐞕10795(ʩ 𐞐)10790ʕ ˤ02E426h ʰ02B0(ꞕ)ɦ ʱ02B1
Approximantʋ ᶹ1DB9ɹ ʴ02B4ɻ ʵ02B5j ʲ02B2(ɥ ᶣ)1DA3  (ʍ ꭩ)AB69ɰ ᶭ1DAD(w ʷ)02B7
Tap/flapⱱ 𐞰107B0ɾ 𐞩107A9ɽ 𐞨107A8
Trillʙ 𐞄10784r ʳ02B3ʀ 𐞪107AAʜ 𐞖10796ʢ 𐞴107B4
Lateral fricativeɬ 𐞛1079B(ʪ 𐞙)10799ɮ 𐞞1079E(ʫ 𐞚)1079Aꞎ 𐞝1079D𝼅 𐞟1079F𝼆 𐞡107A1𝼄 𐞜1079C
Lateral approximantl ˡ02E1(ᶅ ᶪ)1DAA  (ȴ)ɭ ᶩ1DA9ʎ 𐞠107A0ʟ ᶫ1DAB(ɫ ꭞ)27AB5E
Lateral tap/flapɺ 𐞦107A6𝼈 𐞧107A7
Implosiveƥɓ 𐞅10785ƭɗ 𐞌1078C𝼉ᶑ 𐞍1078Dƈʄ 𐞘10798ƙɠ 𐞓10793ʠʛ 𐞔10794
Click releaseʘ 𐞵107B5ɋǀ 𐞶107B6ʇǃ ꜝA71Dʗ𝼊 𐞹107B9ψǂ 𐞸107B8𝼋(ʞ)
Lateral clickreleaseǁ 𐞷107B7ʖ
Percussive¡ ꜞA71E28

The spacing diacritic for ejective consonants, U+2BC, works with superscript letters despite not being superscript itself: ⟨ᵖʼ ᵗʼ ᶜʼ ᵏˣʼ⟩. If a distinction needs to be made, the combining apostrophe U+315 may be used: ⟨ᵖ̕ ᵗ̕ ᶜ̕ ᵏˣ̕⟩. The spacing diacritic should be used for a baseline letter with a superscript release, such as [tˢʼ] or [kˣʼ], where the scope of the apostrophe includes the non-superscript letter, but the combining apostrophe U+315 might be used to indicate a weakly articulated ejective consonant like [ᵗ̕] or [ᵏ̕], where the whole consonant is written as a superscript, or together with U+2BC when separate apostrophes have scope over the base and modifier letters, as in ⟨pʼᵏˣ̕⟩.29

Spacing diacritics, as in ⟨tʲ⟩, cannot be secondarily superscripted in plain text: ⟨ᵗʲ⟩. (In this instance, the old IPA letter for [tʲ], ⟨ƫ⟩, has a superscript variant in Unicode, U+1DB5 ⟨ᶵ⟩, but that is not generally the case.)

Among older letters, the most common letters with palatal hook are supported; they are displayed in the table above. IPA once had an idiosyncratic curl on some of the palatalized letters: these are the fricative letters ⟨ʆ ʓ⟩. Their superscript forms have been accepted for a future version of the Unicode Standard. Old-style click letters and the retired letters ⟨ƞ⟩ and ⟨ɼ⟩ have also been accepted for a future version of the Unicode Standard.30 The Teuthonista letter ⟨ꜧ⟩ (U+A727) is an old graphic variant of ⟨ɮ⟩. Its superscript is supported at ⟨ꭜ⟩ (U+AB5C).

Among para-IPA letters, superscript variants of Sinological ⟨ȡ ȴ ȵ ȶ⟩, of the Bantuist labio-dental plosives ⟨ȹ⟩ and ⟨ȸ⟩, and of central semivowels ⟨ɉ⟩, ⟨ɥ̶⟩, and ⟨w̶⟩ have been accepted for a future version of the Unicode Standard.31

Vowel letters

The Unicode characters for superscript (modifier) IPA vowel letters, plus a pair of extended letters ⟨ᵻ ᵿ⟩ found in English dictionaries, are as follows. Recently retired alternative letters such as ⟨ɩ ɷ⟩ are also supported; they are set off in parentheses and placed below the standard IPA letters:

IPA vowels and superscript variants
FrontCentralBack
Closei ⁱ2071y ʸ02B8ɨ ᶤ1DA4ʉ ᶶ1DB6ɯ ᵚ1D5Au ᵘ1D58
Near-closeɪ ᶦ1DA6(ɩ ᶥ)1DA5ʏ 𐞲107B2(ᵻ ᶧ)1DA7(ᵿ)(ω)ʊ ᶷ1DB7(ɷ 𐞤)107A4
Close-mide ᵉ1D49ø 𐞢107A2ɘ 𐞎1078Eɵ ᶱ1DB1ɤ 𐞑10791o ᵒ1D52
Midə ᵊ1D4A
Open-midɛ ᵋ1D4Bœ ꟹA7F9ɜ ᶟ1D9F(ᴈ ᵌ)1D4Cɞ 𐞏1078Fʌ ᶺ1DBAɔ ᵓ1D53
Near-openæ 𐞃10783ɶ 𐞣107A3ɐ ᵄ1D44ɑ ᵅ1D45ɒ ᶛ1D9B
Opena ᵃ1D43

The precomposed Unicode rhotic vowel letters ⟨ɚ ɝ⟩ are not directly supported. The rhotic diacritic U+02DE ◌˞ should be used instead: ⟨ᵊ˞ ᶟ˞⟩.32

⟨ɜ⟩ and ⟨ᶟ⟩ are reversed ɛ. The older IPA turned ɛ, ⟨ᴈ⟩, is also supported, at U+1D4C ⟨ᵌ⟩. However, the briefly resurrected vowel letter ⟨ʚ⟩ (U+029A) is not supported, only its reversed replacement ⟨ɞ⟩ is.

Among older letters, ⟨ᴜ⟩ (U+1D1C), a graphic variant of ⟨ʊ⟩, is supported at ⟨ᶸ⟩ (U+1DB8).33

Among para-IPA letters, Sinological superscript ⟨ɿ ʅ ʮ ʯ ⟩ and ⟨ᴀ ᴇ⟩ have been accepted for a future version of the Unicode Standard.34

Length marks

The two length marks are also supported:

Length marks
LongHalf-long
ː 𐞁10781ˑ 𐞂10782

These are used to add length to another superscript, such as ⟨Cʰ𐞁⟩ or ⟨Cʰ𐞂⟩ for long aspiration.

Wildcards

Superscript wildcards (full caps) are largely supported: e.g. ᴺC (prenasalized consonant), ꟲN (prestopped nasal), Pꟳ (fricative release), NᴾF (epenthetic plosive), CVNᵀ (tone-bearing syllable), Cᴸ (liquid or lateral release), Cᴿ (rhotic or resonant release), Vᴳ (off-glide/diphthong), Cⱽ (fleeting vowel). Superscript S for sibilant release has been accepted for a future version of the Unicode Standard;35 superscript Ʞ for fleeting/epenthetic click has not. Other basic Latin superscript wildcards for tone and weak indeterminate sounds, as described in the article on the International Phonetic Alphabet, are mostly supported. (See table in previous section.)

Combining marks and subscripts

In addition, a very few IPA letters beyond the basic Latin alphabet have combining forms or are supported as subscripts:

Additional IPA characters
äɑæβçðəʃʍχʔʼ
Overscript◌ᷲ◌ᷧ◌ᷔ◌ᷩ◌ᷗ◌ᷙ◌ᷪ◌ᷯ◌̉36◌̓
Subscript
Underscript◌ᫀ◌̦

Composite characters

Primarily for compatibility with earlier character sets, Unicode contains a number of characters that compose super- and subscripts with other symbols.37 In most fonts these render much better than attempts to construct these symbols from the above characters or by using markup.

Notes

  • Writing portal

References

  1. "UCD: UnicodeData.txt". The Unicode Standard. Retrieved May 14, 2016. https://www.unicode.org/Public/UCD/latest/ucd/UnicodeData.txt

  2. Martin Dürst, Asmus Freytag (May 16, 2007). "Unicode in XML and other Markup Languages". W3C. Retrieved September 13, 2010. http://www.w3.org/TR/unicode-xml/#Superscripts

  3. Martin Dürst, Asmus Freytag (May 16, 2007). "Unicode in XML and other Markup Languages". W3C. Retrieved September 13, 2010. http://www.w3.org/TR/unicode-xml/#Superscripts

  4. "fraction | Dart Package". Dart packages. December 27, 2021. Retrieved September 21, 2022. https://pub.dev/packages/fraction

  5. "MathML | General layout elements | Fractions". data2type GmbH (in German). March 30, 2021. Archived from the original on January 28, 2021. Retrieved January 13, 2022. https://web.archive.org/web/20210128015254/https://www.data2type.de/xml-xslt-xslfo/math-ml/presentation-markup/layout-elements/fractions/

  6. Martin Dürst, Asmus Freytag (May 16, 2007). "Fraction Slash". W3C. Retrieved September 13, 2010. https://www.w3.org/TR/unicode-xml/#Fraction

  7. For a general overview and technical information on glyph substitution (though not specifically for fractions), see GSUB — Glyph Substitution Table in the OpenType specification on the Microsoft Typography site. https://www.microsoft.com/typography/otspec/gsub.htm

  8. Such as Andika, Arno Pro, Brill, Brioso Pro, Calibri, Candara, Carlito, Cantarell, FiraGO, EB Garamond, Gentium, Lato, Linux Libertine, Noto Sans, Noto Serif, Open Sans and Yrsa /wiki/Andika_(typeface)

  9. Such as Chrome, Firefox and Falkon /wiki/Chrome_(browser)

  10. Such as LibreOffice Writer /wiki/LibreOffice_Writer

  11. Such as Adobe InDesign and Scribus /wiki/Adobe_InDesign

  12. "Additional draft repertoire for provisionally assigned code points for Unicode" (PDF). Unicode Consortium. November 26, 2024. https://www.unicode.org/L2/L2024/24268-n5291-post-17-chart.pdf

  13. "Approved Minutes of UTC Meeting 181". Unicode Consortium. January 27, 2025. Retrieved March 8, 2025. https://www.unicode.org/L2/L2024/24221.htm#181-C35

  14. "UCD: UnicodeData.txt". The Unicode Standard. Retrieved May 14, 2016. https://www.unicode.org/Public/UCD/latest/ucd/UnicodeData.txt

  15. "UCD: Scripts.txt". The Unicode Standard. Retrieved September 21, 2022. https://www.unicode.org/Public/UCD/latest/ucd/Scripts.txt

  16. Everson, Michael; West, Andrew (October 5, 2020). "L2/20-268: Revised proposal to add ten characters for Middle English to the UCS" (PDF). https://www.unicode.org/L2/L2020/20268-n5145-ormulum.pdf

  17. "Additional draft repertoire for provisionally assigned code points for Unicode" (PDF). Unicode Consortium. November 26, 2024. https://www.unicode.org/L2/L2024/24268-n5291-post-17-chart.pdf

  18. In some fonts, Latin alpha ᵅ and upsilon ᶹ can be used as superscript Greek alpha and upsilon. ᵋ and ᶥ are also officially Latin letters, but display the same as Greek.

  19. In some fonts, Latin alpha ᵅ and upsilon ᶹ can be used as superscript Greek alpha and upsilon. ᵋ and ᶥ are also officially Latin letters, but display the same as Greek.

  20. In some fonts, Latin alpha ᵅ and upsilon ᶹ can be used as superscript Greek alpha and upsilon. ᵋ and ᶥ are also officially Latin letters, but display the same as Greek.

  21. ⟨ͺ⟩ is set lower than a normal subscript. It is equivalent to underscript ⟨◌ͅ⟩ on a space.

  22. ⟨◌̫⟩ is traditionally typeset as an omega.

  23. "Additional draft repertoire for provisionally assigned code points for Unicode" (PDF). Unicode Consortium. November 26, 2024. https://www.unicode.org/L2/L2024/24268-n5291-post-17-chart.pdf

  24. "Additional draft repertoire for provisionally assigned code points for Unicode" (PDF). Unicode Consortium. November 26, 2024. https://www.unicode.org/L2/L2024/24268-n5291-post-17-chart.pdf

  25. Superscript ⟨ç⟩ is composed of superscript c and a combining cedilla, which should display properly in a good font. Superscript c was specifically requested for this purpose in Unicode proposal L2/03-180. /wiki/Combining_character

  26. U+02E4 ˤ MODIFIER LETTER SMALL REVERSED GLOTTAL STOP is the superscript variant of U+0295 ʕ LATIN LETTER PHARYNGEAL VOICED FRICATIVE and is defined for IPA use. The similar character U+02C1 ˁ MODIFIER LETTER REVERSED GLOTTAL STOP is a reversed U+02C0 ˀ MODIFIER LETTER GLOTTAL STOP, perhaps a gelded reversed question mark. Fonts are inconsistent in whether they look different and what the difference is.

  27. In Microsoft fonts, superscript ⟨ɫ⟩ was erroneously designed as a superscript ⟨ꬸ⟩.

  28. U+A71D ⟨ꜝ⟩ and A71E ⟨ꜞ⟩ were adopted as the Africanist equivalents of the IPA characters ⟨ꜜ⟩ downstep and ⟨ꜛ⟩ upstep. The correspondence of U+A71D ⟨ꜝ⟩ to the IPA click letter ⟨ǃ⟩ is thus accidental. Coincidentally, U+A71E ⟨ꜞ⟩ serves as the superscript variant of the extIPA percussive consonant ⟨¡⟩; the other percussive letters, ⟨ʬ⟩ and ⟨ʭ⟩, do not have superscript support in Unicode. /wiki/Downstep

  29. Kirk Miller & Michael Ashby, L2/20-253R Unicode request for IPA modifier letters (b), non-pulmonic. https://www.unicode.org/L2/L2020/20253r-mod-ipa-b.pdf

  30. "Additional draft repertoire for provisionally assigned code points for Unicode" (PDF). Unicode Consortium. November 26, 2024. https://www.unicode.org/L2/L2024/24268-n5291-post-17-chart.pdf

  31. "Additional draft repertoire for provisionally assigned code points for Unicode" (PDF). Unicode Consortium. November 26, 2024. https://www.unicode.org/L2/L2024/24268-n5291-post-17-chart.pdf

  32. Kirk Miller & Michael Ashby, L2/20-252R Unicode request for IPA modifier-letters (a), pulmonic https://www.unicode.org/L2/L2020/20252r-mod-ipa-a.pdf

  33. Kirk Miller (January 30, 2024). "L2/24-081: Latin Phonetic The for Middle Tilde" (PDF). https://www.unicode.org/L2/L2024/24081-phonetic-middle-tilde.pdf

  34. "Additional draft repertoire for provisionally assigned code points for Unicode" (PDF). Unicode Consortium. November 26, 2024. https://www.unicode.org/L2/L2024/24268-n5291-post-17-chart.pdf

  35. "Additional draft repertoire for provisionally assigned code points for Unicode" (PDF). Unicode Consortium. November 26, 2024. https://www.unicode.org/L2/L2024/24268-n5291-post-17-chart.pdf

  36. This is actually the Vietnamese diacritic dấu hỏi, not specifically IPA, but graphically both are gelded question marks. /wiki/Hook_above

  37. "UCD: UnicodeData.txt". The Unicode Standard. Retrieved May 14, 2016. https://www.unicode.org/Public/UCD/latest/ucd/UnicodeData.txt

  38. Silva, Eduardo Marín (March 1, 2017). "L2/17-066R: Proposal to encode the Marca Registrada sign" (PDF). http://www.unicode.org/L2/L2017/17066r-marca-registrada.pdf