Unicode/Versions

From Wikibooks, open books for an open world
Jump to navigation Jump to search
(edit template)

This page is about each version specification, and the differences between the versions.

Unicode 1.0[edit | edit source]

Unicode 1.0 was the first version of Unicode, released October 1991. It encoded 7,094 new characters.

“Blocks”[edit | edit source]

This version of Unicode did not formally group characters in blocks. But in comparison with version 2.0, the following “blocks” were available: U+0000-U+FFFD 51 Blocks

  • Basic Latin (U+0000-U+007F), containing 128 characters.
  • Latin-1 Supplement (U+0080-U+00FF), containing 128 characters.
  • Latin Extended-A (U+0100-U+017F), containing 127 characters.
  • Latin Extended-B (U+0180-U+01FF), containing 113 characters.
  • IPA Extensions (U+0250-U+02AF), containing 89 characters.
  • Spacing Modifier Letters (U+02B0-U+02FF), containing 57 characters.
  • Combining Diacritical Marks (U+0300-U+036F), containing 66 characters.
  • Greek and Coptic (U+0370-U+03FF), containing 112 characters.
  • Cyrillic (U+0400-U+04FF), containing 192 characters.
  • Armenian (U+0530-U+058F), containing 84 characters.
  • Hebrew (U+0590-U+05FF), containing 52 characters.
  • Arabic (U+0600-U+06FF), containing 169 characters.
  • Devanagari (U+0900-U+097F), containing 104 characters.
  • Bengali (U+0980-U+09FF), containing 89 characters.
  • Gurmukhi (U+0A00-U+0A7F), containing 74 characters.
  • Gujarati (U+0A80-U+0AFF), containing 75 characters.
  • Oriya (U+0B00-U+0B7F), containing 78 characters.
  • Tamil (U+0B80-U+0BFF), containing 61 characters.
  • Telugu (U+0C00-U+0C7F), containing 80 characters.
  • Kannada (U+0C80-U+0CFF), containing 80 characters.
  • Malayalam (U+0D00-U+0D7F), containing 78 characters.
  • Thai (U+0E00-U+0E7F), containing 92 characters.
  • Lao (U+0E80-U+0EFF), containing 70 characters.
  • Tibetan (U+1000-U+105F), containing 71 characters.
  • Georgian (U+10A0-U+10FF), containing 78 characters.
  • General Punctuation (U+2000-U+206F), containing 67 characters.
  • Superscripts and Subscripts (U+2070-U+209F), containing 28 characters.
  • Currency Symbols (U+20A0-U+20CF), containing 11 characters.
  • Combining Marks for Symbols (U+20D0-U+20FF), containing 18 characters.
  • Letterlike Symbols (U+2100-U+214F), containing 57 characters.
  • Number Forms (U+2150-U+218F), containing 48 characters.
  • Arrows (U+2190-U+21FF), containing 91 characters.
  • Mathematical Operators (U+2200-U+22FF), containing 242 characters.
  • Miscellaneous Technical (U+2300-U+23FF), containing 43 characters.
  • Control Pictures (U+2400-U+243F), containing 37 characters.
  • Optical Character Recognition (U+2440-U+245F), containing 11 characters.
  • Enclosed Alphanumerics (U+2460-U+24FF), containing 139 characters.
  • Box Drawing (U+2500-U+257F), containing 128 characters.
  • Block Elements (U+2580-U+259F), containing 22 characters.
  • Geometric Shapes (U+25A0-U+25FF), containing 79 characters.
  • Miscellaneous Symbols (U+2600-U+26FF), containing 106 characters.
  • Dingbats (U+2700-U+27BF), containing 160 characters.
  • CJK Symbols and Punctuation (U+3000-U+303F), containing 56 characters.
  • Hiragana (U+3040-U+309F), containing 90 characters.
  • Katakana (U+30A0-U+30FF), containing 90 characters.
  • Bopomofo (U+3100-U+312F), containing 40 characters.
  • Hangul Compatibility Jamo (U+3130-U+318F), containing 94 characters.
  • Kanbun (U+3190-U+31FF), containing 16 characters.
  • Enclosed CJK Letters and Months (U+3200-U+32FF), containing 191 characters.
  • CJK Compatibility (U+3300-U+33FF), containing 187 characters.
  • Hangul (U+3400-U+3D2D), containing 2,350 characters.
  • Private Use Area (U+E000-U+FDFF), reserved for 5,632 characters.
  • CJK Compatibility Forms (U+FE30-U+FE4F), containing 28 characters.
  • Small Form Variants (U+FE50-U+FE6F), containing 26 characters.
  • Arabic Presentation Forms-B (U+FE70-U+FEFF), containing 140 characters.
  • Halfwidth and Fullwidth Forms (U+FF00-U+FFEF), containing 216 characters.
  • Specials (U+FFF0-U+FFFF), containing 1 character.

Unicode 1.0.1[edit | edit source]

Unicode 1.0.1 was released June 1992. It encoded 28,292 characters, adding 21,204 new characters and removing 6 characters, for a net increase of 21,198 characters.

New blocks[edit | edit source]

  • CJK Unified Ideographs (U+4E00-U+9FFF), containing 20,902 Han Ideographs for Chinese, Japanese and Korean, was added.
  • CJK Compatibility Ideographs (U+F900-U+FAFF), containing 302 Han Ideographs for compatibility with existing character sets, was added.

Removed characters[edit | edit source]

  • Letters Ka and Kha with Ogonek (total 4 characters) were removed from Cyrillic. (U+04C5-U+04C6 and U+04C9-U+04CA)
  • APL Compose Operator and APL Out (total 2 characters) were removed from Miscellaneous Technical. (U+2300-U+2301)

Rearranged characters[edit | edit source]

  • A Japanese Industrial Standard symbol (〄) was moved from Enclosed CJK Letters and Months (U+32FF) to CJK Symbols and Punctuation. (U+3004)
  • Circled Katakana: The characters well be arranged in modern order: e.g., A, I, U, E, O, KA, KI (U+32D0-U+32FE)
  • Basic Glyphs For Arabic Language: The character shapes will be arranged in different order: Isolate, Final, Initial and Medial (U+FE80-FEFC)

Characters with semantics changed[edit | edit source]

  • Zero Width Non-Joiner [ZWNJ] (U+20DC)
  • Zero Width Joiner [ZWJ] (U+20DD)

Unicode 1.1[edit | edit source]

Unicode 1.1 was released June 1993. It encoded 34,168 characters, adding 5,969 new characters and removing 93 characters, for a net increase of 5,876 characters. It finalized the long anticipated Han Unification.

New blocks[edit | edit source]

  • Hangul Jamo (U+1100-U+11FF), containing 240 jamo for the Hangul script, was added.
  • Latin Extended Additional (U+1E00-U+1EFF), containing 245 precomposed characters for transliteration and Vietnamese, was added.
  • Greek Extended (U+1F00-U+1FFF), containing 233 precomposed characters for polytonic Greek, was added.
  • Hangul Supplementary-A (U+3D2E-U+44B7), containing 1,930 precomposed syllables for the Hangul script, was added.
  • Hangul Supplementary-B (U+44B8-U+4DFF), containing 2,376 precomposed syllables for the Hangul script, was added.
  • Alphabetic Presentation Forms (U+FB00-U+FB4F), containing 57 precomposed characters and ligatures, was added.
  • Arabic Presentation Forms-A (U+FB50-U+FDFF), containing 593 combinations of Arabic letters, was added.
  • Combining Half Marks (U+FE20-U+FE2F), containing 4 halves of diacritical marks, was added.

Extended blocks[edit | edit source]

  • The long S (ſ) (total 1 character) was added to Latin Extended-A. (U+017F)
  • The Hungarian Dz, characters for transliteration purposes and precomposed characters with double grave and inverted breve (total 35 characters) were added to Latin Extended-B (U+01F1-U+01F5 and U+01FA-U+0217). The block was expanded from (U+0180-U+01FF) to (U+0180-U+024F)
  • Diacritics for polytonic Greek and double width diacritics (total 6 characters) were added to Combining Diacritical Marks. (U+0342-U+0345 and U+0360-U+0361)
  • Compatibility character now deprecated, Ano Teleia, and other characters (total 5 characters) were added to Greek and Coptic (U+0374-U+0375, U+037A, U+037E and U+0387).
  • Additional characters for non-Slavic languages (total 38 characters) were added to Cyrillic. (U+04D0-U+04EB, U+04EE-U+04F5 and U+04F8-U+04F9)
  • A ligature of Ech and Yiwn (և) (total 1 character) was added to Armenian. (U+0587)
  • One deprecated compatibility character and several characters for biblical texts (total 25 characters) were added to Arabic. (U+066D and U+06D6-U+06ED)
  • A sign Virama (total 1 character) was added to Gurmukhi (U+0A4D).
  • Letters Candra O and E (total 3 characters) were added to Gujarati. (U+0A8D, U+0A91 and U+0AC9)
  • An Ai Length mark (total 1 character) was added to Oriya. (U+0B56)
  • An undertie, a pair of brackets and six formatting characters now deprecated (total 9 characters) were added to General Punctuation. (U+203F, U+2045-U+2046 and U+206A-U+206F)
  • Some additional symbols and the complete set of APL functional symbols (total 79 characters) were added to Miscellaneous Technical. (U+2300 and U+232D-U+237A)
  • A large circle (◯) (total 1 character) was added to Geometric Shapes. (U+25EF)
  • The ideographic telegraph line feed separator symbol (〷) (total 1 character) was added to CJK Symbols and Punctuation. (U+3037)
  • Four Katakana letters not in use since 1945 (total 4 characters) were added to Katakana. (U+30F7-U+30FA)
  • Ideographic telegraph symbols for the twelve months (total 12 characters) were added to Enclosed CJK Letters and Months. (U+32C0-U+32CB)
  • Ideographic telegraph symbols for hours and days and six additional measure units (total 62 characters) were added to CJK Compatibility. (U+3358-U+3376 and U+33E0-U+33FE)
  • Some more space (total 2,304 characters) was added to the Private Use Area.
  • Seven halfwidth geometric shapes (total 7 characters) were added to Halfwidth and Fullwidth Forms. (U+FFE8-U+FFEE)

Removed blocks[edit | edit source]

  • Tibetan, containing 71 letters for the Tibetan script, was removed from the Unicode standard.

Removed characters[edit | edit source]

  • A total of 10 characters were removed from Greek and Coptic. (U+0370-U+0372, U+03D7-U+03D9, U+03DB, U+03DD, U+03DF, and U+03E1)
  • Point Varika (total 1 character) was removed from Hebrew. (U+05F5)
  • Phonetic Order Vowel Signs (total 5 characters) were removed from Thai. (U+0E70-U+0E74)
  • Phonetic Order Vowel Signs (total 5 characters) were removed from Lao. (U+0EF0-U+0EF4)
  • An Ideographic Ditto Mark (total 1 character) was removed from CJK Symbols and Punctuation (U+3004) and merged with CJK Unified Ideograph-4EDD.

Rearranged characters[edit | edit source]

  • Greek character U+03F3 was changed from Spacing Tonos to Letter Yot.
  • A Japanese Industrial Standard symbol (〄) was moved from Enclosed CJK Letters and Months (U+32FF) to CJK Symbols and Punctuation. (U+3004)

Unicode 2.0[edit | edit source]

Unicode 2.0 was released July 1996. It encoded 38,885 characters, adding 11,373 new characters and removing 6,656 characters, for a net increase of 4,717 characters. This was the first Unicode version to reserve blocks outside of the Basic Multilingual Plane.

New blocks[edit | edit source]

  • Hangul Syllables (U+AC00-U+D7AF), containing 11,172 precomposed syllables for the Hangul script, was added.
  • High Surrogates (U+D800-U+DB7F), containing 896 characters, was added.
  • High Private Use Surrogates (U+DB80-U+DBFF), containing 128 characters, was added.
  • Low Surrogates (U+DC00-U+DFFF), containing 1,024 characters, was added.
  • Supplementary Private Use Area-A (U+F0000-U+FFFFF), reserving 65,534 characters for private use, was added.
  • Supplementary Private Use Area-B (U+100000-U+10FFFF), reserving 65,534 characters for private use, was added.

Reinstated blocks[edit | edit source]

  • Tibetan (U+0F00-U+0FFF), now containing 168 characters for the Tibetan script including religious signs, was readded.

Removed blocks[edit | edit source]

  • Hangul, containing 2,350 precomposed syllables for the Hangul script, was removed from the Unicode standard.
  • Hangul Supplementary-A, containing 1,930 precomposed syllables for the Hangul script, was removed from the Unicode standard.
  • Hangul Supplementary-B, containing 2,376 precomposed syllables for the Hangul script, was removed from the Unicode standard.

Extended blocks[edit | edit source]

  • Cantillation marks for use in religious texts (total 31 characters) were added to Hebrew. (U+0591-U+05A1, U+05A3-U+05AF and U+05C4)
  • A long S with Dot Above (total 1 character) was added to Latin Extended Additional. (U+1E9B)
  • A Vietnamese Dong sign (total 1 character) was added to Currency Symbols. (U+20AB)

Unicode 2.1[edit | edit source]

Unicode 2.1 was released May 1998. It encoded 38,887 characters, adding only 2 new characters.

Extended blocks[edit | edit source]

  • A Euro sign (total 1 character) was added to Currency Symbols. (U+20AC)
  • An Object Replacement Character (total 1 character) was added to Specials. (U+FFFC)

Unicode 3.0[edit | edit source]

Unicode 3.0 was released September 1999. It was a big update and encoded 49,194 characters, adding 10,307 new characters.

New blocks[edit | edit source]

  • Syriac (U+0700-U+074F), containing 71 characters used for writing in Syriac script, was added.
  • Thaana (U+0780-U+07BF), containing 49 characters used for writing in Thaana script, was added.
  • Sinhala (U+0D80-U+0DFF), containing 80 characters for the Sinhala script, was added.
  • Myanmar (U+1000-U+109F), containing 78 characters for the Burmese script, was added.
  • Ethiopic (U+1200-U+137F), containing 345 syllables and punctuation marks for the Ethiopic script, was added.
  • Cherokee (U+13A0-U+13FF), containing 85 syllables for the Cherokee script, was added.
  • Unified Canadian Aboriginal Syllabics (U+1400-U+167F), containing 630 syllables and punctuation marks for writing in aboriginal languages of Canada, was added.
  • Ogham (U+1680-U+169F), containing 29 characters for the ancient Ogham script, was added.
  • Runic (U+16A0-U+16FF), containing 81 characters for the Germanic runes, was added.
  • Khmer (U+1780-U+17FF), containing 103 characters for the Khmer script, was added.
  • Mongolian (U+1800-U+18AF), containing 155 characters for the classical Mongolian script, was added.
  • Braille Patterns (U+2800-U+28FF), containing 256 Braille letters, was added.
  • CJK Radicals Supplement (U+2E80-U+2EFF), containing 115 non-Kangxi radicals, was added.
  • Kangxi Radicals (U+2F00-U+2FDF), containing 214 radicals from the Kangxi dictionary, was added.
  • Ideographic Description Characters (U+2FF0-U+2FFF), containing 12 characters used to describe a Han ideograph not available in the font, was added.
  • Bopomofo Extended (U+31A0-U+31BF), containing 24 characters used for phonetic transcription of minority languages of Taiwan, was added.
  • CJK Unified Ideographs Extension A (U+3400-U+4DBF), containing 6,582 additional Han Ideographs, was added.
  • Yi Syllables (U+A000-U+A48F), containing 1,165 syllables of the modern Yi script, was added.
  • Yi Radicals (U+A490-U+A4CF), containing 50 radicals of Yi Syllables, was added.

Extended blocks[edit | edit source]

  • Additional precomposed characters, letters and capital letters of lowercase-only letters (total 30 characters) were added to Latin Extended-B. (U+01F6-U+01F9, U+0218-U+021F and U+0222-U+0233)
  • Extensions for disordered speech (total 5 characters) were added to IPA Extensions. (U+02A9-U+02AD)
  • Some additional modifier letters (total 6 characters) were added to Spacing Modifier Letters. (U+02DF and U+02EA-U+02EE)
  • Additional combining diacritics for IPA (total 10 characters) were added to Combining Diacritical Marks. (U+0346-U+034E and U+0362)
  • Lowercase versions of archaic letters and the Kai symbol (total 5 characters) were added to Greek and Coptic. (U+03D7, U+03DB, U+03DD, U+03DF and U+03E1)
  • Nonstandard letters for Macedonian, combining numeral signs and three letters for Kildin Sami (total 12 characters) were added to Cyrillic. (U+0400, U+040D, U+0450, U+045D, U+0488-U+0489, U+048C-U+048F and U+04EC-U+04ED)
  • A Hyphen (total 1 character) was added to Armenian. (U+058A)
  • Combining hamza and maddah and nine additional Arabic characters (total 12 characters) were added to Arabic. (U+0653-U+0655, U+06B8-U+06B9, U+06BF, U+06CF and U+06FA-U+06FE)
  • Additional letters and religious symbols (total 25 characters) were added to Tibetan. (U+0F6A, U+0F96, U+0FAE-U+0FB0, U+0FB8, U+0FBA-U+0FBC, U+0FBE-U+0FCC and U+0FCF)
  • A narrow no-break space and 6 additional punctuation marks (total 7 characters) were added to General Punctuation. (U+202F and U+2048-U+204D)
  • The Kip, Tugrik and Drachma sign (total 3 characters) were added to Currency Symbols. (U+20AD-U+20AF)
  • An enclosing screen and an enclosing key (total 2 characters) were added to Combining Diacritical Marks for Symbols. (U+20E2-U+20E3)
  • The information symbol and a rotated Q (total 2 characters) were added to Letterlike Symbols. (U+2139-U+213A)
  • A mirrored Roman capital numeral hundred (Ↄ) (total 1 character) was added to Number Forms. (U+2183)
  • Some additional arrows (total 9 characters) were added to Arrows. (U+21EB-U+21F3)
  • Some additional technical symbols, including common keys on a 101 keyboard (total 33 characters) were added to Miscellaneous Technical. (U+2301, U+237B and U+237D-U+239A)
  • Two additional control pictures (total 2 characters) were added to Control Pictures. (U+2425-U+2426)
  • Squares and circles with quadrants (total 8 characters) were added to Geometric Shapes. (U+25F0-U+25F7)
  • Two Syriac crosses and a signature mark (total 3 characters) were added to Miscellaneous Symbols. (U+2619 and U+2670-U+2671)
  • Three Hangzhou numerals and a variation indicator (total 4 characters) were added to CJK Symbols and Punctuation. (U+3038-U+303A and U+303E)
  • A ligature Yod with Hiriq (יִ) (total 1 character) was added to Alphabetic Presentation Forms. (U+FB1D)
  • Three additional control characters for ruby markup (total 3 characters) were added to Specials. (U+FFF9-U+FFFB)

Unicode 3.1[edit | edit source]

Unicode 3.1 was released March 2001. It encoded 94,140 characters, adding 44,946 new characters, and mainly focused on blocks outside of the Basic Multilingual Plane.

New blocks[edit | edit source]

  • Old Italic (U+10300-U+1032F), containing 35 letters for the Etruscan script, was added.
  • Gothic (U+10330-U+1034F), containing 27 letters for the Gothic script, was added.
  • Deseret (U+10400-U+1044F), containing 76 letters for the constructed Deseret script, was added.
  • Byzantine Musical Symbols (U+1D000-U+1D0FF), containing 246 symbols for musical notation in Byzantine, was added.
  • Musical Symbols (U+1D100-U+1D1FF), containing 219 characters for current musical notation, was added.
  • Mathematical Alphanumeric Symbols (U+1D400-U+1D7FF), containing 991 Latin and Greek letters in serif, sans-serif, bold, italic, double-struck, script and Fraktur/Blackletter, was added.
  • CJK Unified Ideographs Extension B (U+20000-U+2A6DF), containing 42,711 additional Chinese Ideographs, was added.
  • CJK Compatibility Ideographs Supplement (U+2F800-U+2FA1F), containing 542 additional Chinese Ideographs for compatibility purposes, was added.
  • Tags, containing 97 language tags, was added. (U+E0000-U+E007F)

Extended noncharacters[edit | edit source]

  • The Noncharacters range: U+FDD0..U+FDEF were added to Arabic Presentation Forms-A.

Extended blocks[edit | edit source]

  • The capital Theta symbol and the Lunate Epsilon symbol (total 2 characters) were added to Greek and Coptic. (U+03F4-U+03F5)

Characters and Scripts Under Investigation or Rejected[edit | edit source]

  • Khmer Sign Laak Was Rejected. (U+17DD) From Khmer.
  • Georgian Letter U-Brjuu Was Rejected. From Georgian.

Unicode 3.2[edit | edit source]

Unicode 3.2 was released March 2002. It encoded 95,156 characters, adding 1,016 new characters.

New blocks[edit | edit source]

  • Cyrillic Supplement (U+0500-U+052F), containing 16 characters used for the Komi language, was added.
  • Tagalog (U+1700-U+171F), containing 20 characters for the Baybayin script, was added.
  • Hanunoo (U+1720-U+173F), containing 23 characters and punctuation for the Hanunoo script, was added.
  • Buhid (U+1740-U+175F), containing 20 characters for the Buhid script, was added.
  • Tagbanwa (U+1760-U+177F), containing 18 characters for the Tagbanwa script, was added.
  • Miscellaneous Mathematical Symbols-A (U+27C0-U+27EF), containing 28 symbols used in math notation, was added.
  • Supplemental Arrows-A (U+27F0-U+27FF), containing 16 additional arrows, was added.
  • Supplemental Arrows-B (U+2900-U+297F), containing 128 special arrows, was added.
  • Miscellaneous Mathematical Symbols-B (U+2980-U+29FF), containing 128 additional mathematical symbols, was added.
  • Supplemental Mathematical Operators (U+2A00-U+2AFF), containing 256 additional mathematical operators, was added.
  • Katakana Phonetic Extensions (U+31F0-U+31FF), containing 16 Katakana letters used for Ainu, was added.
  • Variation Selectors (U+FE00-U+FE0F), containing 16 symbols used for indicating variations, was added.

Extended blocks[edit | edit source]

  • A capital letter N with Long Right Leg (total 1 character) was added to Latin Extended-B. (U+0220)
  • The combining grapheme joiner and combining Latin letters used in medieval texts (total 14 characters) were added to Combining Diacritical Marks. (U+034F and U+0363-U+036F)
  • The Qoppa and a reversed lunate epsilon symbol (total 3 characters) were added to Greek and Coptic. (U+03D8-U+03D9 and U+03F6)
  • Four additional letters used for the Kildin Sami language (total 8 characters) were added to Cyrillic. (U+048A-U+048B, U+04C5-U+04C6, U+04C9-U+04CA and U+04CD-U+04CE)
  • A dotless Beh and a dotless Qaf (total 2 characters) were added to Arabic. (U+066E-U+066F)
  • A Letter for Addu dialect (total 1 character) was added to Thaana. (U+07B1)
  • The letters Yn and Elifi (total 2 characters) were added to Georgian. (U+10F7-U+10F8)
  • Some additional punctuation marks and control characters (total 12 characters) were added to General Punctuation. (U+2047, U+204E-U+2052, U+2057 and U+205F-U+2063)
  • A superscript letter I (total 1 character) was added to Superscripts and Subscripts. (U+2071)
  • German Penny and Peso sign (total 2 characters) were added to Currency Symbols. (U+20B0-U+20B1)
  • Some additional combining characters (total 7 characters) were added to Combining Diacritical Marks for Symbols. (U+20E4-U+20EA)
  • Some double-struck and reversed/turned letters (total 15 characters) were added to Letterlike Symbols. (U+213D-U+214B)
  • Some additional arrows (total 12 characters) were added to Arrows. (U+21F4-U+21FF)
  • Some additional mathematical operators (total 14 characters) were added to Mathematical Operators. (U+22F2-U+22FF)
  • Variable-width and additional symbols (total 53 characters) were added to Miscellaneous Technical. (U+237C and U+239B-U+23CE)
  • Black and double circled numerals (total 20 characters) were added to Enclosed Alphanumerics. U+24EB-U+24FE)
  • Quadrant elements (total 10 characters) were added to Block Elements. (U+2596-U+259F)
  • Some additional triangles and squares (total 8 characters) were added to Geometric Shapes. (U+25F8-U+25FF)
  • Shogi pieces ,recycling symbols, dices and dotted circles (total 24 characters) were added to Miscellaneous Symbols. (U+2616-U+2617, U+2672-U+267D and U+2680-U+2689)
  • Additional parenthesis (total 14 characters) were added to Dingbats. (U+2768-U+2775)
  • Three additional marks (total 3 characters) were added to CJK Symbols and Punctuation. (U+303B-U+303D)
  • A digraph and two additional characters (total 3 characters) were added to Hiragana. (U+3095-U+3096 and U+309F)
  • A digraph and a double hyphen (total 2 characters) were added to Katakana. (U+30A0 and U+30FF)
  • Additional circled numerals (total 30 characters) were added to Enclosed CJK Letters and Months. (U+3251-U+325F and U+32B1-U+32BF
  • Five missing radicals (total 5 characters) were added to Yi Radicals. (U+A4A2-U+A4A3, U+A4B4, U+A4C1, U+A4C5)
  • Additional compatibility characters (total 59 characters) were added to CJK Compatibility Ideographs. (U+FA30-U+FA6A)
  • A Rial sign (total 1 character) was added to Arabic Presentation Forms-A. (U+FDFC)
  • Two sesame dots (total 2 characters) were added to CJK Compatibility Forms. (U+FE45-U+FE46)
  • A tail fragment (total 1 character) was added to Arabic Presentation Forms-B. (U+FE73)
  • A pair of double parenthesis (total 2 characters) was added to Halfwidth and Fullwidth Forms. (U+FF5F-U+FF60)

Unicode 4.0[edit | edit source]

Unicode 4.0 was released April 2003. It encoded 96,382 characters, adding 1,226 new characters.

New blocks[edit | edit source]

  • Limbu, containing 66 characters for the Limbu abugida, was added.
  • Tai Le, containing 35 letters for the Tai Le script, was added.
  • Khmer Symbols, containing 32 symbols for the lunar calendar, was added.
  • Phonetic Extensions, containing 108 letters used in phonetic transcription, was added.
  • Miscellaneous Symbols and Arrows, containing 14 additional arrows, was added.
  • Yijing Hexagram Symbols, containing 64 hexagrams, was added.
  • Linear B Syllabary, containing 88 syllables of the ancient Linear B script, was added.
  • Linear B Ideograms, containing 123 ideograms of the ancient Linear B script, was added.
  • Aegean Numbers, containing 57 numerals used in the Aegean area, was added.
  • Ugaritic, containing 31 characters used in Ugaritic cuneiform, was added.
  • Shavian, containing 48 letters used for the artificial Shavian script, was added.
  • Osmanya, containing 40 characters used in the artificial Osmanya script, was added.
  • Cypriot Syllabary, containing 55 characters formerly used on Cyprus, was added.
  • Tai Xuan Jing Symbols, containing 87 symbols of Tai Xuan Jing, was added.
  • Variation Selectors Supplement, containing 240 additional variation selectors, was added.

Extended blocks[edit | edit source]

  • Letters with curl used in Sinology (total 4 characters) were added to Latin Extended-B.
  • Former IPA letters (total 2 characters) were added to IPA Extensions.
  • Some additional characters (total 17 characters) were added to Spacing Modifier Letters.
  • Additional combining double-width diacritics and diacritics corresponding to their spacing equivalent (total 11 characters) were added to Combining Diacritical Marks.
  • The archaic letters Sho and San and the capital Lunate Sigma (total 5 characters) were added to Greek and Coptic.
  • Some additional markers, biblical signs, and letters with inverted V (total 19 characters) were added to Arabic.
  • Letters used for foreign words from Persian and Sogdian (total 6 characters) were added to Syriac.
  • The short A (ऄ) (total 1 character) was added to Devanagari.
  • The Avagraha sign (ঽ) (total 1 character) was added to Bengali.
  • The Adak Bindi and Visarga signs (total 2 characters) were added to Gurmukhi.
  • The vocalic l and ll and the Rupee sign (total 5 characters) were added to Gujarati.
  • The letters Va and Wa (total 2 characters) were added to Oriya.
  • Additional signs for date and finance environments (total 8 characters) were added to Tamil.
  • The Nukta and Avagraha signs (total 2 characters) were added to Kannada.
  • Some symbols and signs (total 11 characters) were added to Khmer.
  • An inverted undertie and a swung dash (total 2 characters) were added to General Punctuation.
  • The facsimile sign (℻) (total 1 character) was added to Letterlike Symbols.
  • The eject symbol and a vertical line (total 2 characters) were added to Miscellaneous Technical.
  • A black circled digit zero (⓿) (total 1 character) was added to Enclosed Alphanumerics.
  • Monograms and diagrams, flags, warning and weather symbols and a cup of tea (total 12 characters) were added to Miscellaneous Symbols.
  • Additional parenthesized and circled Korean characters and supplemental signs (total 9 characters) were added to Enclosed CJK Letters and Months.
  • Additional measure units (total 7 characters) were added to CJK Compatibility.
  • An additional Arabic sign (﷽) (total 1 character) was added to Arabic Presentation Forms-A.
  • A pair of vertical parenthesis (total 2 characters) was added to CJK Compatibility Forms.
  • The letters Oi and Ew (total 4 characters) were added to Deseret.
  • A small script l (ℓ) (total 1 character) was added to Mathematical Alphanumeric Symbols.

Unicode 4.1[edit | edit source]

Unicode 4.1 was released March 31, 2005. It encoded 97,655 characters, adding 1,273 new characters.

New blocks[edit | edit source]

  • Arabic Supplement, containing 30 characters for various languages written with the Arabic script, was added.
  • Ethiopic Supplement, containing 26 characters and signs for Sebatbeit, was added.
  • New Tai Lue, containing 80 characters for the New Tai Lue script, was added.
  • Buginese, containing 30 characters for the Lontara script, was added.
  • Phonetic Extensions Supplement, containing 64 additional letters for phonetic transcription, was added.
  • Combining Diacritical Marks Supplement, containing 4 additional diacritics, was added.
  • Glagolitic, containing 94 characters for the Glagolitic script, was added.
  • Coptic, containing 114 characters for the Coptic script, was added.
  • Georgian Supplement, containing 38 Nuskhuri letters, was added.
  • Tifinagh, containing 55 characters for the Tifinagh script, was added.
  • Ethiopic Extended, containing 79 additional Ethiopic syllables, was added.
  • Supplemental Punctuation, containing 26 additional punctuation marks, was added.
  • CJK Strokes, containing 16 strokes for Han Ideographs, was added.
  • Modifier Tone Letters, containing 23 letters for Chinese tones, was added.
  • Syloti Nagri, containing 44 characters for the Syloti Nagri abugida, was added.
  • Vertical Forms, containing 10 punctuation marks suited for vertical text, was added.
  • Ancient Greek Numbers, containing 75 numerals and signs used in Ancient Greek, was added.
  • Old Persian, containing 50 characters for Old Persian cuneiform, was added.
  • Kharoshthi, containing 65 characters for the Kharoshthi abugida, was added.
  • Ancient Greek Musical Notation, containing 70 musical signs used in Ancient Greek, was added.

Extended blocks[edit | edit source]

  • Letters for Sencoten, digraphs, letters with swash tail and other additions (total 11 characters) were added to Latin Extended-B.
  • Additional diacritics for transliteration (total 5 characters) were added to Combining Diacritical Marks.
  • Rho with stroke, reversed and dotted Lunate Sigma (total 4 characters) were added to Greek and Coptic.
  • Ghe with descender (Ӷ) (total 2 characters) was added to Cyrillic.
  • An additional biblical mark and some punctuation marks (total 4 characters) were added to Hebrew.
  • Additional biblical marks, punctuation marks and the Afghani sign (total 8 characters) were added to Arabic.
  • A glottal stop (ॽ) (total 1 character) was added to Devanagari.
  • The Khanda Ta letter (ৎ) (total 1 character) was added to Bengali.
  • The letter Sha and the digit zero (total 2 characters) were added to Tamil.
  • Two marks used in Bhutan (total 2 characters) were added to Tibetan.
  • Two letters and a modifier letter (total 3 characters) were added to Georgian.
  • Some additional syllables (total 11 characters) were added to Ethiopic.
  • Additional phonetic symbols (total 20 characters) were added to Phonetic Extensions.
  • A flower and dot punctuation marks (total 9 characters) were added to General Punctuation.
  • Additional subscript letters (total 5 characters) were added to Superscripts and Subscripts.
  • The Guarani, Austral, Hryvnia and Cedi signs (total 4 characters) were added to Currency Symbols.
  • A combining long double solidus (⃫) (total 1 character) was added to Combining Diacritical Marks for Symbols.
  • The per sign and a double-struck letter Pi (total 2 characters) were added to Letterlike Symbols.
  • Metrical and electrical signs (total 11 characters) were added to Miscellaneous Technical.
  • Additional gender and map symbols (total 30 characters) were added to Miscellaneous Symbols.
  • Some additional mathematical symbols (total 7 characters) were added to Miscellaneous Mathematical Symbols-A.
  • Additional arrows and squares (total 6 characters) were added to Miscellaneous Symbols and Arrows.
  • A circled Hangul character (㉾) (total 1 character) was added to Enclosed CJK Letters and Months.
  • Additional Han Ideographs (total 22 characters) were added to CJK Unified Ideographs.
  • Additional Compatibility Ideographs (total 106 characters) were added to CJK Compatibility Ideographs.
  • Italic dotless small i and j (total 2 characters) were added to Mathematical Alphanumeric Symbols.

Unicode 5.0[edit | edit source]

Unicode 5.0 was released July 14, 2006. It encoded 99,024 characters, adding 1,369 new characters.

New blocks[edit | edit source]

  • N'Ko, containing 59 characters for the N'Ko script, was added.
  • Balinese, containing 121 characters and musical signs for the Balinese abugida, was added.
  • Latin Extended-C, containing 17 letters for various languages, was added.
  • Latin Extended-D, containing 2 characters for UPA, was added.
  • Phags-pa, containing 56 characters for the Phags-pa script, was added.
  • Phoenician, containing 27 letters and numerals for the Phoenician script, was added.
  • Cuneiform, containing 879 signs for Sumero-Akkadian Cuneiform, was added.
  • Cuneiform Numbers and Punctuation, containing 103 numerals and punctuation signs for Sumero-Akkadian Cuneiform, was added.
  • Counting Rod Numerals, containing 18 numerals used with counting rods, was added.

Extended blocks[edit | edit source]

  • Various letters used mainly for aboriginal languages (total 14 characters) were added to Latin Extended-B.
  • Lowercase lunate sigma symbols (total 3 characters) were added to Greek and Coptic.
  • Lowercase palochka and 3 letters used in Nivkh (total 7 characters) were added to Cyrillic.
  • Two letters used in Khanty and other languages (total 4 characters) were added to Cyrillic Supplement.
  • A specific point meant for Vav (ֺ) (total 1 character) was added to Hebrew.
  • Four letters used in Sindhi (total 4 characters) were added to Devanagari.
  • Four letters used in Sanskrit (total 4 characters) were added to Kannada.
  • Additional IPA diacritics (total 9 characters) were added to Combining Diacritical Marks Supplement.
  • Four combining arrows (total 4 characters) were added to Combining Diacritical Marks for Symbols.
  • A danish symbol and a lowercase turned F (total 2 characters) were added to Letterlike Symbols.
  • A lowercase reversed C (ↄ) (total 1 character) was added to Number Forms.
  • Vertical parenthesis, geometric forms and electrical symbols (total 12 characters) were added to Miscellaneous Technical.
  • A neuter symbol (⚲) (total 1 character) was added to Miscellaneous Symbols.
  • Four additional mathematical symbols (total 4 characters) were added to Miscellaneous Mathematical Symbols-A.
  • Additional squares, pentagons and hexagons (total 11 characters) were added to Miscellaneous Symbols and Arrows.
  • Four additional tone letters used in Chinantec (total 4 characters) were added to Modifier Tone Letters.
  • Bold Digamma (𝟊/Ϝ) (total 2 characters) was added to Mathematical Alphanumeric Symbols.

Unicode 5.1[edit | edit source]

Unicode 5.1 was released April 4, 2008. It encoded 100,648 characters, adding 1,624 new characters.

New blocks[edit | edit source]

  • Sundanese, containing 55 letters for Sundanese script, was added.
  • Lepcha, containing 74 letters for Lepcha script, was added.
  • Ol Chiki, containing 48 letters for Ol Chiki script, was added.
  • Cyrillic Extended-A, containing 32 letters for combining Cyrillic letters, was added.
  • Vai, containing 300 letters for Vai script, was added.
  • Cyrillic Extended-B, containing 78 letters for additional Cyrillic characters, was added.
  • Saurashtra, containing 81 letters for Saurashtra script, was added.
  • Kayah Li, containing 48 letters for Kayah languages, was added.
  • Rejang, containing 37 letters for Rejang script, was added.
  • Cham, containing 83 letters for Cham script, was added.
  • Ancient Symbols, containing 12 characters for weights and measures and other Ancient symbols, was added.
  • Phaistos Disc, containing 46 hieroglyphs for Phaistos, was added.
  • Lycian, containing 29 letters for Lycian script, was added.
  • Carian, containing 49 letters for Carian script, was added.
  • Lydian, containing 27 letters for Lydian script, was added.
  • Mahjong Tiles, containing 44 mahjong tiles, was added.
  • Domino Tiles, containing 100 domino tiles, was added.

Extended blocks[edit | edit source]

  • Archaic letters and capital kai symbol (total 7 characters) were added to Greek and Coptic.
  • Combining Pokrytie (total 1 character) was added to Cyrillic.
  • Mordvin, Kurdish, Aleut and Chuvash letters (total 16 characters) were added to Cyrillic Supplement.
  • Radix symbols, Letterlike, punctuation, Koranic annotation signs and additions for early Persian and Azerbaijani (total 15 characters) were added to Arabic.
  • Additional letters in Torwali, Burushaski and early Persian (total 18 characters) were added to Arabic Supplement.
  • High spacing dot and candra a (total 2 characters) were added to Devanagari.
  • Udaat and yakash signs (total 2 characters) were added to Gurmukhi.
  • Vocalic rr, l and ll (total 3 characters) were added to Oriya.
  • Om symbol (ௐ) (total 1 character) was added to Tamil.
  • Avagraha, additional phonetic letters, vocalic l and ll, fractional signs and tuumu (total 13 characters) were added to Telugu.
  • Avagraha, vocalic rr, l and ll, Malayalam numerics and fractions and chillu letters (total 17 characters) were added to Malayalam.
  • Letters for Balti and various symbols (total 6 characters) were added to Tibetan.
  • Characters for various languages (total 78 characters) were added to Myanmar.
  • Manchu Ali Gali lha (ᢪ) (total 1 character) was added to Mongolian.
  • Miscellaneous combining marks (total 28 characters) were added to Combining Diacritical Marks Supplement.
  • Medievalist latin letters and miscellaneous letters (total 10 characters) were added to Latin Extended Additional.
  • Invisible plus (+) (total 1 character) was added to General Punctuation.
  • Combining asterisk above ( ⃰)(total 1 character) was added to Combining Diacritical Marks for Symbols.
  • Symbol for Samaritan Source (⅏) (total 1 character) was added to Letterlike Symbols.
  • Archaic Roman Numerals (total 4 characters) were added to Number Forms.
  • Outlined white star and other signs (total 15 characters) were added to Miscellaneous Symbols.
  • Long division and additional mathematical brackets (total 5 characters) were added to Miscellaneous Mathematical Symbols-A.
  • Miscellaneous signs (total 51 characters) were added to Miscellaneous Symbols and Arrows.
  • Additional latin letters (total 12 characters) were added to Latin Extended-C.
  • Additional punctuation (total 23 characters) were added to Supplemental Punctuation.
  • Letter ih (ㄭ) (total 1 character) was added to Bopomofo.
  • Other strokes (total 20 characters) were added to CJK Strokes.
  • Miscellaneous additions (total 8 characters) were added to CJK Unified Ideographs.
  • Africanist tone letters (total 5 characters) were added to Modifier Tone Letters.
  • Miscellaneous letters and symbols (total 112 characters) were added to Latin Extended-D.
  • Continuous macrons for Coptic (total 3 characters) were added to Combining Half Marks.
  • Musical symbol multiple measure rest (𝄩) (total 1 character) was added to Musical Symbols.

Unicode 5.2[edit | edit source]

Unicode 5.2 was released in October 1, 2009. It encoded 107,296 characters, adding 6,648 new characters.

New blocks[edit | edit source]

  • Samaritan, containing 61 letters for Samaritan script, was added.
  • Unified Canadian Aboriginal Syllabics Extended, containing 70 syllables for various cree languages, was added.
  • Tai Tham, containing 127 letters for Tai Tham script, was added.
  • Vedic Extensions, containing 35 characters for tone marks and signs, was added.
  • Lisu, containing 48 letters for Lisu script, was added.
  • Bamum, containing 88 letters for Bamum script, was added.
  • Common Indic Number Forms, containing 10 fractions and marks, was added.
  • Devanagari Extended, containing 28 additional marks, was added.
  • Hangul Jamo Extended-A, containing 29 characters for additional old initial consonants in hangul jamo, was added.
  • Javanese, containing 91 letters for Javanese script, was added.
  • Myanmar Extended-A, containing 28 letters for Khamti Shan in Myanmar, was added.
  • Tai Viet, containing 72 letters for Tai Viet script, was added.
  • Meetei Mayek, containing 56 letters for Meetei Mayek script, was added.
  • Hangul Jamo Extended-B, containing 72 characters for additional old medieval vowels and final consonants in hangul jamo, was added.
  • Imperial Aramaic, containing 31 characters for Old Aramaic, was added.
  • Old South Arabian, containing 32 letters and numbers for South Arabian, was added.
  • Avestan, containing 61 characters for Avestan script, was added.
  • Inscriptional Parthian, containing 30 characters for Inscriptional Parthian script, was added.
  • Inscriptional Pahlavi, containing 27 characters for Inscriptional Pahlavi script, was added.
  • Old Turkic, containing 73 characters for Orkhon script, was added.
  • Rumi Numeral Symbols, containing 31 numeric characters used in Fez, Morocco, and elsewhere in North Africa and the Iberian peninsula, between the tenth and seventeenth centuries, was added.
  • Kaithi, containing 66 letters for Kaithi script, was added.
  • Egyptian Hieroglyphs, containing 1,071 hieroglyphs for Egyptian, was added.
  • Enclosed Alphanumeric Supplement, containing 63 additional circled, parenthesized and squared alphanumerics, was added.
  • Enclosed Ideographic Supplement, containing 44 squared and tortoised shell bracketed ideographs, was added.
  • CJK Unified Ideographs Extension C, containing 4,149 additional Chinese Ideographs, was added.

Extended blocks[edit | edit source]

  • Abhaz letters (total 2 characters) were added to Cyrillic Supplement.
  • Inverted Candrabinbu and additional signs and letters (total 5 characters) were added to Devanagari.
  • Ganda Mark (৻) (total 1 character) was added to Bengali.
  • Religious svasti signs (total 4 characters) were added to Tibetan.
  • Extensions for Khamti Shan and Alton and Phake (total 4 characters) were added to Myanmar.
  • Additional old initial consonants, medival vowels, and old final consonants (total 16 characters) were added to Hangul Jamo.
  • Hyphen and additional syllables (total 10 characters) were added to Unified Canadian Aboriginal Syllabics.
  • Letter Sua and Tham Digit One (total 3 characters) were added to New Tai Lue.
  • Combing Almost Equal to Below ( ᷽) (total 1 character) was added to Combining Diacritical Marks Supplement.
  • The Live Tournosis, Spesmillo and Tenge signs (total 3 characters) were added to Currency Symbols.
  • Additional vulgar fractions from ARIB STD B24 (total 4 characters) were added to Number Forms.
  • Decimal exponent symbol (⏨) from ARIB STD B24 (total 1 characters) was added to Miscellaneous Technical.
  • A soccer ball and symbols from ARIB STD B24 (total 59 characters) were added to Miscellaneous Symbols.
  • Heavy exclamation mark symbol (❗) from ARIB STD B24 (total 1 character) was added to Dingbats.
  • Traffic sign, dictionary and map symbols from ARIB STD B24 (total 5 characters) were added to Miscellaneous Symbols and Arrows.
  • Capital letter turned alpha and additions for shona (total 3 characters) were added to Latin Extended-C.
  • Cryptogrammic letters and combining marks (total 7 characters) were added to Coptic.
  • Word separator middle dot used in Avestan (⸱) (total 1 character) was added to Supplemental Punctuation.
  • Circled ideographs and numbers on black squares from ARIB STD B24 (total 12 characters) were added to Enclosed CJK Letters and Months.
  • Miscellaneous additions (total 8 characters) were added to CJK Unified Ideographs.
  • Miscellaneous additions for compatibility (total 3 characters) were added to CJK Compatibility Ideographs.
  • Number two and three (total 2 characters) were added to Phoenician.

Unicode 6.0[edit | edit source]

Unicode 6.0 was released in October 11, 2010. It encoded 109,384 characters, adding 2,088 new characters.

New blocks[edit | edit source]

  • Mandaic, containing 29 letters for Mandaic script, was added.
  • Batak, containing 56 letters for Batak script, was added.
  • Ethiopic Extended-A, containing 32 letters for Gamo-Gofa-Dawro, Basketo and Gumuz Ethiophic syllables, was added.
  • Brahmi, containing 108 characters for ancient Brahmi abugida, was added.
  • Bamum Supplement, containing 761 letters for additional Bamum script, was added.
  • Kana Supplement, containing 2 characters for archaic katakana, was added.
  • Playing Cards, containing 59 playing cards, was added.
  • Miscellaneous Symbols and Pictographs, containing 529 additional symbols, was added.
  • Emoticons, containing 63 faces, cat faces and gesture symbols, was added.
  • Transport and Map Symbols, containing 70 transportation, traffic signs and other symbols, was added.
  • Alchemical Symbols, containing 116 symbols for elements, was added.
  • CJK Unified Ideographs Extension D, containing 222 miscellaneous Han ideographs, was added.

Extended blocks[edit | edit source]

  • Azerbaijani letters (total 2 characters) were added to Cyrillic Supplement.
  • Kashmiri Yeh and Wavy hamza below (total 2 characters) were added to Arabic.
  • Dependent vowel signs and letters used in Kashmiri and Bihari (total 10 characters) were added to Devanagari.
  • Fraction signs (total 6 characters) were added to Oriya.
  • Letters used in scholarly only and letter dot reph (total 3 characters) were added to Malayalam.
  • Leading and Trailing Mchan Rtags (total 6 characters) were added to Tibetan.
  • Additional combining marks (total 2 characters) were added to Ethiopic.
  • Combining Double Inverted Breve Below (᷼) (total 1 character) was added to Combining Diacritical Marks Supplement.
  • Miscellaneous subscript letters (total 8 characters) were added to Superscripts and Subscripts.
  • Indian Rupee Sign (₹) (total 1 character) was added to Currency Symbols.
  • Pointing double triangle and additional mechanical symbols (total 11 characters) were added to Miscellaneous Technical.
  • Ophiucisus, astronomical symbol for uranus and pentagrams (total 6 characters) were added to Miscellaneous Symbols.
  • Additional heavy punctation marks, raised fist, raised hand, sparkles, heavy arithmetic symbols and curly loops (total 16 characters) were added to Dingbats.
  • Squared logicals (total 2 characters) were added to Miscellaneous Mathematical Symbols-A.
  • Separator mark and consonant joiner (total 2 characters) were added to Tifinagh.
  • Bopomofo for Hmu and Ge (total 3 characters) were added to Bopomofo Extended.
  • Reversed Tse (total 2 characters) were added to Cyrillic Extended-B.
  • Additional letters (total 15 characters) were added to Latin Extended-D.
  • Pedagogical symbols (total 16 characters) were added to Arabic Presentation Forms-A.
  • Additional squared, black circled and squared letters and regional indicator letters (total 107 characters) were added to Enclosed Alphanumeric Supplement.
  • Squared katakana, squared ideographs and circled advantage and accept (total 13 characters) were added to Enclosed Ideographic Supplement.

Unicode 6.1[edit | edit source]

Unicode 6.1 was released in January 31, 2012. It encoded 110,116 characters, adding 732 new characters.

New blocks[edit | edit source]

  • Arabic Extended-A (U+08A0-U+08FF), containing 39 characters, was added.
  • Sundanese Supplement (U+1CC0-U+1CCF), containing 8 characters, was added.
  • Meetei Mayek Extensions (U+AAE0-U+AAFF), containing 23 characters, was added.
  • Meroitic Hieroglyphs (U+10980-U+1099F), containing 32 characters, was added.
  • Meroitic Cursive (U+109A0-U+109FF), containing 26 characters, was added.
  • Sora Sompeng (U+110D0-U+110FF), containing 35 characters, was added.
  • Chakma (U+11100-U+1114F), containing 67 characters, was added.
  • Sharada (U+11180-U+111DF), containing 83 characters, was added.
  • Takri (U+11680-U+116CF), containing 66 characters, was added.
  • Miao (U+16F00-U+16F9F), containing 133 characters, was added.
  • Arabic Mathematical Alphabetic Symbols (U+1EE00-U+1EEFF), containing 143 characters, was added.

Extended blocks[edit | edit source]

  • An Armenian Dram sign (total 1 character) was added to Armenian. (U+058F)
  • A sign Samvat (total 1 character) was added to Arabic. (U+0604)
  • An Abbreviation mark (total 1 character) was added to Gujarati. (U+0AF0)
  • Letters for Khmu (total 2 characters) were added to Lao. (U+0EDE-U+0EDF)
  • Capital letter Yn, letter Aen, Hard and Labial sign (total 5 characters) were added to Georgian. (U+10C7, U+10CD and U+10FD-U+10FF)
  • Letters and signs for Old Sundanese (total 9 characters) were added to Sundanese. (U+1BAB-U+1BAD and U+1BBA-U+1BBF)
  • Sign Rotated Ardhavisarga, Candra Above, Jihvamuliya and Uphadhmaniya (total 4 characters) were added to Vedic Extensions. (U+1CF3-U+1CF6)
  • Mathematical diagonals (total 2 characters) were added to Miscellaneous Mathematical Symbols-A. (U+27CB and U+27CD)
  • A letter Bohairic Khei (total 2 characters) were added to Coptic. (U+2CF2-U+2CF3)
  • Small letters Yn and Aen (total 2 characters) were added to Georgian Supplement. (U+2D27 and U+2D2D)
  • Letters Ye and Yo (total 2 characters) were added to Tifinagh. (U+2D66-U+2D67)
  • (total 10 characters) were added to Supplemental Punctuation. (U+2E32-U+2E3B)
  • An additional ideograph for Kanji (total 1 character) was added to CJK Unified Ideographs. (U+9FCC)
  • Combining letter for Slavonic (total 9 characters) were added to Cyrillic Extended-B. (U+A674-U+A67B and U+A69F)
  • Letter C with Bar, capital letter H with Hook and modifier letters for extended IPA (total 5 characters) were added to Latin Extended-D. (U+A792-U+A793, U+A7AA and U+A7F8-U+A7F9)
  • Some additional ideographs for Korea (total 2 characters) were added to CJK Compatibility Ideographs. (U+FA2E-U+FA2F)
  • Symbols for Canadian legal use (total 2 characters) were added to Enclosed Alphanumeric Supplement. (U+1F16A-U+1F16B)
  • Typikon symbols (total 4 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F540-U+1F543)
  • (total 13 characters) were added to Emoticons. (U+1F600, U+1F611, U+1F615, U+1F617, U+1F619, U+1F61B, U+1F61F, U+1F626-U+1F627, U+1F62C, U+1F62E-U+1F62F and U+1F634)

Unicode 6.2[edit | edit source]

Unicode 6.2 was released in September 26, 2012. It encoded 110,117 characters, adding only 1 new character.

Extended blocks[edit | edit source]

  • A Turkish Lira sign (total 1 character) was added to Currency Symbols. (U+20BA)

Unicode 6.3[edit | edit source]

Unicode 6.3 was released in September 30, 2013. It encoded 110,122 characters, adding only 5 new characters.

Extended blocks[edit | edit source]

  • A Letter mark (total 1 character) was added to Arabic. (U+061C)
  • Isolate directional format characters (total 4 characters) were added to General Punctuation. (U+2066-U+2069)

Unicode 7.0[edit | edit source]

Unicode 7.0 was released in June 16, 2014. It encoded 112,956 characters, adding 2,834 new characters.

New blocks[edit | edit source]

  • Combining Diacritical Marks Extended (U+1AB0-U+1AFF), containing 15 marks, was added.
  • Myanmar Extended-B (U+A9E0-U+A9FF), containing 31 letters, was added.
  • Latin Extended-E (U+AB30-U+AB6F), containing 50 letters, was added.
  • Coptic Epact Numbers (U+102E0-U+102FF), containing 28 numbers, was added.
  • Old Permic (U+10350-U+1037F), containing 43 letters, was added.
  • Elbasan (U+10500-U+1052F), containing 50 letters, was added.
  • Caucasian Albanian (U+10530-U+1056F), containing 53 letters and marks, was added.
  • Linear A (U+10600-U+1077F), containing 341 signs, was added.
  • Palmyrene (U+10860-U+1087F), containing 32 letters, was added.
  • Nabataean (U+10880-U+108AF), containing 40 letters and numbers, was added.
  • Old North Arabian (U+10A80-U+10A9F), containing 32 letters and numbers, was added.
  • Manichaean (U+10AC0-U+10AFF), containing 51 characters, was added.
  • Psalter Pahlavi (U+10B80-U+10BAF), containing 29 characters, was added.
  • Mahajani (U+11150-U+1117F), containing 39 letters and signs, was added.
  • Sinhala Archaic Numbers (U+111E0-U+111FF), containing 20 numbers, was added.
  • Khojki (U+11200-U+1124F), containing 61 characters, was added.
  • Khudawadi (U+112B0-U+112FF), containing 69 characters, was added.
  • Grantha (U+11300-U+1137F), containing 83 characters, was added.
  • Tirhuta (U+11480-U+114DF), containing 82 characters, was added.
  • Siddham (U+11580-U+115FF), containing 72 characters, was added.
  • Modi (U+11600-U+1165F), containing 79 characters, was added.
  • Warang Citi (U+118A0-U+118FF), containing 84 letters and numbers, was added.
  • Pau Cin Hau (U+11AC0-U+11AFF), containing 57 characters, was added.
  • Mro (U+16A40-U+16A6F), containing 43 characters, was added.
  • Bassa Vah (U+16AD0-U+16AFF), containing 36 characters, was added.
  • Pahawh Hmong (U+16B00-U+16B8F), containing 127 letters and signs, was added.
  • Duployan (U+1BC00-U+1BC9F), containing 143 characters, was added.
  • Shorthand Format Controls (U+1BCA0-U+1BCAF), containing 4 format characters, was added.
  • Mende Kikakui (U+1E800-U+1E8DF), containing 213 syllables and numbers, was added.
  • Ornamental Dingbats (U+1F650-U+1F67F), containing 48 pictographic characters, was added.
  • Geometric Shapes Extended (U+1F780-U+1F7FF), containing 85 pictographic characters, was added.
  • Supplemental Arrows-C (U+1F800-U+1F8FF), containing 148 pictographic characters, was added.

Extended blocks[edit | edit source]

  • A capital letter Yot (total 1 character) was added to Greek and Coptic. (U+037F)
  • Letters for Orok, Komi and Khanty (total 8 characters) were added to Cyrillic Supplement. (U+0528-U+052F)
  • An Eternity sign (total 2 characters) were added to Armenian. (U+058D-U+058E)
  • A Number Mark Above (total 1 character) was added to Arabic. (U+0605)
  • Letters for African, Philippine, Turkic, Berber, Belarusian, Palula and Shina languages (total 8 characters) were added to Arabic Extended-A. (U+08A1, U+08AD-U+08B2 and U+08FF)
  • A letter for Marwari (total 1 character) was added to Devanagari. (U+0978)
  • A sign Anji (total 1 character) was added to Bengali. (U+0980)
  • Sign Candrabindu and letter Llla (total 2 characters) were added to Telugu. (U+0C00 and U+0C34)
  • A Sign Candrabindu (total 1 character) was added to Kannada. (U+0C81)
  • A Sign Candrabindu (total 1 character) was added to Malayalam. (U+0D01)
  • Lith Numerals (total 10 characters) were added to Sinhala. (U+0DE6-U+0DEF)
  • Additional Old English runes (total 8 characters) were added to Runic. (U+16F1-U+16F8)
  • Letters Gyan and Tra (total 2 characters) were added to Limbu. (U+191D-U+191E)
  • Signs for Jaiminiya Sama Veda (total 2 characters) were added to Vedic Extensions. (U+1CF8-U+1CF9)
  • Marks for Germanic and American lexicology (total 15 characters) were added to Combining Diacritical Marks Supplement. (U+1DE7-U+1DF5)
  • Nordic Mark, Manat and Ruble sign (total 3 characters) were added to Currency Symbols. (U+20BB-U+20BD)
  • Playback symbols from Webdings font (total 7 characters) were added to Miscellaneous Technical. (U+23F4-U+23FA)
  • A Scissors symbol from Wingdings 2 font (total 1 character) was added to Dingbats. (U+2700)
  • Arrows for Lithuanian dialectology and symbols from Wingdings 3 font (total 115 characters) were added to Miscellaneous Symbols and Arrows. (U+2B4D-U+2B4F, U+2B5A-U+2B5F, U+2B60-U+2B73, U+2B76-U+2B95, U+2B98-U+2BB9, U+2BBD-U+2BC8 and U+2BCA-U+2BD1)
  • (total 7 characters) were added to Supplemental Punctuation. (U+2E3C-U+2E42)
  • Early Cyrillic letters and letters for Lithuanian dialectology (total 6 characters) were added to Cyrillic Extended-B. (U+A698-U+A69D)
  • Letters for European, American and African orthography (total 18 characters) were added to Latin Extended-D. (U+A794-U+A79F, U+A7AB-U+A7AD, U+A7B0-U+A7B1 and U+A7F7)
  • Tone marks for Tai Laing and letters for Shwe Palaung (total 4 characters) were added to Myanmar Extended-A. (U+AA7C-U+AA7F)
  • Combining phonetic marks (total 7 characters) were added to Combining Half Marks. (U+FE27-U+FE2D)
  • Additional mathematical symbols (total 2 characters) were added to Ancient Greek Numbers. (U+1018B-U+1018C)
  • A Greek Tau Rho symbol (total 1 character) was added to Ancient Symbols. (U+101A0)
  • A letter Ess (total 1 character) was added to Old Italic. (U+1031F)
  • A Number Joiner (total 1 character) was added to Brahmi. (U+1107F)
  • Sutra mark and sign Ekam (total 2 characters) were added to Sharada. (U+111CD and U+111DA)
  • Additional cuneiform signs (total 42 characters) were added to Cuneiform. (U+1236F-U+12398)
  • Additional numbers, vulgar fractions and a punctuation mark (total 13 characters) were added to Cuneiform Numbers and Punctuation. (U+12463-U+1246E and U+12474)
  • Red Joker, Fool and trumps (total 23 characters) were added to Playing Cards. (U+1F0BF and U+1F0E0-U+1F0F5)
  • Dingbat normal and negative sans-serif digit zero (total 2 characters) were added to Enclosed Alphanumeric Supplement. (U+1F10B-U+1F10C)
  • Symbols from Webdings, Wingdings 1 and 2 font (total 209 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F321-U+1F32C, U+1F336, U+1F37D, U+1F394-U+1F39F, U+1F3C5, U+1F3CB-U+1F3CE, U+1F3D4-U+1F3DF, U+1F3F1-U+1F3F7, U+1F43F, U+1F441, U+1F4F8, U+1F4FD-U+1F4FE, U+1F53E-U+1F53F, U+1F544-U+1F54A, U+1F568-U+1F579, U+1F57B-U+1F5A3 and U+1F5A5-U+1F5FA)
  • Slightly frowning and smiling faces emoji (total 2 characters) were added to Emoticons. (U+1F641-U+1F642)
  • Symbols from Webdings and Wingdings 2 font (total 27 characters) were added to Transport and Map Symbols. (U+1F6C6-U+1F6CF, U+1F6E0-U+1F6EC and U+1F6F0-U+1F6F3)

Unicode 8.0[edit | edit source]

Unicode 8.0 was released in June 17, 2015. It encoded 120,672 characters, adding 7,716 new characters.

New blocks[edit | edit source]

  • Cherokee Supplement (U+AB70-U+ABBF), containing 80 lowercase letters, was added.
  • Hatran (U+108E0-U+108FF), containing 26 letters, was added.
  • Old Hungarian (U+10C80-U+10CFF), containing 108 letters, was added.
  • Multani (U+11280-U+112AF), containing 38 letters, was added.
  • Ahom (U+11700-U+1173F), containing 57 letters, was added.
  • Early Dynastic Cuneiform (U+12480-U+1254F), containing 196 characters, was added.
  • Anatolian Hieroglyphs (U+14400-U+1467F), containing 583 characters, was added.
  • Sutton SignWriting (U+1D800-U+1DAAF), containing 672 signs, was added.
  • Supplemental Symbols and Pictographs (U+1F900-U+1F9FF), containing 15 pictographic characters, was added.
  • CJK Unified Ideographs Extension E (U+2B820-U+2CEAF), containing 5762 characters, was added.

Extended blocks[edit | edit source]

  • Letters for Arwi (total 3 characters) were added to Arabic Extended-A. (U+08B3-U+08B4 and U+08E3)
  • A letter for Avestan transliteration (total 1 character) was added to Gujarati. (U+0AF9)
  • A letter for Andhra Pradesh (total 1 character) was added to Telugu. (U+0C5A)
  • An archaic letter II (total 1 character) was added to Malayalam. (U+0D5F)
  • A letter Mv and small letters (total 7 characters) were added to Cherokee. (U+13F5 and U+13F8-U+13FD)
  • A Georgian Lari sign (total 1 character) was added to Currency Symbols. (U+20BE)
  • Turned digits (total 2 characters) were added to Number Forms. (U+218A-U+218B)
  • Two headed arrows with triangle arrowheads (total 4 characters) were added to Miscellaneous Symbols and Arrows. (U+2BEC-U+2BEF)
  • Some additional ideographs (total 9 characters) were added to CJK Unified Ideographs. (U+9FCD-U+9FD5)
  • A combining letter Ef (total 1 character) was added to Cyrillic Extended-B. (U+A69E)
  • Sinological dot, phonetic extension for African languages, letters for American and Gabonese orthography (total 7 characters) were added to Latin Extended-D. (U+A78F and U+A7B2-U+A7B7)
  • Sign Siddham and letter Jain Om (total 2 characters) were added to Devanagari Extended. (U+A8FC-U+A8FD)
  • Letters for Yakut transliteration (total 4 characters) were added to Latin Extended-E. (U+AB60-U+AB63)
  • A combining mark for Church Slavonic (total 2 characters) were added to Combining Half Marks. (U+FE2E-U+FE2F)
  • Numerals and vulgar fractions (total 64 characters) were added to Meroitic Cursive. (U+109BC-U+109BD, U+109C0-U+109CF and U+109D2-U+109FF)
  • Sandhi mark, diacritical marks for Kashmiri, sign Siddham and punctuation marks (total 9 characters) were added to Sharada. (U+111C9-U+111CC and U+111DB-U+111DF)
  • Combining Anusvara Above and letter Om (total 2 characters) were added to Grantha. (U+11300 and U+11350)
  • Section marks and alternate letters (total 20 characters) were added to Siddham. (U+115CA-U+115DD)
  • An additional sign (total 1 character) was added to Cuneiform. (U+12399)
  • East-Slavic musical symbols (total 11 characters) were added to Musical Symbols. (U+1D1DE-U+1D1E8)
  • (total 24 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F32D-U+1F32F, U+1F37E-U+1F37F, U+1F3CF-U+1F3D3, U+1F3F8-U+1F3FF, U+1F4FF and U+1F54B-U+1F54F)
  • Upside Down Face and Face With Rolling Eyes emoji (total 2 characters) were added to Emoticons. (U+1F643-U+1F644)
  • A Place of Worship emoji (total 1 character) was added to Transport and Map Symbols. (U+1F6D0)

Unicode 9.0[edit | edit source]

Unicode 9.0, was released in June 21, 2016. It encoded 128,172 characters, adding 7,500 new characters.

New blocks[edit | edit source]

  • Cyrillic Extended-C (U+1C80-U+1C8F), containing 9 letters, was added.
  • Osage (U+104B0-U+104FF), containing 72 letters, was added.
  • Newa (U+11400-U+1147F), containing 92 letters, was added.
  • Mongolian Supplement (U+11660-U+1167F), containing 13 letters, was added.
  • Bhaiksuki (U+11C00-U+11C6F), containing 97 letters, was added.
  • Marchen (U+11C70-U+11CBF), containing 68 letters, was added.
  • Ideographic Symbols and Punctuation (U+16FE0-U+16FFF), containing 1 letter, was added.
  • Tangut (U+17000-U+187FF), containing 6125 letters, was added.
  • Tangut Components (U+18800-U+18AFF), containing 755 letters, was added.
  • Glagolitic Supplement (U+1E000-U+1E02F), containing 38 letters, was added.
  • Adlam (U+1E900-U+1E95F), containing 87 letters, was added.

Extended blocks[edit | edit source]

  • Letters for Bravanese, Warsh and Quranic marks used in Pakistan (total 23 characters) were added to Arabic Extended-A. (U+08B6-U+08BD and U+08D4-U+08E2)
  • A sign Spacing Candrabindu (total 1 character) were added to Kannada. (U+0C80)
  • Sign Para, Chillu letters and vulgar fractions (total 14 characters) were added to Malayalam. (U+0D4F, U+0D54-U+0D56, U+0D58-U+0D5E and U+0D76-U+0D78)
  • A diacritical mark for Newa (total 1 character) was added to Combining Diacritical Marks Supplement. (U+1DFB)
  • Power symbols (total 4 characters) were added to Miscellaneous Technical. (U+23FB-U+23FE)
  • Punctuation marks for Church Slavonic (total 2 characters) were added to Supplemental Punctuation. (U+2E43-U+2E44)
  • A letter for Unifon (total 1 character) was added to Latin Extended-D. (U+A7AE)
  • A sign Candrabindu (total 1 character) was added to Saurashtra. (U+A8C5)
  • Indiction sign and a currency symbol (total 2 characters) were added to Ancient Greek Numbers. (U+1018D-U+1018E)
  • A sign Sukun (total 1 character) was added to Khojki. (U+1123E)
  • Japanese TV symbols (total 18 characters) were added to Enclosed Alphanumeric Supplement. (U+1F19B-U+1F1AC)
  • A Japanese TV symbol (total 1 character) was added to Enclosed Ideographic Supplement. (U+1F23B)
  • A dancing man and Black Heart emoji (total 2 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F57A and U+1F5A4)
  • Octagonal Sign, Shopping Trolley, scooters and a Canoe emoji (total 5 characters) were added to Transport and Map Symbols. (U+1F6D1-U+1F6D2 and U+1F6F4-U+1F6F6)
  • (total 67 characters) were added to Supplemental Symbols and Pictographs. (U+1F919-U+1F91E, U+1F920-U+1F927, U+1F930, U+1F933-U+1F93E, U+1F940-U+1F94B, U+1F950-U+1F95E and U+1F985-U+1F991)

Variation Sequences[edit | edit source]

Here is a table with new standardized variation sequences:

Character Sequence Context Description of Variation Appearance
0030 FE00 short diagonal stroke form # DIGIT ZERO
1000 FE00 dotted form # MYANMAR LETTER KA
1002 FE00 dotted form # MYANMAR LETTER GA
1004 FE00 dotted form # MYANMAR LETTER NGA
1010 FE00 dotted form # MYANMAR LETTER TA
1011 FE00 dotted form # MYANMAR LETTER THA
1015 FE00 dotted form # MYANMAR LETTER PA
1019 FE00 dotted form # MYANMAR LETTER MA
101A FE00 dotted form # MYANMAR LETTER YA
101C FE00 dotted form # MYANMAR LETTER LA
101D FE00 dotted form # MYANMAR LETTER WA
1022 FE00 dotted form # MYANMAR LETTER SHAN A
1031 FE00 dotted form # MYANMAR VOWEL SIGN E
1075 FE00 dotted form # MYANMAR LETTER SHAN KA
1078 FE00 dotted form # MYANMAR LETTER SHAN CA
107A FE00 dotted form # MYANMAR LETTER SHAN NYA
1080 FE00 dotted form # MYANMAR LETTER SHAN THA
2205 FE00 zero with long diagonal stroke overlay form # EMPTY SET
AA60 FE00 dotted form # MYANMAR LETTER KHAMTI GA
AA61 FE00 dotted form # MYANMAR LETTER KHAMTI CA
AA62 FE00 dotted form # MYANMAR LETTER KHAMTI CHA
AA63 FE00 dotted form # MYANMAR LETTER KHAMTI JA
AA64 FE00 dotted form # MYANMAR LETTER KHAMTI JHA
AA65 FE00 dotted form # MYANMAR LETTER KHAMTI NYA
AA66 FE00 dotted form # MYANMAR LETTER KHAMTI TTA
AA6B FE00 dotted form # MYANMAR LETTER KHAMTI NA
AA6C FE00 dotted form # MYANMAR LETTER KHAMTI SA
AA6F FE00 dotted form # MYANMAR LETTER KHAMTI FA
AA7A FE00 dotted form # MYANMAR LETTER AITON RA
278 additional emoji variation sequences

Unicode 10.0[edit | edit source]

Unicode 10.0, was released in June 20, 2017. It encoded 136,690 characters, adding 8,518 new characters.

New blocks[edit | edit source]

  • Syriac Supplement (U+0860-U+086F), containing 11 characters, was added.
  • Zanabazar Square (U+11A00-U+11A4F), containing 72 characters, was added.
  • Soyombo (U+11A50-U+11AAF), containing 80 characters, was added.
  • Masaram Gondi (U+11D00-U+11D5F), containing 75 characters, was added.
  • Kana Extended-A (U+1B100-U+1B12F), containing 31 characters, was added.
  • Nushu (U+1B170-U+1B2FF), containing 396 characters, was added.
  • CJK Unified Ideographs Extension F (U+2CEB0-U+2EBEF), containing 7,473 characters, was added.

Extended blocks[edit | edit source]

  • A Vedic Anusvara and Abbreviation mark (total 2 characters) were added to Bengali. (U+09FC-U+09FD)
  • Letters for Arabic transliteration (total 6 characters) were added to Gujarati. (U+0AFA-U+0AFF)
  • A combining Anusvara Above and Viramas (total 3 characters) were added to Malayalam. (U+0D00 and U+0D3B-U+0D3C)
  • A sign Atikrama (total 1 character) was added to Vedic Extensions. (U+1CF7)
  • Combining diacritical marks for Church Slavonic (total 4 characters) were added to Combining Diacritical Marks Supplement. (U+1DF6-U+1DF9)
  • A Bitcoin sign (total 1 character) was added to Currency Symbols. (U+20BF)
  • An Observe Eye symbol (total 1 character) was added to Miscellaneous Technical. (U+23FF)
  • A Group mark (total 1 character) was added to Miscellaneous Symbols and Arrows. (U+2BD2)
  • Medieval punctuation marks (total 5 characters) were added to Supplemental Punctuation. (U+2E45-U+2E49)
  • A letter O with Dot Above (total 1 character) was added to Bopomofo. (U+312E)
  • Ideographs for Slavonic transliteration (total 21 characters) were added to CJK Unified Ideographs. (U+9FD6-U+9FEA)
  • Letters for North Italic (total 3 characters) were added to Old Italic. (U+1032D-U+1032F)
  • An Iteration mark for Nushu (total 1 character) was added to Ideographic Symbols and Punctuation. (U+16FE1)
  • Letters for Hentaigana (total 254 characters) were added to Kana Supplement. (U+1B002-U+1B0FF)
  • Symbols for Chinese Folk religion (total 6 characters) were added to Enclosed Ideographic Supplement. (U+1F260-U+1F265)
  • Stupa, Pagoda, Sled and Flying Saucer emoji (total 4 characters) were added to Transport and Map Symbols. (U+1F6D3-U+1F6D4 and U+1F6F7-U+1F6F8)
  • (total 66 characters) were added to Supplemental Symbols and Pictographs. (U+1F900-U+1F90B, U+1F91F, U+1F928-U+1F92F, U+1F931-U+1F932, U+1F94C, U+1F95F-U+1F96B, U+1F992-U+1F997 and U+1F9D0-U+1F9E6)

Unicode 11.0[edit | edit source]

Unicode 11.0, was released in June 5, 2018. It encoded 137,374 characters, adding 684 new characters.

New blocks[edit | edit source]

  • Georgian Extended (U+1C90-U+1CBF), containing 46 characters, was added.
  • Hanifi Rohingya (U+10D00-U+10D3F), containing 50 characters, was added.
  • Old Sogdian (U+10F00-U+10F2F), containing 40 characters, was added.
  • Sogdian (U+10F30-U+10F6F), containing 42 characters, was added.
  • Dogra (U+11800-U+1184F), containing 60 characters, was added.
  • Gunjala Gondi (U+11D60-U+11DAF), containing 63 characters, was added.
  • Makasar (U+11EE0-U+11EFF), containing 25 characters, was added.
  • Medefaidrin (U+16E40-U+16E9F), containing 91 characters, was added.
  • Mayan Numerals (U+1D2E0-U+1D2FF), containing 20 characters, was added.
  • Indic Siyaq Numbers (U+1EC70-U+1ECBF), containing 68 characters, was added.
  • Chess Symbols (U+1FA00-U+1FA6F), containing 14 characters, was added.

Extended blocks[edit | edit source]

  • Small letters Turned Ayb and Yi with Stroke (total 2 characters) were added to Armenian. (U+0560 and U+0588)
  • A triangle Yod (total 1 character) were added to Hebrew. (U+05EF)
  • A Dantayalan and currency symbols (total 3 characters) were added to N'Ko. (U+07FD-U+07FF)
  • A Small Low Waw (total 1 character) was added to Arabic Extended-A. (U+08D3)
  • A Sandhi mark (total 1 character) was added to Bengali. (U+09FE)
  • An Abbreviation mark (total 1 character) was added to Gurmukhi. (U+0A76)
  • A combining Anusvara Above (total 1 character) was added to Telugu. (U+0C04)
  • A sign Siddham (total 1 character) was added to Kannada. (U+0C84)
  • A letter for Buryat (total 1 character) was added to Mongolian. (U+1878)
  • Symbols for chess notation, astrological and half star symbols (total 43 characters) were added to Miscellaneous Symbols and Arrows. (U+2BBA-U+2BBC, U+2BD3-U+2BEB and 2BF0-U+2BFE)
  • Medieval punctuation marks (total 5 characters) were added to Supplemental Punctuation. (U+2E4A-U+2E4E)
  • A letter NN (total 1 character) was added to Bopomofo. (U+312F)
  • Some ideographs for Kanji (total 5 characters) were added to CJK Unified Ideographs. (U+9FEB-U+9FEF)
  • A small capital Q and a letter for Mazahua (total 3 characters) were added to Latin Extended-D. (U+A7AF and U+A7B8-U+A7B9)
  • Letter and vowel sign Ay (total 2 characters) were added to Devanagari Extended. (U+A8FE-U+A8FF)
  • Letters Ttta, Vha and a vulgar fraction (total 3 characters) were added to Kharoshthi. (U+10A34-U+10A35 and U+10A48)
  • A Number Sign Above (total 1 character) was added to Kaithi. (U+110CD)
  • Letter Lhaa, vowel sign Aa and Ei (total 3 characters) were added to Chakma. (U+11144-U+11146)
  • A combining Bindu Below (total 1 character) was added to Grantha. (U+1133B)
  • A Sandhi mark (total 1 character) was added to Newa. (U+1145E)
  • An alternate letter Ba (total 1 character) was added to Ahom. (U+1171A)
  • A mark Pluta (total 1 character) was added to Soyombo. (U+11A9D)
  • Additional ideographs (total 5 characters) were added to Tangut. (U+187ED-U+187F1)
  • Tally marks (total 7 characters) were added to Counting Rod Numerals. (U+1D372-U+1D378)
  • A Copyleft symbol (total 1 character) was added to Enclosed Alphanumeric Supplement. (U+1F12F)
  • A Skateboard emoji (total 1 character) was added to Transport and Map Symbols. (U+1F6F9)
  • Normal and negative circled shapes (total 4 characters) were added to Geometric Shapes Extended. (U+1F7D5-U+1F7D8)
  • (total 65 characters) were added to Supplemental Symbols and Pictographs. (U+1F94D-U+1F94F, U+1F96C-U+1F970, U+1F973-U+1F976, U+1F97A, U+1F97C-U+1F97F, U+1F998-U+1F99F, U+1F9A0-U+1F9A2, U+1F9B0-U+1F9B9, U+1F9C1-U+1F9C2 and U+1F9E7-U+1F9FF)

Variation Sequences[edit | edit source]

Here is a table with new standardized variation sequences:

Character Sequence Context Description of Variation Appearance
FF10 FE00 short diagonal stroke form # FULLWIDTH DIGIT ZERO

Unicode 12.0[edit | edit source]

Unicode 12.0 was released on March 5, 2019. It encoded 137,928 characters, adding 554 new characters.

New blocks[edit | edit source]

  • Elymaic (U+10FE0-U+10FFF), containing 23 characters, was added.
  • Nandinagari (U+119A0-U+119FF), containing 65 characters, was added.
  • Tamil Supplement (U+11FC0-U+11FFF), containing 51 characters, was added.
  • Egyptian Hieroglyph Format Controls (U+13430-U+1343F), containing 9 characters, was added.
  • Small Kana Extension (U+1B130-U+1B16F), containing 7 characters, was added.
  • Nyiakeng Puachue Hmong (U+1E100-U+1E14F), containing 71 characters, was added.
  • Wancho (U+1E2C0-U+1E2FF), containing 59 characters, was added.
  • Ottoman Siyaq Numbers (U+1ED00-U+1ED4F), containing 61 characters, was added.
  • Symbols and Pictographs Extended-A (U+1FA70-U+1FAFF), containing 16 characters, was added.

Extended blocks[edit | edit source]

  • A sign Siddham (total 1 character) was added to Telugu. (U+0C77)
  • Letters for Pail and Sanskrit (total 15 characters) were added to Lao. (U+0E86, U+0E89, U+0E8C, U+0E8E-U+0E93, U+0E98, U+0EA0, U+0EA8-U+0EA9, U+0EAC and U+0EBA)
  • A sign Double Anusvara Antargomukha (total 1 character) was added to Vedic Extensions. (U+1CFA)
  • An astrological symbol and Hellschreiber Pause symbol (total 2 characters) were added to Miscellaneous Symbols and Arrows. (U+2BC9 and U+2BFF)
  • A Cornish Verse Divider (total 1 character) was added to Supplemental Punctuation. (U+2E4F)
  • Egyptological letters, Anglicana W and letters for early Pinyin (total 11 characters) were added to Latin Extended-D. (U+A7BA-U+A7BF and U+A7C2-U+A7C6)
  • Sinological phonetic letters (total 2 characters) were added to Latin Extended-E. (U+AB66-U+AB67)
  • A Vedic Anusvara (total 1 character) was added to Newa. (U+1145F)
  • An archaic letter Kha (total 1 character) was added to Takri. (U+116B8)
  • Sign Jihvamuliya and Uphadhmaniya (total 2 characters) were added to Soyombo. (U+11A84-U+11A85)
  • Letters for various Yi and Miao languages (total 16 characters) were added to Miao. (U+16F45-U+16F4A, U+16F4F and U+16F7F-U+16F87)
  • Marks for Ancient Chinese texts (total 2 characters) were added to Ideographic Symbols and Punctuation. (U+16FE2-U+16FE3)
  • Some additional ideographs (total 6 characters) were added to Tangut. (U+187F2-U+187F7)
  • A Nasalization mark (total 1 character) was added to Adlam. (U+1E94B)
  • A Spanish and Portuguese register mark (total 1 character) was added to Enclosed Alphanumeric Supplement. (U+1F16C)
  • Hindu Temple and Auto Rickshaw emoji (total 2 characters) were added to Transport and Map Symbols. (U+1F6D5 and U+1F6FA)
  • Large colored circles and boxes (total 12 characters) were added to Geometric Shapes Extended. (U+1F7E0-U+1F7EB)
  • (total 31 characters) were added to Supplemental Symbols and Pictographs. (U+1F90D-U+1F90F, U+1F93F, U+1F971, U+1F97B, U+1F9A5-U+1F9AA, U+1F9AE-U+1F9AF, U+1F9BA-U+1F9BF, U+1F9C3-U+1F9CA and U+1F9CD-U+1F9CF)
  • Heterodox chess symbols (total 84 characters) were added to Chess Symbols. (U+1FA00-U+1FA53)

Glyph Changes[edit | edit source]

Here is a table with glyph changes:

Block Name Code Points Count
Spacing Modifier Letters 02EA, 02EB 2
Vedic Extensions 1CF2..1CF3 2
Currency Symbols 20A9 1
CJK Symbols and Punctuation 3001, 3002 2
Bopomofo 3105..312F 43
Bopomofo Extended 31A0..31BA 27
CJK Unified Ideographs Extension A 37C3, 3B9D, 3CFD, 3FE0, 44EC, 4A76 6
CJK Unified Ideographs 5344, 55B9, 6ABC, 6FF9, 809E, 80BC, 80E9, 8132, 8159, 841C, 891D, 8C6C, 915E, 9FD4 14
Phags-pa A840..A877 56
Halfwidth and Fullwidth Forms FF01, FF0C, FF0E, FF1A, FF1B, FF1F 6
CJK Unified Ideographs Extension B 200DD, 20164, 20BBF, 20C02, 20CED, 21D4C, 2278B, 23AB8, 2459B, 24A7D, 24FB9, 25ED7, 2677C, 26B4C, 26C21, 26CBE, 26E3D, 28834, 289A1, 289C0, 28A0F, 28B46 22
CJK Unified Ideographs Extension C 2A8FB, 2A917, 2AA30 3
CJK Unified Ideographs Extension E 2BA52, 2BD77, 2C494, 2C72F, 2C734, 2CB38 6
CJK Unified Ideographs Extension F 2D23B, 2E83A 2
Total 192

Variation Sequences[edit | edit source]

Here is a table with new standardized variation sequences:

Character Sequence Context Description of Variation Appearance
3001 FE00 corner-justified form # IDEOGRAPHIC COMMA
3001 FE01 centered form # IDEOGRAPHIC COMMA
3002 FE00 corner-justified form # IDEOGRAPHIC FULL STOP
3002 FE01 centered form # IDEOGRAPHIC FULL STOP
FF01 FE00 corner-justified form # FULLWIDTH EXCLAMATION MARK
FF01 FE01 centered form # FULLWIDTH EXCLAMATION MARK
FF0C FE00 corner-justified form # FULLWIDTH COMMA
FF0C FE01 centered form # FULLWIDTH COMMA
FF0E FE00 corner-justified form # FULLWIDTH FULL STOP
FF0E FE01 centered form # FULLWIDTH FULL STOP
FF1A FE00 corner-justified form # FULLWIDTH COLON
FF1A FE01 centered form # FULLWIDTH COLON
FF1B FE00 corner-justified form # FULLWIDTH SEMICOLON
FF1B FE01 centered form # FULLWIDTH SEMICOLON
FF1F FE00 corner-justified form # FULLWIDTH QUESTION MARK
FF1F FE01 centered form # FULLWIDTH QUESTION MARK

Unicode 12.1[edit | edit source]

Unicode 12.1 was released on May 7, 2019. It encoded 137,929 characters, adding only 1 new character.

Extended blocks[edit | edit source]

  • A square era name Reiwa (total 1 character) was added to Enclosed CJK Letters and Months. (U+32FF)

Unicode 13.0[edit | edit source]

Unicode 13.0 was released on March 10, 2020. It encoded 143,859 characters, adding 5,930 new characters.

New blocks[edit | edit source]

  • Yezidi (U+10E80-U+10EBF), containing 47 characters, was added.
  • Chorasmian (U+10FB0-U+10FDF), containing 28 characters, was added.
  • Dives Akuru (U+11900-U+1195F), containing 72 characters, was added.
  • Lisu Supplement (U+11FB0-U+11FBF), containing 1 character, was added.
  • Khitan Small Script (U+18B00-U+18CFF), containing 470 characters, was added.
  • Tangut Supplement (U+18D00-U+18D08), containing 9 characters, was added.
  • Symbols for Legacy Computing (U+1FB00-U+1FBFF), containing 212 characters, was added.
  • CJK Unified Ideographs Extension G (U+30000-U+3134F), containing 4939 characters, was added.

Extended blocks[edit | edit source]

  • Letters for African languages and Punjabi (total 10 characters) were added to Arabic Extended-A. (U+08BE-U+08C7)
  • A sign Overline (total 1 character) was added to Oriya. (U+0B55)
  • A Vedic Anusvara (total 1 character) was added to Malayalam. (U+0D04)
  • A sign Candrabindu (total 1 character) was added to Sinhala. (U+0D81)
  • Combining diacritical marks for Scottish phonology (total 2 characters) were added to Combining Diacritical Marks Extended. (U+1ABF-U+1AC0)
  • A Japanese symbol for Type A Electronics (total 1 character) was added to Miscellaneous Symbols and Arrows. (U+2B97)
  • Cross patties and a Tironian sign Capita Et (total 3 characters) were added to Supplemental Punctuation. (U+2E50-U+2E52)
  • Letters for Taiwan and Cantonese language (total 5 characters) were added to Bopomofo Extended. (U+31BB-U+31BF)
  • Some disunified ideographs (total 10 characters) were added to CJK Unified Ideographs Extension A. (U+4DB6-4DBF)
  • Some ideographs for China (total 13 characters) were added to CJK Unified Ideographs. (U+9FF0-U+9FFC)
  • Letters for Gaulish (total 6 characters) were added to Latin Extended-D. (U+A7C7-U+A7CA and U+A7F5-U+A7F6)
  • An alternate sign Nasanta (total 1 character) was added to Syloti Nagri. (U+A82C)
  • Letter R With Midle Tilde and modifier letters for Scottish phonology (total 4 characters) were added to Latin Extended-E. (U+AB68-U+AB6B)
  • A symbol Ascia (total 1 character) was added to Ancient Symbols. (U+1019C)
  • A letter for Pali (total 1 character) was added to Chakma. (U+11147)
  • A vowel sign Prishthamatra E and Inverted Candrabindu (total 2 characters) were added to Sharada. (U+111CE and U+111CF)
  • Double comma, sign Jihvamuliya and Uphadhmaniya (total 3 characters) were added to Newa. (U+1145A and U+11460-U+11461)
  • Khitan Small Script Filler and reading marks for Vietnamese (total 3 characters) were added to Ideographic Symbols and Punctuation. (U+16FE4 and U+16FF0-U+16FF1)
  • Some additional components (total 13 characters) were added to Tangut Components. (U+18AF3-U+18AFF)
  • Creative Commons license symbols and Mask Work symbol (total 7 characters) were added to Enclosed Alphanumeric Supplement. (U+1F10D-U+1F10F, U+1F16D-1F16F and U+1F1AD)
  • Hut, Elevator, Pickup Truck and Roller Skate emoji (total 4 characters) were added to Transportation and Map Symbols. (U+1F6D6-U+1F6D7 and U+1F6FB-U+1F6FC)
  • Arrows for legacy computing (total 2 characters) were added to Supplemental Arrows-C. (U+1F8B0-U+1F8B1)
  • (total 10 characters) were added to Supplemental Symbols and Pictographs. (U+1F90C, U+1F972, U+1F977-U+1F978, U+1F9A3-U+1F9A4, U+1F9AB-U+1F9AD and U+1F9CB)
  • (total 41 characters) were added to Symbols and Pictographs Extended-A. (U+1FA74, U+1FA83-U+1FA86, U+1FA96-U+1FAA8, U+1FAB0-U+1FAB6, U+1FAC0-U+1FAC2 and U+1FAD0-U+1FAD6)
  • Gongche charaters for Kunqu Opera (total 7 characters) were added to CJK Unified Ideographs Extension B. (U+2A6D7-U+2A6DD)

Glyph Changes[edit | edit source]

Here is a table with glyph changes:

Block Name Code Points Count
Tagalog 1700..170C, 170E..1714 20
Mongolian 1834, 1871, 1878 3
Sundanese 1BAB 1
Currency Symbols 20BF 1
CJK Radicals Supplement 2E80..2E99, 2E9B..2EF3 115
Kangxi Radicals 2F00..2FD5 214
CJK Unified Ideographs Extension A 3472, 38C7, 3DB8, 3FE0, 440B, 46E9 6
CJK Unified Ideographs 53FD, 6146, 6711, 671C, 6721, 6725, 6BD2, 7B9A, 87CE, 8956, 93BF, 9B97 12
Latin Extended-D A764..A765 2
Phags-pa A86D 1
Tangut 175F6, 17F0D, 17F8A, 17FA5, 180D6, 18139, 18147, 184F1, 18736 9
Tangut Components 18843, 18856, 1888C, 1890A, 18915, 1893B 6
Adlam 1E900..1E94A, 1E950..1E959, 1E95E..1E95F 71
Miscellaneous Symbols and Pictographs 1F3B1 1
Supplemental Symbols and Pictographs 1F995..1F998, 1F99B..1F99E, 1F9B0..1F9B3, 1F9E7 13
CJK Unified Ideographs Extension B 20219, 21249, 21827, 22C3A, 2327B, 23496, 2355E, 2363B, 236ED, 23839, 23FD5, 24261, 24726, 248F2, 2548E, 26657, 26C9E, 26FE1, 27334, 27C0E, 27CEF, 2A38C 22
CJK Unified Ideographs Extension C 2AED5, 2AEF3, 2AF76, 2B09F, 2B1C3, 2B1E5 6
CJK Unified Ideographs Extension E 2B83C, 2B8D9..2B8DA, 2B96F, 2BBD7, 2BD61, 2BE4A, 2BF1D, 2BF9D, 2C0B8, 2C142, 2C176, 2C316, 2C3FB, 2C402, 2C7AC, 2C82C, 2C83A, 2C9A1, 2CC88, 2CD68 21
CJK Unified Ideographs Extension F 2DC09, 2DE4A, 2EB7E, 2EB89 4
CJK Compatibility Ideographs Supplement 2F83B, 2F878, 2F8D6..2F8D7, 2F8DA, 2F8F0, 2F984, 2FA02 8
Total 536

Unicode 14.0[edit | edit source]

Unicode 14.0 was released on September 14, 2021. It encoded 144,697 characters, adding 838 new characters.

New blocks[edit | edit source]

  • Arabic Extended-B (U+0870-U+089F), containing 41 characters, was added.
  • Vithkuqi (U+10570-U+105BF), containing 70 characters, was added.
  • Latin Extended-F (U+10780-U+107BF), containing 57 characters, was added.
  • Old Uyghur (U+10F70-U+10FAF), containing 26 characters, was added.
  • Unified Canadian Aboriginal Syllabics Extended-A (U+11AB0-U+11ABF), containing 16 characters, was added.
  • Cypro-Minoan (U+12F90-U+12FFF), containing 99 characters, was added.
  • Tangsa (U+16A70-U+16ACF), containing 89 characters, was added.
  • Kana Extended-B (U+1AFF0-U+1AFFF), containing 13 characters, was added.
  • Znamenny Musical Symbols (U+1CF00-U+1CFFF), containing 185 characters, was added.
  • Latin Extended-G (U+1DF00-U+1DFFF), containing 31 characters, was added.
  • Toto (U+1E290-U+1E2BF), containing 31 characters, was added.
  • Ethiopic Extended-B (U+1E7E0-U+1E7FF), containing 28 characters, was added.

Extended blocks[edit | edit source]

  • An End of Text punctuation mark (total 1 character) was added to Arabic. (U+061D)
  • Letters for Balti and Quranic orthography (total 12 characters) were added to Arabic Extended-A. (U+08B5 and U+08C8-U+08D2)
  • A sign Nukta and letter Nakaara Pollu (total 2 characters) were added to Telugu. (U+0C3C and U+0C5D)
  • A letter Nakaara Pollu (total 1 character) was added to Kannada. (U+0CDD)
  • A letter Ra, sign Pamudpod and archaic letter Ra (total 3 characters) were added to Tagalog. (U+170D, U+1715 and U+171F)
  • A fourth Free variation selector (total 1 character) was added to Mongolian. (U+180F)
  • Combining diacritical marks for extended IPA (total 14 characters) were added to Combining Diacritical Marks Extended. (U+1AC1-U+1ACE)
  • An archaic ligature Jnya and punctuation marks (total 3 characters) were added to Balinese. (U+1B4C and U+1B7D-U+1B7E)
  • A combining Dot Below Left (total 1 character) was added to Combining Diacritical Marks Supplement. (U+1DFA)
  • A Kyrgyz Som sign (total 1 character) was added to Currency Symbols. (U+20C0)
  • A letter Caudate Chrivi (total 2 characters) were added to Glagolitic. (U+2C2F and U+2C5F)
  • Medieval and phonetic punctuation marks (total 11 characters) were added to Supplemental Punctuation. (U+2E53-U+2E5D)
  • Some ideographs for Macao (total 3 characters) were added to CJK Unified Ideographs. (U+9FFD-U+9FFF)
  • Archaic European letters, modifier letters for Sokuon and Chatino orthography (total 13 characters) were added to Latin Extended-D. (U+A7C0-U+A7C1, U+A7D0-U+A7D1, U+A7D3, U+A7D5, U+A7D6-U+A7D9 and U+A7F2-U+A7F4)
  • A modifier letter Wasla Above and honorifics (total 20 characters) were added to Arabic Presentation Forms-A. (U+FBC2, U+FD40-U+FD4F, U+FDCF and U+FDFE-U+FDFF)
  • Letters for Old Tamil (total 6 characters) were added to Brahmi. (U+11070-U+11075)
  • A vowel sign Vocalic R (total 1 character) was added to Khaiti. (U+110C2)
  • An Abbreviation sign (total 1 character) was added to Takri. (U+116B9)
  • Letters for Tai Ahom (total 7 characters) were added to Ahom. (U+11740-U+11746) The block was expanded from (U+11700-U+1173F) to (U+11700-U+1174F)
  • Kana archaic letters (total 4 characters) were added to Kana Extended-A. (U+1B11F-U+1B122)
  • Accidental symbols for Iranian classical music (total 2 characters) were added to Musical Symbols. (U+1D1E9-U+1D1EA)
  • Playground Slide, Wheel and Ring Buoy emoji (total 3 characters) were added to Transportation and Map Symbols. (U+1F6DD-U+1F6DF)
  • A Heavy Equals Sign emoji (total 1 character) was added to Geometric Shapes Extended. (U+1F7F0)
  • A Troll and Face Holding Back Tears emoji (total 2 characters) were added to Supplemental Symbols and Pictographs. (U+1F979 and U+1F9CC)
  • (total 31 characters) were added to Symbols and Pictographs Extended-A. (U+1FA7B-U+1FA7C, U+1FAA9-U+1FAAC, U+1FAB7-U+1FABA, U+1FAC3-U+1FAC5, U+1FAD7-U+1FAD9, U+1FAE0-U+1FAE7 and U+1FAF0-U+1FAF6)
  • Some ideographs for Macao (total 2 characters) were added to CJK Unified Ideographs Extension B. (U+2A6DE-U+2A6DF)
  • Disunified ideographs and a G source ideograph for China, Hong Kong and Vietnam (total 4 characters) were added to CJK Unified Ideographs Extension C. (U+2B735-U+2B738)

Glyph Changes[edit | edit source]

Here is a table with glyph changes:

Block Name Code Points Count
Latin Extended-B 0184..0185 2
Arabic 0674..0678, 06C5, 06C7, 06FE 8
Letterlike Symbols 210B, 2110, 2112, 211B, 212C, 2130..2131, 2133 8
Enclosed Alphanumerics 2460..24FF 160
Dingbats 2776..2793 30
CJK Symbols and Punctuation 3001..3029, 3030..303D, 303F 56
CJK Strokes 31C0..31E3 36
Katakana Phonetic Extensions 31F0..31FF 16
Enclosed CJK Letters and Months 3200..321E, 3220..32FF 255
CJK Compatibiity 3300..33FF 256
CJK Unified Ideographs Extension A 3777, 3B3F 2
CJK Unified Ideographs 5DD5, 652C, 6AC0 3
Arabic Presentation Forms-A FBD7..FBD8, FBDD, FBE0..FBE1 5
Vertical Forms FE10..FE19 10
CJK Compatibiity Forms FE30..FE4F 32
Small Form Variants FE50..FE52, FE54..FE66, FE68..FE6B 26
Halfwidth and Fullwidth Forms FF01..FF9F, FFA1..FFBE, FFC2..FFC7, FFCA..FFCF, FFD2..FFD7, FFDA..FFDC, FFE0..FFE6, FFE8..FFEE 225
Egyptian Hieroglyphs 1300A, 13017, 1302D, 13032, 13034..13035, 13037..13038, 1303A..1303E, 1304E..1304F, 13055, 13057, 13068, 1309A, 130D2, 130D5, 130F6, 130FE, 13192, 1325F, 13267, 1326A, 13281, 13297, 1329E, 132B4, 132C1, 132E6, 13304, 1331F, 13378..1337B, 1337D..1337E, 133F3, 133FA..13403, 1340D, 13417, 1342B 55
Mathematical Alphanumeric Symbols 1D49C, 1D49E..1D49F, 1D4A2, 1D4A5..1D4A6, 1D4A9..1D4AC, 1D4AE..1D4B5 18
Enclosed Alphanumeric Supplement 1F100..1F1AD, 1F1E6..1F1FF 200
Enclosed Ideographic Supplement 1F200..1F202, 1F210..1F23B, 1F240..1F248, 1F250..1F251, 1F260..1F265 64
Supplemental Symbols and Pictographs 1F930 1
CJK Unified Ideographs Extension B 22ADC, 230F2, 25B27, 26F28 4
Total 1472

Variation Sequences[edit | edit source]

Here is a table with new standardized variation sequences:

Character Sequence Context Description of Variation Appearance
1D49C FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL A
212C FE00 chancery style # SCRIPT CAPITAL B
1D49E FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL C
1D49F FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL D
2130 FE00 chancery style # SCRIPT CAPITAL E
2131 FE00 chancery style # SCRIPT CAPITAL F
1D4A2 FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL G
210B FE00 chancery style # SCRIPT CAPITAL H
2110 FE00 chancery style # SCRIPT CAPITAL I
1D4A5 FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL J
1D4A6 FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL K
2112 FE00 chancery style # SCRIPT CAPITAL L
2133 FE00 chancery style # SCRIPT CAPITAL M
1D4A9 FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL N
1D4AA FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL O
1D4AB FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL P
1D4AC FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL Q
211B FE00 chancery style # SCRIPT CAPITAL R
1D4AE FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL S
1D4AF FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL T
1D4B0 FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL U
1D4B1 FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL V
1D4B2 FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL W
1D4B3 FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL X
1D4B4 FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL Y
1D4B5 FE00 chancery style # MATHEMATICAL SCRIPT CAPITAL Z
1D49C FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL A
212C FE01 roundhand style # SCRIPT CAPITAL B
1D49E FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL C
1D49F FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL D
2130 FE01 roundhand style # SCRIPT CAPITAL E
2131 FE01 roundhand style # SCRIPT CAPITAL F
1D4A2 FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL G
210B FE01 roundhand style # SCRIPT CAPITAL H
2110 FE01 roundhand style # SCRIPT CAPITAL I
1D4A5 FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL J
1D4A6 FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL K
2112 FE01 roundhand style # SCRIPT CAPITAL L
2133 FE01 roundhand style # SCRIPT CAPITAL M
1D4A9 FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL N
1D4AA FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL O
1D4AB FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL P
1D4AC FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL Q
211B FE01 roundhand style # SCRIPT CAPITAL R
1D4AE FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL S
1D4AF FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL T
1D4B0 FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL U
1D4B1 FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL V
1D4B2 FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL W
1D4B3 FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL X
1D4B4 FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL Y
1D4B5 FE01 roundhand style # MATHEMATICAL SCRIPT CAPITAL Z

Named Sequences[edit | edit source]

Here is a table with new named character sequences:

Character Sequence Name
0915 093C DEVANAGARI SEQUENCE FOR LETTER QA
0916 093C DEVANAGARI SEQUENCE FOR LETTER KHHA
0917 093C DEVANAGARI SEQUENCE FOR LETTER GHHA
091C 093C DEVANAGARI SEQUENCE FOR LETTER ZA
0921 093C DEVANAGARI SEQUENCE FOR LETTER DDDHA
0922 093C DEVANAGARI SEQUENCE FOR LETTER RHA
092B 093C DEVANAGARI SEQUENCE FOR LETTER FA
092F 093C DEVANAGARI SEQUENCE FOR LETTER YYA
09A1 09BC BENGALI SEQUENCE FOR LETTER RRA
09A2 09BC BENGALI SEQUENCE FOR LETTER RHA
09AF 09BC BENGALI SEQUENCE FOR LETTER YYA
0A32 0A3C GURMUKHI SEQUENCE FOR LETTER LLA
0A38 0A3C GURMUKHI SEQUENCE FOR LETTER SHA
0A16 0A3C GURMUKHI SEQUENCE FOR LETTER KHHA
0A17 0A3C GURMUKHI SEQUENCE FOR LETTER GHHA
0A1C 0A3C GURMUKHI SEQUENCE FOR LETTER ZA
0A2B 0A3C GURMUKHI SEQUENCE FOR LETTER FA
0B21 0B3C ORIYA SEQUENCE FOR LETTER RRA
0B22 0B3C ORIYA SEQUENCE FOR LETTER RHA

Unicode 15.0[edit | edit source]

Unicode 15.0 was released on September 13, 2022. It encoded 149,186 characters, adding 4,489 new characters.

New blocks[edit | edit source]

  • Arabic Extended-C (U+10EC0-U+10EFF), containing 3 characters, was added.
  • Devanagari Extended-A (U+11B00-U+11B5F), containing 10 characters, was added.
  • Kawi (U+11F00-U+11F5F), containing 86 characters, was added.
  • Kaktovik Numerals (U+1D2C0-U+1D2DF), containing 20 characters, was added.
  • Cyrillic Extended-D (U+1E030-U+1E08F), containing 63 characters, was added.
  • Nag Mundari (U+1E4D0-U+1E4FF), containing 42 characters, was added.
  • CJK Unified Ideographs Extension H (U+31350-U+323AF), containing 4192 characters, was added.

Extended blocks[edit | edit source]

  • A Yamakkan (total 1 character) was added to Lao. (U+0ECE)
  • A combining Anusvara Above Right (total 1 character) was added to Kannada. (U+0CF3)
  • Letters Qa, Short I and Vocalic R (total 3 characters) were added to Khojki. (U+1123F-U+11241)
  • An additional hieroglyph to Group V (total 1 character) was added to Egyptian Hieroglyphs
  • Extended format controls (total 29 characters) were added to Egyptian Hieroglyph Format Controls. (U+13439-U+13455). The block was expanded from (U+13430-U+1343F) to (U+13430-U+1345F)
  • Hiragana and Katakana Small Ko (total 2 characters) were added to Small Kana Extension. (U+1B132 and U+1B155)
  • Letters for Malayalam transliteration (total 6 characters) were added to Latin Extended-G. (U+1DF25-U+1DF2A)
  • A Wireless emoji (total 1 character) was added to Transport and Map Symbols. (U+1F6DC)
  • A Nine Pointed White Star (total 1 character) was be added to Geometric Shapes Extended. (U+1F7D9)
  • A Lot of Fortune, eclipse symbols and symbols for dwarf planets (total 6 characters) were added to Alchemical symbols. (U+1F774-U+1F776 and U+1F77B-U+1F77F)
  • (total 20 characters) were added to Symbols and Pictographs Extended-A. (U+1FA75-U+1FA77, U+1FA87-U+1FA88, U+1FAAD-U+1FAAF, U+1FABB-U+1FABF, U+1FACE-U+1FACF, U+1FADA-U+1FADB, U+1FAE8 and U+1FAF7-U+1FAF8)
  • A disunified ideograph for Macao (total 1 character) was added to CJK Unified Ideographs Extension C. (U+2B739)

Glyph Changes[edit | edit source]

Here is a table with glyph changes:

Block Name Code Points Count
IPA Extensions 025E, 029A 2
United Canadian Aboriginal Syllabics 144B, 14D1, 1506, 15C0..15C3, 15E8..15EE, 1601, 1604..1607, 160A..160D, 1614..162D, 1630..163F, 1646..1647, 165A 66
United Canadian Aboriginal Syllabics Extended 18DB, 18EC, 18F1..18F2, 18F5 5
Sundanese 1BBF 1
Optical Character Recognition 2447 1
CJK Unified Ideographs Extension A 34DC, 3BF6, 3C43, 48B4, 4DBE 5
CJK Unified Ideographs 585F, 5F50, 6BC0, 7BC9, 833E 5
Cyrillic Extended-B A66E 1
Old Turkic 10C47 1
Egyptian Hieroglyphs various (new standardized variation sequences) 94
Khitan Small Script 18CCA 1
Wancho (font update) 1E2C0..1E2F9, 1E2FF 59
Alchemical Symbols (font update) 1F700..1F773 116
CJK Unified Ideographs Extension B 20048, 20A1C, 2143F, 21A5F, 21C08, 21FBA, 22ACF, 23392, 238A7, 23D8F, 23F4E, 25D20, 26E30, 27B48, 27C4F, 28633, 28B02, 28E9A, 29760, 2A60F 20
CJK Unified Ideographs Extension C 2B249 1
CJK Unified Ideographs Extension E 2BB37, 2BD7D, 2C151, 2C1E0, 2C2D6, 2C5CA, 2C810, 2CD34 8
CJK Unified Ideographs Extension F 2CF4E, 2D25D, 2D3EC, 2D6A7, 2D7BA, 2D979, 2DA74, 2DA97, 2DC13, 2DDC0, 2DF10, 2DF78, 2E05A, 2E0AE, 2E516, 2E640, 2E680, 2EA63 18
CJK Compatibility Ideographs Supplement 2F804, 2F805, 2F833, 2F835, 2F84C, 2F84F, 2F852, 2F855, 2F887, 2F88B, 2F899, 2F8A0, 2F8A6, 2F8A7, 2F8AD, 2F8B1, 2F8B4, 2F8B7, 2F8BA, 2F8D0, 2F8E0..2F8E2, 2F8E5, 2F8E6, 2F8FE, 2F900, 2F901, 2F907, 2F912, 2F922, 2F926, 2F936, 2F938, 2F94E, 2F959, 2F95F, 2F96C, 2F99F, 2F9B8, 2F9BA, 2F9D3, 2F9DB, 2F9DC, 2F9E8, 2F9EA, 2F9EE, 2FA00, 2FA0D, 2FA1B 50
CJK Unified Ideographs Extension G 302FC, 30723, 30A6D, 30CF7, 30DBF, 31006, 3105D 7
Total 461

Variation Sequences[edit | edit source]

Here is a table with new standardized variation sequences:

Character Sequence Context Description of Variation Appearance
13091 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH D027
13092 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH D027A
13093 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH D028
130A9 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH D047
1310F FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH F016
13117 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH F023
1311C FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH F028
13121 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH F032
13127 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH F037A
13139 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH F051
13139 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH F051
13183 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH H005
13187 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH H008
131A0 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH K006
131A0 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH K006
131B1 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH M003
131B1 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH M003
131B8 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH M009
131B9 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH M010
131BA FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH M010A
131CB FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH M017
131EE FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH M044
131EE FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH M044
131F8 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH N010
131F9 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH N011
131F9 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH N011
131FA FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH N012
131FA FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH N012
13216 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH N035
13257 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH O006
1327B FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH O029
1327F FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH O031
1327F FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH O031
13285 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH O036
1328C FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH O039
132A4 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH P008
132A4 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH P008
132AA FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH Q003
132CB FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH R024
132DC FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH S010
132E7 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH S018
132E7 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH S018
132E9 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH S020
132F8 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH S033
132FD FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH S037
13302 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH S042
13303 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH S043
13307 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH T001
13308 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH T002
13310 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH T008
13311 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH T008A
13312 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH T009
13312 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH T009
13313 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH T009A
13313 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH T009A
13314 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH T010
13314 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH T010
1331B FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH T016
1331B FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH T016
1331C FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH T016A
13321 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH T021
13321 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH T021
13322 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH T022
13322 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH T022
13331 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH T035
13331 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH T035
1333B FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH U007
1333C FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH U008
1334A FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH U022
13361 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH U042
13373 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH V007A
13377 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH V010
13378 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH V011
1337D FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH V012A
13385 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH V019
13399 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH V026
1339A FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH V027
133AF FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH W001
133B0 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH W002
133BF FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH W014
133D3 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH X004A
133DD FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH Y002
133F2 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH Z007
133F5 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH Z010
133F6 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH Z011
13403 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH Z015I
13416 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH AA008
13419 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH AA011
13419 FE01 rotated 180 degrees # EGYPTIAN HIEROGLYPH AA011
13419 FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH AA011
1341A FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH AA012
13423 FE00 rotated 90 degrees # EGYPTIAN HIEROGLYPH AA021
1342C FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH AA030
1342E FE02 rotated 270 degrees # EGYPTIAN HIEROGLYPH AA032
13443 FE00 expanded # EGYPTIAN HIEROGLYPH LOST SIGN
13444 FE00 expanded # EGYPTIAN HIEROGLYPH HALF LOST SIGN
13445 FE00 expanded # EGYPTIAN HIEROGLYPH TALL LOST SIGN
13446 FE00 expanded # EGYPTIAN HIEROGLYPH WIDE LOST SIGN

Unicode 15.1[edit | edit source]

Unicode 15.1 was released on September 12th, 2023. It encoded 149,813 characters, adding 627 new characters.

New Blocks[edit | edit source]

  • CJK Unified Ideographs Extension I (U+2EBF0-U+2EE5F), containing 622 characters, was added.

Extended Blocks[edit | edit source]

  • 4 Ideographic characters was added to Ideographic Description Characters. (U+2FFC-U+2FFF)
  • An Ideographic subraction (total 1 character) was added to CJK Strokes. (U+31EF)

Glyph Changes[edit | edit source]

Here is a table with glyph changes:

Block Name Code Points Count
CJK Unified Ideographs Extension A 357E, 358B..358E, 3599..359D, 35AF..35B0, 35B2..35B3, 35DF..35E1, 35EF, 360F, 3612, 3F94, 44D5, 48EE 5
CJK Unified Ideographs 5098, 512D, 517A, 5391, 54DB, 551C, 551F, 55B8, 55ED, 56AB, 591E, 594A, 5B2E, 5DFC..5DFD, 5EE4, 609E, 65B0, 65B3, 65D5, 65F2, 67B2, 6AB6, 6AEC, 6C69, 6FC2, 6FD3, 7019, 7361, 74BD, 7934, 820B, 826E, 83BB, 8412, 8456, 848A, 896F, 8E34, 8FD7, 9166, 9855, 985E, 9C4D 5
Latin Extended-D A798 1
Latin Extended-E AB5A 1
Tangut 17105, 172A4, 17BD1..17BD3, 17EF9, 18136 59
Alchemical Symbols 1F741, 1F747, 1F74C, 1F74F, 1F756, 1F758, 1F763, 1F768, 1F76D, 1F76E 116
CJK Unified Ideographs Extension B 20302, 2087A, 20C00, 230B7, 2339E, 236EF, 237C3, 23B87, 23CC0, 23CD9, 23E5E, 2486F, 249D6, 249E8, 24D6A, 2585E, 25D89, 26A5A..26A5B, 26A73, 26A82..26A83, 26A90, 26AA6, 26AA8, 26AD8, 27350, 279F8, 284A3, 28BBA, 29516, 29530 20
CJK Unified Ideographs Extension C 2A741, 2AB63, 2ACD8, 2AF6F, 2B173, 2B490 1
CJK Unified Ideographs Extension E 2BC2E, 2BF45, 2C04C, 2C13A, 2C43C, 2C43E, 2C816 8
CJK Unified Ideographs Extension F 2D1CC..2D1CD, 2D1DD, 2D1E4, 2D1F7, 2D203, 2D256, 2D266, 2D2A2, 2D2AC, 2D2DA 18
CJK Unified Ideographs Extension G 301D4, 301D9, 301E4, 301E8, 301FF..30200, 30205, 3020C, 30211, 30215..30217, 30220, 30234..30235, 30237 7
CJK Unified Ideographs Extension H 314B7, 31542, 31569, 31C7F, 31D5A, 31F68 7
Total 164

Unicode 16.0[edit | edit source]

Unicode 16.0 will be released in September 2024.[1] It will encode 189,192 characters, adding 33,941 new characters.

New Blocks[edit | edit source]

  • Todhri (U+105C0-U+105FF), containing 52 characters will be added.
  • Garay (U+10D40-U+10D8F), containing 69 characters will be added.
  • Tulu-Tigalari (U+11380-U+113FF), containing 80 characters will be added.
  • Myanmar Extended-C (U+116D0-U+116FF), containing 20 characters will be added.
  • Sunuwar (U+11BC0-U+11BFF), containing 44 characters will be added.
  • Egyptian Hieroglyphs Extended-A (U+13460-U+143FF), containing 3995 characters will be added.
  • Tibetan Supplement (U+15200-U+154FF), containing 490 characters, will be added.
  • Gurung Khema (U+16100-U+1613F), containing 58 characters will be added.
  • Kirat Rai (U+16D40-U+16D7F), containing 58 characters will be added.
  • Tangut Components Supplement (U+18D80-U+18DFF), containing 116 characters will be added.
  • Symbols for Legacy Computing Supplement (U+1CC00-U+1CEBF), containing 695 characters will be added.
  • Miscellaneous Symbols Supplement (U+1CEC0-U+1CEFF), containing 34 characters will be added.
  • Ol Onal (U+1E5D0-U+1E5FF), containing 44 characters will be added.
  • Hangul Syllables Extended-A (U+60000-U+62FFF), containing 28,502 characters will be added.

Extended Blocks[edit | edit source]

  • A combining diacritical mark for Jawi (total 1 character) will be added to Arabic Extended-B. (U+0897)
  • An archaic ligature Shrii (total 1 character) will be added to Telugu. (U+0C5C)
  • An archaic ligature Shrii (total 1 character) will be added to Kannada. (U+0CDC)
  • Mongolian Letter Manchu Alternative Ue (total 1 character) will be added to Mongolian. (U+1879)
  • Inverted letters and a punctuation mark (total 3 characters) will be added to Balinese. (U+1B4E-U+1B4F and U+1B7F)
  • A letter Tje (total 2 characters) will be added to Cyrillic Extended-C. (U+1C89-U+1C8A)
  • A Bhutanese ngultrum sign (total 1 character) will be added to Currency Symbols. (U+20C1)
  • Legacy computing symbols for Delete (total 3 characters) will be added to Control Pictures. (U+2427-U+2429)
  • Equal Sign with Infinity Above (total 1 character) will be added to Miscellaneous Symbols and Arrows. (U+2B96)
  • CJK strokes Hzxg and Szp (total 2 characters) will be added to CJK Strokes. (U+31E4-U+31E5)
  • A capital Rams Horn, an S with Diagonal Stroke, Lamda Letters, and letters for Wakashan and Salishan Languages (total 6 characters) will be added to Latin Extended-D. (U+A7CB-U+A7CD, U+A7DA-U+A7DC)
  • A combining Alef overlay and letters with two dots vertically below (total 4 characters) will be added to Arabic Extended-C. (U+10EC2-U+10EC4 and U+10EFC)
  • A sign Nukta (total 1 character) will be added to Kawi. (U+11F5A)
  • Chinese Simplified and Traditional Er (total 2 characters) will be added to Ideographic Symbols and Punctuation. (U+16FF2-U+16FF3)
  • Some additional ideographs (total 8 characters) will be added to Tangut. (U+187F8-U+187FF)
  • A blank character (total 1 character) will be added to Khitan Small Script. (U+18CFF)
  • Additional ideographs (total 20 characters) will be added to Tangut Supplement. (U+18D09-U+18D1C)
  • Stein Zimmerman Symbols, Digit Slash Symbols, and other Symbols (total 23 characters) will be added to Musical Symbols. (U+1D127-U+1D128, U+1D1EB-U+1D1F6, U+1D1F7-U+1D1FF)
  • Historical asteroid symbols (total 4 characters) will be added to Alchemical Symbols. (U+1F777-U+1F77A)
  • Containing and Up-Pointing symbols (total 6 characters) will be added to Geometric Shapes Extended. (U+1F7DA-U+1F7DF)
  • A rightwards arrow with hook, and arrows for mathematical symbols arrows for legacy computing and arrows for Egyptology and arrows for Chemical symbols for Dingbats symbols arrow (total 34 characters) will be added to Supplemental Arrows-C. (U+1F8B2-U+1F8BB, U+1F8C0-U+1F8C1, U+1F8D0-U+1F8D8, U+1F8E0-U+1F8E4)
  • White and Black Chess Ferz and Alfil (total 4 characters) will be added to Chess Symbols. (U+1FA54-U+1FA57)
  • A Harp, Shovel, Leafless Tree, Fingerprint, Root Vegetable, Splatter, and Face with Bags Under Eyes (total 7 characters) will be added to Symbols and Pictographs Extended-A. (U+1FA89, U+1FA8F, U+1FABE, U+1FAC6, U+1FADC, U+1FADF, and U+1FAE9)
  • Graphic shapes for legacy computing and an alarm bell symbol (total 38 characters) will be added to Symbols for Legacy Computing. (U+1FBCB-U+1FBEF and U+1FBFA)
  • 1 extra blank character will be added to Hangul Syllables (U+D7AF-U+D7B0)

Glyph Changes[edit | edit source]

Here is a table with glyph changes:

Block Name Code Points Count
Latin Extended-B 0182, 0183, 0184, 0185, 0186, 018E, 018F, 0190, 0195 9
Greek and Coptic 0394, 03A9, 03B2, 03C0 4
Cyrillic 0460, 0462, 0463 3
Samaritan 0807, 0808, 0809, 080E, 0812 5
Cherokee 13D2, 13D4, 13D5, 13D6, 13D7, 13DC, 13DF, 13E4 8
Unified Canadian Aboriginal Syllabics 1401, 14EC, 150D, 1515, 1517, 1521, 1528 7
Cyrillic Extended-C 1C83, 1C84 2
Phonetic Extensions 1D0E, 1D10, 1D11, 1D12, 1D1F, 1D28, 1D29, 1D2A, 1D2F, 1D34 10
Currency Symbols 20A8 1
Arrows 219A, 219B, 21AE, 21AF, 21B0, 21B1, 21B2, 21B3, 21C0, 21C1, 21C6 11
Mathematical Operators 226E, 226F 2
Miscellaneous Technical 236A, 232C, 233F, 2340, 23E2, 23E3, 23E5, 7
Geometric Shapes 25AC, 25AD, 25B2, 25B3 4
Miscellaneous Symbols 269E, 269F 2
Dingbats 275B, 275C, 4
Miscellaneous Mathematical Symbols-A 27C1, 27C2, 27C3, 27C4, 27C7, 27C8, 27C9 7
Supplemental Arrows-A 27F0, 27F1, 27F2, 27F3, 27F4 5
Supplemental Arrows-B 296E. 296F 2
Miscellaneous Mathematical Symbols-B 29B5, 29BA, 29D4, 29D5, 4
Supplemental Mathematical Operators 2A68, 2A69, 2A76, 2AED, 2AEF 5
Miscellaneous Symbols and Arrows 2BB9, 2BBA, 2BBB, 2BBC, 2BBD, 2BBE, 2BBF 7
Coptic 2CB6, 2CB8 2
Ethiopic Extended 2DB8, 2DB9, 2DBA 3
Supplemental Punctuation 2E26, 2E27 2
CJK Symbols and Punctuation 3001..3029, 3030..303D, 303F 56
CJK Strokes 31C0..31E3 36
Katakana Phonetic Extensions 31F0..31FF 16
Enclosed CJK Letters and Months 3200..321E, 3220..32FF 255
CJK Compatibiity 3300..33FF 256
CJK Unified Ideographs Extension A 3777, 3B3F 2
CJK Unified Ideographs 5DD5, 652C, 6AC0 3
Cyrillic Extended-B A64C, A64D, A64E, A64F, A650, A651 6
Latin Extended-D A74A, A74B, A74E, A74F 4
Myanmar Extended-B A9E2, A9E3, A9E4, A9E5, A9E7, A9E8, A9EE, A9F0 8
Ethiopic Extended-A AB20, AB21, AB22, AB23, AB24, AB25, AB26, AB28, AB29 9
Myanmar Extended-A AA60, AA61, AA74, AA75, AA76, AA77 7
Latin Extended-E AB57, AB58, AB59 3
Arabic Presentation Forms-A FBD7..FBD8, FBDD, FBE0..FBE1 5
Vertical Forms FE10..FE19 10
CJK Compatibiity Forms FE30..FE4F 32
Small Form Variants FE50..FE52, FE54..FE66, FE68..FE6B 26
Mathematical Alphanumeric Symbols 1D4D0, 1D4D1, 1D4D2, 1D4D3, 1D4D4, 1D4D5, 1D4D6, 1D4D7 8
Playing Cards 1F0A1, 1F0B1, 1F0C1, 1F0D1 4
CJK Unified Ideographs Extension B 200DD, 20164, 20BBF, 20C02, 20CED, 21D4C, 2278B, 23AB8, 2459B, 24A7D, 24FB9, 25ED7, 2677C, 26B4C, 26C21, 26CBE, 26E3D, 28834, 289A1, 289C0, 28A0F, 28B46 22
CJK Unified Ideographs Extension C 2A8FB, 2A917, 2AA30 3
CJK Unified Ideographs Extension E 2BA52, 2BD77, 2C494, 2C72F, 2C734, 2CB38 6
CJK Unified Ideographs Extension F 2D23B, 2E83A 2
Total 130

Code Points Provisionally Assigned[edit | edit source]

This is a section where you can add any upcoming Unicode characters that have been provisionally assigned for mature proposals (but not yet accepted) for a future update of The Unicode Standard.

New Blocks[edit | edit source]

  • Sidetic (U+10940-U+1095F), containing 29 characters, will be added.
  • Sharada Supplement (U+11B60-U+11B7F), containing 8 characters, will be added.
  • Tolong Siki (U+11DB0-U+11DEF), containing 54 characters, will be added.
  • Mandombe (U+15B80-U+15FFF), containing 1041 characters, will be added.
  • Chisoi (U+16D80-U+16DAF), containing 40 characters, will be added.
  • Beria Erfe (U+16EA0-U+16EDF), containing 48 characters, will be added.
  • Rejang Supplement (U+1A760-U+1A77F), containing 39 characters, will be added.
  • Kana Extended-C (U+1AFD0-U+1AFEF), containing 20 characters, will be added.
  • Mathematical Alphanumeric Symbols Supplement (U+1D380-U+1D3FF). containing 60 characters, will be added.
  • Tai Yo (U+1E6C0-U+1E6FF), containing 55 characters, will be added.

Extended Blocks[edit | edit source]

  • An alternate letter Ba (total 1 character) will be added to Bengali. (U+09FF)
  • Compound tone diacritics (total 14 characters) will be added to Combining Diacritical Marks Extended. (U+1AD0-U+1ADD)
  • 2 capital letters for Middle English and latin pharyngeal voiced fricative (total 4 characters) will be added to Latin Extended-D. (U+A7CE-U+A7CF, U+A7D2, U+A7D4)
  • Arabic Ligature Rahmatu Allaahi Alayh and Arabic Honorifics (total 8 characters) will be added to Arabic Presentation Forms-A. (U+FD90, U+FDC8-U+FDCE)
  • A Small Yeh Barree with Two Dots Below, Thin Noon, Biblical End of Verse, Double Vertical Bar Below, and Small Low Noon (total 5 characters) will be added to Arabic Extended-C. (U+10EC5-U+10EC6, U+10ED0 and U+10EFA-U+10EFB)
  • Very Early Dynastic Cuneiform Numbers for counting (total 12 characters) will be added to Cuneiform Numbers and Punctuation. (U+1246F, U+12475-U+1247F)

Roadmap Blocks[edit | edit source]

This is a section where present proportional maps of a proposed allocations to Unicode and ISO/IEC 10646. Italic indicates scripts for which detailed proposals have not yet been written.[2] You can add custom blocks onto here and Unicode will know they will add it on the roadmap so there’s more blocks.

Blocks[edit | edit source]

  • Northern Palaeohispanic (U+10200-U+1023F)
  • Southern Palaeohispanic (U+10240-U+1027F)
  • Shavian Quikscript (U+103E0-U+103FF)
  • Proto-Sinaitic (U+108B0-U+108DF)
  • Numidian (U+10960-U+1097F)
  • Balti-A (U+10AA0-U+10ABF)
  • Book Pahlavi (U+10BB0-U+10BDF)
  • Baburi (U+10BE0-U+10BFF)
  • Arabic Extended-D (U+10D90-U+10E5F)
  • Landa (U+11250-U+1127F)
  • Tani Lipi (U+114E0-U+114FF)
  • Ranjana (U+11500-U+1157F)
  • Zou (U+11750-U+117AF)
  • Pyu (U+117B0-U+117FF)
  • Sirmauri (U+11850-U+1188F)
  • Vateluttu (U+11960-U+1199F)
  • Leke (U+11B80-U+11BBF)
  • Balti-B (U+11CC0-U+11CFF)
  • Tocharian (U+11E00-U+11E6F)
  • Khotanese (U+11E70-U+11ECF)
  • Pallava (U+11F60-U+11FAF)
  • Proto-Cuneiform (U+12580-U+12ECF)
  • Egyptian Hieroglyphs Extended-B (U+14680-U+151FF)
  • Mayan Hieroglyphs (U+15500-U+15AFF)
  • Cirth (U+16000-U+1607F)
  • Tengwar (U+16080-U+160FF)
  • Kurux Banna (U+16140-U+1618F)
  • Moon (U+161A0-U+161FF)
  • Blissymbols (U+16200-U+167FF)
  • Woleai (U+16B90-U+16BFF)
  • Kpelle (U+16C00-U+16C7F)
  • Afaka (U+16C80-U+16CCF)
  • Khimhun Tangsa (U+16CD0-U+16CFF)
  • Tikamuli (U+16D00-U+16D3F)
  • Kulitan (U+16DD0-U+16DFF)
  • Mwangwego (U+16E00-U+16E3F)
  • Bopomofo Extended-A (U+16FA0-U+16FAF)
  • Kanbun Extended-A (U+16FB0-U+16FDF)
  • Khitan Ideographs (U+18E00-U+195FF)
  • Jurchen (U+19600-U+19B9F)
  • Kanbun Extended-B (U+19BE0-U+19BEF)
  • Pau Cin Hau Syllabary (U+19E00-U+1A2FF)
  • Eskaya (U+1A300-U+1A75F)
  • Kaida (U+1A780-U+1A7FF)
  • Naxi Dongba (U+1A800-U+1ACFF)
  • Naxi Geba (U+1AD00-U+1AFCF)
  • Shuishu Logograms (U+1B300-U+1B5FF)
  • Lisu Syllabic Script (U+1B600-U+1B9FF)
  • Indus (U+1BA00-U+1BB8F)
  • Pitman Shorthands (U+1BCB0-U+1BCFF)
  • Proto-Elamite (U+1BD00-U+1C37F)
  • Linear-Elamite (U+1C380-U+1C4FF)
  • Old Chinese Musical Symbols (U+1D250-U+1D2AF)
  • Jianzi Format Controls (U+1DAE0-U+1DAFF)
  • Jianzi Musical Symbols (U+1DB00-U+1DC8F)
  • Eebee Hmong (U+1E150-U+1E1FF)
  • Western Cham (U+1E200-U+1E26F)
  • Loma (U+1E300-U+1E41F)
  • Bagam (U+1E420-U+1E4CF)
  • Pungchen (U+1E500-U+1E52F)
  • Pungchung (U+1E530-U+1E55F)
  • Marchung (U+1E560-U+1E59F)
  • Brusha (U+1E5A0-U+1E5CF)
  • Chola (U+1E600-U+1E65F)
  • Chalukya (Box-Headed) (U+1E660-U+1E6BF)
  • Lampung (U+1E700-U+1E73F)
  • Kerinci (U+1E740-U+1E76F)
  • Buginese Supplement (U+1E770-U+1E7BF)
  • Lontara Bilang-Bilang (U+1E7C0-U+1E7DF)
  • Byblos (U+1EB90-U+1EBFF)
  • Persian Siyaq Numbers (U+1EC00-U+1EC7F)
  • Diwani Siyaq Numbers (U+1ECC0-U+1ECFF)
  • Arabic Supplemental Symbols (U+1EF00-U+1EF3F)
  • Extended Pictographic Characters (U+1FC00-U+1FFFF)
  • Seal Script (U+38000-U+3AB9F)

References[edit | edit source]

  1. Invalid <ref> tag; no text was provided for refs named 2023release
  2. Roadmaps to Unicode®