Unicode/Versions: Difference between revisions

From Wikibooks, open books for an open world
Jump to navigation Jump to search
[checked revision][checked revision]
Content deleted Content added
No edit summary
Tags: Mobile edit Mobile web edit
Line 1,452: Line 1,452:
* Kannada and Telugu Archaic Shrii (total 2 characters) will be added to '''Kannada''' and '''Telugu'''. (U+0C5C, U+0CDC).
* Kannada and Telugu Archaic Shrii (total 2 characters) will be added to '''Kannada''' and '''Telugu'''. (U+0C5C, U+0CDC).
* 3 diacritics (total 3 characters) will be added to '''Balinese'''. (U+1B4E-U+1B4F, U+1B7F)
* 3 diacritics (total 3 characters) will be added to '''Balinese'''. (U+1B4E-U+1B4F, U+1B7F)
* Cyrillic letter Tje and zhe with stroke (total 4 characters) will be added to '''Cyrillic Extended-C'''. (U+1C89-U+1C8A, U+1C8D-U+1C8E)
* Cyrillic letter Tje and Zhe with stroke (total 4 characters) will be added to '''Cyrillic Extended-C'''. (U+1C89-U+1C8A, U+1C8D-U+1C8E)
* 3 special symbols for delete (total 3 characters) will be added to '''Control Pictures'''. (U+2427-U+2429)
* 3 special symbols for delete (total 3 characters) will be added to '''Control Pictures'''. (U+2427-U+2429)
* 5 additional ideographic description characters will be added to '''Ideographic Description Characters'''. (U+2FE0-U+2FE4). The block will be expanded from (U+2FF0-U+2FFF) to (U+2FE0-U+2FFF)
* 5 additional ideographic description characters will be added to '''Ideographic Description Characters'''. (U+2FE0-U+2FE4). The block will be expanded from (U+2FF0-U+2FFF) to (U+2FE0-U+2FFF)

Revision as of 23:05, 12 October 2022

(edit template)

This page is about each version specification, and the differences between the versions.

Unicode 1.0

Unicode 1.0 was the first version of Unicode, released October 1991. It encoded 7,161 new characters.

“Blocks”

This version of Unicode did not formally group characters in blocks. But in comparison with version 2.0, the following “blocks” were available: U+0000-U+FFFD 51 Blocks

  • Basic Latin, containing 128 characters.
  • Latin-1 Supplement, containing 128 characters.
  • Latin Extended-A, containing 127 characters.
  • Latin Extended-B, containing 113 characters.
  • IPA Extensions, containing 89 characters.
  • Spacing Modifier Letters, containing 57 characters.
  • Combining Diacritical Marks, containing 66 characters.
  • Greek and Coptic, containing 112 characters.
  • Cyrillic, containing 192 characters.
  • Armenian, containing 84 characters.
  • Hebrew, containing 52 characters.
  • Arabic, containing 169 characters.
  • Devanagari, containing 104 characters.
  • Bengali, containing 89 characters.
  • Gurmukhi, containing 74 characters.
  • Gujarati, containing 75 characters.
  • Oriya, containing 78 characters.
  • Tamil, containing 61 characters.
  • Telugu, containing 80 characters.
  • Kannada, containing 80 characters.
  • Malayalam, containing 78 characters.
  • Thai, containing 92 characters.
  • Lao, containing 70 characters.
  • Tibetan, containing 71 characters.
  • Georgian, containing 78 characters.
  • General Punctuation, containing 67 characters.
  • Superscripts and Subscripts, containing 28 characters.
  • Currency Symbols, containing 11 characters.
  • Combining Marks for Symbols, containing 18 characters.
  • Letterlike Symbols, containing 57 characters.
  • Number Forms, containing 48 characters.
  • Arrows, containing 91 characters.
  • Mathematical Operators, containing 242 characters.
  • Miscellaneous Technical, containing 43 characters.
  • Control Pictures, containing 37 characters.
  • Optical Character Recognition, containing 11 characters.
  • Enclosed Alphanumerics, containing 139 characters.
  • Forms, containing 128 characters.
  • Block Elements, containing 22 characters.
  • Geometric Shapes, containing 79 characters.
  • Miscellaneous Symbols, containing 106 characters.
  • Dingbats, containing 160 characters.
  • CJK Symbols and Punctuation, containing 56 characters.
  • Hiragana, containing 90 characters.
  • Katakana, containing 90 characters.
  • Bopomofo, containing 40 characters.
  • Hangul Compatibility Jamo, containing 94 characters.
  • Kanbun, containing 16 characters.
  • Enclosed CJK Letters and Months, containing 191 characters.
  • CJK Compatibility, containing 187 characters.
  • Hangul Syllables, containing 2,350 characters.
  • Private Use Area, reserved for 5,632 characters.
  • CJK Compatibility Forms, containing 28 characters.
  • Small Form Variants, containing 26 characters.
  • Arabic Presentation Forms-B, containing 140 characters.
  • Halfwidth and Fullwidth Forms, containing 216 characters.
  • Specials, containing 1 character.

Unicode 1.0.1

Unicode 1.0.1 was released June 1992. It encoded 28,365 characters, adding 21,204 new characters, removing 96 characters.

New blocks

  • CJK Unified Ideographs, containing 20,902 Han Ideographs for Chinese, Japanese and Korean was added.
  • CJK Compatibility Ideographs, containing 302 Han Ideographs for compatibility with existing character sets was added.

Removed Blocks

  • Tibetan, containing 71 letters for the Tibetan script, was removed from the Unicode standard.

Removed Characters

  • Thai, (total 5 characters) were Removed From Thai Script. (U+0E70-U+0E74) ( Thai Phonetic Order Vowel Sign เ แ โ ใ And ไ)
  • Lao, (total 5 characters) were Removed From Lao Script. (U+0EF0-U+0EF4)
  • Miscellaneous Technical, (total 2 characters) were Removed From Miscellaneous Technical. (U+2300,U+2301)
  • Greek and Coptic, (total 7 characters) were Removed From Greek Script. (U+03DB,U+03DD,U+03DF,U+03E1,U+0371-U+0372,U+0374)
  • Cyrillic, (total 4 characters) Cyrillic Letter Ka With Ogonek And Cyrillic Letter Kha With Ogonek were Removed From Cyrillic Script. (U+04C5-U+04C6,U+04C9-U+04CA)
  • CJK Symbols and Punctuation, (total 1 character) Ideographic Ditto Mark were Removed From CJK Symbols and Punctuation. (U+3004)

Rearranged Characters

  • Circled Katakana: The characters well be arranged in morden order: e.g.,A, I, U, E, O, KA, KI, (U+32D0-U+32FE)
  • Basic Glyphs For Arabic Language: The character shapes will be arranged in different order:Isolate,Final,Initial,Medial (U+FE80-FEFC)

Characters with semantics changed

  • Zero Width Non-Joiner [ZWNJ] (U+20DC)
  • Zero Width Joiner [ZWJ] (U+20DD)

Unicode 1.1

Unicode 1.1 was released June 1993. It encoded 34,233 characters, adding 5,939 new characters and removing 71 characters. It finalized the long anticipated Han Unification.

New blocks

  • Hangul Jamo, containing 240 jamo for the Hangul script, was added.
  • Latin Extended Additional, containing 245 precomposed characters for transliteration and Vietnamese, was added.
  • Greek Extended, containing 233 precomposed characters for polytonic Greek, was added.
  • Alphabetic Presentation Forms, containing 57 precomposed characters and ligatures, was added.
  • Arabic Presentation Forms-A, containing 593 combinations of Arabic letters, was added.
  • Combining Half Marks, containing 4 halves of diacritical marks, was added.

Extended blocks

  • The long S (ſ) (total 1 character) was added to Latin Extended-A.
  • The Hungarian Dz, characters for transliteration purposes and precomposed characters with double grave and inverted breve (total 35 characters) were added to Latin Extended-B.
  • Diacritics for polytonic Greek and double width diacritics (total 6 characters) were added to Combining Diacritical Marks.
  • Compatibility characters now deprecated (total 5 characters) were added to Greek and Coptic.
  • Additional characters for non-Slavic languages (total 38 characters) were added to Cyrillic.
  • A ligature of Ech and Yiwn (և) (total 1 character) was added to Armenian.
  • One deprecated compatibility character and several characters for biblical texts (total 25 characters) were added to Arabic.
  • The virama (o੍) (total 1 character) was added to Gurmukhi.
  • The candra O and candra E vowels (total 3 characters) were added to Gujarati.
  • The Ai length mark (oୖ) (total 1 character) was added to Oriya.
  • An undertie, a pair of brackets and six formatting characters (total 9 characters) were added to General Punctuation.
  • Some additional symbols and the complete set of APL functional symbols (total 79 characters) were added to Miscellaneous Technical.
  • A large circle () (total 1 character) was added to Geometric Shapes.
  • The ideographic telegraph line feed separator symbol () (total 1 character) was added to CJK Symbols and Punctuation.
  • Four Katakana letters not in use since 1945 (total 4 characters) were added to Katakana.
  • Ideographic telegraph symbols for the twelve months (total 12 characters) were added to Enclosed CJK Letters and Months.
  • Ideographic telegraph symbols for hours and days and six additional measure units (total 62 characters) were added to CJK Compatibility.
  • Some more space (total 2,304 characters) was added to the Private Use Area.
  • Seven halfwidth geometric shapes (total 7 characters) were added to Halfwidth and Fullwidth Forms.

Unicode 2.0

Unicode 2.0 was released July 1996. It encoded 38,950 characters, adding 4,717 new characters, and was the first Unicode version to reserve blocks outside of the Basic Multilingual Plane.

New blocks

  • Hangul Syllables, containing 11,172 precomposed syllables for the Hangul script, was added.
  • Supplementary Private Use Area-A and Supplementary Private Use Area-B, reserving a total of 131,068 characters for private use, was added.

Reinstated blocks

  • Tibetan, now containing 168 characters for the Tibetan script including religious signs, was readded.

Extended blocks

  • Cantillation marks for use in religious texts (total 31 characters) were added to Hebrew.
  • The long S with dot above (ẛ) (total 1 character) was added to Latin Extended Additional.
  • The Vietnamese dong (₫) (total 1 character) was added to Currency Symbols.

Unicode 2.1

Unicode 2.1 was released May 1998. It encoded 38,952 characters, adding only 2 new characters.

Extended blocks

  • The euro sign (€) (total 1 character) was added to Currency Symbols.
  • The object replacement character () (total 1 character) was added to Specials.

Unicode 3.0

Unicode 3.0 was released September 1999. It was a big update and encoded 49,259 characters, adding 10,307 new characters.

New blocks

  • Syriac, containing 71 characters used for writing in Syriac script, was added.
  • Thaana, containing 49 characters used for writing in Thaana script, was added.
  • Sinhala, containing 80 characters for the Sinhala script, was added.
  • Myanmar, containing 78 characters for the Burmese script, was added.
  • Ethiopic, containing 345 syllables and punctuation marks for the Ethiopic script, was added.
  • Cherokee, containing 85 syllables for the Cherokee script, was added.
  • Unified Canadian Aboriginal Syllabics, containing 630 syllables and punctuation marks for writing in aboriginal languages of Canada, was added.
  • Ogham, containing 29 characters for the ancient Ogham script, was added.
  • Runic, containing 81 characters for the Germanic runes, was added.
  • Khmer, containing 103 characters for the Khmer script, was added.
  • Mongolian, containing 155 characters for the classical Mongolian script, was added.
  • Braille Patterns, containing 256 Braille letters, was added.
  • CJK Radicals Supplement, containing 115 non-Kangxi radicals, was added.
  • Kangxi Radicals, containing 214 radicals from the Kangxi dictionary, was added.
  • Ideographic Description characters, used to describe a Han ideograph not available in the font, was added.
  • Bopomofo Extended, containing 24 characters used for phonetic transcription of minority languages of Taiwan, was added.
  • CJK Unified Ideographs Extension A, containing 6,582 additional Han Ideographs, was added.
  • Yi Syllables, containing 1,165 syllables of the modern Yi script, was added.
  • Yi Radicals, containing 50 radicals of Yi Syllables, was added.

Extended blocks

  • Additional precomposed characters, letters and capital letters of lowercase-only letters (total 30 characters) were added to Latin Extended-B.
  • Extensions for disordered speech (total 5 characters) were added to IPA Extensions.
  • Some additional modifier letters (total 6 characters) were added to Spacing Modifier Letters.
  • Additional diacritics for IPA notation (total 10 characters) were added to Combining Diacritical Marks.
  • Lowercase versions of archaic letters and the Kai symbol (total 5 characters) were added to Greek and Coptic.
  • Nonstandard letters for Macedonian, combining numeral signs and three letters for Kildin Sami (total 12 characters) were added to Cyrillic.
  • The hyphen (֊) (total 1 character) was added to Armenian.
  • Combining hamza and maddah and nine additional Arabic characters (total 12 characters) were added to Arabic.
  • Additional letters and religious symbols (total 25 characters) were added to Tibetan.
  • A narrow no-break space and 6 additional punctuation marks (total 7 characters) were added to General Punctuation.
  • The Kip, Tugrik and Drachma sign (total 3 characters) were added to Currency Symbols.
  • An enclosing screen and an enclosing key (total 2 characters) were added to Combining Diacritical Marks for Symbols.
  • The information symbol and a rotated Q (total 2 characters) were added to Letterlike Symbols.
  • A mirrored Roman capital numeral hundred (Ↄ) (total 1 character) was added to Number Forms.
  • Some additional arrows (total 9 characters) were added to Arrows.
  • Some additional technical symbols, including common keys on a 101 keyboard (total 33 characters) were added to Miscellaneous Technical.
  • Two additional control pictures (total 2 characters) were added to Control Pictures.
  • Squares and circles with quadrants (total 8 characters) were added to Geometric Shapes.
  • Two Syriac crosses and a signature mark (total 3 characters) were added to Miscellaneous Symbols.
  • Three Hangzhou numerals and a variation indicator (total 4 characters) were added to CJK Symbols and Punctuation.
  • An additional Hebrew ligature (יִ) (total 1 character) was added to Alphabetic Presentation Forms.
  • Three additional control characters for ruby markup (total 3 characters) were added to Specials.

Unicode 3.1

Unicode 3.1 was released March 2001. It encoded 94,205 characters, adding 44,946 new characters, and mainly focused on blocks outside of the Basic Multilingual Plane.

New blocks

  • Old Italic, containing 35 letters for the Etruscan script, was added.
  • Gothic, containing 27 letters for the Gothic script, was added.
  • Deseret, containing 76 letters for the constructed Deseret script, was added.
  • Byzantine Musical Symbols, containing 246 symbols for musical notation in Byzantine, was added.
  • Musical Symbols, containing 219 characters for current musical notation, was added.
  • Mathematical Alphanumeric Symbols, containing 991 Latin and Greek letters in serif, sans-serif, bold, italic, double-struck, script and Fraktur/Blackletter, was added.
  • CJK Unified Ideographs Extension B, containing 42,711 additional Chinese Ideographs, was added.
  • CJK Compatibility Ideographs Supplement, containing 542 additional Chinese Ideographs for compatibility purposes, was added.
  • Tags, containing 97 language tags, was added.

Extended noncharacters

  • The Noncharacters range: U+FDD0..U+FDEF were added to Arabic Presentation Forms-A.

Extended blocks

  • The capital Theta symbol and the Lunate Epsilon symbol (total 2 characters) were added to Greek and Coptic.

Characters and Scripts Under Investigation or Rejected

  • Khmer Sign Laak Was Rejected. (U+17DD) From Khmer.
  • Georgian Letter U-Brjuu Was Rejected. From Georgian.

Unicode 3.2

Unicode 3.2 was released March 2002. It encoded 95,221 characters, adding 1,016 new characters.

New blocks

  • Cyrillic Supplement, containing 16 characters used for the Komi language, was added.
  • Tagalog, containing 20 characters for the Baybayin script, was added.
  • Hanunoo, containing 23 characters and punctuation for the Hanunoo script, was added.
  • Buhid, containing 20 characters for the Buhid script, was added.
  • Tagbanwa, containing 18 characters for the Tagbanwa script, was added.
  • Miscellaneous Mathematical Symbols-A, containing 28 symbols used in math notation, was added.
  • Supplemental Arrows-A, containing 16 additional arrows, was added.
  • Supplemental Arrows-B, containing 128 special arrows, was added.
  • Miscellaneous Mathematical Symbols-B, containing 128 additional mathematical symbols, was added.
  • Supplemental Mathematical Operators, containing 256 additional mathematical operators, was added.
  • Katakana Phonetic Extensions, containing 16 Katakana letters used for Ainu, was added.
  • Variation Selectors, containing 16 symbols used for indicating variations, was added.

Extended blocks

  • The capital letter N with long right leg (Ƞ) (total 1 character) was added to Latin Extended-B.
  • The combining grapheme joiner and combining Latin letters used in medieval texts (total 14 characters) were added to Combining Diacritical Marks.
  • The Qoppa and a reversed lunate epsilon symbol (total 3 characters) were added to Greek and Coptic.
  • Four additional letters used for the Kildin Sami language (total 8 characters) were added to Cyrillic.
  • A dotless Beh and a dotless Qaf (total 2 characters) were added to Arabic.
  • The letter Naa (ޱ) (total 1 character) was added to Thaana.
  • The letters Yn and Elifi (total 2 characters) were added to Georgian.
  • Some additional punctuation marks and control characters (total 12 characters) were added to General Punctuation.
  • A superscript i (ⁱ) (total 1 character) was added to Superscripts and Subscripts.
  • The old penny sign and the peso sign (total 2 characters) were added to Currency Symbols.
  • Some additional combining characters (total 7 characters) were added to Combining Diacritical Marks for Symbols.
  • Some double-struck and reversed/turned letters (total 15 characters) were added to Letterlike Symbols.
  • Some additional arrows (total 12 characters) were added to Arrows.
  • Some additional mathematical operators (total 14 characters) were added to Mathematical Operators.
  • Variable-width and additional symbols (total 53 characters) were added to Miscellaneous Technical.
  • Black and double circled numerals (total 20 characters) were added to Enclosed Alphanumerics.
  • Quadrant elements (total 10 characters) were added to Block Elements.
  • Some additional triangles and squares (total 8 characters) were added to Geometric Shapes.
  • Shogi pieces ,recycling symbols, dices and dotted circles (total 24 characters) were added to Miscellaneous Symbols.
  • Additional parenthesis (total 14 characters) were added to Dingbats.
  • Three additional marks (total 3 characters) were added to CJK Symbols and Punctuation.
  • A digraph and two additional characters (total 3 characters) were added to Hiragana.
  • A digraph and a double hyphen (total 2 characters) were added to Katakana.
  • Additional circled numerals (total 30 characters) were added to Enclosed CJK Letters and Months.
  • Five missing radicals (total 5 characters) were added to Yi Radicals.
  • Additional compatibility characters (total 59 characters) were added to CJK Compatibility Ideographs.
  • The rial sign (﷼) (total 1 character) was added to Arabic Presentation Forms-A.
  • Two sesame dots (total 2 characters) were added to CJK Compatibility Forms.
  • A tail fragment (ﹳ) (total 1 character) was added to Arabic Presentation Forms-B.
  • A pair of double parenthesis (total 2 characters) was added to Halfwidth and Fullwidth Forms.

Unicode 4.0

Unicode 4.0 was released April 2003. It encoded 96,447 characters, adding 1,226 new characters.

New blocks

  • Limbu, containing 66 characters for the Limbu abugida, was added.
  • Tai Le, containing 35 letters for the Tai Le script, was added.
  • Khmer Symbols, containing 32 symbols for the lunar calendar, was added.
  • Phonetic Extensions, containing 108 letters used in phonetic transcription, was added.
  • Miscellaneous Symbols and Arrows, containing 14 additional arrows, was added.
  • Yijing Hexagram Symbols, containing 64 hexagrams, was added.
  • Linear B Syllabary, containing 88 syllables of the ancient Linear B script, was added.
  • Linear B Ideograms, containing 123 ideograms of the ancient Linear B script, was added.
  • Aegean Numbers, containing 57 numerals used in the Aegean area, was added.
  • Ugaritic, containing 31 characters used in Ugaritic cuneiform, was added.
  • Shavian, containing 48 letters used for the artificial Shavian script, was added.
  • Osmanya, containing 40 characters used in the artificial Osmanya script, was added.
  • Cypriot Syllabary, containing 55 characters formerly used on Cyprus, was added.
  • Tai Xuan Jing Symbols, containing 87 symbols of Tai Xuan Jing, was added.
  • Variation Selectors Supplement, containing 240 additional variation selectors, was added.

Extended blocks

  • Letters with curl used in Sinology (total 4 characters) were added to Latin Extended-B.
  • Former IPA letters (total 2 characters) were added to IPA Extensions.
  • Some additional characters (total 17 characters) were added to Spacing Modifier Letters.
  • Additional combining double-width diacritics and diacritics corresponding to their spacing equivalent (total 11 characters) were added to Combining Diacritical Marks.
  • The archaic letters Sho and San and the capital Lunate Sigma (total 5 characters) were added to Greek and Coptic.
  • Some additional markers, biblical signs, and letters with inverted V (total 19 characters) were added to Arabic.
  • Letters used for foreign words from Persian and Sogdian (total 6 characters) were added to Syriac.
  • The short A (ऄ) (total 1 character) was added to Devanagari.
  • The Avagraha sign (ঽ) (total 1 character) was added to Bengali.
  • The Adak Bindi and Visarga signs (total 2 characters) were added to Gurmukhi.
  • The vocalic l and ll and the Rupee sign (total 5 characters) were added to Gujarati.
  • The letters Va and Wa (total 2 characters) were added to Oriya.
  • Additional signs for date and finance environments (total 8 characters) were added to Tamil.
  • The Nukta and Avagraha signs (total 2 characters) were added to Kannada.
  • Some symbols and signs (total 11 characters) were added to Khmer.
  • An inverted undertie and a swung dash (total 2 characters) were added to General Punctuation.
  • The facsimile sign (℻) (total 1 character) was added to Letterlike Symbols.
  • The eject symbol and a vertical line (total 2 characters) were added to Miscellaneous Technical.
  • A black circled digit zero (⓿) (total 1 character) was added to Enclosed Alphanumerics.
  • Monograms and diagrams, flags, warning and weather symbols and a cup of tea (total 12 characters) were added to Miscellaneous Symbols.
  • Additional parenthesized and circled Korean characters and supplemental signs (total 9 characters) were added to Enclosed CJK Letters and Months.
  • Additional measure units (total 7 characters) were added to CJK Compatibility.
  • An additional Arabic sign (﷽) (total 1 character) was added to Arabic Presentation Forms-A.
  • A pair of vertical parenthesis (total 2 characters) was added to CJK Compatibility Forms.
  • The letters Oi and Ew (total 4 characters) were added to Deseret.
  • A small script l (ℓ) (total 1 character) was added to Mathematical Alphanumeric Symbols.

Unicode 4.1

Unicode 4.1 was released March 31, 2005. It encoded 97,720 characters, adding 1,273 new characters.

New blocks

  • Arabic Supplement, containing 30 characters for various languages written with the Arabic script, was added.
  • Ethiopic Supplement, containing 26 characters and signs for Sebatbeit, was added.
  • New Tai Lue, containing 80 characters for the New Tai Lue script, was added.
  • Buginese, containing 30 characters for the Lontara script, was added.
  • Phonetic Extensions Supplement, containing 64 additional letters for phonetic transcription, was added.
  • Combining Diacritical Marks Supplement, containing 4 additional diacritics, was added.
  • Glagolitic, containing 94 characters for the Glagolitic script, was added.
  • Coptic, containing 114 characters for the Coptic script, was added.
  • Georgian Supplement, containing 38 Nuskhuri letters, was added.
  • Tifinagh, containing 55 characters for the Tifinagh script, was added.
  • Ethiopic Extended, containing 79 additional Ethiopic syllables, was added.
  • Supplemental Punctuation, containing 26 additional punctuation marks, was added.
  • CJK Strokes, containing 16 strokes for Han Ideographs, was added.
  • Modifier Tone Letters, containing 23 letters for Chinese tones, was added.
  • Syloti Nagri, containing 44 characters for the Syloti Nagri abugida, was added.
  • Vertical Forms, containing 10 punctuation marks suited for vertical text, was added.
  • Ancient Greek Numbers, containing 75 numerals and signs used in Ancient Greek, was added.
  • Old Persian, containing 50 characters for Old Persian cuneiform, was added.
  • Kharoshthi, containing 65 characters for the Kharoshthi abugida, was added.
  • Ancient Greek Musical Notation, containing 70 musical signs used in Ancient Greek, was added.

Extended blocks

  • Letters for Sencoten, digraphs, letters with swash tail and other additions (total 11 characters) were added to Latin Extended-B.
  • Additional diacritics for transliteration (total 5 characters) were added to Combining Diacritical Marks.
  • Rho with stroke, reversed and dotted Lunate Sigma (total 4 characters) were added to Greek and Coptic.
  • Ghe with descender (Ӷ) (total 2 characters) was added to Cyrillic.
  • An additional biblical mark and some punctuation marks (total 4 characters) were added to Hebrew.
  • Additional biblical marks, punctuation marks and the Afghani sign (total 8 characters) were added to Arabic.
  • A glottal stop (ॽ) (total 1 character) was added to Devanagari.
  • The Khanda Ta letter (ৎ) (total 1 character) was added to Bengali.
  • The letter Sha and the digit zero (total 2 characters) were added to Tamil.
  • Two marks used in Bhutan (total 2 characters) were added to Tibetan.
  • Two letters and a modifier letter (total 3 characters) were added to Georgian.
  • Some additional syllables (total 11 characters) were added to Ethiopic.
  • Additional phonetic symbols (total 20 characters) were added to Phonetic Extensions.
  • A flower and dot punctuation marks (total 9 characters) were added to General Punctuation.
  • Additional subscript letters (total 5 characters) were added to Superscripts and Subscripts.
  • The Guarani, Austral, Hryvnia and Cedi signs (total 4 characters) were added to Currency Symbols.
  • A combining long double solidus (⃫) (total 1 character) was added to Combining Diacritical Marks for Symbols.
  • The per sign and a double-struck letter Pi (total 2 characters) were added to Letterlike Symbols.
  • Metrical and electrical signs (total 11 characters) were added to Miscellaneous Technical.
  • Additional gender and map symbols (total 30 characters) were added to Miscellaneous Symbols.
  • Some additional mathematical symbols (total 7 characters) were added to Miscellaneous Mathematical Symbols-A.
  • Additional arrows and squares (total 6 characters) were added to Miscellaneous Symbols and Arrows.
  • A circled Hangul character (㉾) (total 1 character) was added to Enclosed CJK Letters and Months.
  • Additional Han Ideographs (total 22 characters) were added to CJK Unified Ideographs.
  • Additional Compatibility Ideographs (total 106 characters) were added to CJK Compatibility Ideographs.
  • Italic dotless small i and j (total 2 characters) were added to Mathematical Alphanumeric Symbols.

Unicode 5.0

Unicode 5.0 was released July 14, 2006. It encoded 99,089 characters, adding 1,369 new characters.

New blocks

  • N'Ko, containing 59 characters for the N'Ko script, was added.
  • Balinese, containing 121 characters and musical signs for the Balinese abugida, was added.
  • Latin Extended-C, containing 17 letters for various languages, was added.
  • Latin Extended-D, containing 2 characters for UPA, was added.
  • Phags-pa, containing 56 characters for the Phags-pa script, was added.
  • Phoenician, containing 27 letters and numerals for the Phoenician script, was added.
  • Cuneiform, containing 879 signs for Sumero-Akkadian Cuneiform, was added.
  • Cuneiform Numbers and Punctuation, containing 103 numerals and punctuation signs for Sumero-Akkadian Cuneiform, was added.
  • Counting Rod Numerals, containing 18 numerals used with counting rods, was added.

Extended blocks

  • Various letters used mainly for aboriginal languages (total 14 characters) were added to Latin Extended-B.
  • Lowercase lunate sigma symbols (total 3 characters) were added to Greek and Coptic.
  • Lowercase palochka and 3 letters used in Nivkh (total 7 characters) were added to Cyrillic.
  • Two letters used in Khanty and other languages (total 4 characters) were added to Cyrillic Supplement.
  • A specific point meant for Vav (ֺ) (total 1 character) was added to Hebrew.
  • Four letters used in Sindhi (total 4 characters) were added to Devanagari.
  • Four letters used in Sanskrit (total 4 characters) were added to Kannada.
  • Additional IPA diacritics (total 9 characters) were added to Combining Diacritical Marks Supplement.
  • Four combining arrows (total 4 characters) were added to Combining Diacritical Marks for Symbols.
  • A danish symbol and a lowercase turned F (total 2 characters) were added to Letterlike Symbols.
  • A lowercase reversed C (ↄ) (total 1 character) was added to Number Forms.
  • Vertical parenthesis, geometric forms and electrical symbols (total 12 characters) were added to Miscellaneous Technical.
  • A neuter symbol (⚲) (total 1 character) was added to Miscellaneous Symbols.
  • Four additional mathematical symbols (total 4 characters) were added to Miscellaneous Mathematical Symbols-A.
  • Additional squares, pentagons and hexagons (total 11 characters) were added to Miscellaneous Symbols and Arrows.
  • Four additional tone letters used in Chinantec (total 4 characters) were added to Modifier Tone Letters.
  • Bold Digamma (𝟊/Ϝ) (total 2 characters) was added to Mathematical Alphanumeric Symbols.

Unicode 5.1

Unicode 5.1 was released April 4, 2008. It encoded 100,713 characters, adding 1,624 new characters.

New blocks

  • Sundanese, containing 55 letters for Sundanese script, was added.
  • Lepcha, containing 74 letters for Lepcha script, was added.
  • Ol Chiki, containing 48 letters for Ol Chiki script, was added.
  • Cyrillic Extended-A, containing 32 letters for combining Cyrillic letters, was added.
  • Vai, containing 300 letters for Vai script, was added.
  • Cyrillic Extended-B, containing 78 letters for additional Cyrillic characters, was added.
  • Saurashtra, containing 81 letters for Saurashtra script, was added.
  • Kayah Li, containing 48 letters for Kayah languages, was added.
  • Rejang, containing 37 letters for Rejang script, was added.
  • Cham, containing 83 letters for Cham script, was added.
  • Ancient Symbols, containing 12 characters for weights and measures and other Ancient symbols, was added.
  • Phaistos Disc, containing 46 hieroglyphs for Phaistos, was added.
  • Lycian, containing 29 letters for Lycian script, was added.
  • Carian, containing 49 letters for Carian script, was added.
  • Lydian, containing 27 letters for Lydian script, was added.
  • Mahjong Tiles, containing 44 mahjong tiles, was added.
  • Domino Tiles, containing 100 domino tiles, was added.

Extended blocks

  • Archaic letters and capital kai symbol (total 7 characters) were added to Greek and Coptic.
  • Combining Pokrytie (total 1 character) was added to Cyrillic.
  • Mordvin, Kurdish, Aleut and Chuvash letters (total 16 characters) were added to Cyrillic Supplement.
  • Radix symbols, Letterlike, punctuation, Koranic annotation signs and additions for early Persian and Azerbaijani (total 15 characters) were added to Arabic.
  • Additional letters in Torwali, Burushaski and early Persian (total 18 characters) were added to Arabic Supplement.
  • High spacing dot and candra a (total 2 characters) were added to Devanagari.
  • Udaat and yakash signs (total 2 characters) were added to Gurmukhi.
  • Vocalic rr, l and ll (total 3 characters) were added to Oriya.
  • Om symbol (ௐ) (total 1 character) was added to Tamil.
  • Avagraha, additional phonetic letters, vocalic l and ll, fractional signs and tuumu (total 13 characters) were added to Telugu.
  • Avagraha, vocalic rr, l and ll, Malayalam numerics and fractions and chillu letters (total 17 characters) were added to Malayalam.
  • Letters for Balti and various symbols (total 6 characters) were added to Tibetan.
  • Characters for various languages (total 78 characters) were added to Myanmar.
  • Manchu Ali Gali lha (ᢪ) (total 1 character) was added to Mongolian.
  • Miscellaneous combining marks (total 28 characters) were added to Combining Diacritical Marks Supplement.
  • Medievalist latin letters and miscellaneous letters (total 10 characters) were added to Latin Extended Additional.
  • Invisible plus (+) (total 1 character) was added to General Punctuation.
  • Combining asterisk above ( ⃰)(total 1 character) was added to Combining Diacritical Marks for Symbols.
  • Symbol for Samaritan Source (⅏) (total 1 character) was added to Letterlike Symbols.
  • Archaic Roman Numerals (total 4 characters) were added to Number Forms.
  • Outlined white star and other signs (total 15 characters) were added to Miscellaneous Symbols.
  • Long division and additional mathematical brackets (total 5 characters) were added to Miscellaneous Mathematical Symbols-A.
  • Miscellaneous signs (total 51 characters) were added to Miscellaneous Symbols and Arrows.
  • Additional latin letters (total 12 characters) were added to Latin Extended-C.
  • Additional punctuation (total 23 characters) were added to Supplemental Punctuation.
  • Letter ih (ㄭ) (total 1 character) was added to Bopomofo.
  • Other strokes (total 20 characters) were added to CJK Strokes.
  • Miscellaneous additions (total 8 characters) were added to CJK Unified Ideographs.
  • Africanist tone letters (total 5 characters) were added to Modifier Tone Letters.
  • Miscellaneous letters and symbols (total 112 characters) were added to Latin Extended-D.
  • Continuous macrons for Coptic (total 3 characters) were added to Combining Half Marks.
  • Musical symbol multiple measure rest (𝄩) (total 1 character) was added to Musical Symbols.

Unicode 5.2

Unicode 5.2 was released in October 1, 2009. It encoded 107,361 characters, adding 6,648 new characters.

New blocks

  • Samaritan, containing 61 letters for Samaritan script, was added.
  • Unified Canadian Aboriginal Syllabics Extended, containing 70 syllables for various cree languages, was added.
  • Tai Tham, containing 127 letters for Tai Tham script, was added.
  • Vedic Extensions, containing 35 characters for tone marks and signs, was added.
  • Lisu, containing 48 letters for Lisu script, was added.
  • Bamum, containing 88 letters for Bamum script, was added.
  • Common Indic Number Forms, containing 10 fractions and marks, was added.
  • Devanagari Extended, containing 28 additional marks, was added.
  • Hangul Jamo Extended-A, containing 29 characters for additional old initial consonants in hangul jamo, was added.
  • Javanese, containing 91 letters for Javanese script, was added.
  • Myanmar Extended-A, containing 28 letters for Khamti Shan in Myanmar, was added.
  • Tai Viet, containing 72 letters for Tai Viet script, was added.
  • Meetei Mayek, containing 56 letters for Meetei Mayek script, was added.
  • Hangul Jamo Extended-B, containing 72 characters for additional old medieval vowels and final consonants in hangul jamo, was added.
  • Imperial Aramaic, containing 31 characters for Old Aramaic, was added.
  • Old South Arabian, containing 32 letters and numbers for South Arabian, was added.
  • Avestan, containing 61 characters for Avestan script, was added.
  • Inscriptional Parthian, containing 30 characters for Inscriptional Parthian script, was added.
  • Inscriptional Pahlavi, containing 27 characters for Inscriptional Pahlavi script, was added.
  • Old Turkic, containing 73 characters for Orkhon script, was added.
  • Rumi Numeral Symbols, containing 31 numeric characters used in Fez, Morocco, and elsewhere in North Africa and the Iberian peninsula, between the tenth and seventeenth centuries, was added.
  • Kaithi, containing 66 letters for Khaiti script, was added.
  • Egyptian Hieroglyphs, containing 1,071 hieroglyphs for Egyptian, was added.
  • Enclosed Alphanumeric Supplement, containing 63 additional circled, parenthesized and squared alphanumerics, was added.
  • Enclosed Ideographic Supplement, containing 44 squared and tortoised shell bracketed ideographs, was added.
  • CJK Unified Ideographs Extension C, containing 4,149 additional Chinese Ideographs, was added.

Extended blocks

  • Abhaz letters (total 2 characters) were added to Cyrillic Supplement.
  • Inverted Candrabinbu and additional signs and letters (total 5 characters) were added to Devanagari.
  • Ganda Mark (৻) (total 1 character) was added to Bengali.
  • Religious svasti signs (total 4 characters) were added to Tibetan.
  • Extensions for Khamti Shan and Alton and Phake (total 4 characters) were added to Myanmar.
  • Additional old initial consonants, medival vowels, and old final consonants (total 16 characters) were added to Hangul Jamo.
  • Hyphen and additional syllables (total 10 characters) were added to Unified Canadian Aboriginal Syllabics.
  • Letter Sua and Tham Digit One (total 3 characters) were added to New Tai Lue.
  • Combing Almost Equal to Below ( ᷽) (total 1 character) was added to Combining Diacritical Marks Supplement.
  • The Live Tournosis, Spesmillo and Tenge signs (total 3 characters) were added to Currency Symbols.
  • Additional vulgar fractions (total 4 characters) were added to Number Forms.
  • Decimal exponent symbol (⏨) (total 1 characters) was added to Miscellaneous Technical.
  • Additional weather, game and map symbols, traffic signs, sport symbols, closed captioning and draught and checkers (total 59 characters) were added to Miscellaneous Symbols.
  • Heavy exclamation mark symbol (❗) (total 1 character) was added to Dingbats.
  • Traffic sign, dictionary and map symbols (total 5 characters) were added to Miscellaneous Symbols and Arrows.
  • Capital letter turned alpha and additions for shona (total 3 characters) were added to Latin Extended-C.
  • Cryptogrammic letters and combining marks (total 7 characters) were added to Coptic.
  • Word separator middle dot used in Avestan (⸱) (total 1 character) was added to Supplemental Punctuation.
  • Circled ideographs and numbers on black squares (total 12 characters) were added to Enclosed CJK Letters and Months.
  • Miscellaneous additions (total 8 characters) were added to CJK Unified Ideographs.
  • Miscellaneous additions for compatibility (total 3 characters) were added to CJK Compatibility Ideographs.
  • Number two and three (total 2 characters) were added to Phoenician.

Unicode 6.0

Unicode 6.0 was released in October 11, 2010. It encoded 109,449 characters, adding 2,088 new characters.

New blocks

  • Mandaic, containing 29 letters for Mandaic script, was added.
  • Batak, containing 56 letters for Batak script, was added.
  • Ethiopic Extended-A, containing 32 letters for Gamo-Gofa-Dawro, Basketo and Gumuz Ethiophic syllables, was added.
  • Brahmi, containing 108 characters for ancient Brahmi abugida, was added.
  • Bamum Supplement, containing 761 letters for additional Bamum script, was added.
  • Kana Supplement, containing 2 characters for archaic katakana, was added.
  • Playing Cards, containing 59 playing cards, was added.
  • Miscellaneous Symbols and Pictographs, containing 529 additional symbols, was added.
  • Emoticons, containing 63 faces, cat faces and gesture symbols, was added.
  • Transport and Map Symbols, containing 70 transportation, traffic signs and other symbols, was added.
  • Alchemical Symbols, containing 116 symbols for elements, was added.
  • CJK Unified Ideographs Extension D, containing 222 miscellaneous Han ideographs, was added.

Extended blocks

  • Azerbaijani letters (total 2 characters) were added to Cyrillic Supplement.
  • Kashmiri Yeh and Wavy hamza below (total 2 characters) were added to Arabic.
  • Dependent vowel signs and letters used in Kashmiri and Bihari (total 10 characters) were added to Devanagari.
  • Fraction signs (total 6 characters) were added to Oriya.
  • Letters used in scholarly only and letter dot reph (total 3 characters) were added to Malayalam.
  • Leading and Trailing Mchan Rtags (total 6 characters) were added to Tibetan.
  • Additional combining marks (total 2 characters) were added to Ethiopic.
  • Combining Double Inverted Breve Below (᷼) (total 1 character) was added to Combining Diacritical Marks Supplement.
  • Miscellaneous subscript letters (total 8 characters) were added to Superscripts and Subscripts.
  • Indian Rupee Sign (₹) (total 1 character) was added to Currency Symbols.
  • Pointing double triangle and additional mechanical symbols (total 11 characters) were added to Miscellaneous Technical.
  • Ophiucisus, astronomical symbol for uranus and pentagrams (total 6 characters) were added to Miscellaneous Symbols.
  • Additional heavy punctation marks, raised fist, raised hand, sparkles, heavy arithmetic symbols and curly loops (total 16 characters) were added to Dingbats.
  • Squared logicals (total 2 characters) were added to Miscellaneous Mathematical Symbols-A.
  • Separator mark and consonant joiner (total 2 characters) were added to Tifinagh.
  • Bopomofo for Hmu and Ge (total 3 characters) were added to Bopomofo Extended.
  • Reversed Tse (total 2 characters) were added to Cyrillic Extended-B.
  • Additional letters (total 15 characters) were added to Latin Extended-D.
  • Pedagogical symbols (total 16 characters) were added to Arabic Presentation Forms-A.
  • Additional squared, black circled and squared letters and regional indicator letters (total 107 characters) were added to Enclosed Alphanumeric Supplement.
  • Squared katakana, squared ideographs and circled advantage and accept (total 13 characters) were added to Enclosed Ideographic Supplement.

Unicode 6.1

Unicode 6.1 was released in January 31, 2012. It encoded 110,181 characters, adding 732 new characters.

New blocks

  • Arabic Extended-A (U+08A0-U+08FF), containing 39 characters, was added.
  • Sundanese Supplement (U+1CC0-U+1CCF), containing 8 characters, was added.
  • Meetei Mayek Extensions (U+AAE0-U+AAFF), containing 23 characters, was added.
  • Meroitic Hieroglyphs (U+10980-U+1099F), containing 32 characters, was added.
  • Meroitic Cursive (U+109A0-U+109FF), containing 26 characters, was added.
  • Sora Sompeng (U+110D0-U+110FF), containing 35 characters, was added.
  • Chakma (U+11100-U+1114F), containing 67 characters, was added.
  • Sharada (U+11180-U+111DF), containing 83 characters, was added.
  • Takri (U+11680-U+116CF), containing 66 characters, was added.
  • Miao (U+16F00-U+16F9F), containing 133 characters, was added.
  • Arabic Mathematical Alphabetic Symbols (U+1EE00-U+1EEFF), containing 143 characters, was added.

Extended blocks

  • (total 1 character) was added to Armenian. (U+058F)
  • (total 1 character) was added to Arabic. (U+0604)
  • (total 1 character) was added to Gujarati. (U+0AF0)
  • (total 2 characters) were added to Lao. (U+0EDE-U+0EDF)
  • (total 5 characters) were added to Georgian. (U+10C7, U+10CD and U+10FD-U+10FF)
  • (total 9 characters) were added to Sundanese. (U+1BAB-U+1BAD and U+1BBA-U+1BBF)
  • (total 4 characters) were added to Vedic Extensions. (U+1CF3-U+1CF6)
  • (total 2 characters) were added to Miscellaneous Mathematical Symbols-A. (U+27CB and U+27CD)
  • (total 2 characters) were added to Coptic. (U+2CF2-U+2CF3)
  • (total 2 characters) were added to Georgian Supplement. (U+2D27 and U+2D2D)
  • (total 2 characters) were added to Tifinagh. (U+2D66-U+2D67)
  • (total 10 characters) were added to Supplemental Punctuation. (U+2E32-U+2E3B)
  • (total 1 character) was added to CJK Unified Ideographs. (U+9FCC)
  • (total 9 characters) were added to Cyrillic Extended-B. (U+A674-U+A67B and U+A69F)
  • (total 5 characters) were added to Latin Extended-D. (U+A792-U+A793, U+A7AA and U+A7F8-U+A7F9)
  • (total 2 characters) were added to CJK Compatibility Ideographs. (U+FA2E-U+FA2F)
  • (total 2 characters) were added to Enclosed Alphanumeric Supplement. (U+1F16A-U+1F16B)
  • (total 4 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F540-U+1F543)
  • (total 13 characters) were added to Emoticons. (U+1F600, U+1F611, U+1F615, U+1F617, U+1F619, U+1F61B, U+1F61F, U+1F626-U+1F627, U+1F62C, U+1F62E-U+1F62F and U+1F634)

Unicode 6.2

Unicode 6.2 was released in September 26, 2012. It encoded 110,182 characters, adding only 1 new character.

Extended blocks

  • (total 1 character) was added to Currency Symbols. (U+20BA)

Unicode 6.3

Unicode 6.3 was released in September 30, 2013. It encoded 110,187 characters, adding only 5 new characters.

Extended blocks

  • (total 1 character) was added to Arabic. (U+061C)
  • (total 4 characters) were added to General Punctuation. (U+2066-U+2069)

Unicode 7.0

Unicode 7.0 was released in June 16, 2014. It encodes 113,021 characters, adding 2,834 new characters.

New blocks

  • Combining Diacritical Marks Extended (U+1AB0-U+1AFF), containing 15 marks, was added.
  • Myanmar Extended-B (U+A9E0-U+A9FF), containing 31 letters, was added.
  • Latin Extended-E (U+AB30-U+AB6F), containing 50 letters, was added.
  • Coptic Epact Numbers (U+102E0-U+102FF), containing 28 numbers, was added.
  • Old Permic (U+10350-U+1037F), containing 43 letters, was added.
  • Elbasan (U+10500-U+1052F), containing 50 letters, was added.
  • Caucasian Albanian (U+10530-U+1056F), containing 53 letters and marks, was added.
  • Linear A (U+10600-U+1077F), containing 341 signs, was added.
  • Palmyrene (U+10860-U+1087F), containing 32 letters, was added.
  • Nabataean (U+10880-U+108AF), containing 40 letters and numbers, was added.
  • Old North Arabian (U+10A80-U+10A9F), containing 32 letters and numbers, was added.
  • Manichaean (U+10AC0-U+10AFF), containing 51 characters, was added.
  • Psalter Pahlavi (U+10B80-U+10BAF), containing 29 characters, was added.
  • Mahajani (U+11150-U+1117F), containing 39 letters and signs, was added.
  • Sinhala Archaic Numbers (U+111E0-U+111FF), containing 20 numbers, was added.
  • Khojki (U+11200-U+1124F), containing 61 characters, was added.
  • Khudawadi (U+112B0-U+112FF), containing 69 characters, was added.
  • Grantha (U+11300-U+1137F), containing 83 characters, was added.
  • Tirhuta (U+11480-U+114DF), containing 82 characters, was added.
  • Siddham (U+11580-U+115FF), containing 72 characters, was added.
  • Modi (U+11600-U+1165F), containing 79 characters, was added.
  • Warang Citi (U+118A0-U+118FF), containing 84 letters and numbers, was added.
  • Pau Cin Hau (U+11AC0-U+11AFF), containing 57 characters, was added.
  • Mro (U+16A40-U+16A6F), containing 43 characters, was added.
  • Bassa Vah (U+16AD0-U+16AFF), containing 36 characters, was added.
  • Pahawh Hmong (U+16B00-U+16B8F), containing 127 letters and signs, was added.
  • Duployan (U+1BC00-U+1BC9F), containing 143 characters, was added.
  • Shorthand Format Controls (U+1BCA0-U+1BCAF), containing 4 format characters, was added.
  • Mende Kikakui (U+1E800-U+1E8DF), containing 213 syllables and numbers, was added.
  • Ornamental Dingbats (U+1F650-U+1F67F), containing 48 pictographic characters, was added.
  • Geometric Shapes Extended (U+1F780-U+1F7FF), containing 85 pictographic characters, was added.
  • Supplemental Arrows-C (U+1F800-U+1F8FF), containing 148 pictographic characters, was added.

Extended blocks

  • (total 1 character) was added to Greek and Coptic. (U+037F)
  • (total 8 characters) were added to Cyrillic Supplement. (U+0528-U+052F)
  • (total 2 characters) were added to Armenian. (U+058D-U+058E)
  • (total 1 character) was added to Arabic. (U+0605)
  • (total 8 characters) were added to Arabic Extended-A. (U+08A1, U+08AD-U+08B2 and U+08FF)
  • (total 1 character) was added to Devanagari. (U+0978)
  • (total 1 character) was added to Bengali. (U+0980)
  • (total 2 characters) were added to Telugu. (U+0C00 and U+0C34)
  • (total 1 character) was added to Kannada. (U+0C81)
  • (total 1 character) was added to Malayalam. (U+0D01)
  • (total 10 digits) were added to Sinhala. (U+0DE6-U+0DEF)
  • (total 8 characters) were added to Runic. (U+16F1-U+16F8)
  • (total 2 characters) were added to Limbu. (U+191D-U+191E)
  • (total 2 characters) were added to Vedic Extensions. (U+1CF8-U+1CF9)
  • (total 15 characters) were added to Combining Diacritical Marks Supplement. (U+1DE7-U+1DF5)
  • (total 3 characters) were added to Currency Symbols. (U+20BB-U+20BD)
  • (total 7 characters) were added to Miscellaneous Technical. (U+23F4-U+23FA)
  • (total 1 character) was added to Dingbats. (U+2700)
  • (total 115 characters) were added to Miscellaneous Symbols and Arrows. (U+2B4D-U+2B4F, U+2B5A-U+2B5F, U+2B60-U+2B73, U+2B76-U+2B95, U+2B98-U+2BB9, U+2BBD-U+2BC8 and U+2BCA-U+2BD1)
  • (total 7 characters) were added to Supplemental Punctuation. (U+2E3C-U+2E42)
  • (total 6 characters) were added to Cyrillic Extended-B. (U+A698-U+A69D)
  • (total 18 characters) were added to Latin Extended-D. (U+A794-U+A79F, U+A7AB-U+A7AD, U+A7B0-U+A7B1 and U+A7F7)
  • (total 4 characters) were added to Myanmar Extended-A. (U+AA7C-U+AA7F)
  • (total 7 characters) were added to Combining Half Marks. (U+FE27-U+FE2D)
  • (total 2 characters) were added to Ancient Greek Numbers. (U+1018B-U+1018C)
  • (total 1 character) was added to Ancient Symbols. (U+101A0)
  • (total 1 character) was added to Old Italic. (U+1031F)
  • (total 1 character) was added to Brahmi. (U+1107F)
  • (total 2 characters) were added to Sharada. (U+111CD and U+111DA)
  • (total 42 characters) were added to Cuneiform. (U+1236F-U+12398)
  • (total 13 characters) were added to Cuneiform Numbers and Punctuation. (U+12463-U+1246E and U+12474)
  • (total 23 characters) were added to Playing Cards. (U+1F0BF and U+1F0E0-U+1F0F5)
  • (total 2 characters) were added to Enclosed Alphanumeric Supplement. (U+1F10B-U+1F10C)
  • (total 209 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F321-U+1F32C, U+1F336, U+1F37D, U+1F394-U+1F39F, U+1F3C5, U+1F3CB-U+1F3CE, U+1F3D4-U+1F3DF, U+1F3F1-U+1F3F7, U+1F43F, U+1F441, U+1F4F8, U+1F4FD-U+1F4FE, U+1F53E-U+1F53F, U+1F544-U+1F54A, U+1F568-U+1F579, U+1F57B-U+1F5A3 and U+1F5A5-U+1F5FA)
  • (total 2 characters) were added to Emoticons. (U+1F641-U+1F642)
  • (total 27 characters) were added to Transport and Map Symbols. (U+1F6C6-U+1F6CF, U+1F6E0-U+1F6EC and U+1F6F0-U+1F6F3)

Unicode 8.0

Unicode 8.0 was released in June 17, 2015. It encoded 120,737 characters, adding 7,716 new characters.

New blocks

  • Cherokee Supplement (U+AB70-U+ABBF), containing 80 lowercase letters, was added.
  • Hatran (U+108E0-U+108FF), containing 26 letters, was added.
  • Old Hungarian (U+10C80-U+10CFF), containing 108 letters, was added.
  • Multani (U+11280-U+112AF), containing 38 letters, was added.
  • Ahom (U+11700-U+1173F), containing 57 letters, was added.
  • Early Dynastic Cuneiform (U+12480-U+1254F), containing 196 characters, was added.
  • Anatolian Hieroglyphs (U+14400-U+1467F), containing 583 characters, was added.
  • Sutton SignWriting (U+1D800-U+1DAAF), containing 672 signs, was added.
  • Supplemental Symbols and Pictographs (U+1F900-U+1F9FF), containing 15 pictographic characters, was added.
  • CJK Unified Ideographs Extension E (U+2B820-U+2CEAF), containing 5762 characters, was added.

Extended blocks

  • (total 3 characters) were added to Arabic Extended-A. (U+08B3-U+08B4 and U+08E3)
  • (total 1 character) were added to Gujarati. (U+0AF9)
  • (total 1 character) were added to Telugu. (U+0C5A)
  • (total 1 character) were added to Malayalam. (U+0D5F)
  • (total 7 characters) were added to Cherokee. (U+13F5 and U+13F8-U+13FD)
  • (total 1 character) were added to Currency Symbols. (U+20BE)
  • (total 2 characters) were added to Number Forms. (U+218A-U+218B)
  • (total 4 characters) were added to Miscellaneous Symbols and Arrows. (U+2BEC-U+2BEF)
  • (total 9 characters) were added to CJK Unified Ideographs. (U+9FCD-U+9FD5)
  • (total 1 character) were added to Cyrillic Extended-B. (U+A69E)
  • (total 7 characters) were added to Latin Extended-D. (U+A78F and U+A7B2-U+A7B7)
  • (total 2 characters) were added to Devanagari Extended. (U+A8FC-U+A8FD)
  • (total 4 characters) were added to Latin Extended-E. (U+AB60-U+AB63)
  • (total 2 characters) were added to Combining Half Marks. (U+FE2E-U+FE2F)
  • (total 64 characters) were added to Meroitic Cursive. (U+109BC-U+109BD, U+109C0-U+109CF and U+109D2-U+109FF)
  • (total 9 characters) were added to Sharada. (U+111C9-U+111CC and U+111DB-U+111DF)
  • (total 2 characters) were added to Grantha. (U+11300 and U+11350)
  • (total 20 characters) were added to Siddham. (U+115CA-U+115DD)
  • (total 1 character) were added to Cuneiform. (U+12399)
  • (total 11 characters) were added to Musical Symbols. (U+1D1DE-U+1D1E8)
  • (total 24 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F32D-U+1F32F, U+1F37E-U+1F37F, U+1F3CF-U+1F3D3, U+1F3F8-U+1F3FF, U+1F4FF and U+1F54B-U+1F54F)
  • (total 2 characters) were added to Emoticons. (U+1F643-U+1F644)
  • (total 1 character) were added to Transport and Map Symbols. (U+1F6D0)

Unicode 9.0

Unicode 9.0, was released in June 21, 2016. It encoded 128,237 characters, adding 7,500 new characters.

New blocks

  • Cyrillic Extended-C (U+1C80-U+1C8F), containing 9 letters, was added.
  • Osage (U+104B0-U+104FF), containing 72 letters, was added.
  • Newa (U+11400-U+1147F), containing 92 letters, was added.
  • Mongolian Supplement (U+11660-U+1167F), containing 13 letters, was added.
  • Bhaiksuki (U+11C00-U+11C6F), containing 97 letters, was added.
  • Marchen (U+11C70-U+11CBF), containing 68 letters, was added.
  • Ideographic Symbols and Punctuation (U+16FE0-U+16FFF), containing 1 letter, was added.
  • Tangut (U+17000-U+187FF), containing 6125 letters, was added.
  • Tangut Components (U+18800-U+18AFF), containing 755 letters, was added.
  • Glagolitic Supplement (U+1E000-U+1E02F), containing 38 letters, was added.
  • Adlam (U+1E900-U+1E95F), containing 87 letters, was added.

Extended blocks

  • (total 23 characters) were added to Arabic Extended-A. (U+08B6-U+08BD and U+08D4-U+08E2)
  • (total 1 character) were added to Kannada. (U+0C80)
  • (total 14 characters) were added to Malayalam. (U+0D4F, U+0D54-U+0D56, U+0D58-U+0D5E and U+0D76-U+0D78)
  • (total 1 character) were added to Combining Diacritical Marks Supplement. (U+1DFB)
  • (total 4 characters) were added to Miscellaneous Technical. (U+23FB-U+23FE)
  • (total 2 characters) were added to Supplemental Punctuation. (U+2E43-U+2E44)
  • (total 1 character) were added to Latin Extended-D. (U+A7AE)
  • (total 1 character) were added to Saurashtra. (U+A8C5)
  • (total 2 characters) were added to Ancient Greek Numbers. (U+1018D-U+1018E)
  • (total 1 character) were added to Khojki. (U+1123E)
  • (total 18 characters) were added to Enclosed Alphanumeric Supplement. (U+1F19B-U+1F1AC)
  • (total 1 character) were added to Enclosed Ideographic Supplement. (U+1F23B)
  • (total 2 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F57A and U+1F5A4)
  • (total 5 characters) were added to Transport and Map Symbols. (U+1F6D1-U+1F6D2 and U+1F6F4-U+1F6F6)
  • (total 67 characters) were added to Supplemental Symbols and Pictographs. (U+1F919-U+1F91E, U+1F920-U+1F927, U+1F930, U+1F933-U+1F93E, U+1F940-U+1F94B, U+1F950-U+1F95E and U+1F985-U+1F991)

Unicode 10.0

Unicode 10.0, was released in June 20, 2017. It encoded 136,690 characters, adding 8,453 new characters.

New blocks

  • Syriac Supplement (U+0860-U+086F), containing 11 characters, was added.
  • Zanabazar Square (U+11A00-U+11A4F), containing 72 characters, was added.
  • Soyombo (U+11A50-U+11AAF), containing 80 characters, was added.
  • Masaram Gondi (U+11D00-U+11D5F), containing 75 characters, was added.
  • Kana Extended-A (U+1B100-U+1B12F), containing 31 characters, was added.
  • Nushu (U+1B170-U+1B2FF), containing 396 characters, was added.
  • CJK Unified Ideographs Extension F (U+2CEB0-U+2EBEF), containing 7,473 characters, was added.

Extended blocks

  • (total 2 characters) were added to Bengali. (U+09FC-U+09FD)
  • (total 6 characters) were added to Gujarati. (U+0AFA-U+0AFF)
  • (total 3 characters) were added to Malayalam. (U+0D00 and U+0D3B-U+0D3C)
  • (total 1 character) were added to Vedic Extensions. (U+1CF7)
  • (total 4 characters) were added to Combining Diacritical Marks Supplement. (U+1DF6-U+1DF9)
  • (total 1 character) were added to Currency Symbols. (U+20BF)
  • (total 1 character) were added to Miscellaneous Technical. (U+23FF)
  • (total 1 character) were added to Miscellaneous Symbols and Arrows. (U+2BD2)
  • (total 5 characters) were added to Supplemental Punctuation. (U+2E45-U+2E49)
  • (total 1 character) were added to Bopomofo. (U+312E)
  • (total 21 characters) were added to CJK Unified Ideographs. (U+9FD6-U+9FEA)
  • (total 3 characters) were added to Old Italic. (U+1032D-U+1032F)
  • (total 1 character) were added to Ideographic Symbols and Punctuation. (U+16FE1)
  • (total 254 characters) were added to Kana Supplement. (U+1B002-U+1B0FF)
  • (total 6 characters) were added to Enclosed Ideographic Supplement. (U+1F260-U+1F265)
  • (total 4 characters) were added to Transport and Map Symbols. (U+1F6D3-U+1F6D4 and U+1F6F7-U+1F6F8)
  • (total 66 characters) were added to Supplemental Symbols and Pictographs. (U+1F900-U+1F90B, U+1F91F, U+1F928-U+1F92F, U+1F931-U+1F932, U+1F94C, U+1F95F-U+1F96B, U+1F992-U+1F997 and U+1F9D0-U+1F9E6)

Unicode 11.0

Unicode 11.0, was released in June 5, 2018. It encoded 137,374 characters, adding 684 new characters.

New blocks

  • Georgian Extended (U+1C90-U+1CBF), containing 46 characters, was added.
  • Hanifi Rohingya (U+10D00-U+10D3F), containing 50 characters, was added.
  • Old Sogdian (U+10F00-U+10F2F), containing 40 characters, was added.
  • Sogdian (U+10F30-U+10F6F), containing 42 characters, was added.
  • Dogra (U+11800-U+1184F), containing 60 characters, was added.
  • Gunjala Gondi (U+11D60-U+11DAF), containing 63 characters, was added.
  • Makasar (U+11EE0-U+11EFF), containing 25 characters, was added.
  • Medefaidrin (U+16E40-U+16E9F), containing 91 characters, was added.
  • Mayan Numerals (U+1D2E0-U+1D2FF), containing 20 characters, was added.
  • Indic Siyaq Numbers (U+1EC70-U+1ECBF), containing 68 characters, was added.
  • Chess Symbols (U+1FA00-U+1FA6F), containing 14 characters, was added.

Extended blocks

  • (total 2 characters) were added to Armenian. (U+0560 and U+0588)
  • (total 1 character) were added to Hebrew. (U+05EF)
  • (total 3 characters) were added to N'Ko. (U+07FD-U+07FF)
  • (total 1 character) were added to Arabic Extended-A. (U+08D3)
  • (total 1 character) were added to Bengali. (U+09FE)
  • (total 1 character) were added to Gurmukhi. (U+0A76)
  • (total 1 character) were added to Telugu. (U+0C04)
  • (total 1 character) were added to Kannada. (U+0C84)
  • (total 1 character) were added to Mongolian. (U+1878)
  • (total 43 characters) were added to Miscellaneous Symbols and Arrows. (U+2BBA-U+2BBC, U+2BD3-U+2BEB and 2BF0-U+2BFE)
  • (total 5 characters) were added to Supplemental Punctuation. (U+2E4A-U+2E4E)
  • (total 1 character) were added to Bopomofo. (U+312F)
  • (total 5 characters) were added to CJK Unified Ideographs. (U+9FEB-U+9FEF)
  • (total 3 characters) were added to Latin Extended-D. (U+A7AF and U+A7B8-U+A7B9)
  • (total 2 characters) were added to Devanagari Extended. (U+A8FE-U+A8FF)
  • (total 3 characters) were added to Kharoshthi. (U+10A34-U+10A35 and U+10A48)
  • (total 1 character) were added to Kaithi. (U+110CD)
  • (total 3 characters) were added to Chakma. (U+11144-U+11146)
  • (total 1 character) were added to Grantha. (U+1133B)
  • (total 1 character) were added to Newa. (U+1145E)
  • (total 1 character) were added to Ahom. (U+1171A)
  • (total 1 character) were added to Soyombo. (U+11A9D)
  • (total 5 characters) were added to Tangut. (U+187ED-U+187F1)
  • (total 7 characters) were added to Counting Rod Numerals. (U+1D372-U+1D378)
  • (total 1 character) were added to Enclosed Alphanumeric Supplement. (U+1F12F)
  • (total 1 character) were added to Transport and Map Symbols. (U+1F6F9)
  • (total 4 characters) were added to Geometric Shapes Extended. (U+1F7D5-U+1F7D8)
  • (total 65 characters) were added to Supplemental Symbols and Pictographs. (U+1F94D-U+1F94F, U+1F96C-U+1F970, U+1F973-U+1F976, U+1F97A, U+1F97C-U+1F97F, U+1F998-U+1F99F, U+1F9A0-U+1F9A2, U+1F9B0-U+1F9B9, U+1F9C1-U+1F9C2 and U+1F9E7-U+1F9FF)

Unicode 12.0

Unicode 12.0 was released on March 5, 2019. It encoded 137,928 characters, adding 555 new characters.

New blocks

  • Elymaic (U+10FE0-U+10FFF), containing 23 characters, was added.
  • Nandinagari (U+119A0-U+119FF), containing 65 characters, was added.
  • Tamil Supplement (U+11FC0-U+11FFF), containing 51 characters, was added.
  • Egyptian Hieroglyph Format Controls (U+13430-U+1343F), containing 9 characters, was added.
  • Small Kana Extension (U+1B130-U+1B16F), containing 7 characters, was added.
  • Nyiakeng Puachue Hmong (U+1E100-U+1E14F), containing 71 characters, was added.
  • Wancho (U+1E2C0-U+1E2FF), containing 59 characters, was added.
  • Ottoman Siyaq Numbers (U+1ED00-U+1ED4F), containing 61 characters, was added.
  • Symbols and Pictographs Extended-A (U+1FA70-U+1FAFF), containing 16 characters, was added.

Extended blocks

  • (total 1 character) was added to Telugu. (U+0C77)
  • (total 15 characters) were added to Lao. (U+0E86, U+0E89, U+0E8C, U+0E8E-U+0E93, U+0E98, U+0EA0, U+0EA8-U+0EA9, U+0EAC and U+0EBA)
  • (total 1 character) was added to Vedic Extensions. (U+1CFA)
  • (total 2 characters) were added to Miscellaneous Symbols and Arrows. (U+2BC9 and U+2BFF)
  • (total 1 character) was added to Supplemental Punctuation. (U+2E4F)
  • (total 11 characters) were added to Latin Extended-D. (U+A7BA-U+A7BF and U+A7C2-U+A7C6)
  • (total 2 characters) were added to Latin Extended-E. (U+AB66-U+AB67)
  • (total 1 character) was added to Newa. (U+1145F)
  • (total 1 character) was added to Takri. (U+116B8)
  • (total 2 characters) were added to Soyombo. (U+11A84-U+11A85)
  • (total 16 characters) were added to Miao. (U+16F45-U+16F4A, U+16F4F and U+16F7F-U+16F87)
  • (total 2 characters) were added to Ideographic Symbols and Punctuation. (U+16FE2-U+16FE3)
  • (total 6 characters) were added to Tangut. (U+187F2-U+187F7)
  • (total 1 character) was added to Adlam. (U+1E94B)
  • (total 1 character) was added to Enclosed Alphanumeric Supplement. (U+1F16C)
  • (total 2 characters) were added to Transport and Map Symbols. (U+1F6D5 and U+1F6FA)
  • (total 12 characters) were added to Geometric Shapes Extended. (U+1F7E0-U+1F7EB)
  • (total 31 characters) were added to Supplemental Symbols and Pictographs. (U+1F90D-U+1F90F, U+1F93F, U+1F971, U+1F97B, U+1F9A5-U+1F9AA, U+1F9AE-U+1F9AF, U+1F9BA-U+1F9BF, U+1F9C3-U+1F9CA and U+1F9CD-U+1F9CF)
  • (total 84 characters) were added to Chess Symbols. (U+1FA00-U+1FA53)

Glyph Changes

  • The Won Symbol (U+2089) Is Now Wider.
  • Both Extended Bopomofo Tone Marks (026A-026B) Is Now Bigger.

And Here Are List of Other Changes:

Block Name Code Points Count
Spacing Modifier Letters 02EA, 02EB 2
Vedic Extensions 1CF2..1CF3 2
Currency Symbols 20A9 1
CJK Symbols and Punctuation 3001, 3002 2
Bopomofo 3105..312F 43
Bopomofo Extended 31A0..31BA 27
CJK Unified Ideographs Extension A 37C3, 3B9D, 3CFD, 3FE0, 44EC, 4A76 6
CJK Unified Ideographs 5344, 55B9, 6ABC, 6FF9, 809E, 80BC, 80E9, 8132, 8159, 841C, 891D, 8C6C, 915E, 9FD4 14
Phags-pa A840..A877 56
Halfwidth and Fullwidth Forms FF01, FF0C, FF0E, FF1A, FF1B, FF1F 6
CJK Unified Ideographs Extension B 200DD, 20164, 20BBF, 20C02, 20CED, 21D4C, 2278B, 23AB8, 2459B, 24A7D, 24FB9, 25ED7, 2677C, 26B4C, 26C21, 26CBE, 26E3D, 28834, 289A1, 289C0, 28A0F, 28B46 22
CJK Unified Ideographs Extension C 2A8FB, 2A917, 2AA30 3
CJK Unified Ideographs Extension E 2BA52, 2BD77, 2C494, 2C72F, 2C734, 2CB38 6
CJK Unified Ideographs Extension F 2D23B, 2E83A 2
Total 192

Unicode 12.1

Unicode 12.1 was released on May 7, 2019. It encoded 137,929 characters, adding only 1 new character.

Extended blocks

  • (total 1 character) was added to Enclosed CJK Letters and Months. (U+32FF)

Unicode 13.0

Unicode 13.0 was released on March 10, 2020. It encoded 143,859 characters, adding 5930 new characters.

New blocks

  • Yezidi (U+10E80-U+10EBF), containing 47 characters, was added.
  • Chorasmian (U+10FB0-U+10FDF), containing 28 characters, was added.
  • Dives Akuru (U+11900-U+1195F), containing 72 characters, was added.
  • Lisu Supplement (U+11FB0-U+11FBF), containing 1 character, was added.
  • Khitan Small Script (U+18B00-U+18CFF), containing 470 characters, was added.
  • Tangut Supplement (U+18D00-U+18D08), containing 9 characters, was added.
  • Symbols for Legacy Computing (U+1FB00-U+1FBFF), containing 212 characters, was added.
  • CJK Unified Ideographs Extension G (U+30000-U+3134F), containing 4939 characters, was added.

Extended blocks

  • (total 10 characters) were added to Arabic Extended-A. (U+08BE-U+08C7)
  • (total 1 character) was added to Oriya. (U+0B55)
  • (total 1 character) was added to Malayalam. (U+0D04)
  • (total 1 character) was added to Sinhala. (U+0D81)
  • (total 2 characters) were added to Combining Diacritical Marks Extended. (U+1ABF-U+1AC0)
  • (total 1 character) was added to Miscellaneous Symbols and Arrows. (U+2B97)
  • (total 3 characters) were added to Supplemental Punctuation. (U+2E50-U+2E52)
  • (total 5 characters) were added to Bopomofo Extended. (U+31BB-U+31BF)
  • (total 10 characters) were added to CJK Unified Ideographs Extension A. (U+4DB6-4DBF)
  • (total 13 characters) were added to CJK Unified Ideographs. (U+9FF0-U+9FFC)
  • (total 6 characters) were added to Latin Extended-D. (U+A7C7-U+A7CA and U+A7F5-U+A7F6)
  • (total 1 character) was added to Syloti Nagri. (U+A82C)
  • (total 4 characters) were added to Latin Extended-E. (U+AB68-U+AB6B)
  • (total 1 character) was added to Ancient Symbols. (U+1019C)
  • (total 1 character) was added to Chakma. (U+11147)
  • (total 2 characters) were added to Sharada. (U+111CE and U+111CF)
  • (total 3 characters) were added to Newa. (U+1145A and U+11460-U+11461)
  • (total 3 characters) were added to Ideographic Symbols and Punctuation. (U+16FE4 and U+16FF0-U+16FF1)
  • (total 13 characters) were added to Tangut Components. (U+18AF3-U+18AFF)
  • (total 7 characters) were added to Enclosed Alphanumeric Supplement. (U+1F10D-U+1F10F, U+1F16D-1F16F and U+1F1AD)
  • (total 4 characters) were added to Transportation and Map Symbols. (U+1F6D6-U+1F6D7 and U+1F6FB-U+1F6FC)
  • (total 2 characters) were added to Supplemental Arrows-C. (U+1F8B0-U+1F8B1)
  • (total 10 characters) were added to Supplemental Symbols and Pictographs. (U+1F90C, U+1F972, U+1F977-U+1F978, U+1F9A3-U+1F9A4, U+1F9AB-U+1F9AD and U+1F9CB)
  • (total 41 characters) were added to Symbols and Pictographs Extended-A. (U+1FA74, U+1FA83-U+1FA86, U+1FA96-U+1FAA8, U+1FAB0-U+1FAB6, U+1FAC0-U+1FAC2 and U+1FAD0-U+1FAD6)
  • (total 7 characters) were added to CJK Unified Ideographs Extension B. (U+2A6D7-U+2A6DD)

Glyph Changes

Block Name Code Points Count
Tagalog 1700..170C, 170E..1714 20
Mongolian 1834, 1871, 1878 3
Sundanese 1BAB 1
Currency Symbols 20BF 1
CJK Radicals Supplement 2E80..2E99, 2E9B..2EF3 115
Kangxi Radicals 2F00..2FD5 214
CJK Unified Ideographs Extension A 3472, 38C7, 3DB8, 3FE0, 440B, 46E9 6
CJK Unified Ideographs 53FD, 6146, 6711, 671C, 6721, 6725, 6BD2, 7B9A, 87CE, 8956, 93BF, 9B97 12
Latin Extended-D A764..A765 2
Phags-pa A86D 1
Tangut 175F6, 17F0D, 17F8A, 17FA5, 180D6, 18139, 18147, 184F1, 18736 9
Tangut Components 18843, 18856, 1888C, 1890A, 18915, 1893B 6
Adlam 1E900..1E94A, 1E950..1E959, 1E95E..1E95F 71
Miscellaneous Symbols and Pictographs 1F3B1 1
Supplemental Symbols and Pictographs 1F995..1F998, 1F99B..1F99E, 1F9B0..1F9B3, 1F9E7 13
CJK Unified Ideographs Extension B 20219, 21249, 21827, 22C3A, 2327B, 23496, 2355E, 2363B, 236ED, 23839, 23FD5, 24261, 24726, 248F2, 2548E, 26657, 26C9E, 26FE1, 27334, 27C0E, 27CEF, 2A38C 22
CJK Unified Ideographs Extension C 2AED5, 2AEF3, 2AF76, 2B09F, 2B1C3, 2B1E5 6
CJK Unified Ideographs Extension E 2B83C, 2B8D9..2B8DA, 2B96F, 2BBD7, 2BD61, 2BE4A, 2BF1D, 2BF9D, 2C0B8, 2C142, 2C176, 2C316, 2C3FB, 2C402, 2C7AC, 2C82C, 2C83A, 2C9A1, 2CC88, 2CD68 21
CJK Unified Ideographs Extension F 2DC09, 2DE4A, 2EB7E, 2EB89 4
CJK Compatibility Ideographs Supplement 2F83B, 2F878, 2F8D6..2F8D7, 2F8DA, 2F8F0, 2F984, 2FA02 8
Total 536

Unicode 14.0

Unicode 14.0 was released on September 14, 2021. It encoded 144,697 characters, added 838 new characters.

New blocks

  • Arabic Extended-B (U+0870-U+089F), containing 41 characters, was added.
  • Vithkuqi (U+10570-U+105BF), containing 70 characters, was added.
  • Latin Extended-F (U+10780-U+107BF), containing 57 characters, was added.
  • Old Uyghur (U+10F70-U+10FAF), containing 26 characters, was added.
  • Unified Canadian Aboriginal Syllabics Extended-A (U+11AB0-U+11ABF), containing 16 characters, was added.
  • Cypro-Minoan (U+12F90-U+12FFF), containing 99 characters, was added.
  • Tangsa (U+16A70-U+16ACF), containing 89 characters, was added.
  • Kana Extended-B (U+1AFF0-U+1AFFF), containing 13 characters, was added.
  • Znamenny Musical Symbols (U+1CF00-U+1CFFF), containing 185 characters, was added.
  • Latin Extended-G (U+1DF00-U+1DFFF), containing 31 characters, was added.
  • Toto (U+1E290-U+1E2BF), containing 31 characters, was added.
  • Ethiopic Extended-B (U+1E7E0-U+1E7FF), containing 28 characters, was added.

Extended blocks

  • (total 1 character) was added to Arabic. (U+061D)
  • (total 12 characters) were added to Arabic Extended-A. (U+08B5 and U+08C8-U+08D2)
  • (total 2 characters) were added to Telugu. (U+0C3C and U+0C5D)
  • (total 1 character) was added to Kannada. (U+0CDD)
  • (total 3 characters) were added to Tagalog. (U+170D, U+1715 and U+171F)
  • (total 1 character) was added to Mongolian. (U+180F)
  • (total 14 characters) were added to Combining Diacritical Marks Extended. (U+1AC1-U+1ACE)
  • (total 3 characters) were added to Balinese. (U+1B4C and U+1B7D-U+1B7E)
  • (total 1 character) was added to Combining Diacritical Marks Supplement. (U+1DFA)
  • (total 1 character) was added to Currency Symbols. (U+20C0)
  • (total 2 characters) were added to Glagolitic. (U+2C2F and U+2C5F)
  • (total 11 characters) were added to Supplemental Punctuation. (U+2E53-U+2E5D)
  • (total 3 characters) were added to CJK Unified Ideographs. (U+9FFD-U+9FFF)
  • (total 13 characters) were added to Latin Extended-D. (U+A7C0-U+A7C1, U+A7D0-U+A7D1, U+A7D3, U+A7D5, U+A7D6-U+A7D9 and U+A7F2-U+A7F4)
  • (total 20 characters) were added to Arabic Presentation Forms-A. (U+FBC2, U+FD40-U+FD4F, U+FDCF and U+FDFE-U+FDFF)
  • (total 6 characters) were added to Brahmi. (U+11070-U+11075)
  • (total 1 character) was added to Khaiti. (U+110C2)
  • (total 1 character) was added to Takri. (U+116B9)
  • (total 7 characters) were added to Ahom. (U+11740-U+11746) The block was expanded from (U+11700-U+1173F) to (U+11700-U+1174F)
  • (total 4 characters) were added to Kana Extended-A. (U+1B11F-U+1B122)
  • (total 2 characters) were added to Musical Symbols. (U+1D1E9-U+1D1EA)
  • (total 3 characters) were added to Transportation and Map Symbols. (U+1F6DD-U+1F6DF)
  • (total 1 character) was added to Geometric Shapes Extended. (U+1F7F0)
  • (total 2 characters) were added to Supplemental Symbols and Pictographs. (U+1F979 and U+1F9CC)
  • (total 31 characters) were added to Symbols and Pictographs Extended-A. (U+1FA7B-U+1FA7C, U+1FAA9-U+1FAAC, U+1FAB7-U+1FABA, U+1FAC3-U+1FAC5, U+1FAD7-U+1FAD9, U+1FAE0-U+1FAE7 and U+1FAF0-U+1FAF6)
  • (total 2 characters) were added to CJK Unified Ideographs Extension B. (U+2A6DE-U+2A6DF)
  • (total 4 characters) were added to CJK Unified Ideographs Extension C. (U+2B735-U+2B738)

Glyph Changes

  • The Tone Six Glyph (U+0184-U+0185 Ƅƅ) was changed the top to a diagonal stroke downwards.

And Here Are List of Other Changes:

Block Name Code Points Count
Latin Extended-B 0184..0185 2
Arabic 0674..0678, 06C5, 06C7, 06FE 8
Letterlike Symbols 210B, 2110, 2112, 211B, 212C, 2130..2131, 2133 8
Enclosed Alphanumerics 2460..24FF 160
Dingbats 2776..2793 30
CJK Symbols and Punctuation 3001..3029, 3030..303D, 303F 56
CJK Strokes 31C0..31E3 36
Katakana Phonetic Extensions 31F0..31FF 16
Enclosed CJK Letters and Months 3200..321E, 3220..32FF 255
CJK Compatibiity 3300..33FF 256
CJK Unified Ideographs Extension A 3777, 3B3F 2
CJK Unified Ideographs 5DD5, 652C, 6AC0 3
Arabic Presentation Forms-A FBD7..FBD8, FBDD, FBE0..FBE1 5
Vertical Forms FE10..FE19 10
CJK Compatibiity Forms FE30..FE4F 32
Small Form Variants FE50..FE52, FE54..FE66, FE68..FE6B 26
Halfwidth and Fullwidth Forms FF01..FF9F, FFA1..FFBE, FFC2..FFC7, FFCA..FFCF, FFD2..FFD7, FFDA..FFDC, FFE0..FFE6, FFE8..FFEE 225
Egyptian Hieroglyphs 1300A, 13017, 1302D, 13032, 13034..13035, 13037..13038, 1303A..1303E, 1304E..1304F, 13055, 13057, 13068, 1309A, 130D2, 130D5, 130F6, 130FE, 13192, 1325F, 13267, 1326A, 13281, 13297, 1329E, 132B4, 132C1, 132E6, 13304, 1331F, 13378..1337B, 1337D..1337E, 133F3, 133FA..13403, 1340D, 13417, 1342B 55
Mathematical Alphanumeric Symbols 1D49C, 1D49E..1D49F, 1D4A2, 1D4A5..1D4A6, 1D4A9..1D4AC, 1D4AE..1D4B5 18
Enclosed Alphanumeric Supplement 1F100..1F1AD, 1F1E6..1F1FF 200
Enclosed Ideographic Supplement 1F200..1F202, 1F210..1F23B, 1F240..1F248, 1F250..1F251, 1F260..1F265 64
Supplemental Symbols and Pictographs 1F930 1
CJK Unified Ideographs Extension B 22ADC, 230F2, 25B27, 26F28 4
Total 1472

Unicode 15.0

Unicode 15.0 was released on September 13, 2022. It encoded 149,186 characters, added 4489 new characters.

New blocks

  • Arabic Extended-C (U+10EC0-U+10EFF), containing 3 characters, was added
  • Devanagari Extended-A (U+11B00-U+11B5F), containing 10 characters, was added
  • Kawi (U+11F00-U+11F5F), containing 86 characters, was added
  • Kaktovik Numerals (U+1D2C0-U+1D2DF), containing 20 characters, was added
  • Cyrillic Extended-D (U+1E030-U+1E08F), containing 63 characters, was added
  • Nag Mundari (U+1E4D0-U+1E4FF), containing 42 characters, was added
  • CJK Unified Ideographs Extension H (U+31350-U+323AF), containing 4192 characters, was added

Extended blocks

  • (total 1 character) was added to Lao. (U+0ECE)
  • (total 1 character) was added to Kannada. (U+0CF3)
  • (total 3 characters) were added to Khojki. (U+1123F-U+11241)
  • (total 1 character) was added to Egyptian Hieroglyphs
  • (total 29 characters) were added to Egyptian Hieroglyph Format Controls. (U+13439-U+13455). The block was expanded from (U+13430-U+1343F) to (U+13430-U+1345F).
  • (total 2 characters) were added to Small Kana Extension. (U+1B132, U+1B155)
  • (total 6 characters) were added to Latin Extended-G. (U+1DF25-U+1DF2A)
  • (total 1 character) was added to Transport and Map Symbols. (U+1F6DC)
  • (total 1 character) was be added to Geometric Shapes Extended. (U+1F7D9)
  • (total 8 characters) were added to Alchemical symbols. (U+1F774-U+1F776, U+1F77B-U+1F77F)
  • (total 20 characters) were added to Symbols and Pictographs Extended-A. (U+1FA75-U+1FA77, U+1FA87-U+1FA88, U+1FAAD-U+1FAAF, U+1FABB-U+1FABF, U+1FACE-U+1FACF, U+1FADA-U+1FADB, U+1FAE8 and U+1FAF7-U+1FAF8)
  • (total 1 character) was added to CJK Unified Ideographs Extension C

Glyph Changes

  • The closed open E and the reversed closed open E (U+25E, U+29A) got a more Latin-like glyph rather than the Greek-like one.
  • The Carrier Dlu and Carrier Tlo got new glyphs, as well as the Ojibway Sh. (U+1628, U+163B, U+18DB)
  • The Sundanese Final M got a new glyph. (U+1BBF)
  • The lowercase Insular G (U+1D79) got the s-shaped glyph to match the capital Insular G (U+A77D)
  • The OCR amount of check got a new glyph (U+2447)
  • The Cyrillic Multiocular O (U+A66E ꙮ) got a 10-eyed glyph rather than previous 7-eyed glyph.
  • The Old Turkic Orkhon Ot (U+10C47) got a slightly smaller glyph.
  • Several Egyptian hieroglyphs got different glyphs.
  • A small Khitan character (U+18CCA) got a slightly different glyph.
  • Wancho and Alchemical Symbols got a new font updated.

And Here Are List of Other Changes:

Block Name Code Points Count
IPA Extensions 025E, 029A 2
United Canadian Aboriginal Syllabics 144B, 14D1, 1506, 15C0..15C3, 15E8..15EE, 1601, 1604..1607, 160A..160D, 1614..162D, 1630..163F, 1646..1647, 165A 66
United Canadian Aboriginal Syllabics Extended 18DB, 18EC, 18F1..18F2, 18F5 5
Sundanese 1BBF 1
Optical Character Recognition 2447 1
CJK Unified Ideographs Extension A 34DC, 3BF6, 3C43, 48B4, 4DBE 5
CJK Unified Ideographs 585F, 5F50, 6BC0, 7BC9, 833E 5
Cyrillic Extended-B A66E 1
Old Turkic 10C47 1
Egyptian Hieroglyphs various (new standardized variation sequences) 94
Khitan Small Script 18CCA 1
Wancho (font update) 1E2C0..1E2F9, 1E2FF 59
Alchemical Symbols (font update) 1F700..1F773 116
CJK Unified Ideographs Extension B 20048, 20A1C, 2143F, 21A5F, 21C08, 21FBA, 22ACF, 23392, 238A7, 23D8F, 23F4E, 25D20, 26E30, 27B48, 27C4F, 28633, 28B02, 28E9A, 29760, 2A60F 20
CJK Unified Ideographs Extension C 2B249 1
CJK Unified Ideographs Extension E 2BB37, 2BD7D, 2C151, 2C1E0, 2C2D6, 2C5CA, 2C810, 2CD34 8
CJK Unified Ideographs Extension F 2CF4E, 2D25D, 2D3EC, 2D6A7, 2D7BA, 2D979, 2DA74, 2DA97, 2DC13, 2DDC0, 2DF10, 2DF78, 2E05A, 2E0AE, 2E516, 2E640, 2E680, 2EA63 18
CJK Compatibility Ideographs Supplement 2F804, 2F805, 2F833, 2F835, 2F84C, 2F84F, 2F852, 2F855, 2F887, 2F88B, 2F899, 2F8A0, 2F8A6, 2F8A7, 2F8AD, 2F8B1, 2F8B4, 2F8B7, 2F8BA, 2F8D0, 2F8E0..2F8E2, 2F8E5, 2F8E6, 2F8FE, 2F900, 2F901, 2F907, 2F912, 2F922, 2F926, 2F936, 2F938, 2F94E, 2F959, 2F95F, 2F96C, 2F99F, 2F9B8, 2F9BA, 2F9D3, 2F9DB, 2F9DC, 2F9E8, 2F9EA, 2F9EE, 2FA00, 2FA0D, 2FA1B 50
CJK Unified Ideographs Extension G 302FC, 30723, 30A6D, 30CF7, 30DBF, 31006, 3105D 7
Total 461

Note

Edit the entire page to have the "Glyph Changes" so we know which characters had changed their appearance over the years and describe what changes the character has received.

Future Versions

This is a section where you can add any upcoming Unicode characters that have been confirmed to be in a future update of The Unicode Standard. Do not add any false information as this will confuse people into thinking that they are official.

New blocks

  • Northen Palaeohispanic (U+10200-U+1023F), containing 58 characters will be added
  • Southern Palaeohispanic (U+10240-U+1027F), containing 55 characters will be added
  • Todhri (U+105C0-U+105FF), containing 52 characters will be added
  • Garay (U+10D40-U+10D8F), containing 69 characters will be added
  • Tulu-Tigalari (U+11380-U+113FF), containing 78 characters will be added
  • Myanmar Extended-C (U+116D0-U+116FF), containing 20 characters will be added
  • Sunuwar (U+11BC0-U+11BFF), containing 44 characters will be added
  • Gurung Khema (U+16100-U+1613F), containing 58 characters will be added
  • Rma (U+16140-U+1617F), containing 61 characters will be added
  • Kirat Rai (U+16D40-U+16D7F), containing 58 characters will be added
  • Buginese Supplement (U+16EA0-U+16EFF), containing 8 characters will be added
  • Symbols for Legacy Computing Supplement (U+1CC00-U+1CEBF), containing 686 characters will be added.
  • Kodo Incense Linear Patterns (U+1DAB0-U+1DABF), containing 12 characters will be added.
  • Western Cham (U+1E200-U+1E26F), containing 105 characters will be added.
  • Yo Lai Tay (U+1E6C0-U+1E6FF), containing 55 characters will be added.
  • Symbols and Pictographs Extended-B (U+1FD00-U+1FDFF), containing unknown characters will be added.

Extended blocks

  • An Arabic Pepet (total 1 character) will be added to Arabic Extended-B. (U+0897)
  • Kannada and Telugu Archaic Shrii (total 2 characters) will be added to Kannada and Telugu. (U+0C5C, U+0CDC).
  • 3 diacritics (total 3 characters) will be added to Balinese. (U+1B4E-U+1B4F, U+1B7F)
  • Cyrillic letter Tje and Zhe with stroke (total 4 characters) will be added to Cyrillic Extended-C. (U+1C89-U+1C8A, U+1C8D-U+1C8E)
  • 3 special symbols for delete (total 3 characters) will be added to Control Pictures. (U+2427-U+2429)
  • 5 additional ideographic description characters will be added to Ideographic Description Characters. (U+2FE0-U+2FE4). The block will be expanded from (U+2FF0-U+2FFF) to (U+2FE0-U+2FFF)
  • Latin capital Rams Horn, S with Diagonal Stroke, I with Bowl and letters for adyghian (total 18 characters) will be added to Latin Extended-D. (U+A7CB-U+A7CF, U+A7E1-U+A7E6, U+A7E8-U+A7EA, U+A7EC-U+A7ED, U+A7EF, U+A7F1)
  • Tai Don letters (total 22 characters) will be added to Tai Viet. (U+AAC3-U+AADA)
  • Letters used in Benjamin Franklin's phonetic alphabet (total 4 characters) will be added to Latin Extended-E. (U+AB6C-U+AB6F)
  • Ligature Ve (total 1 character) will be added to Alphabetic Presentation Forms. (U+FE07)
  • 3 letters with two dots vertically below, Raised Small Alef, Small Low Noon, Small Yeh Barree with Two Dots Below and Combining Alef Overlay (total 7 characters) will be added to Arabic Extended-C. (U+10EC2-U+10EC4, U+10EC9-U+10EFC)
  • Letters for old Bashkir (total 18 characters) will be added to Cyrillic Extended-D. (U+1E06E-U+1E07F)
  • A rightwards arrow with hook and arrows for legacy computing (total 10 characters) will be added to Supplemental Arrows-C. (U+1F8B2-U+1F8BB)
  • Graphic shapes for legacy computing and hexadecimal digits for Chinese legacy (total 43 characters) will be added to Symbols for Legacy Computing. (U+1FBCB-U+1FBEF, U+1FBFA-U+1FBFF)

Characters with no assigned codepoints

  • Half Face of the moon will be added.
  • Man and Woman suit levitating will be added. (For other platforms, not just Facebook and Twitter)
  • Six new faces emoji will be added. (Clever Face, Third Eye Face, Daydreaming Face, Baffled Face, Empathetic Face and Scolding Face)
  • Two new hands emoji will be added. (Gun Fingers and Loser Hand)
  • A train conductor emoji will be added.
  • Seven animals emoji will be added. (Black Swan, Dragonfly, Firefly, Axolotl, Wombat, Sea Urchin and Starfish)
  • Seven fruits and vegetables emoji will be added. (Dragon Fruit, Leek, Artichoke, Lychee, Beet, Fig and Guava)
  • Five foods emoji will be added. (Bag of Chips, Cotton Candy, Popsicle, Nachos and Cinnamon Rolls)
  • New wine emoji variants will be added. (White)
  • A pinecone emoji will be added.
  • A seaweed emoji will be added.
  • Aurora and Earthquake Icon emoji will be added.
  • Food Cart will be added.
  • Two colored hearts emoji will be added. (Indigo and Rainbow)
  • A stool bar emoji will be added.
  • Barrel will be added.
  • A calculator emoji will be added.
  • A marigold and a pot of gold will be added.
  • Pan African Flag emoji will be added.
  • Two new weapons emoji will be added. (Mace and Whip)
  • Three body organs emoji will be added. (Stomach, Liver and Kidney)
  • No cars and park ranger emoji will be added.

Roadmap Blocks

This is a section where present proportional maps of a proposed allocations to Unicode and ISO/IEC 10646. Italic indicates scripts for which detailed proposals have not yet been written.

Blocks

  • Shavian Quikscript (U+103E0-U+103FF)
  • Rejang Extended (U+107C0-U+107FF)
  • Proto-Sinaitic (U+108B0-U+108DF)
  • Sidetic (U+10940-U+1095F)
  • Numidian (U+10960-U+1097F)
  • Balti (U+10AA0-U+10ABF)
  • Book Pahlavi (U+10BB0-U+10BDF)
  • Baburi (U+10BE0-U+10BFF)
  • Byblos (U+10D80-U+10DFF)
  • Landa (U+11250-U+1127F)
  • Tani Lipi (U+114E0-U+114FF)
  • Ranjana (U+11500-U+1157F)
  • Zou (U+11750-U+117AF)
  • Pyu (U+117B0-U+117FF)
  • Sirmauri (U+11850-U+1188F)
  • Vateluttu (U+11960-U+1199F)
  • Sharada Extended (U+11B60-U+11B7F)
  • Tolong Siki (U+11B80-U+11BBF)
  • Balti-B (U+11CC0-U+11CFF)
  • Leke (U+11DB0-U+11DEF)
  • Tocharian (U+11E00-U+11E6F)
  • Khotanese (U+11E70-U+11ECF)
  • Pallava (U+11F60-U+11FAF)
  • Proto-Cuneiform (U+12580-U+12DFF)
  • Indus (U+12E00-U+12F8F)
  • Egyptian Hieroglyphs Extended-A (U+13480-U+143FF)
  • Egyptian Hieroglyphs Extended-B (U+14680-U+151FF)
  • Mayan Hieroglyphs (U+15500-U+15AFF)
  • Lampung (U+15B00-U+15B3F)
  • Kerinci (U+15B40-U+15B6F)
  • Mandombe (U+15B80-U+15FFF)
  • Cirth (U+16000-U+1607F)
  • Tengwar (U+16080-U+160FF)
  • Moon (U+161A0-U+161FF)
  • BlIssymbols (U+16200-U+167FF)
  • Woleai (U+16B90-U+16BFF)
  • Kpelle (U+16C00-U+16C7F)
  • Afaka (U+16C80-U+16CCF)
  • Tangsa-Khimhun (U+16CD0-U+16CFF)
  • Tikamuli (U+16D00-U+16D3F)
  • Lontara Bilang-Bilang (U+16D80-U+16DBF)
  • Kulitan (U+16DD0-U+16DFF)
  • Mwangwego (U+16E00-U+16E3F)
  • Bopomofo Extended-A (U+16FA0-U+16FAF)
  • Kanbun Extended-A (U+16FB0-U+16DFF)
  • Khitan Ideographs (U+18D80-U+195FF)
  • Jurchen (U+19600-U+19B9F)
  • Pau Cin Hau Syllabary (U+19E00-U+1A75F)
  • Kaida (U+1A780-U+1A7FF)
  • Naxi Dongba (U+1A800-U+1ACFF)
  • Naxi Geba (U+1AD00-U+1AFCF)
  • Kana Extended-C (U+1AFD0-U+1AFEF)
  • Shuishu Logograms (U+1B300-U+1B5FF)
  • Lisu Syllabic Script (U+1B600-U+1B9FF)
  • Pitman Shorthands (U+1BCB0-U+1BCFF)
  • Proto-Elamite (U+1BD00-U+1C37F)
  • Linear-Elamite (U+1C380-U+1C4FF)
  • Chinese Musical Symbols (U+1D250-U+1D2BF)
  • Mathematical Alphanumeric Symbols Supplement (U+1D380-U+1D3FF)
  • Jianzi Format Controls (U+1DAE0-U+1DAFF)
  • Jianzi Musical Symbols (U+1DB00-U+1DC8F)
  • Eebee Hmong (U+1E150-U+1E1FF)
  • Loma (U+1E300-U+1E41F)
  • Bagam (U+1E420-U+1E4CF)
  • Pungchen (U+1E500-U+1E52F)
  • Pungchung (U+1E530-U+1E55F)
  • Marchung (U+1E560-U+1E59F)
  • Brusha (U+1E5A0-U+1E5CF)
  • Ol Onal (U+1E5D0-U+1E5FF)
  • Chola (U+1E600-U+1E65F)
  • Chalukya (Box-Headed) (U+1E660-U+1E6BF)
  • Beria (U+1E700-U+1E72F)
  • Persian Siyaq Numbers (U+1EC00-U+1EC7F)
  • Diwani Siyaq Numbers (U+1ECC0-U+1ECFF)
  • Arabic Supplemental Symbols (U+1EF00-U+1EF3F)
  • Extended Pictographic Characters (U+1FC00-U+1FFFF)
  • Seal Script (U+32400-U+352FF)
  • Oracle Bone Script (U+35400-36BFF)