Unicode/Versions

From Wikibooks, open books for an open world
< Unicode
Jump to navigation Jump to search
Unicode Standard
Discussion Character Reference
(edit template)</small=$>

This page is about each version specification, and the differences between the versions.

Unicode 1.0[edit]

Unicode 1.0 was the first version of Unicode, released October 1991. It encoded 7,161 characters.

“Blocks”[edit]

This version of Unicode did not formally group characters in blocks. But in comparison with version 2.0, the following “blocks” were available: U+0000-U+FFFD 51 Blocks

  • Basic Latin, containing 128 characters.
  • Latin-1 Supplement, containing 128 characters.
  • Latin Extended-A, containing 127 characters.
  • Latin Extended-B, containing 113 characters.
  • IPA Extensions, containing 89 characters.
  • Spacing Modifier Letters, containing 57 characters.
  • Combining Diacritical Marks, containing 66 characters.
  • Greek and Coptic, containing 112 characters.
  • Cyrillic, containing 192 characters.
  • Armenian, containing 84 characters.
  • Hebrew, containing 52 characters.
  • Arabic, containing 169 characters.
  • Devanagari, containing 104 characters.
  • Bengali, containing 89 characters.
  • Gurmukhi, containing 74 characters.
  • Gujarati, containing 75 characters.
  • Oriya, containing 78 characters.
  • Tamil, containing 61 characters.
  • Telugu, containing 80 characters.
  • Kannada, containing 80 characters.
  • Malayalam, containing 78 characters.
  • Thai, containing 92 characters.
  • Lao, containing 70 characters.
  • Tibetan, containing 71 characters.
  • Georgian, containing 78 characters.
  • General Punctuation, containing 67 characters.
  • Superscripts and Subscripts, containing 28 characters.
  • Currency Symbols, containing 11 characters.
  • Combining Marks for Symbols, containing 18 characters.
  • Letterlike Symbols, containing 57 characters.
  • Number Forms, containing 48 characters.
  • Arrows, containing 91 characters.
  • Mathematical Operators, containing 242 characters.
  • Miscellaneous Technical, containing 43 characters.
  • Control Pictures, containing 37 characters.
  • Optical Character Recognition, containing 11 characters.
  • Enclosed Alphanumerics, containing 139 characters.
  • Forms, containing 128 characters.
  • Block Elements, containing 22 characters.
  • Geometric Shapes, containing 79 characters.
  • Miscellaneous Symbols, containing 106 characters.
  • Dingbats, containing 160 characters.
  • CJK Symbols and Punctuation, containing 56 characters.
  • Hiragana, containing 90 characters.
  • Katakana, containing 90 characters.
  • Bopomofo, containing 40 characters.
  • Hangul Compatibility Jamo, containing 94 characters.
  • Kanbun, containing 16 characters.
  • Enclosed CJK Letters and Months, containing 191 characters.
  • CJK Compatibility, containing 187 characters.
  • Hangul Syllables, containing 2,350 characters.
  • Private Use Area, reserved for 5,632 characters.
  • CJK Compatibility Forms, containing 28 characters.
  • Small Form Variants, containing 26 characters.
  • Arabic Presentation Forms-B, containing 140 characters.
  • Halfwidth and Fullwidth Forms, containing 216 characters.
  • Specials, containing 1 character.

Unicode 1.0.1[edit]

Unicode 1.0.1 was released June 1992. It encoded 28,365 characters.

New blocks[edit]

  • CJK Unified Ideographs, containing 20,902 Han Ideographs for Chinese, Japanese and Korean was added.
  • CJK Compatibility Ideographs, containing 302 Han Ideographs for compatibility with existing character sets was added.

Unicode 1.1[edit]

Unicode 1.1 was released June 1993. It encoded 34,233 characters, and finalized the long anticipated Han Unification.

New blocks[edit]

  • Hangul Jamo, containing 240 jamos for the Hangul script, was added.
  • Latin Extended Additional, containing 245 precomposed characters for transliteration and Vietnamese, was added.
  • Greek Extended, containing 233 precomposed characters for polytonic Greek, was added.
  • Alphabetic Presentation Forms, containing 57 precomposed characters and ligatures, was added.
  • Arabic Presentation Forms-A, containing 593 combinations of Arabic letters, was added.
  • Combining Half Marks, containing 4 halves of diacritical marks, was added.

Removed blocks[edit]

  • Tibetan, containing 71 letters for the Tibetan script, was removed from the Unicode standard.

Extended blocks[edit]

  • The long S (ſ) (total 1 character) was added to Latin Extended-A.
  • The Hungarian Dz, characters for transliteration purposes and precomposed characters with double grave and inverted breve (total 35 characters) were added to Latin Extended-B.
  • Diacritics for polytonic Greek and double width diacritics (total 6 characters) were added to Combining Diacritical Marks.
  • Compatibility characters now deprecated (total 5 characters) were added to Greek and Coptic.
  • Additional characters for non-Slavic languages (total 38 characters) were added to Cyrillic.
  • A ligature of Ech and Yiwn (և) (total 1 character) was added to Armenian.
  • One deprecated compatibility character and several characters for biblical texts (total 25 characters) were added to Arabic.
  • The virama (o੍) (total 1 character) was added to Gurmukhi.
  • The candra O and candra E vowels (total 3 characters) were added to Gujarati.
  • The Ai length mark (oୖ) (total 1 character) was added to Oriya.
  • An undertie, a pair of brackets and six formatting characters (total 9 characters) were added to General Punctuation.
  • Some additional symbols and the complete set of APL functional symbols (total 79 characters) were added to Miscellaneous Technical.
  • A large circle () (total 1 character) was added to Geometric Shapes.
  • The ideographic telegraph line feed separator symbol () (total 1 character) was added to CJK Symbols and Punctuation.
  • Four Katakana letters not in use since 1945 (total 4 characters) were added to Katakana.
  • Ideographic telegraph symbols for the twelve months (total 12 characters) were added to Enclosed CJK Letters and Months.
  • Ideographic telegraph symbols for hours and days and six additional measure units (total 62 characters) were added to CJK Compatibility.
  • Some more space (total 2,304 characters) was added to the Private Use Area.
  • Seven halfwidth geometric shapes (total 7 characters) were added to Halfwidth and Fullwidth Forms.

Unicode 2.0[edit]

Unicode 2.0 was released July 1996. It encoded 38,950 characters, and was the first Unicode version to reserve blocks outside of the Basic Multilingual Plane.

New blocks[edit]

  • Hangul Syllables, containing 11,172 precomposed syllables for the Hangul script, was added.
  • Supplementary Private Use Area-A and Supplementary Private Use Area-B, reserving a total of 131,068 characters for private use, was added.

Reinstated blocks[edit]

  • Tibetan, containing 168 characters for the Tibetan script including religious signs, was readded.

Extended blocks[edit]

  • Cantillation marks for use in religious texts (total 31 characters) were added to Hebrew.
  • The long S with dot above (ẛ) (total 1 character) was added to Latin Extended Additional.
  • The Vietnamese dong (₫) (total 1 character) was added to Currency Symbols.

Unicode 2.1[edit]

Unicode 2.1 was released May 1998. It encoded 38,952 characters, only 2 characters more than the last version.

Extended blocks[edit]

  • The euro sign (€) (total 1 character) was added to Currency Symbols.
  • The object replacement character () (total 1 character) was added to Specials.

Unicode 3.0[edit]

Unicode 3.0 was released September 1999. It was a big update and encoded 49,259 characters.

New blocks[edit]

  • Syriac, containing 71 characters used for writing in Syriac script, was added.
  • Thaana, containing 49 characters used for writing in Thaana script, was added.
  • Sinhala, containing 80 characters for the Sinhala script, was added.
  • Myanmar, containing 78 characters for the Burmese script, was added.
  • Ethiopic, containing 345 syllables and punctuation marks for the Ethiopic script, was added.
  • Cherokee, containing 85 syllables for the Cherokee script, was added.
  • Unified Canadian Aboriginal Syllabics, containing 630 syllables and punctuation marks for writing in aboriginal languages of Canada, was added.
  • Ogham, containing 29 characters for the ancient Ogham script, was added.
  • Runic, containing 81 characters for the Germanic runes, was added.
  • Khmer, containing 103 characters for the Khmer script, was added.
  • Mongolian, containing 155 characters for the classical Mongolian script, was added.
  • Braille Patterns, containing 256 Braille letters, was added.
  • CJK Radicals Supplement, containing 115 non-Kangxi radicals, was added.
  • Kangxi Radicals, containing 214 radicals from the Kangxi dictionary, was added.
  • Ideographic Description characters, used to describe a Han ideograph not available in the font, was added.
  • Bopomofo Extended, containing 24 characters used for phonetic transcription of minority languages of Taiwan, was added.
  • CJK Unified Ideographs Extension A, containing 6,582 additional Han Ideographs, was added.
  • Yi Syllables, containing 1,165 syllables of the modern Yi script, was added.
  • Yi Radicals, containing 50 radicals of Yi Syllables, was added.

Extended blocks[edit]

  • Additional precomposed characters, letters and capital letters of lowercase-only letters (total 30 characters) were added to Latin Extended-B.
  • Extensions for disordered speech (total 5 characters) were added to IPA Extensions.
  • Some additional modifier letters (total 6 characters) were added to Spacing Modifier Letters.
  • Additional diacritics for IPA notation (total 10 characters) were added to Combining Diacritical Marks.
  • Lowercase versions of archaic letters and the Kai symbol (total 5 characters) were added to Greek and Coptic.
  • Nonstandard letters for Macedonian, combining numeral signs and three letters for Kildin Sami (total 12 characters) were added to Cyrillic.
  • The hyphen (֊) (total 1 character) was added to Armenian.
  • Combining hamza and maddah and nine additional Arabic characters (total 12 characters) were added to Arabic.
  • Additional letters and religious symbols (total 25 characters) were added to Tibetan.
  • A narrow no-break space and 6 additional punctuation marks (total 7 characters) were added to General Punctuation.
  • The Kip, Tugrik and Drachma sign (total 3 characters) were added to Currency Symbols.
  • An enclosing screen and an enclosing key (total 2 characters) were added to Combining Diacritical Marks for Symbols.
  • The information symbol and a rotated Q (total 2 characters) were added to Letterlike Symbols.
  • A mirrored Roman capital numeral hundred (Ↄ) (total 1 character) was added to Number Forms.
  • Some additional arrows (total 9 characters) were added to Arrows.
  • Some additional technical symbols, including common keys on a 101 keyboard (total 33 characters) were added to Miscellaneous Technical.
  • Two additional control pictures (total 2 characters) were added to Control Pictures.
  • Squares and circles with quadrants (total 8 characters) were added to Geometric Shapes.
  • Two Syriac crosses and a signature mark (total 3 characters) were added to Miscellaneous Symbols.
  • Three Hangzhou numerals and a variation indicator (total 4 characters) were added to CJK Symbols and Punctuation.
  • An additional Hebrew ligature (יִ) (total 1 character) was added to Alphabetic Presentation Forms.
  • Three additional control characters for ruby markup (total 3 characters) were added to Specials.

Unicode 3.1[edit]

Unicode 3.1 was released March 2001. It encoded 94,205 characters and mainly focused on blocks outside of the Basic Multilingual Plane.

New blocks[edit]

  • Old Italic, containing 35 letters for the Etruscan script, was added.
  • Gothic, containing 27 letters for the Gothic script, was added.
  • Deseret, containing 76 letters for the constructed Deseret script, was added.
  • Byzantine Musical Symbols, containing 246 symbols for musical notation in Byzantine, was added.
  • Musical Symbols, containing 219 characters for current musical notation, was added.
  • Mathematical Alphanumeric Symbols, containing 991 Latin and Greek letters in serif, sans-serif, bold, italic, double-struck, script and Fraktur/Blackletter, was added.
  • CJK Unified Ideographs Extension B, containing 42,711 additional Chinese Ideographs, was added.
  • CJK Compatibility Ideographs Supplement, containing 542 additional Chinese Ideographs for compatibility purposes, was added.
  • Tags, containing 97 language tags, was added.

Extended noncharacters[edit]

  • The Noncharacters range: U+FDD0..U+FDEF were added to Arabic Presentation Forms-A.

Extended blocks[edit]

  • The capital Theta symbol and the Lunate Epsilon symbol (total 2 characters) were added to Greek and Coptic.

Unicode 3.2[edit]

Unicode 3.2 was released March 2002. It encoded 95,221 characters.

New blocks[edit]

  • Cyrillic Supplement, containing 16 characters used for the Komi language, was added.
  • Tagalog, containing 20 characters for the Baybayin script, was added.
  • Hanunoo, containing 23 characters and punctuation for the Hanunoo script, was added.
  • Buhid, containing 20 characters for the Buhid script, was added.
  • Tagbanwa, containing 18 characters for the Tagbanwa script, was added.
  • Miscellaneous Mathematical Symbols-A, containing 28 symbols used in math notation, was added.
  • Supplemental Arrows-A, containing 16 additional arrows, was added.
  • Supplemental Arrows-B, containing 128 special arrows, was added.
  • Miscellaneous Mathematical Symbols-B, containing 128 additional mathematical symbols, was added.
  • Supplemental Mathematical Operators, containing 256 additional mathematical operators, was added.
  • Katakana Phonetic Extensions, containing 16 Katakana letters used for Ainu, was added.
  • Variation Selectors, containing 16 symbols used for indicating variations, was added.

Extended blocks[edit]

  • The capital letter N with long right leg (Ƞ) (total 1 character) was added to Latin Extended-B.
  • The combining grapheme joiner and combining Latin letters used in medieval texts (total 14 characters) were added to Combining Diacritical Marks.
  • The Qoppa and a reversed lunate epsilon symbol (total 3 characters) were added to Greek and Coptic.
  • Four additional letters used for the Kildin Sami language (total 8 characters) were added to Cyrillic.
  • A dotless Beh and a dotless Qaf (total 2 characters) were added to Arabic.
  • The letter Naa (ޱ) (total 1 character) was added to Thaana.
  • The letters Yn and Elifi (total 2 characters) were added to Georgian.
  • Some additional punctuation marks and control characters (total 12 characters) were added to General Punctuation.
  • A superscript i (ⁱ) (total 1 character) was added to Superscripts and Subscripts.
  • The old penny sign and the peso sign (total 2 characters) were added to Currency Symbols.
  • Some additional combining characters (total 7 characters) were added to Combining Diacritical Marks for Symbols.
  • Some double-struck and reversed/turned letters (total 15 characters) were added to Letterlike Symbols.
  • Some additional arrows (total 12 characters) were added to Arrows.
  • Some additional mathematical operators (total 14 characters) were added to Mathematical Operators.
  • Variable-width and additional symbols (total 53 characters) were added to Miscellaneous Technical.
  • Black and double circled numerals (total 20 characters) were added to Enclosed Alphanumerics.
  • Quadrant elements (total 10 characters) were added to Block Elements.
  • Some additional triangles and squares (total 8 characters) were added to Geometric Shapes.
  • Shogi pieces ,recycling symbols, dices and dotted circles (total 24 characters) were added to Miscellaneous Symbols.
  • Additional parenthesis (total 14 characters) were added to Dingbats.
  • Three additional marks (total 3 characters) were added to CJK Symbols and Punctuation.
  • A digraph and two additional characters (total 3 characters) were added to Hiragana.
  • A digraph and a double hyphen (total 2 characters) were added to Katakana.
  • Additional circled numerals (total 30 characters) were added to Enclosed CJK Letters and Months.
  • Five missing radicals (total 5 characters) were added to Yi Radicals.
  • Additional compatibility characters (total 59 characters) were added to CJK Compatibility Ideographs.
  • The rial sign (﷼) (total 1 character) was added to Arabic Presentation Forms-A.
  • Two sesame dots (total 2 characters) were added to CJK Compatibility Forms.
  • A tail fragment (ﹳ) (total 1 character) was added to Arabic Presentation Forms-B.
  • A pair of double parenthesis (total 2 characters) was added to Halfwidth and Fullwidth Forms.

Unicode 4.0[edit]

Unicode 4.0 was released April 2003. It encoded 96,447 characters.

New blocks[edit]

  • Limbu, containing 66 characters for the Limbu abugida, was added.
  • Tai Le, containing 35 letters for the Tai Le script, was added.
  • Khmer Symbols, containing 32 symbols for the lunar calendar, was added.
  • Phonetic Extensions, containing 108 letters used in phonetic transcription, was added.
  • Miscellaneous Symbols and Arrows, containing 14 additional arrows, was added.
  • Yijing Hexagram Symbols, containing 64 hexagrams, was added.
  • Linear B Syllabary, containing 88 syllables of the ancient Linear B script, was added.
  • Linear B Ideograms, containing 123 ideograms of the ancient Linear B script, was added.
  • Aegean Numbers, containing 57 numerals used in the Aegean area, was added.
  • Ugaritic, containing 31 characters used in Ugaritic cuneiform, was added.
  • Shavian, containing 48 letters used for the artificial Shavian script, was added.
  • Osmanya, containing 40 characters used in the artificial Osmanya script, was added.
  • Cypriot Syllabary, containing 55 characters formerly used on Cyprus, was added.
  • Tai Xuan Jing Symbols, containing 87 symbols of Tai Xuan Jing, was added.
  • Variation Selectors Supplement, containing 240 additional variation selectors, was added.

Extended blocks[edit]

  • Letters with curl used in Sinology (total 4 characters) were added to Latin Extended-B.
  • Former IPA letters (total 2 characters) were added to IPA Extensions.
  • Some additional characters (total 17 characters) were added to Spacing Modifier Letters.
  • Additional combining double-width diacritics and diacritics corresponding to their spacing equivalent (total 11 characters) were added to Combining Diacritical Marks.
  • The archaic letters Sho and San and the capital Lunate Sigma (total 5 characters) were added to Greek and Coptic.
  • Some additional markers, biblical signs, and letters with inverted V (total 19 characters) were added to Arabic.
  • Letters used for foreign words from Persian and Sogdian (total 6 characters) were added to Syriac.
  • The short A (ऄ) (total 1 character) was added to Devanagari.
  • The Avagraha sign (ঽ) (total 1 character) was added to Bengali.
  • The Adak Bindi and Visarga signs (total 2 characters) were added to Gurmukhi.
  • The vocalic l and ll and the Rupee sign (total 5 characters) were added to Gujarati.
  • The letters Va and Wa (total 2 characters) were added to Oriya.
  • Additional signs for date and finance environments (total 8 characters) were added to Tamil.
  • The Nukta and Avagraha signs (total 2 characters) were added to Kannada.
  • Some symbols and signs (total 11 characters) were added to Khmer.
  • An inverted undertie and a swung dash (total 2 characters) were added to General Punctuation.
  • The facsimile sign (℻) (total 1 character) was added to Letterlike Symbols.
  • The eject symbol and a vertical line (total 2 characters) were added to Miscellaneous Technical.
  • A black circled digit zero (⓿) (total 1 character) was added to Enclosed Alphanumerics.
  • Monograms and diagrams, flags, warning and weather symbols and a cup of tea (total 12 characters) were added to Miscellaneous Symbols.
  • Additional parenthesized and circled Korean characters and supplemental signs (total 9 characters) were added to Enclosed CJK Letters and Months.
  • Additional measure units (total 7 characters) were added to CJK Compatibility.
  • An additional Arabic sign (﷽) (total 1 character) was added to Arabic Presentation Forms-A.
  • A pair of vertical parenthesis (total 2 characters) was added to CJK Compatibility Forms.
  • The letters Oi and Ew (total 4 characters) were added to Deseret.
  • A small script l (ℓ) (total 1 character) was added to Mathematical Alphanumeric Symbols.

Unicode 4.1[edit]

Unicode 4.1 was released March 31, 2005. It encoded 97,720 characters.

New blocks[edit]

  • Arabic Supplement, containing 30 characters for various languages written with the Arabic script, was added.
  • Ethiopic Supplement, containing 26 characters and signs for Sebatbeit, was added.
  • New Tai Lue, containing 80 characters for the New Tai Lue script, was added.
  • Buginese, containing 30 characters for the Lontara script, was added.
  • Phonetic Extensions Supplement, containing 64 additional letters for phonetic transcription, was added.
  • Combining Diacritical Marks Supplement, containing 4 additional diacritics, was added.
  • Glagolitic, containing 94 characters for the Glagolitic script, was added.
  • Coptic, containing 114 characters for the Coptic script, was added.
  • Georgian Supplement, containing 38 Nuskhuri letters, was added.
  • Tifinagh, containing 55 characters for the Tifinagh script, was added.
  • Ethiopic Extended, containing 79 additional Ethiopic syllables, was added.
  • Supplemental Punctuation, containing 26 additional punctuation marks, was added.
  • CJK Strokes, containing 16 strokes for Han Ideographs, was added.
  • Modifier Tone Letters, containing 23 letters for Chinese tones, was added.
  • Syloti Nagri, containing 44 characters for the Syloti Nagri abugida, was added.
  • Vertical Forms, containing 10 punctuation marks suited for vertical text, was added.
  • Ancient Greek Numbers, containing 75 numerals and signs used in Ancient Greek, was added.
  • Old Persian, containing 50 characters for Old Persian cuneiform, was added.
  • Kharoshthi, containing 65 characters for the Kharoshthi abugida, was added.
  • Ancient Greek Musical Notation, containing 70 musical signs used in Ancient Greek, was added.

Extended blocks[edit]

  • Letters for Sencoten, digraphs, letters with swash tail and other additions (total 11 characters) were added to Latin Extended-B.
  • Additional diacritics for transliteration (total 5 characters) were added to Combining Diacritical Marks.
  • Rho with stroke, reversed and dotted Lunate Sigma (total 4 characters) were added to Greek and Coptic.
  • Ghe with descender (Ӷ) (total 2 characters) was added to Cyrillic.
  • An additional biblical mark and some punctuation marks (total 4 characters) were added to Hebrew.
  • Additional biblical marks, punctuation marks and the Afghani sign (total 8 characters) were added to Arabic.
  • A glottal stop (ॽ) (total 1 character) was added to Devanagari.
  • The Khanda Ta letter (ৎ) (total 1 character) was added to Bengali.
  • The letter Sha and the digit zero (total 2 characters) were added to Tamil.
  • Two marks used in Bhutan (total 2 characters) were added to Tibetan.
  • Two letters and a modifier letter (total 3 characters) were added to Georgian.
  • Some additional syllables (total 11 characters) were added to Ethiopic.
  • Additional phonetic symbols (total 20 characters) were added to Phonetic Extensions.
  • A flower and dot punctuation marks (total 9 characters) were added to General Punctuation.
  • Additional subscript letters (total 5 characters) were added to Superscripts and Subscripts.
  • The Guarani, Austral, Hryvnia and Cedi signs (total 4 characters) were added to Currency Symbols.
  • A combining long double solidus (⃫) (total 1 character) was added to Combining Diacritical Marks for Symbols.
  • The per sign and a double-struck letter Pi (total 2 characters) were added to Letterlike Symbols.
  • Metrical and electrical signs (total 11 characters) were added to Miscellaneous Technical.
  • Additional gender and map symbols (total 30 characters) were added to Miscellaneous Symbols.
  • Some additional mathematical symbols (total 7 characters) were added to Miscellaneous Mathematical Symbols-A.
  • Additional arrows and squares (total 6 characters) were added to Miscellaneous Symbols and Arrows.
  • A circled Hangul character (㉾) (total 1 character) was added to Enclosed CJK Letters and Months.
  • Additional Han Ideographs (total 22 characters) were added to CJK Unified Ideographs.
  • Additional Compatibility Ideographs (total 106 characters) were added to CJK Compatibility Ideographs.
  • Italic dotless small i and j (total 2 characters) were added to Mathematical Alphanumeric Symbols.

Unicode 5.0[edit]

Unicode 5.0 was released July 14, 2006. It encoded 99,089 characters.

New blocks[edit]

  • N'Ko, containing 59 characters for the N'Ko script, was added.
  • Balinese, containing 121 characters and musical signs for the Balinese abugida, was added.
  • Latin Extended-C, containing 17 letters for various languages, was added.
  • Latin Extended-D, containing 2 characters for UPA, was added.
  • Phags-pa, containing 56 characters for the Phags-pa script, was added.
  • Phoenician, containing 27 letters and numerals for the Phoenician script, was added.
  • Cuneiform, containing 879 signs for Sumero-Akkadian Cuneiform, was added.
  • Cuneiform Numbers and Punctuation, containing 103 numerals and punctuation signs for Sumero-Akkadian Cuneiform, was added.
  • Counting Rod Numerals, containing 18 numerals used with counting rods, was added.

Extended blocks[edit]

  • Various letters used mainly for aboriginal languages (total 14 characters) were added to Latin Extended-B.
  • Lowercase lunate sigma symbols (total 3 characters) were added to Greek and Coptic.
  • Lowercase palochka and 3 letters used in Nivkh (total 7 characters) were added to Cyrillic.
  • Two letters used in Khanty and other languages (total 4 characters) were added to Cyrillic Supplement.
  • A specific point meant for Vav (ֺ) (total 1 character) was added to Hebrew.
  • Four letters used in Sindhi (total 4 characters) were added to Devanagari.
  • Four letters used in Sanskrit (total 4 characters) were added to Kannada.
  • Additional IPA diacritics (total 9 characters) were added to Combining Diacritical Marks Supplement.
  • Four combining arrows (total 4 characters) were added to Combining Diacritical Marks for Symbols.
  • A danish symbol and a lowercase turned F (total 2 characters) were added to Letterlike Symbols.
  • A lowercase reversed C (ↄ) (total 1 character) was added to Number Forms.
  • Vertical parenthesis, geometric forms and electrical symbols (total 12 characters) were added to Miscellaneous Technical.
  • A neuter symbol (⚲) (total 1 character) was added to Miscellaneous Symbols.
  • Four additional mathematical symbols (total 4 characters) were added to Miscellaneous Mathematical Symbols-A.
  • Additional squares, pentagons and hexagons (total 11 characters) were added to Miscellaneous Symbols and Arrows.
  • Four additional tone letters used in Chinantec (total 4 characters) were added to Modifier Tone Letters.
  • Bold Digamma (𝟊/Ϝ) (total 2 characters) was added to Mathematical Alphanumeric Symbols.

Unicode 5.1[edit]

Unicode 5.1 was released April 4, 2008. It encoded 100,713 characters.

New blocks[edit]

  • Sundanese, containing 55 characters for Sundanese script, was added.
  • Lepcha, containing 74 characters for Lepcha script, was added.
  • Ol Chiki, containing 48 characters for Ol Chiki script, was added.
  • Cyrillic Extended-A, containing 32 characters for combining Cyrillic letters, was added.
  • Vai, containing 300 characters for Vai script, was added.
  • Cyrillic Extended-B, containing 78 characters for additional Cyrillic characters, was added.
  • Saurashtra, containing 81 characters for Saurashtra script, was added.
  • Kayah Li, containing 48 characters for Kayah languages, was added.
  • Rejang, containing 37 characters for Rejang script, was added.
  • Cham, containing 83 characters for Cham script, was added.
  • Ancient Symbols, containing 12 characters for weights and measures and other Ancient symbols, was added.
  • Phaistos Disc, containing 46 characters for Phaistos hieroglyphs, was added.
  • Lycian, containing 29 characters for Lycian script, was added.
  • Carian, containing 49 characters for Carian script, was added.
  • Lydian, containing 27 characters for Lydian script, was added.
  • Mahjong Tiles, containing 44 characters for Mahjong tiles, was added.
  • Domino Tiles, containing 100 characters for Domino tiles, was added.

Extended blocks[edit]

  • Archaic letters and capital kai symbol (total 7 characters) were added to Greek and Coptic.
  • Combining Pokrytie (total 1 character) was added to Cyrillic.
  • Mordvin, Kurdish, Aleut and Chuvash letters (total 16 characters) were added to Cyrillic Supplement.
  • Radix symbols, Letterlike, punctuation, Koranic annotation signs and additions for early Persian and Azerbaijani (total 15 characters) were added to Arabic.
  • Additional letters in Torwali, Burushaski and early Persian (total 18 characters) were added to Arabic Supplement.
  • High spacing dot and candra a (total 2 characters) were added to Devanagari.
  • Udaat and yakash signs (total 2 characters) were added to Gurmukhi.
  • Vocalic rr, l and ll (total 3 characters) were added to Oriya.
  • Om symbol (ௐ) (total 1 character) was added to Tamil.
  • Avagraha, additional phonetic letters, vocalic l and ll, fractional signs and tuumu (total 13 characters) were added to Telugu.
  • Avagraha, vocalic rr, l and ll, Malayalam numerics and fractions and chillu letters (total 17 characters) were added to Malayalam.
  • Letters for Balti and various symbols (total 6 characters) were added to Tibetan.
  • Characters for various languages (total 78 characters) were added to Myanmar.
  • Manchu Ali Gali lha (ᢪ) (total 1 character) was added to Mongolian.
  • Miscellaneous combining marks (total 28 characters) were added to Combining Diacritical Marks Supplement.
  • Medievalist latin letters and miscellaneous letters (total 10 characters) were added to Latin Extended Additional.
  • Invisible plus (+) (total 1 character) was added to General Punctuation.
  • Combining asterisk above ( ⃰)(total 1 character) was added to Combining Diacritical Marks for Symbols.
  • Symbol for Samaritan Source (⅏) (total 1 character) was added to Letterlike Symbols.
  • Archaic Roman Numerals (total 4 characters) were added to Number Forms.
  • Outlined white star and other signs (total 15 characters) were added to Miscellaneous Symbols.
  • Long division and additional mathematical brackets (total 5 characters) were added to Miscellaneous Mathematical Symbols-A.
  • Miscellaneous signs (total 51 characters) were added to Miscellaneous Symbols and Arrows.
  • Additional latin letters (total 12 characters) were added to Latin Extended-C.
  • Additional punctuation (total 23 characters) were added to Supplemental Punctuation.
  • Letter ih (ㄭ) (total 1 character) was added to Bopomofo.
  • Other strokes (total 20 characters) were added to CJK Strokes.
  • Miscellaneous additions (total 8 characters) were added to CJK Unified Ideographs.
  • Africanist tone letters (total 5 characters) were added to Modifier Tone Letters.
  • Miscellaneous letters and symbols (total 112 characters) were added to Latin Extended-D.
  • Continuous macrons for Coptic (total 3 characters) were added to Combining Half Marks.
  • Musical symbol multiple measure rest (𝄩) (total 1 character) was added to Musical Symbols.

Unicode 5.2[edit]

Unicode 5.2 was released in October 1, 2009. It encoded 107,361 characters.

New blocks[edit]

  • Samaritan (U+0800-U+083F), containing 61 characters, was added.
  • Unified Canadian Aboriginal Syllabics Extended (U+18B0-U+18FF), containing 70 characters, was added.
  • Tai Tham (U+1A20-U+1AAF), containing 127 characters, was added.
  • Vedic Extensions (U+1CD0-U+1CFF), containing 35 characters, was added.
  • Lisu (U+A4D0-U+A4FF), containing 48 characters, was added.
  • Bamum (U+A6A0-U+A6FF), containing 88 characters, was added.
  • Common Indic Number Forms (U+A830-U+A83F), containing 10 characters, was added.
  • Devanagari Extended (U+A8E0-U+A8FF), containing 28 characters, was added.
  • Hangul Jamo Extended-A (U+A960-U+A97F), containing 29 characters, was added.
  • Javanese (U+A980-U+A9DF), containing 91 characters, was added.
  • Myanmar Extended-A (U+AA60-U+AA7F), containing 28 characters, was added.
  • Tai Viet (U+AA80-U+AADF), containing 72 characters, was added.
  • Meetei Mayek (U+ABC0-U+ABFF), containing 56 characters, was added.
  • Hangul Jamo Extended-B (U+D7B0-U+D7FF), containing 72 characters, was added.
  • Imperial Aramaic (U+10840-U+1085F), containing 31 characters, was added.
  • Old South Arabian (U+10A60-U+10A7F), containing 32 characters, was added.
  • Avestan (U+10B00-U+10B3F), containing 61 characters, was added.
  • Inscriptional Parthian (U+10B40-U+10B5F), containing 30 characters, was added.
  • Inscriptional Pahlavi (U+10B60-U+10B7F), containing 27 characters, was added.
  • Old Turkic (U+10C00-U+10C4F), containing 73 characters, was added.
  • Rumi Numeral Symbols (U+10E60-U+10E7F), containing 31 characters, was added.
  • Kaithi (U+11080-U+110CF), containing 66 characters, was added.
  • Egyptian Hieroglyphs (U+13000-U+1342F), containing 1,071 characters, was added.
  • Enclosed Alphanumeric Supplement (U+1F100-U+1F1FF), containing 63 characters, was added.
  • Enclosed Ideographic Supplement (U+1F200-U+1F2FF), containing 44 characters, was added.
  • CJK Unified Ideographs Extension C (U+2A700-U+2B73F), containing 4,149 characters, was added.

Extended blocks[edit]

  • (total 2 characters) were added to Cyrillic Supplement. (U+0524-U+0525)
  • (total 5 characters) were added to Devanagari. (U+0900, U+094E, U+0955 and U+0979-U+097A)
  • (total 1 character) was added to Bengali. (U+09FB)
  • (total 4 characters) were added to Tibetan. (U+0FD5-U+0FD8)
  • (total 4 characters) were added to Myanmar. (U+109A-U+109D)
  • (total 16 characters) were added to Hangul Jamo. (U+115A-U+115E, U+11A3-U+11A7 and U+11FA-U+11FF)
  • (total 10 characters) were added to Unified Canadian Aboriginal Syllabics. (U+1400 and U+1677-U+167F)
  • (total 3 characters) were added to New Tai Lue. (U+19AA-U+19AB and U+19DA)
  • (total 1 character) was added to Combining Diacritical Marks Supplement. (U+1DFD)
  • (total 3 characters) were added to Currency Symbols. (U+20B6-U+20B8)
  • (total 4 characters) were added to Number Forms. (U+2150-U+2152 and U+2189)
  • (total 1 characters) was added to Miscellaneous Technical. (U+23E8)
  • (total 59 characters) were added to Miscellaneous Symbols. (U+U+269E-U+269F, U+26BD-U+26BF, U+26C4-U+26CD, U+26CF-U+26E1, U+26E3 and U+26E8-U+26FF)
  • (total 1 character) was added to Dingbats. (U+2757)
  • (total 5 characters) were added to Miscellaneous Symbols and Arrows. (U+2B55-U+2B59)
  • (total 3 characters) were added to Latin Extended-C. (U+2C70 and U+2C7E-U+2C7F)
  • (total 7 characters) were added to Coptic. (U+2CEB-U+2CF1)
  • (total 1 character) was added to Supplemental Punctuation. (U+2E31)
  • (total 12 characters) were added to Enclosed CJK Letters and Months. (U+3244-U+324F)
  • (total 8 characters) were added to CJK Unified Ideographs. (U+9FC4-U+9FCB)
  • (total 3 characters) were added to CJK Compatibility Ideographs. (U+FA6B-U+FA6D)
  • (total 2 characters) were added to Phoenician. (U+1091A-U+1091B)

Unicode 6.0[edit]

Unicode 6.0 was released in October 11, 2010. It encoded 109,449 characters.

New blocks[edit]

  • Mandaic (U+0840-U+085F), containing 29 characters, was added.
  • Batak (U+1BC0-U+1BFF), containing 56 characters, was added.
  • Ethiopic Extended-A (U+AB00-U+AB2F), containing 32 characters, was added.
  • Brahmi (U+11000-U+1107F), containing 108 characters, was added.
  • Bamum Supplement (U+16800-U+16A3F), containing 761 characters, was added.
  • Kana Supplement (U+1B000-U+1B0FF), containing 2 characters, was added.
  • Playing Cards (U+1F0A0-U+1F0FF), containing 59 characters, was added.
  • Miscellaneous Symbols and Pictographs (U+1F300-U1F5FF), containing 529 characters, was added.
  • Emoticons (U+1F600-U+1F64F), containing 63 characters, was added.
  • Transport and Map Symbols (U+1F680-U+1F6FF), containing 70 characters, was added.
  • Alchemical Symbols (U+1F700-U+1F77F), containing 116 characters, was added.
  • CJK Unified Ideographs Extension D (U+2B740-U+2B81F), containing 222 characters, was added.

Extended blocks[edit]

  • (total 2 characters) were added to Cyrillic Supplement. (U+0526-U+0527)
  • (total 2 characters) were added to Arabic. (U+0620 and U+065F)
  • (total 10 characters) were added to Devanagari. (U+093A-U+093B, U+094F, U+0956-U+0957 and U+0973-U+0977)
  • (total 6 characters) were added to Oriya. (U+0B72-U+0B77)
  • (total 3 characters) were added to Malayalam. (U+0D29, U+0D3A and U+0D4E)
  • (total 6 characters) were added to Tibetan. (U+0FD9-U+0FDA)
  • (total 2 characters) were added to Ethiopic. (U+135D-U+135E)
  • (total 1 character) was added to Combining Diacritical Marks Supplement. (U+1DFC)
  • (total 8 characters) were added to Superscripts and Subscripts. (U+2095-U+209C)
  • (total 1 character) was added to Currency Symbols. (U+20B9)
  • (total 11 characters) were added to Miscellaneous Technical. (U+23E9-U+23F3)
  • (total 6 characters) were added to Miscellaneous Symbols. (U+26CE, U+26E2 and U+26E4-U+26E7)
  • (total 16 characters) were added to Dingbats. (U+2705, U+270A-U+270B, U+2728, U+274C, U+274E, U+2753-U+2755, U+275F-U+2760, U+2795-U+2797, U+27B0 and U+27BF)
  • (total 2 characters) were added to Miscellaneous Mathematical Symbols-A. (U+27CE-U+27CF)
  • (total 2 characters) were added to Tifinagh. (U+2D70 and U+2D7F)
  • (total 3 characters) were added to Bopomofo Extended. (U+31B8-U+31BA)
  • (total 2 characters) were added to Cyrillic Extended-B. (U+A660-U+A661)
  • (total 15 characters) were added to Latin Extended-D. (U+A78D-U+A78E, U+A790-U+A791, U+A7A0-U+A7A9 and U+A7FA)
  • (total 16 characters) were added to Arabic Presentation Forms-A. (U+FBB2-U+FBC1)
  • (total 107 characters) were added to Enclosed Alphanumeric Supplement. (U+1F130, U+1F132-U+1F13C, U+1F13E, U+1F140-U+1F141, U+1F143-U+1F145, U+1F147-U+1F149, U+1F14F-U+1F156, U+1F158-U+1F15E, U+1F160-U+1F169, U+1F170-U+1F178, U+1F17A, U+1F17D-U+1F17E, U+1F180-U+1F189, U+1F18E-U+1F18F, U+1F191-U+1F19A and U+1F1E6-U+1F1FF)
  • (total 13 characters) were added to Enclosed Ideographic Supplement.(U+1F201-U+1F202, U+1F232-U+1F23A and U+1F250-U+1F251)

Unicode 6.1[edit]

Unicode 6.1 was released in January 31, 2012. It encoded 110,181 characters.

New blocks[edit]

  • Arabic Extended-A (U+08A0-U+08FF), containing 39 characters, was added.
  • Sundanese Supplement (U+1CC0-U+1CCF), containing 8 characters, was added.
  • Meetei Mayek Extensions (U+AAE0-U+AAFF), containing 23 characters, was added.
  • Meroitic Hieroglyphs (U+10980-U+1099F), containing 32 characters, was added.
  • Meroitic Cursive (U+109A0-U+109FF), containing 26 characters, was added.
  • Sora Sompeng (U+110D0-U+110FF), containing 35 characters, was added.
  • Chakma (U+11100-U+1114F), containing 67 characters, was added.
  • Sharada (U+11180-U+111DF), containing 83 characters, was added.
  • Takri (U+11680-U+116CF), containing 66 characters, was added.
  • Miao (U+16F00-U+16F9F), containing 133 characters, was added.
  • Arabic Mathematical Alphabetic Symbols (U+1EE00-U+1EEFF), containing 143 characters, was added.

Extended blocks[edit]

  • (total 1 character) was added to Armenian. (U+058F)
  • (total 1 character) was added to Arabic. (U+0604)
  • (total 1 character) was added to Gujarati. (U+0AF0)
  • (total 2 characters) were added to Lao. (U+0EDE-U+0EDF)
  • (total 5 characters) were added to Georgian. (U+10C7, U+10CD and U+10FD-U+10FF)
  • (total 9 characters) were added to Sundanese. (U+1BAB-U+1BAD and U+1BBA-U+1BBF)
  • (total 4 characters) were added to Vedic Extensions. (U+1CF3-U+1CF6)
  • (total 2 characters) were added to Miscellaneous Mathematical Symbols-A. (U+27CB and U+27CD)
  • (total 2 characters) were added to Coptic. (U+2CF2-U+2CF3)
  • (total 2 characters) were added to Georgian Supplement. (U+2D27 and U+2D2D)
  • (total 2 characters) were added to Tifinagh. (U+2D66-U+2D67)
  • (total 10 characters) were added to Supplemental Punctuation. (U+2E32-U+2E3B)
  • (total 1 character) was added to CJK Unified Ideographs. (U+9FCC)
  • (total 9 characters) were added to Cyrillic Extended-B. (U+A674-U+A67B and U+A69F)
  • (total 5 characters) were added to Latin Extended-D. (U+A792-U+A793, U+A7AA and U+A7F8-U+A7F9)
  • (total 2 characters) were added to CJK Compatibility Ideographs. (U+FA2E-U+FA2F)
  • (total 2 characters) were added to Enclosed Alphanumeric Supplement. (U+1F16A-U+1F16B)
  • (total 4 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F540-U+1F543)
  • (total 13 characters) were added to Emoticons. (U+1F600, U+1F611, U+1F615, U+1F617, U+1F619, U+1F61B, U+1F61F, U+1F626-U+1F627, U+1F62C, U+1F62E-U+1F62F and U+1F634)

Unicode 6.2[edit]

Unicode 6.2 was released in September 26, 2012. It encoded 110,182 characters.

Extended blocks[edit]

  • (total 1 character) was added to Currency Symbols. (U+20BA)

Unicode 6.3[edit]

Unicode 6.3 was released in September 30, 2013. It encoded 110,187 characters.

Extended blocks[edit]

  • (total 1 character) was added to Arabic. (U+061C)
  • (total 4 characters) were added to General Punctuation. (U+2066-U+2069)

Unicode 7.0[edit]

Unicode 7.0 was released in June 16, 2014. It encodes 113,021 characters.

New blocks[edit]

  • Combining Diacritical Marks Extended (U+1AB0-U+1AFF), containing 15 marks, was added.
  • Myanmar Extended-B (U+A9E0-U+A9FF), containing 31 letters, was added.
  • Latin Extended-E (U+AB30-U+AB6F), containing 50 letters, was added.
  • Coptic Epact Numbers (U+102E0-U+102FF), containing 28 numbers, was added.
  • Old Permic (U+10350-U+1037F), containing 43 letters, was added.
  • Elbasan (U+10500-U+1052F), containing 50 letters, was added.
  • Caucasian Albanian (U+10530-U+1056F), containing 53 letters and marks, was added.
  • Linear A (U+10600-U+1077F), containing 341 signs, was added.
  • Palmyrene (U+10860-U+1087F), containing 32 letters, was added.
  • Nabataean (U+10880-U+108AF), containing 40 letters and numbers, was added.
  • Old North Arabian (U+10A80-U+10A9F), containing 32 letters and numbers, was added.
  • Manichaean (U+10AC0-U+10AFF), containing 51 characters, was added.
  • Psalter Pahlavi (U+10B80-U+10BAF), containing 29 characters, was added.
  • Mahajani (U+11150-U+1117F), containing 39 letters and signs, was added.
  • Sinhala Archaic Numbers (U+111E0-U+111FF), containing 20 numbers, was added.
  • Khojki (U+11200-U+1124F), containing 61 characters, was added.
  • Khudawadi (U+112B0-U+112FF), containing 69 characters, was added.
  • Grantha (U+11300-U+1137F), containing 83 characters, was added.
  • Tirhuta (U+11480-U+114DF), containing 82 characters, was added.
  • Siddham (U+11580-U+115FF), containing 72 characters, was added.
  • Modi (U+11600-U+1165F), containing 79 characters, was added.
  • Warang Citi (U+118A0-U+118FF), containing 84 letters and numbers, was added.
  • Pau Cin Hau (U+11AC0-U+11AFF), containing 57 characters, was added.
  • Mro (U+16A40-U+16A6F), containing 43 characters, was added.
  • Bassa Vah (U+16AD0-U+16AFF), containing 36 characters, was added.
  • Pahawh Hmong (U+16B00-U+16B8F), containing 127 letters and signs, was added.
  • Duployan (U+1BC00-U+1BC9F), containing 143 characters, was added.
  • Shorthand Format Controls (U+1BCA0-U+1BCAF), containing 4 format characters, was added.
  • Mende Kikakui (U+1E800-U+1E8DF), containing 213 syllables and numbers, was added.
  • Ornamental Dingbats (U+1F650-U+1F67F), containing 48 pictographic characters, was added.
  • Geometric Shapes Extended (U+1F780-U+1F7FF), containing 85 pictographic characters, was added.
  • Supplemental Arrows-C (U+1F800-U+1F8FF), containing 148 pictographic characters, was added.

Extended blocks[edit]

  • (total 1 character) was added to Greek and Coptic. (U+037F)
  • (total 8 characters) were added to Cyrillic Supplement. (U+0528-U+052F)
  • (total 2 characters) were added to Armenian. (U+058D-U+058E)
  • (total 1 character) was added to Arabic. (U+0605)
  • (total 8 characters) were added to Arabic Extended-A. (U+08A1, U+08AD-U+08B2 and U+08FF)
  • (total 1 character) was added to Devanagari. (U+0978)
  • (total 1 character) was added to Bengali. (U+0980)
  • (total 2 characters) were added to Telugu. (U+0C00 and U+0C34)
  • (total 1 character) was added to Kannada. (U+0C81)
  • (total 1 character) was added to Malayalam. (U+0D01)
  • (total 10 digits) were added to Sinhala. (U+0DE6-U+0DEF)
  • (total 8 characters) were added to Runic. (U+16F1-U+16F8)
  • (total 2 characters) were added to Limbu. (U+191D-U+191E)
  • (total 2 characters) were added to Vedic Extensions. (U+1CF8-U+1CF9)
  • (total 15 characters) were added to Combining Diacritical Marks Supplement. (U+1DE7-U+1DF5)
  • (total 3 characters) were added to Currency Symbols. (U+20BB-U+20BD)
  • (total 7 characters) were added to Miscellaneous Technical. (U+23F4-U+23FA)
  • (total 1 character) was added to Dingbats. (U+2700)
  • (total 115 characters) were added to Miscellaneous Symbols and Arrows. (U+2B4D-U+2B4F, U+2B5A-U+2B5F, U+2B60-U+2B73, U+2B76-U+2B95, U+2B98-U+2BB9, U+2BBD-U+2BC8 and U+2BCA-U+2BD1)
  • (total 7 characters) were added to Supplemental Punctuation. (U+2E3C-U+2E42)
  • (total 6 characters) were added to Cyrillic Extended-B. (U+A698-U+A69D)
  • (total 18 characters) were added to Latin Extended-D. (U+A794-U+A79F, U+A7AB-U+A7AD, U+A7B0-U+A7B1 and U+A7F7)
  • (total 4 characters) were added to Myanmar Extended-A. (U+AA7C-U+AA7F)
  • (total 7 characters) were added to Combining Half Marks. (U+FE27-U+FE2D)
  • (total 2 characters) were added to Ancient Greek Numbers. (U+1018B-U+1018C)
  • (total 1 character) was added to Ancient Symbols. (U+101A0)
  • (total 1 character) was added to Old Italic. (U+1031F)
  • (total 1 character) was added to Brahmi. (U+1107F)
  • (total 2 characters) were added to Sharada. (U+111CD and U+111DA)
  • (total 42 characters) were added to Cuneiform. (U+1236F-U+12398)
  • (total 13 characters) were added to Cuneiform Numbers and Punctuation. (U+12463-U+1246E and U+12474)
  • (total 23 characters) were added to Playing Cards. (U+1F0BF and U+1F0E0-U+1F0F5)
  • (total 2 characters) were added to Enclosed Alphanumeric Supplement. (U+1F10B-U+1F10C)
  • (total 209 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F321-U+1F32C, U+1F336, U+1F37D, U+1F394-U+1F39F, U+1F3C5, U+1F3CB-U+1F3CE, U+1F3D4-U+1F3DF, U+1F3F1-U+1F3F7, U+1F43F, U+1F441, U+1F4F8, U+1F4FD-U+1F4FE, U+1F53E-U+1F53F, U+1F544-U+1F54A, U+1F568-U+1F579, U+1F57B-U+1F5A3 and U+1F5A5-U+1F5FA)
  • (total 2 characters) were added to Emoticons. (U+1F641-U+1F642)
  • (total 27 characters) were added to Transport and Map Symbols. (U+1F6C6-U+1F6CF, U+1F6E0-U+1F6EC and U+1F6F0-U+1F6F3)

Unicode 8.0[edit]

Unicode 8.0 was released in June 17, 2015. It encoded 120,737 characters.

New blocks[edit]

  • Cherokee Supplement (U+AB70-U+ABBF), containing 80 lowercase letters, was added.
  • Hatran (U+108E0-U+108FF), containing 26 letters, was added.
  • Old Hungarian (U+10C80-U+10CFF), containing 108 letters, was added.
  • Multani (U+11280-U+112AF), containing 38 letters, was added.
  • Ahom (U+11700-U+1173F), containing 57 letters, was added.
  • Early Dynastic Cuneiform (U+12480-U+1254F), containing 196 characters, was added.
  • Anatolian Hieroglyphs (U+14400-U+1467F), containing 583 characters, was added.
  • Sutton SignWriting (U+1D800-U+1DAAF), containing 672 signs, was added.
  • Supplemental Symbols and Pictographs (U+1F900-U+1F9FF), containing 15 pictographic characters, was added.
  • CJK Unified Ideographs Extension E (U+2B820-U+2CEAF), containing 5762 characters, was added.

Extended blocks[edit]

  • (total 3 characters) were added to Arabic Extended-A. (U+08B3-U+08B4 and U+08E3)
  • (total 1 character) were added to Gujarati. (U+0AF9)
  • (total 1 character) were added to Telugu. (U+0C5A)
  • (total 1 character) were added to Malayalam. (U+0D5F)
  • (total 7 characters) were added to Cherokee. (U+13F5 and U+13F8-U+13FD)
  • (total 1 character) were added to Currency Symbols. (U+20BE)
  • (total 2 characters) were added to Number Forms. (U+218A-U+218B)
  • (total 4 characters) were added to Miscellaneous Symbols and Arrows. (U+2BEC-U+2BEF)
  • (total 9 characters) were added to CJK Unified Ideographs. (U+9FCD-U+9FD5)
  • (total 1 character) were added to Cyrillic Extended-B. (U+A69E)
  • (total 7 characters) were added to Latin Extended-D. (U+A78F and U+A7B2-U+A7B7)
  • (total 2 characters) were added to Devanagari Extended. (U+A8FC-U+A8FD)
  • (total 4 characters) were added to Latin Extended-E. (U+AB60-U+AB63)
  • (total 2 characters) were added to Combining Half Marks. (U+FE2E-U+FE2F)
  • (total 64 characters) were added to Meroitic Cursive. (U+109BC-U+109BD, U+109C0-U+109CF and U+109D2-U+109FF)
  • (total 9 characters) were added to Sharada. (U+111C9-U+111CC and U+111DB-U+111DF)
  • (total 2 characters) were added to Grantha. (U+11300 and U+11350)
  • (total 20 characters) were added to Siddham. (U+115CA-U+115DD)
  • (total 1 character) were added to Cuneiform. (U+12399)
  • (total 11 characters) were added to Musical Symbols. (U+1D1DE-U+1D1E8)
  • (total 24 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F32D-U+1F32F, U+1F37E-U+1F37F, U+1F3CF-U+1F3D3, U+1F3F8-U+1F3FF, U+1F4FF and U+1F54B-U+1F54F)
  • (total 2 characters) were added to Emoticons. (U+1F643-U+1F644)
  • (total 1 character) were added to Transport and Map Symbols. (U+1F6D0)

Unicode 9.0[edit]

Unicode 9.0, was released in June 21, 2016. It encoded 128,237 characters.

New blocks[edit]

  • Cyrillic Extended-C (U+1C80-U+1C8F), containing 9 letters, was added.
  • Osage (U+104B0-U+104FF), containing 72 letters, was added.
  • Newa (U+11400-U+1147F), containing 92 letters, was added.
  • Mongolian Supplement (U+11660-U+1167F), containing 13 letters, was added.
  • Bhaiksuki (U+11C00-U+11C6F), containing 97 letters, was added.
  • Marchen (U+11C70-U+11CBF), containing 68 letters, was added.
  • Ideographic Symbols and Punctuation (U+16FE0-U+16FFF), containing 1 letter, was added.
  • Tangut (U+17000-U+187FF), containing 6125 letters, was added.
  • Tangut Components (U+18800-U+18AFF), containing 755 letters, was added.
  • Glagolitic Supplement (U+1E000-U+1E02F), containing 38 letters, was added.
  • Adlam (U+1E900-U+1E95F), containing 87 letters, was added.

Extended blocks[edit]

  • (total 23 characters) were added to Arabic Extended-A. (U+08B6-U+08BD and U+08D4-U+08E2)
  • (total 1 character) were added to Kannada. (U+0C80)
  • (total 14 characters) were added to Malayalam. (U+0D4F, U+0D54-U+0D56, U+0D58-U+0D5E and U+0D76-U+0D78)
  • (total 1 character) were added to Combining Diacritical Marks Supplement. (U+1DFB)
  • (total 4 characters) were added to Miscellaneous Technical. (U+23FB-U+23FE)
  • (total 2 characters) were added to Supplemental Punctuation. (U+2E43-U+2E44)
  • (total 1 character) were added to Latin Extended-D. (U+A7AE)
  • (total 1 character) were added to Saurashtra. (U+A8C5)
  • (total 2 characters) were added to Ancient Greek Numbers. (U+1018D-U+1018E)
  • (total 1 character) were added to Khojki. (U+1123E)
  • (total 18 characters) were added to Enclosed Alphanumeric Supplement. (U+1F19B-U+1F1AC)
  • (total 1 character) were added to Enclosed Ideographic Supplement. (U+1F23B)
  • (total 2 characters) were added to Miscellaneous Symbols and Pictographs. (U+1F57A and U+1F5A4)
  • (total 5 characters) were added to Transport and Map Symbols. (U+1F6D1-U+1F6D2 and U+1F6F4-U+1F6F6)
  • (total 67 characters) were added to Supplemental Symbols and Pictographs. (U+1F919-U+1F91E, U+1F920-U+1F927, U+1F930, U+1F933-U+1F93E, U+1F940-U+1F94B, U+1F950-U+1F95E and U+1F985-U+1F991)

Unicode 10.0[edit]

Unicode 10.0, was released in June 20, 2017. It encoded 136,690 characters.

New blocks[edit]

  • Syriac Supplement (U+0860-U+086F), containing 11 characters, was added.
  • Zanabazar Square (U+11A00-U+11A4F), containing 72 characters, was added.
  • Soyombo (U+11A50-U+11AAF), containing 80 characters, was added.
  • Masaram Gondi (U+11D00-U+11D5F), containing 75 characters, was added.
  • Kana Extended-A (U+1B100-U+1B12F), containing 31 characters, was added.
  • Nushu (U+1B170-U+1B2FF), containing 396 characters, was added.
  • CJK Unified Ideographs Extension F (U+2CEB0-U+2EBEF), containing 7,473 characters, was added.

Extended blocks[edit]

  • (total 2 characters) were added to Bengali. (U+09FC-U+09FD)
  • (total 6 characters) were added to Gujarati. (U+0AFA-U+0AFF)
  • (total 3 characters) were added to Malayalam. (U+0D00 and U+0D3B-U+0D3C)
  • (total 1 character) were added to Vedic Extensions. (U+1CF7)
  • (total 4 characters) were added to Combining Diacritical Marks Supplement. (U+1DF6-U+1DF9)
  • (total 1 character) were added to Currency Symbols. (U+20BF)
  • (total 1 character) were added to Miscellaneous Technical. (U+23FF)
  • (total 1 character) were added to Miscellaneous Symbols and Arrows. (U+2BD2)
  • (total 5 characters) were added to Supplemental Punctuation. (U+2E45-U+2E49)
  • (total 1 character) were added to Bopomofo. (U+312E)
  • (total 21 characters) were added to CJK Unified Ideographs. (U+9FD6-U+9FEA)
  • (total 3 characters) were added to Old Italic. (U+1032D-U+1032F)
  • (total 1 character) were added to Ideographic Symbols and Punctuation. (U+16FE1)
  • (total 254 characters) were added to Kana Supplement. (U+1B002-U+1B0FF)
  • (total 6 characters) were added to Enclosed Ideographic Supplement. (U+1F260-U+1F265)
  • (total 4 characters) were added to Transport and Map Symbols. (U+1F6D3-U+1F6D4 and U+1F6F7-U+1F6F8)
  • (total 66 characters) were added to Supplemental Symbols and Pictographs. (U+1F900-U+1F90B, U+1F91F, U+1F928-U+1F92F, U+1F931-U+1F932, U+1F94C, U+1F95F-U+1F96B, U+1F992-U+1F997 and U+1F9D0-U+1F9E6)

Unicode 11.0[edit]

Unicode 11.0, was released in June 5, 2018. It encoded 137,374 characters.

New blocks[edit]

  • Georgian Extended (U+1C90-U+1CBF), containing 46 characters, was added.
  • Hanifi Rohingya (U+10D00-U+10D3F), containing 50 characters, was added.
  • Old Sogdian (U+10F00-U+10F2F), containing 40 characters, was added.
  • Sogdian (U+10F30-U+10F6F), containing 42 characters, was added.
  • Dogra (U+11800-U+1184F), containing 60 characters, was added.
  • Gunjala Gondi (U+11D60-U+11DAF), containing 63 characters, was added.
  • Makasar (U+11EE0-U+11EFF), containing 25 characters, was added.
  • Medefaidrin (U+16E40-U+16E9F), containing 91 characters, was added.
  • Mayan Numerals (U+1D2E0-U+1D2FF), containing 20 characters, was added.
  • Indic Siyaq Numbers (U+1EC70-U+1ECBF), containing 68 characters, was added.
  • Chess Symbols (U+1FA00-U+1FA6F), containing 14 characters, was added.

Extended blocks[edit]

  • (total 2 characters) were added to Armenian. (U+0560 and U+0588)
  • (total 1 character) were added to Hebrew. (U+05EF)
  • (total 3 characters) were added to N'Ko. (U+07FD-U+07FF)
  • (total 1 character) were added to Arabic Extended-A. (U+08D3)
  • (total 1 character) were added to Bengali. (U+09FE)
  • (total 1 character) were added to Gurmukhi. (U+0A76)
  • (total 1 character) were added to Telugu. (U+0C04)
  • (total 1 character) were added to Kannada. (U+0C84)
  • (total 1 character) were added to Mongolian. (U+1878)
  • (total 43 characters) were added to Miscellaneous Symbols and Arrows. (U+2BBA-U+2BBC, U+2BD3-U+2BEB and 2BF0-U+2BFE)
  • (total 5 characters) were added to Supplemental Punctuation. (U+2E4A-U+2E4E)
  • (total 1 character) were added to Bopomofo. (U+312F)
  • (total 5 characters) were added to CJK Unified Ideographs. (U+9FEB-U+9FEF)
  • (total 3 characters) were added to Latin Extended-D. (U+A7AF and U+A7B8-U+A7B9)
  • (total 2 characters) were added to Devanagari Extended. (U+A8FE-U+A8FF)
  • (total 3 characters) were added to Kharoshthi. (U+10A34-U+10A35 and U+10A48)
  • (total 1 character) were added to Kaithi. (U+110CD)
  • (total 3 characters) were added to Chakma. (U+11144-U+11146)
  • (total 1 character) were added to Grantha. (U+1133B)
  • (total 1 character) were added to Newa. (U+1145E)
  • (total 1 character) were added to Ahom. (U+1171A)
  • (total 1 character) were added to Soyombo. (U+11A9D)
  • (total 5 characters) were added to Tangut. (U+187ED-U+187F1)
  • (total 7 characters) were added to Counting Rod Numerals. (U+1D372-U+1D378)
  • (total 1 character) were added to Enclosed Alphanumeric Supplement. (U+1F12F)
  • (total 1 character) were added to Transport and Map Symbols. (U+1F6F9)
  • (total 4 characters) were added to Geometric Shapes Extended. (U+1F7D5-U+1F7D8)
  • (total 65 characters) were added to Supplemental Symbols and Pictographs. (U+1F94D-U+1F94F, U+1F96C-U+1F970, U+1F973-U+1F976, U+1F97A, U+1F97C-U+1F97F, U+1F998-U+1F99F, U+1F9A0-U+1F9A2, U+1F9B0-U+1F9B9, U+1F9C1-U+1F9C2 and U+1F9E7-U+1F9FF)

Unicode 12.0[edit]

Unicode 12.0 is scheduled to be released in March 5, 2019. It is planned to encode 137,929 new characters.

New blocks[edit]

  • Elymaic (U+10FE0-U+10FFF), containing 23 characters, will be added.
  • Nandinagari (U+119A0-U+119FF), containing 65 characters, will be added.
  • Tamil Supplement (U+11FC0-U+11FFF), containing 51 characters, will be added.
  • Egyptian Hieroglyph Format Controls (U+13430-U+1343F), containing 9 characters, will be added.
  • Small Kana Extension (U+1B130-U+1B16F), containing 7 characters, will be added.
  • Nyiakeng Puachue Hmong (U+1E100-U+1E14F), containing 71 characters, will be added.
  • Wancho (U+1E2C0-U+1E2FF), containing 59 characters, will be added.
  • Ottoman Siyaq Numbers (U+1ED00-U+1ED4F), containing 61 characters, will be added.
  • Symbols and Pictographs Extended-A (U+1FA70-U+1FAFF), containing 16 characters, will be added.

Extended blocks[edit]

  • (total 1 character) will be added to Telugu. (U+0C77)
  • (total 15 characters) will be added to Lao. (U+0E86, U+0E89, U+0E8C, U+0E8E-U+0E93, U+0E98, U+0EA0, U+0EA8-U+0EA9, U+0EAC and U+0EBA)
  • (total 1 character) will be added to Vedic Extensions. (U+1CFA)
  • (total 2 characters) will be added to Miscellaneous Symbols and Arrows. (U+2BC9 and U+2BFF)
  • (total 1 character) will be added to Supplemental Punctuation. (U+2E4F)
  • (total 11 characters) will be added to Latin Extended-D. (U+A7BA-U+A7BF and U+A7C2-U+A7C6)
  • (total 2 characters) will be added to Latin Extended-E. (U+AB66-U+AB67)
  • (total 1 character) will be added to Newa. (U+1145F)
  • (total 1 character) will be added to Takri. (U+116B8)
  • (total 2 characters) will be added to Soyombo. (U+11A84-U+11A85)
  • (total 16 characters) will be added to Miao. (U+16F45-U+16F4A, U+16F4F and U+16F7F-U+16F87)
  • (total 2 characters) will be added to Ideographic Symbols and Punctuation. (U+16FE2-U+16FE3)
  • (total 6 characters) will be added to Tangut. (U+187F2-U+187F7)
  • (total 1 character) will be added to Adlam. (U+1E94B)
  • (total 1 character) will be added to Enclosed Alphanumeric Supplement. (U+1F16C)
  • (total 2 characters) will be added to Transport and Map Symbols. (U+1F6D5 and U+1F6FA)
  • (total 12 characters) will be added to Geometric Shapes Extended. (U+1F7E0-U+1F7EB)
  • (total 31 characters) will be added to Supplemental Symbols and Pictographs. (U+1F90D-U+1F90F, U+1F93F, U+1F971, U+1F97B, U+1F9A5-U+1F9AA, U+1F9AE-U+1F9AF, U+1F9BA-U+1F9BF, U+1F9C3-U+1F9CA and U+1F9CD-U+1F9CF)
  • (total 84 characters) will be added to Chess Symbols. (U+1FA00-U+1FA53)