L2/09-058 R2Mark Davis, 2009-01-30 Live doc: http://www.macchiato.com/unicode/historic-unicode-characters The following is a draft list of historic characters, meaning characters that are no longer customarily used in current modern languages in typical publications (corner newspapers, magazines, etc.). For example, you wouldn't expect to see words written in Cuneiform in the NY Times. They may of course still be used in in technical journals, especially those dealing with archaic languages, or have occasional decorative use, or be in academic documents, or be quoted in modern works, or be used in liturgical works. For example, in modern Tibetan usage,the Phags-pa script is restricted to calligraphic, decorative and other special purposes, and is not used for writing extensive texts (as it used to be in the 13th and 14th centuries). Thus Phags-pa is an archaic script (in the same way that in the West, blackletter is an archaic script, even though it is still used for calligraphic and decorative purposes). In many cases in the Unicode Standard, we mark characters (such as with subheadings or block names) as being not in modern use, using terms like Obsolete, Old, Ancient, Archaic, and so on. But there are characters that are not clearly marked as such. In UAX31 we include information about blocks or scripts that are unsuitable for identifiers - this is often, but not always, based on whether they are archaic scripts or not. In UTS39 we have a tag for 'archaic' - and I have an action to update that UTS. This does not yet include historic CJK characters. It is useful for people to know which characters are in customary modern use, and which are not, so I set out to try to get a clearer picture of the situation. This is not intended as a proposal for a standard Unicode property; it is simply some information about Unicode characters that may be useful to some people in some contexts. For example, someone building a character picker could put historic characters into an 'extended' bucket for more advanced users. (See, for example, the mockup at http://macchiato.com/picker/MyApplication.html.) For right now, this is targeted at an update of UTS39, plus possible improvements to NamesList annotations. Here is the data. The lines marked with yellow are those that should be removed, based on feedback on the email lists. Additions from that feedback are listed at the end. Current count: 4,441 Code Points = 3,867 Code Points (if unassigned characters are removed from the blocks).
Both include the yellow items that should be removed, below, so the end figures would be somewhat smaller. (1) Data from UAX31This is broadened from scripts to blocks where possible.
[:blk=Ancient_Greek_Musical_Notation:] [:blk=Buginese:] [:blk=Buhid:] [:blk=Carian:] [:blk=Coptic:] [:blk=Cuneiform:] [:blk=Cuneiform_Numbers_And_Punctuation:] [:blk=Cypriot_Syllabary:] [:blk=Deseret:] [:blk=Glagolitic:] [:blk=Gothic:] [:blk=Hanunoo:] [:blk=Kharoshthi:] [:blk=Linear_B_Ideograms:] [:blk=Linear_B_Syllabary:] [:blk=Lycian:] [:blk=Lydian:] [:blk=Ogham:] [:blk=Old_Italic:] [:blk=Old_Persian:] [:blk=Osmanya:] [:blk=Phags_Pa:] [:blk=Phaistos_Disc:] [:blk=Phoenician:] [:blk=Rejang:] [:blk=Runic:] [:blk=Shavian:] [:blk=Sundanese:] [:blk=Syloti_Nagri:] [:blk=Syriac:] [:blk=Tagalog:] [:blk=Tagbanwa:] [:blk=Ugaritic:] [:sc=Copt:] // Note that there is a revival effort for Coptic, that may move it out of this section over time. (2) Additional data from UTS39
Note
that the Korean Jamo are listed because they would normally not appear
in NFC. Also broadened to blocks if possible, and subtracting data
above.
[:blk=Balinese:] // this needs to be removed
[:blk=Ancient_Greek_Numbers:] [:Block=Hangul_Jamo:] [:Block=Hangul_Compatibility_Jamo:] + Latin Extended B - Non-European and historic LatinU+018D ( ƍ ) LATIN SMALL LETTER TURNED DELTAU+01AA ( ƪ ) LATIN LETTER REVERSED ESH LOOPU+01AB ( ƫ ) LATIN SMALL LETTER T WITH PALATAL HOOKU+01B9 ( ƹ ) LATIN SMALL LETTER EZH REVERSEDU+01BA ( ƺ ) LATIN SMALL LETTER EZH WITH TAILU+01BB ( ƻ ) LATIN LETTER TWO WITH STROKEU+01BE ( ƾ ) LATIN LETTER INVERTED GLOTTAL STOP WITH STROKEU+01BF ( ƿ ) LATIN LETTER WYNNLatin Extended B - Miscellaneous additionsU+021C ( Ȝ ) LATIN CAPITAL LETTER YOGHU+021D ( ȝ ) LATIN SMALL LETTER YOGHIPA Extensions - IPA extensionsU+025F ( ɟ ) LATIN SMALL LETTER DOTLESS J WITH STROKEU+0277 ( ɷ ) LATIN SMALL LETTER CLOSED OMEGAU+027C ( ɼ ) LATIN SMALL LETTER R WITH LONG LEGU+029E ( ʞ ) LATIN SMALL LETTER TURNED KCombining Diacritical Marks - Additions for GreekU+0343 ( ̓ ) COMBINING GREEK KORONISGreek And Coptic - Variant letterformsU+03D0 ( ϐ ) GREEK BETA SYMBOLU+03D1 ( ϑ ) GREEK THETA SYMBOLU+03D5 ( ϕ ) GREEK PHI SYMBOLU+03D6 ( ϖ ) GREEK PI SYMBOLU+03D7 ( ϗ ) GREEK KAI SYMBOLGreek And Coptic - Archaic lettersU+03D8 ( Ϙ ) GREEK LETTER ARCHAIC KOPPAU+03D9 ( ϙ ) GREEK SMALL LETTER ARCHAIC KOPPAU+03DA ( Ϛ ) GREEK LETTER STIGMAU+03DB ( ϛ ) GREEK SMALL LETTER STIGMAU+03DC ( Ϝ ) GREEK LETTER DIGAMMAU+03DD ( ϝ ) GREEK SMALL LETTER DIGAMMAU+03DE ( Ϟ ) GREEK LETTER KOPPAU+03DF ( ϟ ) GREEK SMALL LETTER KOPPAU+03E0 ( Ϡ ) GREEK LETTER SAMPIU+03E1 ( ϡ ) GREEK SMALL LETTER SAMPIGreek And Coptic - Additional archaic letters for BactrianU+03F7 ( Ϸ ) GREEK CAPITAL LETTER SHOU+03F8 ( ϸ ) GREEK SMALL LETTER SHOGreek And Coptic - Variant letterformU+03F9 ( Ϲ ) GREEK CAPITAL LUNATE SIGMA SYMBOLGreek And Coptic - Archaic lettersU+03FA ( Ϻ ) GREEK CAPITAL LETTER SANU+03FB ( ϻ ) GREEK SMALL LETTER SANCyrillic - Historic miscellaneousU+0483 ( ҃ ) COMBINING CYRILLIC TITLOU+0484 ( ҄ ) COMBINING CYRILLIC PALATALIZATIONU+0485 ( ҅ ) COMBINING CYRILLIC DASIA PNEUMATAU+0486 ( ҆ ) COMBINING CYRILLIC PSILI PNEUMATAHebrew - Cantillation marksU+05A2 ( ֢ ) HEBREW ACCENT ATNAH HAFUKHHebrew - Puncta extraordinariaU+05C5 ( ׅ ) HEBREW MARK LOWER DOTHebrew - Points and punctuationU+05C6 ( ׆ ) HEBREW PUNCTUATION NUN HAFUKHAU+05C7 ( ׇ ) HEBREW POINT QAMATS QATANArabic - Archaic lettersU+066E ( ٮ ) ARABIC LETTER DOTLESS BEHU+066F ( ٯ ) ARABIC LETTER DOTLESS QAFArabic - Extended Arabic lettersU+068E ( ڎ ) ARABIC LETTER DULKannada - Additional consonantsU+0CDE ( ೞ ) KANNADA LETTER FAGeorgian - Archaic lettersU+10F1 ( ჱ ) GEORGIAN LETTER HEU+10F2 ( ჲ ) GEORGIAN LETTER HIEU+10F3 ( ჳ ) GEORGIAN LETTER WEU+10F4 ( ჴ ) GEORGIAN LETTER HARU+10F5 ( ჵ ) GEORGIAN LETTER HOEU+10F6 ( ჶ ) GEORGIAN LETTER FIKhmer - Independent vowelsU+17A8 ( ឨ ) KHMER INDEPENDENT VOWEL QUKKhmer - Various signsU+17D1 ( ៑ ) KHMER SIGN VIRIAMU+17DD ( ៝ ) KHMER SIGN ATTHACANCombining Diacritical Marks Supplement - Used for Ancient GreekU+1DC0 ( ᷀ ) COMBINING DOTTED GRAVE ACCENTU+1DC1 ( ᷁ ) COMBINING DOTTED ACUTE ACCENTCombining Diacritical Marks Supplement - Miscellaneous marksU+1DC2 ( ᷂ ) COMBINING SNAKE BELOWU+1DC3 ( ᷃ ) COMBINING SUSPENSION MARKModifier Tone Letters - Corner tone marks for ChineseU+A700 ( ꜀ ) MODIFIER LETTER CHINESE TONE YIN PINGU+A701 ( ꜁ ) MODIFIER LETTER CHINESE TONE YANG PINGU+A702 ( ꜂ ) MODIFIER LETTER CHINESE TONE YIN SHANGU+A703 ( ꜃ ) MODIFIER LETTER CHINESE TONE YANG SHANGU+A704 ( ꜄ ) MODIFIER LETTER CHINESE TONE YIN QUU+A705 ( ꜅ ) MODIFIER LETTER CHINESE TONE YANG QUU+A706 ( ꜆ ) MODIFIER LETTER CHINESE TONE YIN RUU+A707 ( ꜇ ) MODIFIER LETTER CHINESE TONE YANG RU(3) Heuristically-Derived DataThe Unicode character with subheads, block names, or character names in the standard that contain any of the words:
Obsolete|Ancient|Archaic|Medieval|New Testament|UPA|Old|Early
minus the above (and with some small hand-editing for exceptional cases) Errors in these may indicate that we want to change some of the subheaders. Also broadened to blocks if possible, and subtracting data above. [:blk=Ancient_Symbols:] [:blk=Ancient_Greek_Musical_Notation:] [:blk=Cyrillic_Extended_A:] [:blk=Cyrillic_Extended_B:] + Spacing Modifier Letters - UPA modifiersU+02EF ( ˯ ) MODIFIER LETTER LOW DOWN ARROWHEADU+02F0 ( ˰ ) MODIFIER LETTER LOW UP ARROWHEADU+02F1 ( ˱ ) MODIFIER LETTER LOW LEFT ARROWHEADU+02F2 ( ˲ ) MODIFIER LETTER LOW RIGHT ARROWHEADU+02F3 ( ˳ ) MODIFIER LETTER LOW RINGU+02F4 ( ˴ ) MODIFIER LETTER MIDDLE GRAVE ACCENTU+02F5 ( ˵ ) MODIFIER LETTER MIDDLE DOUBLE GRAVE ACCENTU+02F6 ( ˶ ) MODIFIER LETTER MIDDLE DOUBLE ACUTE ACCENTU+02F7 ( ˷ ) MODIFIER LETTER LOW TILDEU+02F8 ( ˸ ) MODIFIER LETTER RAISED COLONU+02F9 ( ˹ ) MODIFIER LETTER BEGIN HIGH TONEU+02FA ( ˺ ) MODIFIER LETTER END HIGH TONEU+02FB ( ˻ ) MODIFIER LETTER BEGIN LOW TONEU+02FC ( ˼ ) MODIFIER LETTER END LOW TONEU+02FD ( ˽ ) MODIFIER LETTER SHELFU+02FE ( ˾ ) MODIFIER LETTER OPEN SHELFU+02FF ( ˿ ) MODIFIER LETTER LOW LEFT ARROWCombining Diacritical Marks - Medieval superscript letter diacriticsU+0363 ( ͣ ) COMBINING LATIN SMALL LETTER AU+0364 ( ͤ ) COMBINING LATIN SMALL LETTER EU+0365 ( ͥ ) COMBINING LATIN SMALL LETTER IU+0366 ( ͦ ) COMBINING LATIN SMALL LETTER OU+0367 ( ͧ ) COMBINING LATIN SMALL LETTER UU+0368 ( ͨ ) COMBINING LATIN SMALL LETTER CU+0369 ( ͩ ) COMBINING LATIN SMALL LETTER DU+036A ( ͪ ) COMBINING LATIN SMALL LETTER HU+036B ( ͫ ) COMBINING LATIN SMALL LETTER MU+036C ( ͬ ) COMBINING LATIN SMALL LETTER RU+036D ( ͭ ) COMBINING LATIN SMALL LETTER T
U+036F ( ͯ ) COMBINING LATIN SMALL LETTER XGreek And Coptic - Archaic lettersU+0370 ( Ͱ ) GREEK CAPITAL LETTER HETAU+0371 ( ͱ ) GREEK SMALL LETTER HETAU+0372 ( Ͳ ) GREEK CAPITAL LETTER ARCHAIC SAMPIU+0373 ( ͳ ) GREEK SMALL LETTER ARCHAIC SAMPIU+0376 ( Ͷ ) GREEK CAPITAL LETTER PAMPHYLIAN DIGAMMAU+0377 ( ͷ ) GREEK SMALL LETTER PAMPHYLIAN DIGAMMAArabic - Additions for early Persian and AzerbaijaniU+063B ( ػ ) ARABIC LETTER KEHEH WITH TWO DOTS ABOVEU+063C ( ؼ ) ARABIC LETTER KEHEH WITH THREE DOTS BELOWU+063E ( ؾ ) ARABIC LETTER FARSI YEH WITH TWO DOTS ABOVEU+063F ( ؿ ) ARABIC LETTER FARSI YEH WITH THREE DOTS ABOVEArabic Supplement - Additions for early PersianU+077E ( ݾ ) ARABIC LETTER SEEN WITH INVERTED VU+077F ( ݿ ) ARABIC LETTER KAF WITH TWO DOTS ABOVE
NKo - Archaic lettersU+07E8 ( ߨ ) NKO LETTER JONA JAU+07E9 ( ߩ ) NKO LETTER JONA CHAU+07EA ( ߪ ) NKO LETTER JONA RACombining Diacritical Marks Supplement - Medievalist additionsU+1DCE ( ᷎ ) COMBINING OGONEK ABOVEU+1DCF ( ᷏ ) COMBINING ZIGZAG BELOWU+1DD0 ( ᷐ ) COMBINING IS BELOWU+1DD1 ( ᷑ ) COMBINING UR ABOVEU+1DD2 ( ᷒ ) COMBINING US ABOVECombining Diacritical Marks Supplement - Medieval superscript letter diacriticsU+1DD3 ( ᷓ ) COMBINING LATIN SMALL LETTER FLATTENED OPEN A ABOVEU+1DD4 ( ᷔ ) COMBINING LATIN SMALL LETTER AEU+1DD5 ( ᷕ ) COMBINING LATIN SMALL LETTER AOU+1DD6 ( ᷖ ) COMBINING LATIN SMALL LETTER AVU+1DD7 ( ᷗ ) COMBINING LATIN SMALL LETTER C CEDILLAU+1DD8 ( ᷘ ) COMBINING LATIN SMALL LETTER INSULAR DU+1DD9 ( ᷙ ) COMBINING LATIN SMALL LETTER ETHU+1DDA ( ᷚ ) COMBINING LATIN SMALL LETTER GU+1DDB ( ᷛ ) COMBINING LATIN LETTER SMALL CAPITAL GU+1DDC ( ᷜ ) COMBINING LATIN SMALL LETTER KU+1DDD ( ᷝ ) COMBINING LATIN SMALL LETTER LU+1DDE ( ᷞ ) COMBINING LATIN LETTER SMALL CAPITAL LU+1DDF ( ᷟ ) COMBINING LATIN LETTER SMALL CAPITAL MU+1DE0 ( ᷠ ) COMBINING LATIN SMALL LETTER NU+1DE1 ( ᷡ ) COMBINING LATIN LETTER SMALL CAPITAL NU+1DE2 ( ᷢ ) COMBINING LATIN LETTER SMALL CAPITAL RU+1DE3 ( ᷣ ) COMBINING LATIN SMALL LETTER R ROTUNDAU+1DE4 ( ᷤ ) COMBINING LATIN SMALL LETTER SU+1DE5 ( ᷥ ) COMBINING LATIN SMALL LETTER LONG SU+1DE6 ( ᷦ ) COMBINING LATIN SMALL LETTER ZCombining Diacritical Marks Supplement - Additional marks for UPAU+1DFE ( ᷾ ) COMBINING LEFT ARROWHEAD ABOVEU+1DFF ( ᷿ ) COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOWLatin Extended Additional - Medievalist additionsU+1E9C ( ẜ ) LATIN SMALL LETTER LONG S WITH DIAGONAL STROKEU+1E9D ( ẝ ) LATIN SMALL LETTER LONG S WITH HIGH STROKELatin Extended Additional - Medievalist additionU+1E9F ( ẟ ) LATIN SMALL LETTER DELTALatin Extended Additional - Medievalist additionsU+1EFA ( Ỻ ) LATIN CAPITAL LETTER MIDDLE-WELSH LLU+1EFB ( ỻ ) LATIN SMALL LETTER MIDDLE-WELSH LLU+1EFC ( Ỽ ) LATIN CAPITAL LETTER MIDDLE-WELSH VU+1EFD ( ỽ ) LATIN SMALL LETTER MIDDLE-WELSH VU+1EFE ( Ỿ ) LATIN CAPITAL LETTER Y WITH LOOPU+1EFF ( ỿ ) LATIN SMALL LETTER Y WITH LOOPGeneral Punctuation - Archaic punctuationU+2056 ( ⁖ ) THREE DOT PUNCTUATIONU+2058 ( ⁘ ) FOUR DOT PUNCTUATIONU+2059 ( ⁙ ) FIVE DOT PUNCTUATIONU+205A ( ⁚ ) TWO DOT PUNCTUATIONU+205B ( ⁛ ) FOUR DOT MARKU+205C ( ⁜ ) DOTTED CROSSU+205D ( ⁝ ) TRICOLONU+205E ( ⁞ ) VERTICAL FOUR DOTSNumber Forms - Archaic Roman numeralsU+2180 ( ↀ ) ROMAN NUMERAL ONE THOUSAND C DU+2181 ( ↁ ) ROMAN NUMERAL FIVE THOUSANDU+2182 ( ↂ ) ROMAN NUMERAL TEN THOUSANDU+2183 ( Ↄ ) ROMAN NUMERAL REVERSED ONE HUNDREDU+2185 ( ↅ ) ROMAN NUMERAL SIX LATE FORMU+2186 ( ↆ ) ROMAN NUMERAL FIFTY EARLY FORMU+2187 ( ↇ ) ROMAN NUMERAL FIFTY THOUSANDU+2188 ( ↈ ) ROMAN NUMERAL ONE HUNDRED THOUSANDLatin Extended C - Additions for UPAU+2C77 ( ⱷ ) LATIN SMALL LETTER TAILLESS PHIU+2C78 ( ⱸ ) LATIN SMALL LETTER E WITH NOTCHU+2C79 ( ⱹ ) LATIN SMALL LETTER TURNED R WITH TAILU+2C7A ( ⱺ ) LATIN SMALL LETTER O WITH LOW RING INSIDEU+2C7B ( ⱻ ) LATIN LETTER SMALL CAPITAL TURNED EU+2C7C ( ⱼ ) LATIN SUBSCRIPT SMALL LETTER JU+2C7D ( ⱽ ) MODIFIER LETTER CAPITAL VSupplemental Punctuation - New Testament editorial symbolsU+2E00 ( ⸀ ) RIGHT ANGLE SUBSTITUTION MARKERU+2E01 ( ⸁ ) RIGHT ANGLE DOTTED SUBSTITUTION MARKERU+2E02 ( ⸂ ) LEFT SUBSTITUTION BRACKETU+2E03 ( ⸃ ) RIGHT SUBSTITUTION BRACKETU+2E04 ( ⸄ ) LEFT DOTTED SUBSTITUTION BRACKETU+2E05 ( ⸅ ) RIGHT DOTTED SUBSTITUTION BRACKETU+2E06 ( ⸆ ) RAISED INTERPOLATION MARKERU+2E07 ( ⸇ ) RAISED DOTTED INTERPOLATION MARKERU+2E08 ( ⸈ ) DOTTED TRANSPOSITION MARKERU+2E09 ( ⸉ ) LEFT TRANSPOSITION BRACKETU+2E0A ( ⸊ ) RIGHT TRANSPOSITION BRACKETU+2E0B ( ⸋ ) RAISED SQUAREU+2E0C ( ⸌ ) LEFT RAISED OMISSION BRACKETU+2E0D ( ⸍ ) RIGHT RAISED OMISSION BRACKETSupplemental Punctuation - Ancient Greek textual symbolsU+2E0E ( ⸎ ) EDITORIAL CORONISU+2E0F ( ⸏ ) PARAGRAPHOSU+2E10 ( ⸐ ) FORKED PARAGRAPHOSU+2E11 ( ⸑ ) REVERSED FORKED PARAGRAPHOSU+2E12 ( ⸒ ) HYPODIASTOLEU+2E13 ( ⸓ ) DOTTED OBELOSU+2E14 ( ⸔ ) DOWNWARDS ANCORAU+2E15 ( ⸕ ) UPWARDS ANCORAU+2E16 ( ⸖ ) DOTTED RIGHT-POINTING ANGLESupplemental Punctuation - Ancient Near-Eastern linguistic symbolU+2E17 ( ⸗ ) DOUBLE OBLIQUE HYPHENSupplemental Punctuation - Medievalist punctuationU+2E2A ( ⸪ ) TWO DOTS OVER ONE DOT PUNCTUATIONU+2E2B ( ⸫ ) ONE DOT OVER TWO DOTS PUNCTUATIONU+2E2C ( ⸬ ) SQUARED FOUR DOT PUNCTUATIONU+2E2D ( ⸭ ) FIVE DOT MARKU+2E2E ( ⸮ ) REVERSED QUESTION MARKU+2E2F ( ⸯ ) VERTICAL TILDEU+2E30 ( ⸰ ) RING POINTLatin Extended D - Additions for UPAU+A720 ( ꜠ ) MODIFIER LETTER STRESS AND HIGH TONEU+A721 ( ꜡ ) MODIFIER LETTER STRESS AND LOW TONELatin Extended D - Medievalist additionsU+A730 ( ꜰ ) LATIN LETTER SMALL CAPITAL FU+A731 ( ꜱ ) LATIN LETTER SMALL CAPITAL SU+A732 ( Ꜳ ) LATIN CAPITAL LETTER AAU+A733 ( ꜳ ) LATIN SMALL LETTER AAU+A734 ( Ꜵ ) LATIN CAPITAL LETTER AOU+A735 ( ꜵ ) LATIN SMALL LETTER AOU+A736 ( Ꜷ ) LATIN CAPITAL LETTER AUU+A737 ( ꜷ ) LATIN SMALL LETTER AUU+A738 ( Ꜹ ) LATIN CAPITAL LETTER AVU+A739 ( ꜹ ) LATIN SMALL LETTER AVU+A73A ( Ꜻ ) LATIN CAPITAL LETTER AV WITH HORIZONTAL BARU+A73B ( ꜻ ) LATIN SMALL LETTER AV WITH HORIZONTAL BARU+A73C ( Ꜽ ) LATIN CAPITAL LETTER AYU+A73D ( ꜽ ) LATIN SMALL LETTER AYU+A73E ( Ꜿ ) LATIN CAPITAL LETTER REVERSED C WITH DOTU+A73F ( ꜿ ) LATIN SMALL LETTER REVERSED C WITH DOTU+A740 ( Ꝁ ) LATIN CAPITAL LETTER K WITH STROKEU+A741 ( ꝁ ) LATIN SMALL LETTER K WITH STROKEU+A742 ( Ꝃ ) LATIN CAPITAL LETTER K WITH DIAGONAL STROKEU+A743 ( ꝃ ) LATIN SMALL LETTER K WITH DIAGONAL STROKEU+A744 ( Ꝅ ) LATIN CAPITAL LETTER K WITH STROKE AND DIAGONAL STROKEU+A745 ( ꝅ ) LATIN SMALL LETTER K WITH STROKE AND DIAGONAL STROKEU+A746 ( Ꝇ ) LATIN CAPITAL LETTER BROKEN LU+A747 ( ꝇ ) LATIN SMALL LETTER BROKEN LU+A748 ( Ꝉ ) LATIN CAPITAL LETTER L WITH HIGH STROKEU+A749 ( ꝉ ) LATIN SMALL LETTER L WITH HIGH STROKEU+A74A ( Ꝋ ) LATIN CAPITAL LETTER O WITH LONG STROKE OVERLAYU+A74B ( ꝋ ) LATIN SMALL LETTER O WITH LONG STROKE OVERLAYU+A74C ( Ꝍ ) LATIN CAPITAL LETTER O WITH LOOPU+A74D ( ꝍ ) LATIN SMALL LETTER O WITH LOOPU+A74E ( Ꝏ ) LATIN CAPITAL LETTER OOU+A74F ( ꝏ ) LATIN SMALL LETTER OOU+A750 ( Ꝑ ) LATIN CAPITAL LETTER P WITH STROKE THROUGH DESCENDERU+A751 ( ꝑ ) LATIN SMALL LETTER P WITH STROKE THROUGH DESCENDERU+A752 ( Ꝓ ) LATIN CAPITAL LETTER P WITH FLOURISHU+A753 ( ꝓ ) LATIN SMALL LETTER P WITH FLOURISHU+A754 ( Ꝕ ) LATIN CAPITAL LETTER P WITH SQUIRREL TAILU+A755 ( ꝕ ) LATIN SMALL LETTER P WITH SQUIRREL TAILU+A756 ( Ꝗ ) LATIN CAPITAL LETTER Q WITH STROKE THROUGH DESCENDERU+A757 ( ꝗ ) LATIN SMALL LETTER Q WITH STROKE THROUGH DESCENDERU+A758 ( Ꝙ ) LATIN CAPITAL LETTER Q WITH DIAGONAL STROKEU+A759 ( ꝙ ) LATIN SMALL LETTER Q WITH DIAGONAL STROKEU+A75A ( Ꝛ ) LATIN CAPITAL LETTER R ROTUNDAU+A75B ( ꝛ ) LATIN SMALL LETTER R ROTUNDAU+A75C ( Ꝝ ) LATIN CAPITAL LETTER RUM ROTUNDAU+A75D ( ꝝ ) LATIN SMALL LETTER RUM ROTUNDAU+A75E ( Ꝟ ) LATIN CAPITAL LETTER V WITH DIAGONAL STROKEU+A75F ( ꝟ ) LATIN SMALL LETTER V WITH DIAGONAL STROKEU+A760 ( Ꝡ ) LATIN CAPITAL LETTER VYU+A761 ( ꝡ ) LATIN SMALL LETTER VYU+A762 ( Ꝣ ) LATIN CAPITAL LETTER VISIGOTHIC ZU+A763 ( ꝣ ) LATIN SMALL LETTER VISIGOTHIC ZU+A764 ( Ꝥ ) LATIN CAPITAL LETTER THORN WITH STROKEU+A765 ( ꝥ ) LATIN SMALL LETTER THORN WITH STROKEU+A766 ( Ꝧ ) LATIN CAPITAL LETTER THORN WITH STROKE THROUGH DESCENDERU+A767 ( ꝧ ) LATIN SMALL LETTER THORN WITH STROKE THROUGH DESCENDERU+A768 ( Ꝩ ) LATIN CAPITAL LETTER VENDU+A769 ( ꝩ ) LATIN SMALL LETTER VENDU+A76A ( Ꝫ ) LATIN CAPITAL LETTER ETU+A76B ( ꝫ ) LATIN SMALL LETTER ETU+A76C ( Ꝭ ) LATIN CAPITAL LETTER ISU+A76D ( ꝭ ) LATIN SMALL LETTER ISU+A76E ( Ꝯ ) LATIN CAPITAL LETTER CONU+A76F ( ꝯ ) LATIN SMALL LETTER CONU+A770 ( ꝰ ) MODIFIER LETTER USU+A771 ( ꝱ ) LATIN SMALL LETTER DUMU+A772 ( ꝲ ) LATIN SMALL LETTER LUMU+A773 ( ꝳ ) LATIN SMALL LETTER MUMU+A774 ( ꝴ ) LATIN SMALL LETTER NUMU+A775 ( ꝵ ) LATIN SMALL LETTER RUMU+A776 ( ꝶ ) LATIN LETTER SMALL CAPITAL RUMU+A777 ( ꝷ ) LATIN SMALL LETTER TUMU+A778 ( ꝸ ) LATIN SMALL LETTER UMLatin Extended D - Ancient Roman epigraphic lettersU+A7FB ( ꟻ ) LATIN EPIGRAPHIC LETTER REVERSED FU+A7FC ( ꟼ ) LATIN EPIGRAPHIC LETTER REVERSED PU+A7FD ( ꟽ ) LATIN EPIGRAPHIC LETTER INVERTED MU+A7FE ( ꟾ ) LATIN EPIGRAPHIC LETTER I LONGAU+A7FF ( ꟿ ) LATIN EPIGRAPHIC LETTER ARCHAIC M
(4) Feedback AdditionsBased on suggestions in feedback. Also broadened to blocks if possible, and subtracting data above.[:block=Georgian Supplement:] + Greek And Coptic - Lowercase of editorial symbolsU+037B ( ͻ ) GREEK SMALL REVERSED LUNATE SIGMA SYMBOLU+037C ( ͼ ) GREEK SMALL DOTTED LUNATE SIGMA SYMBOLU+037D ( ͽ ) GREEK SMALL REVERSED DOTTED LUNATE SIGMA SYMBOLGreek And Coptic - Variant letterformsU+03CF ( Ϗ ) GREEK CAPITAL KAI SYMBOLGreek And Coptic - Editorial symbolsU+03FD ( Ͻ ) GREEK CAPITAL REVERSED LUNATE SIGMA SYMBOLU+03FE ( Ͼ ) GREEK CAPITAL DOTTED LUNATE SIGMA SYMBOLU+03FF ( Ͽ ) GREEK CAPITAL REVERSED DOTTED LUNATE SIGMA SYMBOLLatin Extended B - Non-European and historic LatinU+0185 ( ƅ ) LATIN SMALL LETTER TONE SIXU+01A8 ( ƨ ) LATIN SMALL LETTER TONE TWOU+01BD ( ƽ ) LATIN SMALL LETTER TONE FIVEHebrew - Cantillation marksU+0591 ( ֑ ) HEBREW ACCENT ETNAHTAU+0592 ( ֒ ) HEBREW ACCENT SEGOLU+0593 ( ֓ ) HEBREW ACCENT SHALSHELETU+0594 ( ֔ ) HEBREW ACCENT ZAQEF QATANU+0595 ( ֕ ) HEBREW ACCENT ZAQEF GADOLU+0596 ( ֖ ) HEBREW ACCENT TIPEHAU+0597 ( ֗ ) HEBREW ACCENT REVIAU+0598 ( ֘ ) HEBREW ACCENT ZARQAU+0599 ( ֙ ) HEBREW ACCENT PASHTAU+059A ( ֚ ) HEBREW ACCENT YETIVU+059B ( ֛ ) HEBREW ACCENT TEVIRU+059C ( ֜ ) HEBREW ACCENT GERESHU+059D ( ֝ ) HEBREW ACCENT GERESH MUQDAMU+059E ( ֞ ) HEBREW ACCENT GERSHAYIMU+059F ( ֟ ) HEBREW ACCENT QARNEY PARAU+05A0 ( ֠ ) HEBREW ACCENT TELISHA GEDOLAU+05A1 ( ֡ ) HEBREW ACCENT PAZERU+05A2 ( ֢ ) HEBREW ACCENT ATNAH HAFUKHU+05A3 ( ֣ ) HEBREW ACCENT MUNAHU+05A4 ( ֤ ) HEBREW ACCENT MAHAPAKHU+05A5 ( ֥ ) HEBREW ACCENT MERKHAU+05A6 ( ֦ ) HEBREW ACCENT MERKHA KEFULAU+05A7 ( ֧ ) HEBREW ACCENT DARGAU+05A8 ( ֨ ) HEBREW ACCENT QADMAU+05A9 ( ֩ ) HEBREW ACCENT TELISHA QETANAU+05AA ( ֪ ) HEBREW ACCENT YERAH BEN YOMOU+05AB ( ֫ ) HEBREW ACCENT OLEU+05AC ( ֬ ) HEBREW ACCENT ILUYU+05AD ( ֭ ) HEBREW ACCENT DEHIU+05AE ( ֮ ) HEBREW ACCENT ZINORU+05AF ( ֯ ) HEBREW MARK MASORA CIRCLEHebrew - Puncta extraordinariaU+05C4 ( ׄ ) HEBREW MARK UPPER DOTU+05C5 ( ׅ ) HEBREW MARK LOWER DOTArabic - Koranic annotation signsU+0615 ( ؕ ) ARABIC SMALL HIGH TAHU+0616 ( ؖ ) ARABIC SMALL HIGH LIGATURE ALEF WITH LAM WITH YEHU+0617 ( ؗ ) ARABIC SMALL HIGH ZAINU+0618 ( ؘ ) ARABIC SMALL FATHAU+0619 ( ؙ ) ARABIC SMALL DAMMAU+061A ( ؚ ) ARABIC SMALL KASRAU+06D6 ( ۖ ) ARABIC SMALL HIGH LIGATURE SAD WITH LAM WITH ALEF MAKSURAU+06D7 ( ۗ ) ARABIC SMALL HIGH LIGATURE QAF WITH LAM WITH ALEF MAKSURAU+06D8 ( ۘ ) ARABIC SMALL HIGH MEEM INITIAL FORMU+06D9 ( ۙ ) ARABIC SMALL HIGH LAM ALEFU+06DA ( ۚ ) ARABIC SMALL HIGH JEEMU+06DB ( ۛ ) ARABIC SMALL HIGH THREE DOTSU+06DC ( ۜ ) ARABIC SMALL HIGH SEENU+06DD ( ) ARABIC END OF AYAHU+06DE ( ۞ ) ARABIC START OF RUB EL HIZBU+06DF ( ۟ ) ARABIC SMALL HIGH ROUNDED ZEROU+06E0 ( ۠ ) ARABIC SMALL HIGH UPRIGHT RECTANGULAR ZEROU+06E1 ( ۡ ) ARABIC SMALL HIGH DOTLESS HEAD OF KHAHU+06E2 ( ۢ ) ARABIC SMALL HIGH MEEM ISOLATED FORMU+06E3 ( ۣ ) ARABIC SMALL LOW SEENU+06E4 ( ۤ ) ARABIC SMALL HIGH MADDAU+06E5 ( ۥ ) ARABIC SMALL WAWU+06E6 ( ۦ ) ARABIC SMALL YEHU+06E7 ( ۧ ) ARABIC SMALL HIGH YEHU+06E8 ( ۨ ) ARABIC SMALL HIGH NOONU+06E9 ( ۩ ) ARABIC PLACE OF SAJDAHU+06EA ( ۪ ) ARABIC EMPTY CENTRE LOW STOPU+06EB ( ۫ ) ARABIC EMPTY CENTRE HIGH STOPU+06EC ( ۬ ) ARABIC ROUNDED HIGH STOP WITH FILLED CENTREU+06ED ( ۭ ) ARABIC SMALL LOW MEEMGeorgian - Capital letters (Khutsuri)U+10A0 ( Ⴀ ) GEORGIAN CAPITAL LETTER ANU+10A1 ( Ⴁ ) GEORGIAN CAPITAL LETTER BANU+10A2 ( Ⴂ ) GEORGIAN CAPITAL LETTER GANU+10A3 ( Ⴃ ) GEORGIAN CAPITAL LETTER DONU+10A4 ( Ⴄ ) GEORGIAN CAPITAL LETTER ENU+10A5 ( Ⴅ ) GEORGIAN CAPITAL LETTER VINU+10A6 ( Ⴆ ) GEORGIAN CAPITAL LETTER ZENU+10A7 ( Ⴇ ) GEORGIAN CAPITAL LETTER TANU+10A8 ( Ⴈ ) GEORGIAN CAPITAL LETTER INU+10A9 ( Ⴉ ) GEORGIAN CAPITAL LETTER KANU+10AA ( Ⴊ ) GEORGIAN CAPITAL LETTER LASU+10AB ( Ⴋ ) GEORGIAN CAPITAL LETTER MANU+10AC ( Ⴌ ) GEORGIAN CAPITAL LETTER NARU+10AD ( Ⴍ ) GEORGIAN CAPITAL LETTER ONU+10AE ( Ⴎ ) GEORGIAN CAPITAL LETTER PARU+10AF ( Ⴏ ) GEORGIAN CAPITAL LETTER ZHARU+10B0 ( Ⴐ ) GEORGIAN CAPITAL LETTER RAEU+10B1 ( Ⴑ ) GEORGIAN CAPITAL LETTER SANU+10B2 ( Ⴒ ) GEORGIAN CAPITAL LETTER TARU+10B3 ( Ⴓ ) GEORGIAN CAPITAL LETTER UNU+10B4 ( Ⴔ ) GEORGIAN CAPITAL LETTER PHARU+10B5 ( Ⴕ ) GEORGIAN CAPITAL LETTER KHARU+10B6 ( Ⴖ ) GEORGIAN CAPITAL LETTER GHANU+10B7 ( Ⴗ ) GEORGIAN CAPITAL LETTER QARU+10B8 ( Ⴘ ) GEORGIAN CAPITAL LETTER SHINU+10B9 ( Ⴙ ) GEORGIAN CAPITAL LETTER CHINU+10BA ( Ⴚ ) GEORGIAN CAPITAL LETTER CANU+10BB ( Ⴛ ) GEORGIAN CAPITAL LETTER JILU+10BC ( Ⴜ ) GEORGIAN CAPITAL LETTER CILU+10BD ( Ⴝ ) GEORGIAN CAPITAL LETTER CHARU+10BE ( Ⴞ ) GEORGIAN CAPITAL LETTER XANU+10BF ( Ⴟ ) GEORGIAN CAPITAL LETTER JHANU+10C0 ( Ⴠ ) GEORGIAN CAPITAL LETTER HAEU+10C1 ( Ⴡ ) GEORGIAN CAPITAL LETTER HEU+10C2 ( Ⴢ ) GEORGIAN CAPITAL LETTER HIEU+10C3 ( Ⴣ ) GEORGIAN CAPITAL LETTER WEU+10C4 ( Ⴤ ) GEORGIAN CAPITAL LETTER HARU+10C5 ( Ⴥ ) GEORGIAN CAPITAL LETTER HOEGeorgian - PunctuationU+10FB ( ჻ ) GEORGIAN PARAGRAPH SEPARATORPlus case closures for the above: U+0184 ( Ƅ ) LATIN CAPITAL LETTER TONE SIXU+01A7 ( Ƨ ) LATIN CAPITAL LETTER TONE TWOU+01B8 ( Ƹ ) LATIN CAPITAL LETTER EZH REVERSEDU+01BC ( Ƽ ) LATIN CAPITAL LETTER TONE FIVEU+01F7 ( Ƿ ) LATIN CAPITAL LETTER WYNNU+03F2 ( ϲ ) GREEK LUNATE SIGMA SYMBOLU+03F4 ( ϴ ) GREEK CAPITAL THETA SYMBOLU+2184 ( ↄ ) LATIN SMALL LETTER REVERSED CNote that the last is not the same as: U+0254 ( ɔ ) LATIN SMALL LETTER OPEN OPossibly also: Arabic - Extended Arabic lettersU+0682 ( ڂ ) ARABIC LETTER HAH WITH TWO DOTS VERTICAL ABOVEU+0690 ( ڐ ) ARABIC LETTER DAL WITH FOUR DOTS ABOVEU+069B ( ڛ ) ARABIC LETTER SEEN WITH THREE DOTS BELOWU+069F ( ڟ ) ARABIC LETTER TAH WITH THREE DOTS ABOVEU+06A0 ( ڠ ) ARABIC LETTER AIN WITH THREE DOTS ABOVEU+06AC ( ڬ ) ARABIC LETTER KAF WITH DOT ABOVEU+06B2 ( ڲ ) ARABIC LETTER GAF WITH TWO DOTS BELOWU+06B4 ( ڴ ) ARABIC LETTER GAF WITH THREE DOTS ABOVEU+06B8 ( ڸ ) ARABIC LETTER LAM WITH THREE DOTS BELOWU+06B9 ( ڹ ) ARABIC LETTER NOON WITH DOT BELOWSide NoteThe IPA characters [ɩɷɼɿʅ-ʇʓʖʗʚʞʠʣʥʦʨ-ʯ] are not official IPA, but are not obsolete or archaic; they are still used in some traditions. |
Unicode & Int’l SW > UTC >