Mark Davis, 2009-01-30
Live doc: http://www.macchiato.com/unicode/historic-unicode-characters
The following is a draft list of historic characters, meaning characters that are no longer customarily used in current modern languages in typical publications (corner newspapers, magazines, etc.). For example, you wouldn't expect to see words written in Cuneiform in the NY Times. They may of course still be used in in technical journals, especially those dealing with archaic languages, or have occasional decorative use, or be in academic documents, or be quoted in modern works, or be used in liturgical works.
For example, in modern Tibetan usage,the Phags-pa script is restricted to calligraphic, decorative and other special purposes, and is not used for writing extensive texts (as it used to be in the 13th and 14th
centuries). Thus Phags-pa is an archaic script (in the same way that in the West, blackletter is an archaic script, even though it is still used for calligraphic and decorative purposes).
In many cases in the Unicode Standard, we mark characters (such as with subheadings or block names) as being not in modern use, using terms like Obsolete, Old, Ancient, Archaic, and so on. But there are characters that are not clearly marked as such. In UAX31 we include information about blocks or scripts that are unsuitable for identifiers - this is often, but not always, based on whether they are archaic scripts or not. In UTS39 we have a tag for 'archaic' - and I have an action to update that UTS.
This does not yet include historic CJK characters.
It is useful for people to know which characters are in customary modern use, and which are not, so I set out to try to get a clearer picture of the situation. This is not intended as a proposal for a standard Unicode property; it is simply some information about Unicode characters that may be useful to some people in some contexts. For example, someone building a character picker could put historic characters into an 'extended' bucket for more advanced users. (See, for example, the mockup at http://macchiato.com/picker/MyApplication.html.)
For right now, this is targeted at an update of UTS39, plus possible improvements to NamesList annotations.
Here is the data. The lines marked with yellow are those that should be removed, based on feedback on the email lists. Additions from that feedback are listed at the end.
Current count: 4,441 Code Points
= 3,867 Code Points (if unassigned characters are removed from the blocks).
Both include the yellow items that should be removed, below, so the end figures would be somewhat smaller.
This is broadened from scripts to blocks where possible.
[:blk=Ancient_Greek_Musical_Notation:]
[:blk=Buginese:]
[:blk=Buhid:]
[:blk=Carian:]
[:blk=Coptic:]
[:blk=Cuneiform:]
[:blk=Cuneiform_Numbers_And_Punctuation:]
[:blk=Cypriot_Syllabary:]
[:blk=Deseret:]
[:blk=Glagolitic:]
[:blk=Gothic:]
[:blk=Hanunoo:]
[:blk=Kharoshthi:]
[:blk=Linear_B_Ideograms:]
[:blk=Linear_B_Syllabary:]
[:blk=Lycian:]
[:blk=Lydian:]
[:blk=Ogham:]
[:blk=Old_Italic:]
[:blk=Old_Persian:]
[:blk=Osmanya:]
[:blk=Phags_Pa:]
[:blk=Phaistos_Disc:]
[:blk=Phoenician:]
[:blk=Rejang:]
[:blk=Runic:]
[:blk=Shavian:]
[:blk=Sundanese:]
[:blk=Syloti_Nagri:]
[:blk=Syriac:]
[:blk=Tagalog:]
[:blk=Tagbanwa:]
[:blk=Ugaritic:]
[:sc=Copt:]
// Note that there is a revival effort for Coptic, that may move it out of this section over time.
Note that the Korean Jamo are listed because they would normally not appear in NFC. Also broadened to blocks if possible, and subtracting data above.
[:blk=Balinese:] // this needs to be removed
[:blk=Ancient_Greek_Numbers:]
[:Block=Hangul_Jamo:]
[:Block=Hangul_Compatibility_Jamo:]
+
U+018D ( ƍ ) LATIN SMALL LETTER TURNED DELTA
U+01AA ( ƪ ) LATIN LETTER REVERSED ESH LOOP
U+01AB ( ƫ ) LATIN SMALL LETTER T WITH PALATAL HOOK
U+01B9 ( ƹ ) LATIN SMALL LETTER EZH REVERSED
U+01BA ( ƺ ) LATIN SMALL LETTER EZH WITH TAIL
U+01BB ( ƻ ) LATIN LETTER TWO WITH STROKE
U+01BE ( ƾ ) LATIN LETTER INVERTED GLOTTAL STOP WITH STROKE
U+01BF ( ƿ ) LATIN LETTER WYNN
U+021C ( Ȝ ) LATIN CAPITAL LETTER YOGH
U+021D ( ȝ ) LATIN SMALL LETTER YOGH
U+025F ( ɟ ) LATIN SMALL LETTER DOTLESS J WITH STROKE
U+0277 ( ɷ ) LATIN SMALL LETTER CLOSED OMEGA
U+027C ( ɼ ) LATIN SMALL LETTER R WITH LONG LEG
U+029E ( ʞ ) LATIN SMALL LETTER TURNED K
U+0343 ( ̓ ) COMBINING GREEK KORONIS
U+03D0 ( ϐ ) GREEK BETA SYMBOL
U+03D1 ( ϑ ) GREEK THETA SYMBOL
U+03D5 ( ϕ ) GREEK PHI SYMBOL
U+03D6 ( ϖ ) GREEK PI SYMBOL
U+03D7 ( ϗ ) GREEK KAI SYMBOL
U+03D8 ( Ϙ ) GREEK LETTER ARCHAIC KOPPA
U+03D9 ( ϙ ) GREEK SMALL LETTER ARCHAIC KOPPA
U+03DA ( Ϛ ) GREEK LETTER STIGMA
U+03DB ( ϛ ) GREEK SMALL LETTER STIGMA
U+03DC ( Ϝ ) GREEK LETTER DIGAMMA
U+03DD ( ϝ ) GREEK SMALL LETTER DIGAMMA
U+03DE ( Ϟ ) GREEK LETTER KOPPA
U+03DF ( ϟ ) GREEK SMALL LETTER KOPPA
U+03E0 ( Ϡ ) GREEK LETTER SAMPI
U+03E1 ( ϡ ) GREEK SMALL LETTER SAMPI
U+03F7 ( Ϸ ) GREEK CAPITAL LETTER SHO
U+03F8 ( ϸ ) GREEK SMALL LETTER SHO
U+03F9 ( Ϲ ) GREEK CAPITAL LUNATE SIGMA SYMBOL
U+03FA ( Ϻ ) GREEK CAPITAL LETTER SAN
U+03FB ( ϻ ) GREEK SMALL LETTER SAN
U+0483 ( ҃ ) COMBINING CYRILLIC TITLO
U+0484 ( ҄ ) COMBINING CYRILLIC PALATALIZATION
U+0485 ( ҅ ) COMBINING CYRILLIC DASIA PNEUMATA
U+0486 ( ҆ ) COMBINING CYRILLIC PSILI PNEUMATA
U+05A2 ( ֢ ) HEBREW ACCENT ATNAH HAFUKH
U+05C5 ( ׅ ) HEBREW MARK LOWER DOT
U+05C6 ( ׆ ) HEBREW PUNCTUATION NUN HAFUKHA
U+05C7 ( ׇ ) HEBREW POINT QAMATS QATAN
U+066E ( ٮ ) ARABIC LETTER DOTLESS BEH
U+066F ( ٯ ) ARABIC LETTER DOTLESS QAF
U+068E ( ڎ ) ARABIC LETTER DUL
U+0CDE ( ೞ ) KANNADA LETTER FA
U+10F1 ( ჱ ) GEORGIAN LETTER HE
U+10F2 ( ჲ ) GEORGIAN LETTER HIE
U+10F3 ( ჳ ) GEORGIAN LETTER WE
U+10F4 ( ჴ ) GEORGIAN LETTER HAR
U+10F5 ( ჵ ) GEORGIAN LETTER HOE
U+10F6 ( ჶ ) GEORGIAN LETTER FI
U+17A8 ( ឨ ) KHMER INDEPENDENT VOWEL QUK
U+17D1 ( ៑ ) KHMER SIGN VIRIAM
U+17DD ( ៝ ) KHMER SIGN ATTHACAN
U+1DC0 ( ᷀ ) COMBINING DOTTED GRAVE ACCENT
U+1DC1 ( ᷁ ) COMBINING DOTTED ACUTE ACCENT
U+1DC2 ( ᷂ ) COMBINING SNAKE BELOW
U+1DC3 ( ᷃ ) COMBINING SUSPENSION MARK
U+A700 ( ꜀ ) MODIFIER LETTER CHINESE TONE YIN PING
U+A701 ( ꜁ ) MODIFIER LETTER CHINESE TONE YANG PING
U+A702 ( ꜂ ) MODIFIER LETTER CHINESE TONE YIN SHANG
U+A703 ( ꜃ ) MODIFIER LETTER CHINESE TONE YANG SHANG
U+A704 ( ꜄ ) MODIFIER LETTER CHINESE TONE YIN QU
U+A705 ( ꜅ ) MODIFIER LETTER CHINESE TONE YANG QU
U+A706 ( ꜆ ) MODIFIER LETTER CHINESE TONE YIN RU
U+A707 ( ꜇ ) MODIFIER LETTER CHINESE TONE YANG RU
The Unicode character with subheads, block names, or character names in the standard that contain any of the words:
Obsolete|Ancient|Archaic|Medieval|New Testament|UPA|Old|Early
minus the above (and with some small hand-editing for exceptional cases) Errors in these may indicate that we want to change some of the subheaders. Also broadened to blocks if possible, and subtracting data above.
[:blk=Ancient_Symbols:]
[:blk=Ancient_Greek_Musical_Notation:]
[:blk=Cyrillic_Extended_A:]
[:blk=Cyrillic_Extended_B:]
+
U+02EF ( ˯ ) MODIFIER LETTER LOW DOWN ARROWHEAD
U+02F0 ( ˰ ) MODIFIER LETTER LOW UP ARROWHEAD
U+02F1 ( ˱ ) MODIFIER LETTER LOW LEFT ARROWHEAD
U+02F2 ( ˲ ) MODIFIER LETTER LOW RIGHT ARROWHEAD
U+02F3 ( ˳ ) MODIFIER LETTER LOW RING
U+02F4 ( ˴ ) MODIFIER LETTER MIDDLE GRAVE ACCENT
U+02F5 ( ˵ ) MODIFIER LETTER MIDDLE DOUBLE GRAVE ACCENT
U+02F6 ( ˶ ) MODIFIER LETTER MIDDLE DOUBLE ACUTE ACCENT
U+02F7 ( ˷ ) MODIFIER LETTER LOW TILDE
U+02F8 ( ˸ ) MODIFIER LETTER RAISED COLON
U+02F9 ( ˹ ) MODIFIER LETTER BEGIN HIGH TONE
U+02FA ( ˺ ) MODIFIER LETTER END HIGH TONE
U+02FB ( ˻ ) MODIFIER LETTER BEGIN LOW TONE
U+02FC ( ˼ ) MODIFIER LETTER END LOW TONE
U+02FD ( ˽ ) MODIFIER LETTER SHELF
U+02FE ( ˾ ) MODIFIER LETTER OPEN SHELF
U+02FF ( ˿ ) MODIFIER LETTER LOW LEFT ARROW
U+0363 ( ͣ ) COMBINING LATIN SMALL LETTER A
U+0364 ( ͤ ) COMBINING LATIN SMALL LETTER E
U+0365 ( ͥ ) COMBINING LATIN SMALL LETTER I
U+0366 ( ͦ ) COMBINING LATIN SMALL LETTER O
U+0367 ( ͧ ) COMBINING LATIN SMALL LETTER U
U+0368 ( ͨ ) COMBINING LATIN SMALL LETTER C
U+0369 ( ͩ ) COMBINING LATIN SMALL LETTER D
U+036A ( ͪ ) COMBINING LATIN SMALL LETTER H
U+036B ( ͫ ) COMBINING LATIN SMALL LETTER M
U+036C ( ͬ ) COMBINING LATIN SMALL LETTER R
U+036D ( ͭ ) COMBINING LATIN SMALL LETTER T
U+036E ( ͮ ) COMBINING LATIN SMALL LETTER V
U+036F ( ͯ ) COMBINING LATIN SMALL LETTER X
U+0370 ( Ͱ ) GREEK CAPITAL LETTER HETA
U+0371 ( ͱ ) GREEK SMALL LETTER HETA
U+0372 ( Ͳ ) GREEK CAPITAL LETTER ARCHAIC SAMPI
U+0373 ( ͳ ) GREEK SMALL LETTER ARCHAIC SAMPI
U+0376 ( Ͷ ) GREEK CAPITAL LETTER PAMPHYLIAN DIGAMMA
U+0377 ( ͷ ) GREEK SMALL LETTER PAMPHYLIAN DIGAMMA
U+063B ( ػ ) ARABIC LETTER KEHEH WITH TWO DOTS ABOVE
U+063C ( ؼ ) ARABIC LETTER KEHEH WITH THREE DOTS BELOW
U+063E ( ؾ ) ARABIC LETTER FARSI YEH WITH TWO DOTS ABOVE
U+063F ( ؿ ) ARABIC LETTER FARSI YEH WITH THREE DOTS ABOVE
U+077E ( ݾ ) ARABIC LETTER SEEN WITH INVERTED V
U+077F ( ݿ ) ARABIC LETTER KAF WITH TWO DOTS ABOVE
U+07E8 ( ߨ ) NKO LETTER JONA JA
U+07E9 ( ߩ ) NKO LETTER JONA CHA
U+07EA ( ߪ ) NKO LETTER JONA RA
U+1DCE ( ᷎ ) COMBINING OGONEK ABOVE
U+1DCF ( ᷏ ) COMBINING ZIGZAG BELOW
U+1DD0 ( ᷐ ) COMBINING IS BELOW
U+1DD1 ( ᷑ ) COMBINING UR ABOVE
U+1DD2 ( ᷒ ) COMBINING US ABOVE
U+1DD3 ( ᷓ ) COMBINING LATIN SMALL LETTER FLATTENED OPEN A ABOVE
U+1DD4 ( ᷔ ) COMBINING LATIN SMALL LETTER AE
U+1DD5 ( ᷕ ) COMBINING LATIN SMALL LETTER AO
U+1DD6 ( ᷖ ) COMBINING LATIN SMALL LETTER AV
U+1DD7 ( ᷗ ) COMBINING LATIN SMALL LETTER C CEDILLA
U+1DD8 ( ᷘ ) COMBINING LATIN SMALL LETTER INSULAR D
U+1DD9 ( ᷙ ) COMBINING LATIN SMALL LETTER ETH
U+1DDA ( ᷚ ) COMBINING LATIN SMALL LETTER G
U+1DDB ( ᷛ ) COMBINING LATIN LETTER SMALL CAPITAL G
U+1DDC ( ᷜ ) COMBINING LATIN SMALL LETTER K
U+1DDD ( ᷝ ) COMBINING LATIN SMALL LETTER L
U+1DDE ( ᷞ ) COMBINING LATIN LETTER SMALL CAPITAL L
U+1DDF ( ᷟ ) COMBINING LATIN LETTER SMALL CAPITAL M
U+1DE0 ( ᷠ ) COMBINING LATIN SMALL LETTER N
U+1DE1 ( ᷡ ) COMBINING LATIN LETTER SMALL CAPITAL N
U+1DE2 ( ᷢ ) COMBINING LATIN LETTER SMALL CAPITAL R
U+1DE3 ( ᷣ ) COMBINING LATIN SMALL LETTER R ROTUNDA
U+1DE4 ( ᷤ ) COMBINING LATIN SMALL LETTER S
U+1DE5 ( ᷥ ) COMBINING LATIN SMALL LETTER LONG S
U+1DE6 ( ᷦ ) COMBINING LATIN SMALL LETTER Z
U+1DFE ( ᷾ ) COMBINING LEFT ARROWHEAD ABOVE
U+1DFF ( ᷿ ) COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOW
U+1E9C ( ẜ ) LATIN SMALL LETTER LONG S WITH DIAGONAL STROKE
U+1E9D ( ẝ ) LATIN SMALL LETTER LONG S WITH HIGH STROKE
U+1E9F ( ẟ ) LATIN SMALL LETTER DELTA
U+1EFA ( Ỻ ) LATIN CAPITAL LETTER MIDDLE-WELSH LL
U+1EFB ( ỻ ) LATIN SMALL LETTER MIDDLE-WELSH LL
U+1EFC ( Ỽ ) LATIN CAPITAL LETTER MIDDLE-WELSH V
U+1EFD ( ỽ ) LATIN SMALL LETTER MIDDLE-WELSH V
U+1EFE ( Ỿ ) LATIN CAPITAL LETTER Y WITH LOOP
U+1EFF ( ỿ ) LATIN SMALL LETTER Y WITH LOOP
U+2056 ( ⁖ ) THREE DOT PUNCTUATION
U+2058 ( ⁘ ) FOUR DOT PUNCTUATION
U+2059 ( ⁙ ) FIVE DOT PUNCTUATION
U+205A ( ⁚ ) TWO DOT PUNCTUATION
U+205B ( ⁛ ) FOUR DOT MARK
U+205C ( ⁜ ) DOTTED CROSS
U+205D ( ⁝ ) TRICOLON
U+205E ( ⁞ ) VERTICAL FOUR DOTS
U+2180 ( ↀ ) ROMAN NUMERAL ONE THOUSAND C D
U+2181 ( ↁ ) ROMAN NUMERAL FIVE THOUSAND
U+2182 ( ↂ ) ROMAN NUMERAL TEN THOUSAND
U+2183 ( Ↄ ) ROMAN NUMERAL REVERSED ONE HUNDRED
U+2185 ( ↅ ) ROMAN NUMERAL SIX LATE FORM
U+2186 ( ↆ ) ROMAN NUMERAL FIFTY EARLY FORM
U+2187 ( ↇ ) ROMAN NUMERAL FIFTY THOUSAND
U+2188 ( ↈ ) ROMAN NUMERAL ONE HUNDRED THOUSAND
U+2C77 ( ⱷ ) LATIN SMALL LETTER TAILLESS PHI
U+2C78 ( ⱸ ) LATIN SMALL LETTER E WITH NOTCH
U+2C79 ( ⱹ ) LATIN SMALL LETTER TURNED R WITH TAIL
U+2C7A ( ⱺ ) LATIN SMALL LETTER O WITH LOW RING INSIDE
U+2C7B ( ⱻ ) LATIN LETTER SMALL CAPITAL TURNED E
U+2C7C ( ⱼ ) LATIN SUBSCRIPT SMALL LETTER J
U+2C7D ( ⱽ ) MODIFIER LETTER CAPITAL V
U+2E00 ( ⸀ ) RIGHT ANGLE SUBSTITUTION MARKER
U+2E01 ( ⸁ ) RIGHT ANGLE DOTTED SUBSTITUTION MARKER
U+2E02 ( ⸂ ) LEFT SUBSTITUTION BRACKET
U+2E03 ( ⸃ ) RIGHT SUBSTITUTION BRACKET
U+2E04 ( ⸄ ) LEFT DOTTED SUBSTITUTION BRACKET
U+2E05 ( ⸅ ) RIGHT DOTTED SUBSTITUTION BRACKET
U+2E06 ( ⸆ ) RAISED INTERPOLATION MARKER
U+2E07 ( ⸇ ) RAISED DOTTED INTERPOLATION MARKER
U+2E08 ( ⸈ ) DOTTED TRANSPOSITION MARKER
U+2E09 ( ⸉ ) LEFT TRANSPOSITION BRACKET
U+2E0A ( ⸊ ) RIGHT TRANSPOSITION BRACKET
U+2E0B ( ⸋ ) RAISED SQUARE
U+2E0C ( ⸌ ) LEFT RAISED OMISSION BRACKET
U+2E0D ( ⸍ ) RIGHT RAISED OMISSION BRACKET
U+2E0E ( ⸎ ) EDITORIAL CORONIS
U+2E0F ( ⸏ ) PARAGRAPHOS
U+2E10 ( ⸐ ) FORKED PARAGRAPHOS
U+2E11 ( ⸑ ) REVERSED FORKED PARAGRAPHOS
U+2E12 ( ⸒ ) HYPODIASTOLE
U+2E13 ( ⸓ ) DOTTED OBELOS
U+2E14 ( ⸔ ) DOWNWARDS ANCORA
U+2E15 ( ⸕ ) UPWARDS ANCORA
U+2E16 ( ⸖ ) DOTTED RIGHT-POINTING ANGLE
U+2E17 ( ⸗ ) DOUBLE OBLIQUE HYPHEN
U+2E2A ( ⸪ ) TWO DOTS OVER ONE DOT PUNCTUATION
U+2E2B ( ⸫ ) ONE DOT OVER TWO DOTS PUNCTUATION
U+2E2C ( ⸬ ) SQUARED FOUR DOT PUNCTUATION
U+2E2D ( ⸭ ) FIVE DOT MARK
U+2E2E ( ⸮ ) REVERSED QUESTION MARK
U+2E2F ( ⸯ ) VERTICAL TILDE
U+2E30 ( ⸰ ) RING POINT
U+A720 ( ꜠ ) MODIFIER LETTER STRESS AND HIGH TONE
U+A721 ( ꜡ ) MODIFIER LETTER STRESS AND LOW TONE
U+A730 ( ꜰ ) LATIN LETTER SMALL CAPITAL F
U+A731 ( ꜱ ) LATIN LETTER SMALL CAPITAL S
U+A732 ( Ꜳ ) LATIN CAPITAL LETTER AA
U+A733 ( ꜳ ) LATIN SMALL LETTER AA
U+A734 ( Ꜵ ) LATIN CAPITAL LETTER AO
U+A735 ( ꜵ ) LATIN SMALL LETTER AO
U+A736 ( Ꜷ ) LATIN CAPITAL LETTER AU
U+A737 ( ꜷ ) LATIN SMALL LETTER AU
U+A738 ( Ꜹ ) LATIN CAPITAL LETTER AV
U+A739 ( ꜹ ) LATIN SMALL LETTER AV
U+A73A ( Ꜻ ) LATIN CAPITAL LETTER AV WITH HORIZONTAL BAR
U+A73B ( ꜻ ) LATIN SMALL LETTER AV WITH HORIZONTAL BAR
U+A73C ( Ꜽ ) LATIN CAPITAL LETTER AY
U+A73D ( ꜽ ) LATIN SMALL LETTER AY
U+A73E ( Ꜿ ) LATIN CAPITAL LETTER REVERSED C WITH DOT
U+A73F ( ꜿ ) LATIN SMALL LETTER REVERSED C WITH DOT
U+A740 ( Ꝁ ) LATIN CAPITAL LETTER K WITH STROKE
U+A741 ( ꝁ ) LATIN SMALL LETTER K WITH STROKE
U+A742 ( Ꝃ ) LATIN CAPITAL LETTER K WITH DIAGONAL STROKE
U+A743 ( ꝃ ) LATIN SMALL LETTER K WITH DIAGONAL STROKE
U+A744 ( Ꝅ ) LATIN CAPITAL LETTER K WITH STROKE AND DIAGONAL STROKE
U+A745 ( ꝅ ) LATIN SMALL LETTER K WITH STROKE AND DIAGONAL STROKE
U+A746 ( Ꝇ ) LATIN CAPITAL LETTER BROKEN L
U+A747 ( ꝇ ) LATIN SMALL LETTER BROKEN L
U+A748 ( Ꝉ ) LATIN CAPITAL LETTER L WITH HIGH STROKE
U+A749 ( ꝉ ) LATIN SMALL LETTER L WITH HIGH STROKE
U+A74A ( Ꝋ ) LATIN CAPITAL LETTER O WITH LONG STROKE OVERLAY
U+A74B ( ꝋ ) LATIN SMALL LETTER O WITH LONG STROKE OVERLAY
U+A74C ( Ꝍ ) LATIN CAPITAL LETTER O WITH LOOP
U+A74D ( ꝍ ) LATIN SMALL LETTER O WITH LOOP
U+A74E ( Ꝏ ) LATIN CAPITAL LETTER OO
U+A74F ( ꝏ ) LATIN SMALL LETTER OO
U+A750 ( Ꝑ ) LATIN CAPITAL LETTER P WITH STROKE THROUGH DESCENDER
U+A751 ( ꝑ ) LATIN SMALL LETTER P WITH STROKE THROUGH DESCENDER
U+A752 ( Ꝓ ) LATIN CAPITAL LETTER P WITH FLOURISH
U+A753 ( ꝓ ) LATIN SMALL LETTER P WITH FLOURISH
U+A754 ( Ꝕ ) LATIN CAPITAL LETTER P WITH SQUIRREL TAIL
U+A755 ( ꝕ ) LATIN SMALL LETTER P WITH SQUIRREL TAIL
U+A756 ( Ꝗ ) LATIN CAPITAL LETTER Q WITH STROKE THROUGH DESCENDER
U+A757 ( ꝗ ) LATIN SMALL LETTER Q WITH STROKE THROUGH DESCENDER
U+A758 ( Ꝙ ) LATIN CAPITAL LETTER Q WITH DIAGONAL STROKE
U+A759 ( ꝙ ) LATIN SMALL LETTER Q WITH DIAGONAL STROKE
U+A75A ( Ꝛ ) LATIN CAPITAL LETTER R ROTUNDA
U+A75B ( ꝛ ) LATIN SMALL LETTER R ROTUNDA
U+A75C ( Ꝝ ) LATIN CAPITAL LETTER RUM ROTUNDA
U+A75D ( ꝝ ) LATIN SMALL LETTER RUM ROTUNDA
U+A75E ( Ꝟ ) LATIN CAPITAL LETTER V WITH DIAGONAL STROKE
U+A75F ( ꝟ ) LATIN SMALL LETTER V WITH DIAGONAL STROKE
U+A760 ( Ꝡ ) LATIN CAPITAL LETTER VY
U+A761 ( ꝡ ) LATIN SMALL LETTER VY
U+A762 ( Ꝣ ) LATIN CAPITAL LETTER VISIGOTHIC Z
U+A763 ( ꝣ ) LATIN SMALL LETTER VISIGOTHIC Z
U+A764 ( Ꝥ ) LATIN CAPITAL LETTER THORN WITH STROKE
U+A765 ( ꝥ ) LATIN SMALL LETTER THORN WITH STROKE
U+A766 ( Ꝧ ) LATIN CAPITAL LETTER THORN WITH STROKE THROUGH DESCENDER
U+A767 ( ꝧ ) LATIN SMALL LETTER THORN WITH STROKE THROUGH DESCENDER
U+A768 ( Ꝩ ) LATIN CAPITAL LETTER VEND
U+A769 ( ꝩ ) LATIN SMALL LETTER VEND
U+A76A ( Ꝫ ) LATIN CAPITAL LETTER ET
U+A76B ( ꝫ ) LATIN SMALL LETTER ET
U+A76C ( Ꝭ ) LATIN CAPITAL LETTER IS
U+A76D ( ꝭ ) LATIN SMALL LETTER IS
U+A76E ( Ꝯ ) LATIN CAPITAL LETTER CON
U+A76F ( ꝯ ) LATIN SMALL LETTER CON
U+A770 ( ꝰ ) MODIFIER LETTER US
U+A771 ( ꝱ ) LATIN SMALL LETTER DUM
U+A772 ( ꝲ ) LATIN SMALL LETTER LUM
U+A773 ( ꝳ ) LATIN SMALL LETTER MUM
U+A774 ( ꝴ ) LATIN SMALL LETTER NUM
U+A775 ( ꝵ ) LATIN SMALL LETTER RUM
U+A776 ( ꝶ ) LATIN LETTER SMALL CAPITAL RUM
U+A777 ( ꝷ ) LATIN SMALL LETTER TUM
U+A778 ( ꝸ ) LATIN SMALL LETTER UM
U+A7FB ( ꟻ ) LATIN EPIGRAPHIC LETTER REVERSED F
U+A7FC ( ꟼ ) LATIN EPIGRAPHIC LETTER REVERSED P
U+A7FD ( ꟽ ) LATIN EPIGRAPHIC LETTER INVERTED M
U+A7FE ( ꟾ ) LATIN EPIGRAPHIC LETTER I LONGA
U+A7FF ( ꟿ ) LATIN EPIGRAPHIC LETTER ARCHAIC M
Based on suggestions in feedback. Also broadened to blocks if possible, and subtracting data above.
[:block=Georgian Supplement:]
+
U+037B ( ͻ ) GREEK SMALL REVERSED LUNATE SIGMA SYMBOL
U+037C ( ͼ ) GREEK SMALL DOTTED LUNATE SIGMA SYMBOL
U+037D ( ͽ ) GREEK SMALL REVERSED DOTTED LUNATE SIGMA SYMBOL
U+03CF ( Ϗ ) GREEK CAPITAL KAI SYMBOL
U+03FD ( Ͻ ) GREEK CAPITAL REVERSED LUNATE SIGMA SYMBOL
U+03FE ( Ͼ ) GREEK CAPITAL DOTTED LUNATE SIGMA SYMBOL
U+03FF ( Ͽ ) GREEK CAPITAL REVERSED DOTTED LUNATE SIGMA SYMBOL
U+0185 ( ƅ ) LATIN SMALL LETTER TONE SIX
U+01A8 ( ƨ ) LATIN SMALL LETTER TONE TWO
U+01BD ( ƽ ) LATIN SMALL LETTER TONE FIVE
U+0591 ( ֑ ) HEBREW ACCENT ETNAHTA
U+0592 ( ֒ ) HEBREW ACCENT SEGOL
U+0593 ( ֓ ) HEBREW ACCENT SHALSHELET
U+0594 ( ֔ ) HEBREW ACCENT ZAQEF QATAN
U+0595 ( ֕ ) HEBREW ACCENT ZAQEF GADOL
U+0596 ( ֖ ) HEBREW ACCENT TIPEHA
U+0597 ( ֗ ) HEBREW ACCENT REVIA
U+0598 ( ֘ ) HEBREW ACCENT ZARQA
U+0599 ( ֙ ) HEBREW ACCENT PASHTA
U+059A ( ֚ ) HEBREW ACCENT YETIV
U+059B ( ֛ ) HEBREW ACCENT TEVIR
U+059C ( ֜ ) HEBREW ACCENT GERESH
U+059D ( ֝ ) HEBREW ACCENT GERESH MUQDAM
U+059E ( ֞ ) HEBREW ACCENT GERSHAYIM
U+059F ( ֟ ) HEBREW ACCENT QARNEY PARA
U+05A0 ( ֠ ) HEBREW ACCENT TELISHA GEDOLA
U+05A1 ( ֡ ) HEBREW ACCENT PAZER
U+05A2 ( ֢ ) HEBREW ACCENT ATNAH HAFUKH
U+05A3 ( ֣ ) HEBREW ACCENT MUNAH
U+05A4 ( ֤ ) HEBREW ACCENT MAHAPAKH
U+05A5 ( ֥ ) HEBREW ACCENT MERKHA
U+05A6 ( ֦ ) HEBREW ACCENT MERKHA KEFULA
U+05A7 ( ֧ ) HEBREW ACCENT DARGA
U+05A8 ( ֨ ) HEBREW ACCENT QADMA
U+05A9 ( ֩ ) HEBREW ACCENT TELISHA QETANA
U+05AA ( ֪ ) HEBREW ACCENT YERAH BEN YOMO
U+05AB ( ֫ ) HEBREW ACCENT OLE
U+05AC ( ֬ ) HEBREW ACCENT ILUY
U+05AD ( ֭ ) HEBREW ACCENT DEHI
U+05AE ( ֮ ) HEBREW ACCENT ZINOR
U+05AF ( ֯ ) HEBREW MARK MASORA CIRCLE
U+05C4 ( ׄ ) HEBREW MARK UPPER DOT
U+05C5 ( ׅ ) HEBREW MARK LOWER DOT
U+0615 ( ؕ ) ARABIC SMALL HIGH TAH
U+0616 ( ؖ ) ARABIC SMALL HIGH LIGATURE ALEF WITH LAM WITH YEH
U+0617 ( ؗ ) ARABIC SMALL HIGH ZAIN
U+0618 ( ؘ ) ARABIC SMALL FATHA
U+0619 ( ؙ ) ARABIC SMALL DAMMA
U+061A ( ؚ ) ARABIC SMALL KASRA
U+06D6 ( ۖ ) ARABIC SMALL HIGH LIGATURE SAD WITH LAM WITH ALEF MAKSURA
U+06D7 ( ۗ ) ARABIC SMALL HIGH LIGATURE QAF WITH LAM WITH ALEF MAKSURA
U+06D8 ( ۘ ) ARABIC SMALL HIGH MEEM INITIAL FORM
U+06D9 ( ۙ ) ARABIC SMALL HIGH LAM ALEF
U+06DA ( ۚ ) ARABIC SMALL HIGH JEEM
U+06DB ( ۛ ) ARABIC SMALL HIGH THREE DOTS
U+06DC ( ۜ ) ARABIC SMALL HIGH SEEN
U+06DD ( ) ARABIC END OF AYAH
U+06DE ( ۞ ) ARABIC START OF RUB EL HIZB
U+06DF ( ۟ ) ARABIC SMALL HIGH ROUNDED ZERO
U+06E0 ( ۠ ) ARABIC SMALL HIGH UPRIGHT RECTANGULAR ZERO
U+06E1 ( ۡ ) ARABIC SMALL HIGH DOTLESS HEAD OF KHAH
U+06E2 ( ۢ ) ARABIC SMALL HIGH MEEM ISOLATED FORM
U+06E3 ( ۣ ) ARABIC SMALL LOW SEEN
U+06E4 ( ۤ ) ARABIC SMALL HIGH MADDA
U+06E5 ( ۥ ) ARABIC SMALL WAW
U+06E6 ( ۦ ) ARABIC SMALL YEH
U+06E7 ( ۧ ) ARABIC SMALL HIGH YEH
U+06E8 ( ۨ ) ARABIC SMALL HIGH NOON
U+06E9 ( ۩ ) ARABIC PLACE OF SAJDAH
U+06EA ( ۪ ) ARABIC EMPTY CENTRE LOW STOP
U+06EB ( ۫ ) ARABIC EMPTY CENTRE HIGH STOP
U+06EC ( ۬ ) ARABIC ROUNDED HIGH STOP WITH FILLED CENTRE
U+06ED ( ۭ ) ARABIC SMALL LOW MEEM
U+10A0 ( Ⴀ ) GEORGIAN CAPITAL LETTER AN
U+10A1 ( Ⴁ ) GEORGIAN CAPITAL LETTER BAN
U+10A2 ( Ⴂ ) GEORGIAN CAPITAL LETTER GAN
U+10A3 ( Ⴃ ) GEORGIAN CAPITAL LETTER DON
U+10A4 ( Ⴄ ) GEORGIAN CAPITAL LETTER EN
U+10A5 ( Ⴅ ) GEORGIAN CAPITAL LETTER VIN
U+10A6 ( Ⴆ ) GEORGIAN CAPITAL LETTER ZEN
U+10A7 ( Ⴇ ) GEORGIAN CAPITAL LETTER TAN
U+10A8 ( Ⴈ ) GEORGIAN CAPITAL LETTER IN
U+10A9 ( Ⴉ ) GEORGIAN CAPITAL LETTER KAN
U+10AA ( Ⴊ ) GEORGIAN CAPITAL LETTER LAS
U+10AB ( Ⴋ ) GEORGIAN CAPITAL LETTER MAN
U+10AC ( Ⴌ ) GEORGIAN CAPITAL LETTER NAR
U+10AD ( Ⴍ ) GEORGIAN CAPITAL LETTER ON
U+10AE ( Ⴎ ) GEORGIAN CAPITAL LETTER PAR
U+10AF ( Ⴏ ) GEORGIAN CAPITAL LETTER ZHAR
U+10B0 ( Ⴐ ) GEORGIAN CAPITAL LETTER RAE
U+10B1 ( Ⴑ ) GEORGIAN CAPITAL LETTER SAN
U+10B2 ( Ⴒ ) GEORGIAN CAPITAL LETTER TAR
U+10B3 ( Ⴓ ) GEORGIAN CAPITAL LETTER UN
U+10B4 ( Ⴔ ) GEORGIAN CAPITAL LETTER PHAR
U+10B5 ( Ⴕ ) GEORGIAN CAPITAL LETTER KHAR
U+10B6 ( Ⴖ ) GEORGIAN CAPITAL LETTER GHAN
U+10B7 ( Ⴗ ) GEORGIAN CAPITAL LETTER QAR
U+10B8 ( Ⴘ ) GEORGIAN CAPITAL LETTER SHIN
U+10B9 ( Ⴙ ) GEORGIAN CAPITAL LETTER CHIN
U+10BA ( Ⴚ ) GEORGIAN CAPITAL LETTER CAN
U+10BB ( Ⴛ ) GEORGIAN CAPITAL LETTER JIL
U+10BC ( Ⴜ ) GEORGIAN CAPITAL LETTER CIL
U+10BD ( Ⴝ ) GEORGIAN CAPITAL LETTER CHAR
U+10BE ( Ⴞ ) GEORGIAN CAPITAL LETTER XAN
U+10BF ( Ⴟ ) GEORGIAN CAPITAL LETTER JHAN
U+10C0 ( Ⴠ ) GEORGIAN CAPITAL LETTER HAE
U+10C1 ( Ⴡ ) GEORGIAN CAPITAL LETTER HE
U+10C2 ( Ⴢ ) GEORGIAN CAPITAL LETTER HIE
U+10C3 ( Ⴣ ) GEORGIAN CAPITAL LETTER WE
U+10C4 ( Ⴤ ) GEORGIAN CAPITAL LETTER HAR
U+10C5 ( Ⴥ ) GEORGIAN CAPITAL LETTER HOE
U+10FB ( ჻ ) GEORGIAN PARAGRAPH SEPARATOR
Plus case closures for the above:
U+0184 ( Ƅ ) LATIN CAPITAL LETTER TONE SIX
U+01A7 ( Ƨ ) LATIN CAPITAL LETTER TONE TWO
U+01B8 ( Ƹ ) LATIN CAPITAL LETTER EZH REVERSED
U+01BC ( Ƽ ) LATIN CAPITAL LETTER TONE FIVE
U+01F7 ( Ƿ ) LATIN CAPITAL LETTER WYNN
U+03F2 ( ϲ ) GREEK LUNATE SIGMA SYMBOL
U+03F4 ( ϴ ) GREEK CAPITAL THETA SYMBOL
U+2184 ( ↄ ) LATIN SMALL LETTER REVERSED C
Note that the last is not the same as:
U+0254 ( ɔ ) LATIN SMALL LETTER OPEN O
Possibly also:
U+0682 ( ڂ ) ARABIC LETTER HAH WITH TWO DOTS VERTICAL ABOVE
U+0690 ( ڐ ) ARABIC LETTER DAL WITH FOUR DOTS ABOVE
U+069B ( ڛ ) ARABIC LETTER SEEN WITH THREE DOTS BELOW
U+069F ( ڟ ) ARABIC LETTER TAH WITH THREE DOTS ABOVE
U+06A0 ( ڠ ) ARABIC LETTER AIN WITH THREE DOTS ABOVE
U+06AC ( ڬ ) ARABIC LETTER KAF WITH DOT ABOVE
U+06B2 ( ڲ ) ARABIC LETTER GAF WITH TWO DOTS BELOW
U+06B4 ( ڴ ) ARABIC LETTER GAF WITH THREE DOTS ABOVE
U+06B8 ( ڸ ) ARABIC LETTER LAM WITH THREE DOTS BELOW
U+06B9 ( ڹ ) ARABIC LETTER NOON WITH DOT BELOW
The IPA characters [ɩɷɼɿʅ-ʇʓʖʗʚʞʠʣʥʦʨ-ʯ] are not official IPA, but are not obsolete or archaic; they are still used in some traditions.