*TODO Historic Unicode Characters

L2/09-058 R2

Mark Davis, 2009-01-30

Live doc: http://www.macchiato.com/unicode/historic-unicode-characters

The following is a draft list of historic characters, meaning characters that are no longer customarily used in current modern languages in typical publications (corner newspapers, magazines, etc.). For example, you wouldn't expect to see words written in Cuneiform in the NY Times. They may of course still be used in in technical journals, especially those dealing with archaic languages, or have occasional decorative use, or be in academic documents, or be quoted in modern works, or be used in liturgical works.

For example, in modern Tibetan usage,the Phags-pa script is restricted to calligraphic, decorative and other special purposes, and is not used for writing extensive texts (as it used to be in the 13th and 14th

centuries). Thus Phags-pa is an archaic script (in the same way that in the West, blackletter is an archaic script, even though it is still used for calligraphic and decorative purposes).

In many cases in the Unicode Standard, we mark characters (such as with subheadings or block names) as being not in modern use, using terms like Obsolete, Old, Ancient, Archaic, and so on. But there are characters that are not clearly marked as such. In UAX31 we include information about blocks or scripts that are unsuitable for identifiers - this is often, but not always, based on whether they are archaic scripts or not. In UTS39 we have a tag for 'archaic' - and I have an action to update that UTS.

This does not yet include historic CJK characters.

It is useful for people to know which characters are in customary modern use, and which are not, so I set out to try to get a clearer picture of the situation. This is not intended as a proposal for a standard Unicode property; it is simply some information about Unicode characters that may be useful to some people in some contexts. For example, someone building a character picker could put historic characters into an 'extended' bucket for more advanced users. (See, for example, the mockup at http://macchiato.com/picker/MyApplication.html.)

For right now, this is targeted at an update of UTS39, plus possible improvements to NamesList annotations.

Here is the data. The lines marked with yellow are those that should be removed, based on feedback on the email lists. Additions from that feedback are listed at the end.

Current count: 4,441 Code Points

= 3,867 Code Points (if unassigned characters are removed from the blocks).

Both include the yellow items that should be removed, below, so the end figures would be somewhat smaller.

(1) Data from UAX31

This is broadened from scripts to blocks where possible.

[:blk=Ancient_Greek_Musical_Notation:]

[:blk=Buginese:]

[:blk=Buhid:]

[:blk=Carian:]

[:blk=Coptic:]

[:blk=Cuneiform:]

[:blk=Cuneiform_Numbers_And_Punctuation:]

[:blk=Cypriot_Syllabary:]

[:blk=Deseret:]

[:blk=Glagolitic:]

[:blk=Gothic:]

[:blk=Hanunoo:]

[:blk=Kharoshthi:]

[:blk=Linear_B_Ideograms:]

[:blk=Linear_B_Syllabary:]

[:blk=Lycian:]

[:blk=Lydian:]

[:blk=Ogham:]

[:blk=Old_Italic:]

[:blk=Old_Persian:]

[:blk=Osmanya:]

[:blk=Phags_Pa:]

[:blk=Phaistos_Disc:]

[:blk=Phoenician:]

[:blk=Rejang:]

[:blk=Runic:]

[:blk=Shavian:]

[:blk=Sundanese:]

[:blk=Syloti_Nagri:]

[:blk=Syriac:]

[:blk=Tagalog:]

[:blk=Tagbanwa:]

[:blk=Ugaritic:]

[:sc=Copt:]

// Note that there is a revival effort for Coptic, that may move it out of this section over time.

(2) Additional data from UTS39

Note that the Korean Jamo are listed because they would normally not appear in NFC. Also broadened to blocks if possible, and subtracting data above.

[:blk=Balinese:] // this needs to be removed

[:blk=Ancient_Greek_Numbers:]

[:Block=Hangul_Jamo:]

[:Block=Hangul_Compatibility_Jamo:]

+

Latin Extended B - Non-European and historic Latin

U+018D ( ƍ ) LATIN SMALL LETTER TURNED DELTA

U+01AA ( ƪ ) LATIN LETTER REVERSED ESH LOOP

U+01AB ( ƫ ) LATIN SMALL LETTER T WITH PALATAL HOOK

U+01B9 ( ƹ ) LATIN SMALL LETTER EZH REVERSED

U+01BA ( ƺ ) LATIN SMALL LETTER EZH WITH TAIL

U+01BB ( ƻ ) LATIN LETTER TWO WITH STROKE

U+01BE ( ƾ ) LATIN LETTER INVERTED GLOTTAL STOP WITH STROKE

U+01BF ( ƿ ) LATIN LETTER WYNN

Latin Extended B - Miscellaneous additions

U+021C ( Ȝ ) LATIN CAPITAL LETTER YOGH

U+021D ( ȝ ) LATIN SMALL LETTER YOGH

IPA Extensions - IPA extensions

U+025F ( ɟ ) LATIN SMALL LETTER DOTLESS J WITH STROKE

U+0277 ( ɷ ) LATIN SMALL LETTER CLOSED OMEGA

U+027C ( ɼ ) LATIN SMALL LETTER R WITH LONG LEG

U+029E ( ʞ ) LATIN SMALL LETTER TURNED K

Combining Diacritical Marks - Additions for Greek

U+0343 ( ̓ ) COMBINING GREEK KORONIS

Greek And Coptic - Variant letterforms

U+03D0 ( ϐ ) GREEK BETA SYMBOL

U+03D1 ( ϑ ) GREEK THETA SYMBOL

U+03D5 ( ϕ ) GREEK PHI SYMBOL

U+03D6 ( ϖ ) GREEK PI SYMBOL

U+03D7 ( ϗ ) GREEK KAI SYMBOL

Greek And Coptic - Archaic letters

U+03D8 ( Ϙ ) GREEK LETTER ARCHAIC KOPPA

U+03D9 ( ϙ ) GREEK SMALL LETTER ARCHAIC KOPPA

U+03DA ( Ϛ ) GREEK LETTER STIGMA

U+03DB ( ϛ ) GREEK SMALL LETTER STIGMA

U+03DC ( Ϝ ) GREEK LETTER DIGAMMA

U+03DD ( ϝ ) GREEK SMALL LETTER DIGAMMA

U+03DE ( Ϟ ) GREEK LETTER KOPPA

U+03DF ( ϟ ) GREEK SMALL LETTER KOPPA

U+03E0 ( Ϡ ) GREEK LETTER SAMPI

U+03E1 ( ϡ ) GREEK SMALL LETTER SAMPI

Greek And Coptic - Additional archaic letters for Bactrian

U+03F7 ( Ϸ ) GREEK CAPITAL LETTER SHO

U+03F8 ( ϸ ) GREEK SMALL LETTER SHO

Greek And Coptic - Variant letterform

U+03F9 ( Ϲ ) GREEK CAPITAL LUNATE SIGMA SYMBOL

Greek And Coptic - Archaic letters

U+03FA ( Ϻ ) GREEK CAPITAL LETTER SAN

U+03FB ( ϻ ) GREEK SMALL LETTER SAN

Cyrillic - Historic miscellaneous

U+0483 ( ҃ ) COMBINING CYRILLIC TITLO

U+0484 ( ҄ ) COMBINING CYRILLIC PALATALIZATION

U+0485 ( ҅ ) COMBINING CYRILLIC DASIA PNEUMATA

U+0486 ( ҆ ) COMBINING CYRILLIC PSILI PNEUMATA

Hebrew - Cantillation marks

U+05A2 ( ֢ ) HEBREW ACCENT ATNAH HAFUKH

Hebrew - Puncta extraordinaria

U+05C5 ( ׅ ) HEBREW MARK LOWER DOT

Hebrew - Points and punctuation

U+05C6 ( ‎׆‎ ) HEBREW PUNCTUATION NUN HAFUKHA

U+05C7 ( ׇ ) HEBREW POINT QAMATS QATAN

Arabic - Archaic letters

U+066E ( ‎ٮ‎ ) ARABIC LETTER DOTLESS BEH

U+066F ( ‎ٯ‎ ) ARABIC LETTER DOTLESS QAF

Arabic - Extended Arabic letters

U+068E ( ‎ڎ‎ ) ARABIC LETTER DUL

Kannada - Additional consonants

U+0CDE ( ೞ ) KANNADA LETTER FA

Georgian - Archaic letters

U+10F1 ( ჱ ) GEORGIAN LETTER HE

U+10F2 ( ჲ ) GEORGIAN LETTER HIE

U+10F3 ( ჳ ) GEORGIAN LETTER WE

U+10F4 ( ჴ ) GEORGIAN LETTER HAR

U+10F5 ( ჵ ) GEORGIAN LETTER HOE

U+10F6 ( ჶ ) GEORGIAN LETTER FI

Khmer - Independent vowels

U+17A8 ( ឨ ) KHMER INDEPENDENT VOWEL QUK

Khmer - Various signs

U+17D1 ( ៑ ) KHMER SIGN VIRIAM

U+17DD ( ៝ ) KHMER SIGN ATTHACAN

Combining Diacritical Marks Supplement - Used for Ancient Greek

U+1DC0 ( ᷀ ) COMBINING DOTTED GRAVE ACCENT

U+1DC1 ( ᷁ ) COMBINING DOTTED ACUTE ACCENT

Combining Diacritical Marks Supplement - Miscellaneous marks

U+1DC2 ( ᷂ ) COMBINING SNAKE BELOW

U+1DC3 ( ᷃ ) COMBINING SUSPENSION MARK

Modifier Tone Letters - Corner tone marks for Chinese

U+A700 ( ꜀ ) MODIFIER LETTER CHINESE TONE YIN PING

U+A701 ( ꜁ ) MODIFIER LETTER CHINESE TONE YANG PING

U+A702 ( ꜂ ) MODIFIER LETTER CHINESE TONE YIN SHANG

U+A703 ( ꜃ ) MODIFIER LETTER CHINESE TONE YANG SHANG

U+A704 ( ꜄ ) MODIFIER LETTER CHINESE TONE YIN QU

U+A705 ( ꜅ ) MODIFIER LETTER CHINESE TONE YANG QU

U+A706 ( ꜆ ) MODIFIER LETTER CHINESE TONE YIN RU

U+A707 ( ꜇ ) MODIFIER LETTER CHINESE TONE YANG RU

(3) Heuristically-Derived Data

The Unicode character with subheads, block names, or character names in the standard that contain any of the words:

Obsolete|Ancient|Archaic|Medieval|New Testament|UPA|Old|Early

minus the above (and with some small hand-editing for exceptional cases) Errors in these may indicate that we want to change some of the subheaders. Also broadened to blocks if possible, and subtracting data above.

[:blk=Ancient_Symbols:]

[:blk=Ancient_Greek_Musical_Notation:]

[:blk=Cyrillic_Extended_A:]

[:blk=Cyrillic_Extended_B:]

+

Spacing Modifier Letters - UPA modifiers

U+02EF ( ˯ ) MODIFIER LETTER LOW DOWN ARROWHEAD

U+02F0 ( ˰ ) MODIFIER LETTER LOW UP ARROWHEAD

U+02F1 ( ˱ ) MODIFIER LETTER LOW LEFT ARROWHEAD

U+02F2 ( ˲ ) MODIFIER LETTER LOW RIGHT ARROWHEAD

U+02F3 ( ˳ ) MODIFIER LETTER LOW RING

U+02F4 ( ˴ ) MODIFIER LETTER MIDDLE GRAVE ACCENT

U+02F5 ( ˵ ) MODIFIER LETTER MIDDLE DOUBLE GRAVE ACCENT

U+02F6 ( ˶ ) MODIFIER LETTER MIDDLE DOUBLE ACUTE ACCENT

U+02F7 ( ˷ ) MODIFIER LETTER LOW TILDE

U+02F8 ( ˸ ) MODIFIER LETTER RAISED COLON

U+02F9 ( ˹ ) MODIFIER LETTER BEGIN HIGH TONE

U+02FA ( ˺ ) MODIFIER LETTER END HIGH TONE

U+02FB ( ˻ ) MODIFIER LETTER BEGIN LOW TONE

U+02FC ( ˼ ) MODIFIER LETTER END LOW TONE

U+02FD ( ˽ ) MODIFIER LETTER SHELF

U+02FE ( ˾ ) MODIFIER LETTER OPEN SHELF

U+02FF ( ˿ ) MODIFIER LETTER LOW LEFT ARROW

Combining Diacritical Marks - Medieval superscript letter diacritics

U+0363 ( ͣ ) COMBINING LATIN SMALL LETTER A

U+0364 ( ͤ ) COMBINING LATIN SMALL LETTER E

U+0365 ( ͥ ) COMBINING LATIN SMALL LETTER I

U+0366 ( ͦ ) COMBINING LATIN SMALL LETTER O

U+0367 ( ͧ ) COMBINING LATIN SMALL LETTER U

U+0368 ( ͨ ) COMBINING LATIN SMALL LETTER C

U+0369 ( ͩ ) COMBINING LATIN SMALL LETTER D

U+036A ( ͪ ) COMBINING LATIN SMALL LETTER H

U+036B ( ͫ ) COMBINING LATIN SMALL LETTER M

U+036C ( ͬ ) COMBINING LATIN SMALL LETTER R

U+036D ( ͭ ) COMBINING LATIN SMALL LETTER T

U+036E ( ͮ ) COMBINING LATIN SMALL LETTER V

U+036F ( ͯ ) COMBINING LATIN SMALL LETTER X

Greek And Coptic - Archaic letters

U+0370 ( Ͱ ) GREEK CAPITAL LETTER HETA

U+0371 ( ͱ ) GREEK SMALL LETTER HETA

U+0372 ( Ͳ ) GREEK CAPITAL LETTER ARCHAIC SAMPI

U+0373 ( ͳ ) GREEK SMALL LETTER ARCHAIC SAMPI

U+0376 ( Ͷ ) GREEK CAPITAL LETTER PAMPHYLIAN DIGAMMA

U+0377 ( ͷ ) GREEK SMALL LETTER PAMPHYLIAN DIGAMMA

Arabic - Additions for early Persian and Azerbaijani

U+063B ( ‎ػ‎ ) ARABIC LETTER KEHEH WITH TWO DOTS ABOVE

U+063C ( ‎ؼ‎ ) ARABIC LETTER KEHEH WITH THREE DOTS BELOW

U+063E ( ‎ؾ‎ ) ARABIC LETTER FARSI YEH WITH TWO DOTS ABOVE

U+063F ( ‎ؿ‎ ) ARABIC LETTER FARSI YEH WITH THREE DOTS ABOVE

Arabic Supplement - Additions for early Persian

U+077E ( ‎ݾ‎ ) ARABIC LETTER SEEN WITH INVERTED V

U+077F ( ‎ݿ‎ ) ARABIC LETTER KAF WITH TWO DOTS ABOVE

NKo - Archaic letters

U+07E8 ( ‎ߨ‎ ) NKO LETTER JONA JA

U+07E9 ( ‎ߩ‎ ) NKO LETTER JONA CHA

U+07EA ( ‎ߪ‎ ) NKO LETTER JONA RA

Combining Diacritical Marks Supplement - Medievalist additions

U+1DCE ( ᷎ ) COMBINING OGONEK ABOVE

U+1DCF ( ᷏ ) COMBINING ZIGZAG BELOW

U+1DD0 ( ᷐ ) COMBINING IS BELOW

U+1DD1 ( ᷑ ) COMBINING UR ABOVE

U+1DD2 ( ᷒ ) COMBINING US ABOVE

Combining Diacritical Marks Supplement - Medieval superscript letter diacritics

U+1DD3 ( ᷓ ) COMBINING LATIN SMALL LETTER FLATTENED OPEN A ABOVE

U+1DD4 ( ᷔ ) COMBINING LATIN SMALL LETTER AE

U+1DD5 ( ᷕ ) COMBINING LATIN SMALL LETTER AO

U+1DD6 ( ᷖ ) COMBINING LATIN SMALL LETTER AV

U+1DD7 ( ᷗ ) COMBINING LATIN SMALL LETTER C CEDILLA

U+1DD8 ( ᷘ ) COMBINING LATIN SMALL LETTER INSULAR D

U+1DD9 ( ᷙ ) COMBINING LATIN SMALL LETTER ETH

U+1DDA ( ᷚ ) COMBINING LATIN SMALL LETTER G

U+1DDB ( ᷛ ) COMBINING LATIN LETTER SMALL CAPITAL G

U+1DDC ( ᷜ ) COMBINING LATIN SMALL LETTER K

U+1DDD ( ᷝ ) COMBINING LATIN SMALL LETTER L

U+1DDE ( ᷞ ) COMBINING LATIN LETTER SMALL CAPITAL L

U+1DDF ( ᷟ ) COMBINING LATIN LETTER SMALL CAPITAL M

U+1DE0 ( ᷠ ) COMBINING LATIN SMALL LETTER N

U+1DE1 ( ᷡ ) COMBINING LATIN LETTER SMALL CAPITAL N

U+1DE2 ( ᷢ ) COMBINING LATIN LETTER SMALL CAPITAL R

U+1DE3 ( ᷣ ) COMBINING LATIN SMALL LETTER R ROTUNDA

U+1DE4 ( ᷤ ) COMBINING LATIN SMALL LETTER S

U+1DE5 ( ᷥ ) COMBINING LATIN SMALL LETTER LONG S

U+1DE6 ( ᷦ ) COMBINING LATIN SMALL LETTER Z

Combining Diacritical Marks Supplement - Additional marks for UPA

U+1DFE ( ᷾ ) COMBINING LEFT ARROWHEAD ABOVE

U+1DFF ( ᷿ ) COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOW

Latin Extended Additional - Medievalist additions

U+1E9C ( ẜ ) LATIN SMALL LETTER LONG S WITH DIAGONAL STROKE

U+1E9D ( ẝ ) LATIN SMALL LETTER LONG S WITH HIGH STROKE

Latin Extended Additional - Medievalist addition

U+1E9F ( ẟ ) LATIN SMALL LETTER DELTA

Latin Extended Additional - Medievalist additions

U+1EFA ( Ỻ ) LATIN CAPITAL LETTER MIDDLE-WELSH LL

U+1EFB ( ỻ ) LATIN SMALL LETTER MIDDLE-WELSH LL

U+1EFC ( Ỽ ) LATIN CAPITAL LETTER MIDDLE-WELSH V

U+1EFD ( ỽ ) LATIN SMALL LETTER MIDDLE-WELSH V

U+1EFE ( Ỿ ) LATIN CAPITAL LETTER Y WITH LOOP

U+1EFF ( ỿ ) LATIN SMALL LETTER Y WITH LOOP

General Punctuation - Archaic punctuation

U+2056 ( ⁖ ) THREE DOT PUNCTUATION

U+2058 ( ⁘ ) FOUR DOT PUNCTUATION

U+2059 ( ⁙ ) FIVE DOT PUNCTUATION

U+205A ( ⁚ ) TWO DOT PUNCTUATION

U+205B ( ⁛ ) FOUR DOT MARK

U+205C ( ⁜ ) DOTTED CROSS

U+205D ( ⁝ ) TRICOLON

U+205E ( ⁞ ) VERTICAL FOUR DOTS

Number Forms - Archaic Roman numerals

U+2180 ( ↀ ) ROMAN NUMERAL ONE THOUSAND C D

U+2181 ( ↁ ) ROMAN NUMERAL FIVE THOUSAND

U+2182 ( ↂ ) ROMAN NUMERAL TEN THOUSAND

U+2183 ( Ↄ ) ROMAN NUMERAL REVERSED ONE HUNDRED

U+2185 ( ↅ ) ROMAN NUMERAL SIX LATE FORM

U+2186 ( ↆ ) ROMAN NUMERAL FIFTY EARLY FORM

U+2187 ( ↇ ) ROMAN NUMERAL FIFTY THOUSAND

U+2188 ( ↈ ) ROMAN NUMERAL ONE HUNDRED THOUSAND

Latin Extended C - Additions for UPA

U+2C77 ( ⱷ ) LATIN SMALL LETTER TAILLESS PHI

U+2C78 ( ⱸ ) LATIN SMALL LETTER E WITH NOTCH

U+2C79 ( ⱹ ) LATIN SMALL LETTER TURNED R WITH TAIL

U+2C7A ( ⱺ ) LATIN SMALL LETTER O WITH LOW RING INSIDE

U+2C7B ( ⱻ ) LATIN LETTER SMALL CAPITAL TURNED E

U+2C7C ( ⱼ ) LATIN SUBSCRIPT SMALL LETTER J

U+2C7D ( ⱽ ) MODIFIER LETTER CAPITAL V

Supplemental Punctuation - New Testament editorial symbols

U+2E00 ( ⸀ ) RIGHT ANGLE SUBSTITUTION MARKER

U+2E01 ( ⸁ ) RIGHT ANGLE DOTTED SUBSTITUTION MARKER

U+2E02 ( ⸂ ) LEFT SUBSTITUTION BRACKET

U+2E03 ( ⸃ ) RIGHT SUBSTITUTION BRACKET

U+2E04 ( ⸄ ) LEFT DOTTED SUBSTITUTION BRACKET

U+2E05 ( ⸅ ) RIGHT DOTTED SUBSTITUTION BRACKET

U+2E06 ( ⸆ ) RAISED INTERPOLATION MARKER

U+2E07 ( ⸇ ) RAISED DOTTED INTERPOLATION MARKER

U+2E08 ( ⸈ ) DOTTED TRANSPOSITION MARKER

U+2E09 ( ⸉ ) LEFT TRANSPOSITION BRACKET

U+2E0A ( ⸊ ) RIGHT TRANSPOSITION BRACKET

U+2E0B ( ⸋ ) RAISED SQUARE

U+2E0C ( ⸌ ) LEFT RAISED OMISSION BRACKET

U+2E0D ( ⸍ ) RIGHT RAISED OMISSION BRACKET

Supplemental Punctuation - Ancient Greek textual symbols

U+2E0E ( ⸎ ) EDITORIAL CORONIS

U+2E0F ( ⸏ ) PARAGRAPHOS

U+2E10 ( ⸐ ) FORKED PARAGRAPHOS

U+2E11 ( ⸑ ) REVERSED FORKED PARAGRAPHOS

U+2E12 ( ⸒ ) HYPODIASTOLE

U+2E13 ( ⸓ ) DOTTED OBELOS

U+2E14 ( ⸔ ) DOWNWARDS ANCORA

U+2E15 ( ⸕ ) UPWARDS ANCORA

U+2E16 ( ⸖ ) DOTTED RIGHT-POINTING ANGLE

Supplemental Punctuation - Ancient Near-Eastern linguistic symbol

U+2E17 ( ⸗ ) DOUBLE OBLIQUE HYPHEN

Supplemental Punctuation - Medievalist punctuation

U+2E2A ( ⸪ ) TWO DOTS OVER ONE DOT PUNCTUATION

U+2E2B ( ⸫ ) ONE DOT OVER TWO DOTS PUNCTUATION

U+2E2C ( ⸬ ) SQUARED FOUR DOT PUNCTUATION

U+2E2D ( ⸭ ) FIVE DOT MARK

U+2E2E ( ⸮ ) REVERSED QUESTION MARK

U+2E2F ( ⸯ ) VERTICAL TILDE

U+2E30 ( ⸰ ) RING POINT

Latin Extended D - Additions for UPA

U+A720 ( ꜠ ) MODIFIER LETTER STRESS AND HIGH TONE

U+A721 ( ꜡ ) MODIFIER LETTER STRESS AND LOW TONE

Latin Extended D - Medievalist additions

U+A730 ( ꜰ ) LATIN LETTER SMALL CAPITAL F

U+A731 ( ꜱ ) LATIN LETTER SMALL CAPITAL S

U+A732 ( Ꜳ ) LATIN CAPITAL LETTER AA

U+A733 ( ꜳ ) LATIN SMALL LETTER AA

U+A734 ( Ꜵ ) LATIN CAPITAL LETTER AO

U+A735 ( ꜵ ) LATIN SMALL LETTER AO

U+A736 ( Ꜷ ) LATIN CAPITAL LETTER AU

U+A737 ( ꜷ ) LATIN SMALL LETTER AU

U+A738 ( Ꜹ ) LATIN CAPITAL LETTER AV

U+A739 ( ꜹ ) LATIN SMALL LETTER AV

U+A73A ( Ꜻ ) LATIN CAPITAL LETTER AV WITH HORIZONTAL BAR

U+A73B ( ꜻ ) LATIN SMALL LETTER AV WITH HORIZONTAL BAR

U+A73C ( Ꜽ ) LATIN CAPITAL LETTER AY

U+A73D ( ꜽ ) LATIN SMALL LETTER AY

U+A73E ( Ꜿ ) LATIN CAPITAL LETTER REVERSED C WITH DOT

U+A73F ( ꜿ ) LATIN SMALL LETTER REVERSED C WITH DOT

U+A740 ( Ꝁ ) LATIN CAPITAL LETTER K WITH STROKE

U+A741 ( ꝁ ) LATIN SMALL LETTER K WITH STROKE

U+A742 ( Ꝃ ) LATIN CAPITAL LETTER K WITH DIAGONAL STROKE

U+A743 ( ꝃ ) LATIN SMALL LETTER K WITH DIAGONAL STROKE

U+A744 ( Ꝅ ) LATIN CAPITAL LETTER K WITH STROKE AND DIAGONAL STROKE

U+A745 ( ꝅ ) LATIN SMALL LETTER K WITH STROKE AND DIAGONAL STROKE

U+A746 ( Ꝇ ) LATIN CAPITAL LETTER BROKEN L

U+A747 ( ꝇ ) LATIN SMALL LETTER BROKEN L

U+A748 ( Ꝉ ) LATIN CAPITAL LETTER L WITH HIGH STROKE

U+A749 ( ꝉ ) LATIN SMALL LETTER L WITH HIGH STROKE

U+A74A ( Ꝋ ) LATIN CAPITAL LETTER O WITH LONG STROKE OVERLAY

U+A74B ( ꝋ ) LATIN SMALL LETTER O WITH LONG STROKE OVERLAY

U+A74C ( Ꝍ ) LATIN CAPITAL LETTER O WITH LOOP

U+A74D ( ꝍ ) LATIN SMALL LETTER O WITH LOOP

U+A74E ( Ꝏ ) LATIN CAPITAL LETTER OO

U+A74F ( ꝏ ) LATIN SMALL LETTER OO

U+A750 ( Ꝑ ) LATIN CAPITAL LETTER P WITH STROKE THROUGH DESCENDER

U+A751 ( ꝑ ) LATIN SMALL LETTER P WITH STROKE THROUGH DESCENDER

U+A752 ( Ꝓ ) LATIN CAPITAL LETTER P WITH FLOURISH

U+A753 ( ꝓ ) LATIN SMALL LETTER P WITH FLOURISH

U+A754 ( Ꝕ ) LATIN CAPITAL LETTER P WITH SQUIRREL TAIL

U+A755 ( ꝕ ) LATIN SMALL LETTER P WITH SQUIRREL TAIL

U+A756 ( Ꝗ ) LATIN CAPITAL LETTER Q WITH STROKE THROUGH DESCENDER

U+A757 ( ꝗ ) LATIN SMALL LETTER Q WITH STROKE THROUGH DESCENDER

U+A758 ( Ꝙ ) LATIN CAPITAL LETTER Q WITH DIAGONAL STROKE

U+A759 ( ꝙ ) LATIN SMALL LETTER Q WITH DIAGONAL STROKE

U+A75A ( Ꝛ ) LATIN CAPITAL LETTER R ROTUNDA

U+A75B ( ꝛ ) LATIN SMALL LETTER R ROTUNDA

U+A75C ( Ꝝ ) LATIN CAPITAL LETTER RUM ROTUNDA

U+A75D ( ꝝ ) LATIN SMALL LETTER RUM ROTUNDA

U+A75E ( Ꝟ ) LATIN CAPITAL LETTER V WITH DIAGONAL STROKE

U+A75F ( ꝟ ) LATIN SMALL LETTER V WITH DIAGONAL STROKE

U+A760 ( Ꝡ ) LATIN CAPITAL LETTER VY

U+A761 ( ꝡ ) LATIN SMALL LETTER VY

U+A762 ( Ꝣ ) LATIN CAPITAL LETTER VISIGOTHIC Z

U+A763 ( ꝣ ) LATIN SMALL LETTER VISIGOTHIC Z

U+A764 ( Ꝥ ) LATIN CAPITAL LETTER THORN WITH STROKE

U+A765 ( ꝥ ) LATIN SMALL LETTER THORN WITH STROKE

U+A766 ( Ꝧ ) LATIN CAPITAL LETTER THORN WITH STROKE THROUGH DESCENDER

U+A767 ( ꝧ ) LATIN SMALL LETTER THORN WITH STROKE THROUGH DESCENDER

U+A768 ( Ꝩ ) LATIN CAPITAL LETTER VEND

U+A769 ( ꝩ ) LATIN SMALL LETTER VEND

U+A76A ( Ꝫ ) LATIN CAPITAL LETTER ET

U+A76B ( ꝫ ) LATIN SMALL LETTER ET

U+A76C ( Ꝭ ) LATIN CAPITAL LETTER IS

U+A76D ( ꝭ ) LATIN SMALL LETTER IS

U+A76E ( Ꝯ ) LATIN CAPITAL LETTER CON

U+A76F ( ꝯ ) LATIN SMALL LETTER CON

U+A770 ( ꝰ ) MODIFIER LETTER US

U+A771 ( ꝱ ) LATIN SMALL LETTER DUM

U+A772 ( ꝲ ) LATIN SMALL LETTER LUM

U+A773 ( ꝳ ) LATIN SMALL LETTER MUM

U+A774 ( ꝴ ) LATIN SMALL LETTER NUM

U+A775 ( ꝵ ) LATIN SMALL LETTER RUM

U+A776 ( ꝶ ) LATIN LETTER SMALL CAPITAL RUM

U+A777 ( ꝷ ) LATIN SMALL LETTER TUM

U+A778 ( ꝸ ) LATIN SMALL LETTER UM

Latin Extended D - Ancient Roman epigraphic letters

U+A7FB ( ꟻ ) LATIN EPIGRAPHIC LETTER REVERSED F

U+A7FC ( ꟼ ) LATIN EPIGRAPHIC LETTER REVERSED P

U+A7FD ( ꟽ ) LATIN EPIGRAPHIC LETTER INVERTED M

U+A7FE ( ꟾ ) LATIN EPIGRAPHIC LETTER I LONGA

U+A7FF ( ꟿ ) LATIN EPIGRAPHIC LETTER ARCHAIC M

(4) Feedback Additions

Based on suggestions in feedback. Also broadened to blocks if possible, and subtracting data above.

[:block=Georgian Supplement:]

+

Greek And Coptic - Lowercase of editorial symbols

U+037B ( ͻ ) GREEK SMALL REVERSED LUNATE SIGMA SYMBOL

U+037C ( ͼ ) GREEK SMALL DOTTED LUNATE SIGMA SYMBOL

U+037D ( ͽ ) GREEK SMALL REVERSED DOTTED LUNATE SIGMA SYMBOL

Greek And Coptic - Variant letterforms

U+03CF ( Ϗ ) GREEK CAPITAL KAI SYMBOL

Greek And Coptic - Editorial symbols

U+03FD ( Ͻ ) GREEK CAPITAL REVERSED LUNATE SIGMA SYMBOL

U+03FE ( Ͼ ) GREEK CAPITAL DOTTED LUNATE SIGMA SYMBOL

U+03FF ( Ͽ ) GREEK CAPITAL REVERSED DOTTED LUNATE SIGMA SYMBOL

Latin Extended B - Non-European and historic Latin

U+0185 ( ƅ ) LATIN SMALL LETTER TONE SIX

U+01A8 ( ƨ ) LATIN SMALL LETTER TONE TWO

U+01BD ( ƽ ) LATIN SMALL LETTER TONE FIVE

Hebrew - Cantillation marks

U+0591 ( ֑ ) HEBREW ACCENT ETNAHTA

U+0592 ( ֒ ) HEBREW ACCENT SEGOL

U+0593 ( ֓ ) HEBREW ACCENT SHALSHELET

U+0594 ( ֔ ) HEBREW ACCENT ZAQEF QATAN

U+0595 ( ֕ ) HEBREW ACCENT ZAQEF GADOL

U+0596 ( ֖ ) HEBREW ACCENT TIPEHA

U+0597 ( ֗ ) HEBREW ACCENT REVIA

U+0598 ( ֘ ) HEBREW ACCENT ZARQA

U+0599 ( ֙ ) HEBREW ACCENT PASHTA

U+059A ( ֚ ) HEBREW ACCENT YETIV

U+059B ( ֛ ) HEBREW ACCENT TEVIR

U+059C ( ֜ ) HEBREW ACCENT GERESH

U+059D ( ֝ ) HEBREW ACCENT GERESH MUQDAM

U+059E ( ֞ ) HEBREW ACCENT GERSHAYIM

U+059F ( ֟ ) HEBREW ACCENT QARNEY PARA

U+05A0 ( ֠ ) HEBREW ACCENT TELISHA GEDOLA

U+05A1 ( ֡ ) HEBREW ACCENT PAZER

U+05A2 ( ֢ ) HEBREW ACCENT ATNAH HAFUKH

U+05A3 ( ֣ ) HEBREW ACCENT MUNAH

U+05A4 ( ֤ ) HEBREW ACCENT MAHAPAKH

U+05A5 ( ֥ ) HEBREW ACCENT MERKHA

U+05A6 ( ֦ ) HEBREW ACCENT MERKHA KEFULA

U+05A7 ( ֧ ) HEBREW ACCENT DARGA

U+05A8 ( ֨ ) HEBREW ACCENT QADMA

U+05A9 ( ֩ ) HEBREW ACCENT TELISHA QETANA

U+05AA ( ֪ ) HEBREW ACCENT YERAH BEN YOMO

U+05AB ( ֫ ) HEBREW ACCENT OLE

U+05AC ( ֬ ) HEBREW ACCENT ILUY

U+05AD ( ֭ ) HEBREW ACCENT DEHI

U+05AE ( ֮ ) HEBREW ACCENT ZINOR

U+05AF ( ֯ ) HEBREW MARK MASORA CIRCLE

Hebrew - Puncta extraordinaria

U+05C4 ( ׄ ) HEBREW MARK UPPER DOT

U+05C5 ( ׅ ) HEBREW MARK LOWER DOT

Arabic - Koranic annotation signs

U+0615 ( ؕ ) ARABIC SMALL HIGH TAH

U+0616 ( ؖ ) ARABIC SMALL HIGH LIGATURE ALEF WITH LAM WITH YEH

U+0617 ( ؗ ) ARABIC SMALL HIGH ZAIN

U+0618 ( ؘ ) ARABIC SMALL FATHA

U+0619 ( ؙ ) ARABIC SMALL DAMMA

U+061A ( ؚ ) ARABIC SMALL KASRA

U+06D6 ( ۖ ) ARABIC SMALL HIGH LIGATURE SAD WITH LAM WITH ALEF MAKSURA

U+06D7 ( ۗ ) ARABIC SMALL HIGH LIGATURE QAF WITH LAM WITH ALEF MAKSURA

U+06D8 ( ۘ ) ARABIC SMALL HIGH MEEM INITIAL FORM

U+06D9 ( ۙ ) ARABIC SMALL HIGH LAM ALEF

U+06DA ( ۚ ) ARABIC SMALL HIGH JEEM

U+06DB ( ۛ ) ARABIC SMALL HIGH THREE DOTS

U+06DC ( ۜ ) ARABIC SMALL HIGH SEEN

U+06DD ( ۝ ) ARABIC END OF AYAH

U+06DE ( ۞ ) ARABIC START OF RUB EL HIZB

U+06DF ( ۟ ) ARABIC SMALL HIGH ROUNDED ZERO

U+06E0 ( ۠ ) ARABIC SMALL HIGH UPRIGHT RECTANGULAR ZERO

U+06E1 ( ۡ ) ARABIC SMALL HIGH DOTLESS HEAD OF KHAH

U+06E2 ( ۢ ) ARABIC SMALL HIGH MEEM ISOLATED FORM

U+06E3 ( ۣ ) ARABIC SMALL LOW SEEN

U+06E4 ( ۤ ) ARABIC SMALL HIGH MADDA

U+06E5 ( ‎ۥ‎ ) ARABIC SMALL WAW

U+06E6 ( ‎ۦ‎ ) ARABIC SMALL YEH

U+06E7 ( ۧ ) ARABIC SMALL HIGH YEH

U+06E8 ( ۨ ) ARABIC SMALL HIGH NOON

U+06E9 ( ۩ ) ARABIC PLACE OF SAJDAH

U+06EA ( ۪ ) ARABIC EMPTY CENTRE LOW STOP

U+06EB ( ۫ ) ARABIC EMPTY CENTRE HIGH STOP

U+06EC ( ۬ ) ARABIC ROUNDED HIGH STOP WITH FILLED CENTRE

U+06ED ( ۭ ) ARABIC SMALL LOW MEEM

Georgian - Capital letters (Khutsuri)

U+10A0 ( Ⴀ ) GEORGIAN CAPITAL LETTER AN

U+10A1 ( Ⴁ ) GEORGIAN CAPITAL LETTER BAN

U+10A2 ( Ⴂ ) GEORGIAN CAPITAL LETTER GAN

U+10A3 ( Ⴃ ) GEORGIAN CAPITAL LETTER DON

U+10A4 ( Ⴄ ) GEORGIAN CAPITAL LETTER EN

U+10A5 ( Ⴅ ) GEORGIAN CAPITAL LETTER VIN

U+10A6 ( Ⴆ ) GEORGIAN CAPITAL LETTER ZEN

U+10A7 ( Ⴇ ) GEORGIAN CAPITAL LETTER TAN

U+10A8 ( Ⴈ ) GEORGIAN CAPITAL LETTER IN

U+10A9 ( Ⴉ ) GEORGIAN CAPITAL LETTER KAN

U+10AA ( Ⴊ ) GEORGIAN CAPITAL LETTER LAS

U+10AB ( Ⴋ ) GEORGIAN CAPITAL LETTER MAN

U+10AC ( Ⴌ ) GEORGIAN CAPITAL LETTER NAR

U+10AD ( Ⴍ ) GEORGIAN CAPITAL LETTER ON

U+10AE ( Ⴎ ) GEORGIAN CAPITAL LETTER PAR

U+10AF ( Ⴏ ) GEORGIAN CAPITAL LETTER ZHAR

U+10B0 ( Ⴐ ) GEORGIAN CAPITAL LETTER RAE

U+10B1 ( Ⴑ ) GEORGIAN CAPITAL LETTER SAN

U+10B2 ( Ⴒ ) GEORGIAN CAPITAL LETTER TAR

U+10B3 ( Ⴓ ) GEORGIAN CAPITAL LETTER UN

U+10B4 ( Ⴔ ) GEORGIAN CAPITAL LETTER PHAR

U+10B5 ( Ⴕ ) GEORGIAN CAPITAL LETTER KHAR

U+10B6 ( Ⴖ ) GEORGIAN CAPITAL LETTER GHAN

U+10B7 ( Ⴗ ) GEORGIAN CAPITAL LETTER QAR

U+10B8 ( Ⴘ ) GEORGIAN CAPITAL LETTER SHIN

U+10B9 ( Ⴙ ) GEORGIAN CAPITAL LETTER CHIN

U+10BA ( Ⴚ ) GEORGIAN CAPITAL LETTER CAN

U+10BB ( Ⴛ ) GEORGIAN CAPITAL LETTER JIL

U+10BC ( Ⴜ ) GEORGIAN CAPITAL LETTER CIL

U+10BD ( Ⴝ ) GEORGIAN CAPITAL LETTER CHAR

U+10BE ( Ⴞ ) GEORGIAN CAPITAL LETTER XAN

U+10BF ( Ⴟ ) GEORGIAN CAPITAL LETTER JHAN

U+10C0 ( Ⴠ ) GEORGIAN CAPITAL LETTER HAE

U+10C1 ( Ⴡ ) GEORGIAN CAPITAL LETTER HE

U+10C2 ( Ⴢ ) GEORGIAN CAPITAL LETTER HIE

U+10C3 ( Ⴣ ) GEORGIAN CAPITAL LETTER WE

U+10C4 ( Ⴤ ) GEORGIAN CAPITAL LETTER HAR

U+10C5 ( Ⴥ ) GEORGIAN CAPITAL LETTER HOE

Georgian - Punctuation

U+10FB ( ჻ ) GEORGIAN PARAGRAPH SEPARATOR

Plus case closures for the above:

U+0184 ( Ƅ ) LATIN CAPITAL LETTER TONE SIX

U+01A7 ( Ƨ ) LATIN CAPITAL LETTER TONE TWO

U+01B8 ( Ƹ ) LATIN CAPITAL LETTER EZH REVERSED

U+01BC ( Ƽ ) LATIN CAPITAL LETTER TONE FIVE

U+01F7 ( Ƿ ) LATIN CAPITAL LETTER WYNN

U+03F2 ( ϲ ) GREEK LUNATE SIGMA SYMBOL

U+03F4 ( ϴ ) GREEK CAPITAL THETA SYMBOL

U+2184 ( ↄ ) LATIN SMALL LETTER REVERSED C

Note that the last is not the same as:

U+0254 ( ɔ ) LATIN SMALL LETTER OPEN O

Possibly also:

Arabic - Extended Arabic letters

U+0682 ( ‎ڂ‎ ) ARABIC LETTER HAH WITH TWO DOTS VERTICAL ABOVE

U+0690 ( ‎ڐ‎ ) ARABIC LETTER DAL WITH FOUR DOTS ABOVE

U+069B ( ‎ڛ‎ ) ARABIC LETTER SEEN WITH THREE DOTS BELOW

U+069F ( ‎ڟ‎ ) ARABIC LETTER TAH WITH THREE DOTS ABOVE

U+06A0 ( ‎ڠ‎ ) ARABIC LETTER AIN WITH THREE DOTS ABOVE

U+06AC ( ‎ڬ‎ ) ARABIC LETTER KAF WITH DOT ABOVE

U+06B2 ( ‎ڲ‎ ) ARABIC LETTER GAF WITH TWO DOTS BELOW

U+06B4 ( ‎ڴ‎ ) ARABIC LETTER GAF WITH THREE DOTS ABOVE

U+06B8 ( ‎ڸ‎ ) ARABIC LETTER LAM WITH THREE DOTS BELOW

U+06B9 ( ‎ڹ‎ ) ARABIC LETTER NOON WITH DOT BELOW

Side Note

The IPA characters [ɩɷɼɿʅ-ʇʓʖʗʚʞʠʣʥʦʨ-ʯ] are not official IPA, but are not obsolete or archaic; they are still used in some traditions.