The following was L2/09-029 R3. It is now a working draft of text and data.
In most cases, the assignment of scripts is straightfoward. There are, however, some edge cases that people may stumble over. In particular, the Common script value is too coarse for many applications. It only indicates that the character is used with multiple scripts, but not with which ones. Many applications need more information in order to do a reasonable job. For example, where a Character Picker organizes characters into buckets by script, such characters should show up in two (or more) buckets.
The following text provides information about such edge cases that may be useful to implementations. The following notation means that the following characters may, in some circumstances, be treated as if they had the listed scripts.
@Latin, Cyrillic,...
[Issue: should this be a data file instead?; it would be easier to use.]
[Ed note: define in #24 someplace that "explicit script" means a script value other than Common or Inherited.]
The following are functionally Latin script; applications may find it useful in certain circumstances to treat them as such.
U+005E ( ^ ) CIRCUMFLEX ACCENT
U+0060 ( ` ) GRAVE ACCENT
U+00A8 ( ¨ ) DIAERESIS
U+00AF ( ¯ ) MACRON
U+00B4 ( ´ ) ACUTE ACCENT
U+00B8 ( ¸ ) CEDILLA
U+02B9 ( ʹ ) MODIFIER LETTER PRIME
U+02BA ( ʺ ) MODIFIER LETTER DOUBLE PRIME
U+02BB ( ʻ ) MODIFIER LETTER TURNED COMMA
U+02BD ( ʽ ) MODIFIER LETTER REVERSED COMMA
U+02BE ( ʾ ) MODIFIER LETTER RIGHT HALF RING
U+02BF ( ʿ ) MODIFIER LETTER LEFT HALF RING
U+02C0 ( ˀ ) MODIFIER LETTER GLOTTAL STOP
U+02C1 ( ˁ ) MODIFIER LETTER REVERSED GLOTTAL STOP
U+02C2 ( ˂ ) MODIFIER LETTER LEFT ARROWHEAD
U+02C3 ( ˃ ) MODIFIER LETTER RIGHT ARROWHEAD
U+02C4 ( ˄ ) MODIFIER LETTER UP ARROWHEAD
U+02C5 ( ˅ ) MODIFIER LETTER DOWN ARROWHEAD
U+02C6 ( ˆ ) MODIFIER LETTER CIRCUMFLEX ACCENT
U+02C7 ( ˇ ) CARON
U+02C8 ( ˈ ) MODIFIER LETTER VERTICAL LINE
U+02C9 ( ˉ ) MODIFIER LETTER MACRON
U+02CA ( ˊ ) MODIFIER LETTER ACUTE ACCENT
U+02CB ( ˋ ) MODIFIER LETTER GRAVE ACCENT
U+02CC ( ˌ ) MODIFIER LETTER LOW VERTICAL LINE
U+02CD ( ˍ ) MODIFIER LETTER LOW MACRON
U+02CE ( ˎ ) MODIFIER LETTER LOW GRAVE ACCENT
U+02CF ( ˏ ) MODIFIER LETTER LOW ACUTE ACCENT
U+02D0 ( ː ) MODIFIER LETTER TRIANGULAR COLON
U+02D1 ( ˑ ) MODIFIER LETTER HALF TRIANGULAR COLON
U+02D2 ( ˒ ) MODIFIER LETTER CENTRED RIGHT HALF RING
U+02D3 ( ˓ ) MODIFIER LETTER CENTRED LEFT HALF RING
U+02D4 ( ˔ ) MODIFIER LETTER UP TACK
U+02D5 ( ˕ ) MODIFIER LETTER DOWN TACK
U+02D6 ( ˖ ) MODIFIER LETTER PLUS SIGN
U+02D7 ( ˗ ) MODIFIER LETTER MINUS SIGN
U+02D8 ( ˘ ) BREVE
U+02D9 ( ˙ ) DOT ABOVE
U+02DA ( ˚ ) RING ABOVE
U+02DB ( ˛ ) OGONEK
U+02DC ( ˜ ) SMALL TILDE
U+02DD ( ˝ ) DOUBLE ACUTE ACCENT
U+02DE ( ˞ ) MODIFIER LETTER RHOTIC HOOK
U+02DF ( ˟ ) MODIFIER LETTER CROSS ACCENT
U+02E5 ( ˥ ) MODIFIER LETTER EXTRA-HIGH TONE BAR
U+02E6 ( ˦ ) MODIFIER LETTER HIGH TONE BAR
U+02E7 ( ˧ ) MODIFIER LETTER MID TONE BAR
U+02E8 ( ˨ ) MODIFIER LETTER LOW TONE BAR
U+02E9 ( ˩ ) MODIFIER LETTER EXTRA-LOW TONE BAR
U+02EC ( ˬ ) MODIFIER LETTER VOICING
U+02ED ( ˭ ) MODIFIER LETTER UNASPIRATED
U+02EE ( ˮ ) MODIFIER LETTER DOUBLE APOSTROPHE
U+A700 ( ꜀ ) MODIFIER LETTER CHINESE TONE YIN PING
U+A701 ( ꜁ ) MODIFIER LETTER CHINESE TONE YANG PING
U+A702 ( ꜂ ) MODIFIER LETTER CHINESE TONE YIN SHANG
U+A703 ( ꜃ ) MODIFIER LETTER CHINESE TONE YANG SHANG
U+A704 ( ꜄ ) MODIFIER LETTER CHINESE TONE YIN QU
U+A705 ( ꜅ ) MODIFIER LETTER CHINESE TONE YANG QU
U+A706 ( ꜆ ) MODIFIER LETTER CHINESE TONE YIN RU
U+A707 ( ꜇ ) MODIFIER LETTER CHINESE TONE YANG RU
U+A708 ( ꜈ ) MODIFIER LETTER EXTRA-HIGH DOTTED TONE BAR
U+A709 ( ꜉ ) MODIFIER LETTER HIGH DOTTED TONE BAR
U+A70A ( ꜊ ) MODIFIER LETTER MID DOTTED TONE BAR
U+A70B ( ꜋ ) MODIFIER LETTER LOW DOTTED TONE BAR
U+A70C ( ꜌ ) MODIFIER LETTER EXTRA-LOW DOTTED TONE BAR
U+A70D ( ꜍ ) MODIFIER LETTER EXTRA-HIGH DOTTED LEFT-STEM TONE BAR
U+A70E ( ꜎ ) MODIFIER LETTER HIGH DOTTED LEFT-STEM TONE BAR
U+A70F ( ꜏ ) MODIFIER LETTER MID DOTTED LEFT-STEM TONE BAR
U+A710 ( ꜐ ) MODIFIER LETTER LOW DOTTED LEFT-STEM TONE BAR
U+A711 ( ꜑ ) MODIFIER LETTER EXTRA-LOW DOTTED LEFT-STEM TONE BAR
U+A712 ( ꜒ ) MODIFIER LETTER EXTRA-HIGH LEFT-STEM TONE BAR
U+A713 ( ꜓ ) MODIFIER LETTER HIGH LEFT-STEM TONE BAR
U+A714 ( ꜔ ) MODIFIER LETTER MID LEFT-STEM TONE BAR
U+A715 ( ꜕ ) MODIFIER LETTER LOW LEFT-STEM TONE BAR
U+A716 ( ꜖ ) MODIFIER LETTER EXTRA-LOW LEFT-STEM TONE BAR
U+A717 ( ꜗ ) MODIFIER LETTER DOT VERTICAL BAR
U+A718 ( ꜘ ) MODIFIER LETTER DOT SLASH
U+A719 ( ꜙ ) MODIFIER LETTER DOT HORIZONTAL BAR
U+A71A ( ꜚ ) MODIFIER LETTER LOWER RIGHT CORNER ANGLE
U+A71B ( ꜛ ) MODIFIER LETTER RAISED UP ARROW
U+A71C ( ꜜ ) MODIFIER LETTER RAISED DOWN ARROW
U+A71D ( ꜝ ) MODIFIER LETTER RAISED EXCLAMATION MARK
U+A71E ( ꜞ ) MODIFIER LETTER RAISED INVERTED EXCLAMATION MARK
U+A71F ( ꜟ ) MODIFIER LETTER LOW INVERTED EXCLAMATION MARK
U+A788 ( ꞈ ) MODIFIER LETTER LOW CIRCUMFLEX ACCENT
U+A789 ( ꞉ ) MODIFIER LETTER COLON
U+A78A ( ꞊ ) MODIFIER LETTER SHORT EQUALS SIGN
U+02EF ( ˯ ) MODIFIER LETTER LOW DOWN ARROWHEAD
U+02F0 ( ˰ ) MODIFIER LETTER LOW UP ARROWHEAD
U+02F1 ( ˱ ) MODIFIER LETTER LOW LEFT ARROWHEAD
U+02F2 ( ˲ ) MODIFIER LETTER LOW RIGHT ARROWHEAD
U+02F3 ( ˳ ) MODIFIER LETTER LOW RING
U+02F4 ( ˴ ) MODIFIER LETTER MIDDLE GRAVE ACCENT
U+02F5 ( ˵ ) MODIFIER LETTER MIDDLE DOUBLE GRAVE ACCENT
U+02F6 ( ˶ ) MODIFIER LETTER MIDDLE DOUBLE ACUTE ACCENT
U+02F7 ( ˷ ) MODIFIER LETTER LOW TILDE
U+02F8 ( ˸ ) MODIFIER LETTER RAISED COLON
U+02F9 ( ˹ ) MODIFIER LETTER BEGIN HIGH TONE
U+02FA ( ˺ ) MODIFIER LETTER END HIGH TONE
U+02FB ( ˻ ) MODIFIER LETTER BEGIN LOW TONE
U+02FC ( ˼ ) MODIFIER LETTER END LOW TONE
U+02FD ( ˽ ) MODIFIER LETTER SHELF
U+02FE ( ˾ ) MODIFIER LETTER OPEN SHELF
U+02FF ( ˿ ) MODIFIER LETTER LOW LEFT ARROW
U+A720 ( ꜠ ) MODIFIER LETTER STRESS AND HIGH TONE
U+A721 ( ꜡ ) MODIFIER LETTER STRESS AND LOW TONE
U+2102 ( ℂ ) DOUBLE-STRUCK CAPITAL C
U+210A ( ℊ ) SCRIPT SMALL G
U+210B ( ℋ ) SCRIPT CAPITAL H
U+210C ( ℌ ) BLACK-LETTER CAPITAL H
U+210D ( ℍ ) DOUBLE-STRUCK CAPITAL H
U+210E ( ℎ ) PLANCK CONSTANT
U+210F ( ℏ ) PLANCK CONSTANT OVER TWO PI
U+2110 ( ℐ ) SCRIPT CAPITAL I
U+2111 ( ℑ ) BLACK-LETTER CAPITAL I
U+2112 ( ℒ ) SCRIPT CAPITAL L
U+2113 ( ℓ ) SCRIPT SMALL L
U+2115 ( ℕ ) DOUBLE-STRUCK CAPITAL N
U+2119 ( ℙ ) DOUBLE-STRUCK CAPITAL P
U+211A ( ℚ ) DOUBLE-STRUCK CAPITAL Q
U+211B ( ℛ ) SCRIPT CAPITAL R
U+211C ( ℜ ) BLACK-LETTER CAPITAL R
U+211D ( ℝ ) DOUBLE-STRUCK CAPITAL R
U+2124 ( ℤ ) DOUBLE-STRUCK CAPITAL Z
U+2128 ( ℨ ) BLACK-LETTER CAPITAL Z
U+212C ( ℬ ) SCRIPT CAPITAL B
U+212D ( ℭ ) BLACK-LETTER CAPITAL C
U+212F ( ℯ ) SCRIPT SMALL E
U+2130 ( ℰ ) SCRIPT CAPITAL E
U+2131 ( ℱ ) SCRIPT CAPITAL F
U+2133 ( ℳ ) SCRIPT CAPITAL M
U+2134 ( ℴ ) SCRIPT SMALL O
U+2145 ( ⅅ ) DOUBLE-STRUCK ITALIC CAPITAL D
U+2146 ( ⅆ ) DOUBLE-STRUCK ITALIC SMALL D
U+2147 ( ⅇ ) DOUBLE-STRUCK ITALIC SMALL E
U+2148 ( ⅈ ) DOUBLE-STRUCK ITALIC SMALL I
U+2149 ( ⅉ ) DOUBLE-STRUCK ITALIC SMALL J
U+249C ( ⒜ ) PARENTHESIZED LATIN SMALL LETTER A
U+249D ( ⒝ ) PARENTHESIZED LATIN SMALL LETTER B
U+249E ( ⒞ ) PARENTHESIZED LATIN SMALL LETTER C
U+249F ( ⒟ ) PARENTHESIZED LATIN SMALL LETTER D
U+24A0 ( ⒠ ) PARENTHESIZED LATIN SMALL LETTER E
U+24A1 ( ⒡ ) PARENTHESIZED LATIN SMALL LETTER F
U+24A2 ( ⒢ ) PARENTHESIZED LATIN SMALL LETTER G
U+24A3 ( ⒣ ) PARENTHESIZED LATIN SMALL LETTER H
U+24A4 ( ⒤ ) PARENTHESIZED LATIN SMALL LETTER I
U+24A5 ( ⒥ ) PARENTHESIZED LATIN SMALL LETTER J
U+24A6 ( ⒦ ) PARENTHESIZED LATIN SMALL LETTER K
U+24A7 ( ⒧ ) PARENTHESIZED LATIN SMALL LETTER L
U+24A8 ( ⒨ ) PARENTHESIZED LATIN SMALL LETTER M
U+24A9 ( ⒩ ) PARENTHESIZED LATIN SMALL LETTER N
U+24AA ( ⒪ ) PARENTHESIZED LATIN SMALL LETTER O
U+24AB ( ⒫ ) PARENTHESIZED LATIN SMALL LETTER P
U+24AC ( ⒬ ) PARENTHESIZED LATIN SMALL LETTER Q
U+24AD ( ⒭ ) PARENTHESIZED LATIN SMALL LETTER R
U+24AE ( ⒮ ) PARENTHESIZED LATIN SMALL LETTER S
U+24AF ( ⒯ ) PARENTHESIZED LATIN SMALL LETTER T
U+24B0 ( ⒰ ) PARENTHESIZED LATIN SMALL LETTER U
U+24B1 ( ⒱ ) PARENTHESIZED LATIN SMALL LETTER V
U+24B2 ( ⒲ ) PARENTHESIZED LATIN SMALL LETTER W
U+24B3 ( ⒳ ) PARENTHESIZED LATIN SMALL LETTER X
U+24B4 ( ⒴ ) PARENTHESIZED LATIN SMALL LETTER Y
U+24B5 ( ⒵ ) PARENTHESIZED LATIN SMALL LETTER Z
U+24B6 ( Ⓐ ) CIRCLED LATIN CAPITAL LETTER A
U+24B7 ( Ⓑ ) CIRCLED LATIN CAPITAL LETTER B
U+24B8 ( Ⓒ ) CIRCLED LATIN CAPITAL LETTER C
U+24B9 ( Ⓓ ) CIRCLED LATIN CAPITAL LETTER D
U+24BA ( Ⓔ ) CIRCLED LATIN CAPITAL LETTER E
U+24BB ( Ⓕ ) CIRCLED LATIN CAPITAL LETTER F
U+24BC ( Ⓖ ) CIRCLED LATIN CAPITAL LETTER G
U+24BD ( Ⓗ ) CIRCLED LATIN CAPITAL LETTER H
U+24BE ( Ⓘ ) CIRCLED LATIN CAPITAL LETTER I
U+24BF ( Ⓙ ) CIRCLED LATIN CAPITAL LETTER J
U+24C0 ( Ⓚ ) CIRCLED LATIN CAPITAL LETTER K
U+24C1 ( Ⓛ ) CIRCLED LATIN CAPITAL LETTER L
U+24C2 ( Ⓜ ) CIRCLED LATIN CAPITAL LETTER M
U+24C3 ( Ⓝ ) CIRCLED LATIN CAPITAL LETTER N
U+24C4 ( Ⓞ ) CIRCLED LATIN CAPITAL LETTER O
U+24C5 ( Ⓟ ) CIRCLED LATIN CAPITAL LETTER P
U+24C6 ( Ⓠ ) CIRCLED LATIN CAPITAL LETTER Q
U+24C7 ( Ⓡ ) CIRCLED LATIN CAPITAL LETTER R
U+24C8 ( Ⓢ ) CIRCLED LATIN CAPITAL LETTER S
U+24C9 ( Ⓣ ) CIRCLED LATIN CAPITAL LETTER T
U+24CA ( Ⓤ ) CIRCLED LATIN CAPITAL LETTER U
U+24CB ( Ⓥ ) CIRCLED LATIN CAPITAL LETTER V
U+24CC ( Ⓦ ) CIRCLED LATIN CAPITAL LETTER W
U+24CD ( Ⓧ ) CIRCLED LATIN CAPITAL LETTER X
U+24CE ( Ⓨ ) CIRCLED LATIN CAPITAL LETTER Y
U+24CF ( Ⓩ ) CIRCLED LATIN CAPITAL LETTER Z
U+24D0 ( ⓐ ) CIRCLED LATIN SMALL LETTER A
U+24D1 ( ⓑ ) CIRCLED LATIN SMALL LETTER B
U+24D2 ( ⓒ ) CIRCLED LATIN SMALL LETTER C
U+24D3 ( ⓓ ) CIRCLED LATIN SMALL LETTER D
U+24D4 ( ⓔ ) CIRCLED LATIN SMALL LETTER E
U+24D5 ( ⓕ ) CIRCLED LATIN SMALL LETTER F
U+24D6 ( ⓖ ) CIRCLED LATIN SMALL LETTER G
U+24D7 ( ⓗ ) CIRCLED LATIN SMALL LETTER H
U+24D8 ( ⓘ ) CIRCLED LATIN SMALL LETTER I
U+24D9 ( ⓙ ) CIRCLED LATIN SMALL LETTER J
U+24DA ( ⓚ ) CIRCLED LATIN SMALL LETTER K
U+24DB ( ⓛ ) CIRCLED LATIN SMALL LETTER L
U+24DC ( ⓜ ) CIRCLED LATIN SMALL LETTER M
U+24DD ( ⓝ ) CIRCLED LATIN SMALL LETTER N
U+24DE ( ⓞ ) CIRCLED LATIN SMALL LETTER O
U+24DF ( ⓟ ) CIRCLED LATIN SMALL LETTER P
U+24E0 ( ⓠ ) CIRCLED LATIN SMALL LETTER Q
U+24E1 ( ⓡ ) CIRCLED LATIN SMALL LETTER R
U+24E2 ( ⓢ ) CIRCLED LATIN SMALL LETTER S
U+24E3 ( ⓣ ) CIRCLED LATIN SMALL LETTER T
U+24E4 ( ⓤ ) CIRCLED LATIN SMALL LETTER U
U+24E5 ( ⓥ ) CIRCLED LATIN SMALL LETTER V
U+24E6 ( ⓦ ) CIRCLED LATIN SMALL LETTER W
U+24E7 ( ⓧ ) CIRCLED LATIN SMALL LETTER X
U+24E8 ( ⓨ ) CIRCLED LATIN SMALL LETTER Y
U+24E9 ( ⓩ ) CIRCLED LATIN SMALL LETTER Z
U+3250 ( ㉐ ) PARTNERSHIP SIGN
U+32CC ( ㋌ ) SQUARE HG
U+32CD ( ㋍ ) SQUARE ERG
U+32CE ( ㋎ ) SQUARE EV
U+32CF ( ㋏ ) LIMITED LIABILITY SIGN
U+3371 ( ㍱ ) SQUARE HPA
U+3372 ( ㍲ ) SQUARE DA
U+3373 ( ㍳ ) SQUARE AU
U+3374 ( ㍴ ) SQUARE BAR
U+3375 ( ㍵ ) SQUARE OV
U+3376 ( ㍶ ) SQUARE PC
U+3377 ( ㍷ ) SQUARE DM
U+3378 ( ㍸ ) SQUARE DM SQUARED
U+3379 ( ㍹ ) SQUARE DM CUBED
U+337A ( ㍺ ) SQUARE IU
U+3380 ( ㎀ ) SQUARE PA AMPS
U+3381 ( ㎁ ) SQUARE NA
U+3382 ( ㎂ ) SQUARE MU A
U+3383 ( ㎃ ) SQUARE MA
U+3384 ( ㎄ ) SQUARE KA
U+3385 ( ㎅ ) SQUARE KB
U+3386 ( ㎆ ) SQUARE MB
U+3387 ( ㎇ ) SQUARE GB
U+3388 ( ㎈ ) SQUARE CAL
U+3389 ( ㎉ ) SQUARE KCAL
U+338A ( ㎊ ) SQUARE PF
U+338B ( ㎋ ) SQUARE NF
U+338C ( ㎌ ) SQUARE MU F
U+338D ( ㎍ ) SQUARE MU G
U+338E ( ㎎ ) SQUARE MG
U+338F ( ㎏ ) SQUARE KG
U+3390 ( ㎐ ) SQUARE HZ
U+3391 ( ㎑ ) SQUARE KHZ
U+3392 ( ㎒ ) SQUARE MHZ
U+3393 ( ㎓ ) SQUARE GHZ
U+3394 ( ㎔ ) SQUARE THZ
U+3395 ( ㎕ ) SQUARE MU L
U+3396 ( ㎖ ) SQUARE ML
U+3397 ( ㎗ ) SQUARE DL
U+3398 ( ㎘ ) SQUARE KL
U+3399 ( ㎙ ) SQUARE FM
U+339A ( ㎚ ) SQUARE NM
U+339B ( ㎛ ) SQUARE MU M
U+339C ( ㎜ ) SQUARE MM
U+339D ( ㎝ ) SQUARE CM
U+339E ( ㎞ ) SQUARE KM
U+339F ( ㎟ ) SQUARE MM SQUARED
U+33A0 ( ㎠ ) SQUARE CM SQUARED
U+33A1 ( ㎡ ) SQUARE M SQUARED
U+33A2 ( ㎢ ) SQUARE KM SQUARED
U+33A3 ( ㎣ ) SQUARE MM CUBED
U+33A4 ( ㎤ ) SQUARE CM CUBED
U+33A5 ( ㎥ ) SQUARE M CUBED
U+33A6 ( ㎦ ) SQUARE KM CUBED
U+33A7 ( ㎧ ) SQUARE M OVER S
U+33A8 ( ㎨ ) SQUARE M OVER S SQUARED
U+33A9 ( ㎩ ) SQUARE PA
U+33AA ( ㎪ ) SQUARE KPA
U+33AB ( ㎫ ) SQUARE MPA
U+33AC ( ㎬ ) SQUARE GPA
U+33AD ( ㎭ ) SQUARE RAD
U+33AE ( ㎮ ) SQUARE RAD OVER S
U+33AF ( ㎯ ) SQUARE RAD OVER S SQUARED
U+33B0 ( ㎰ ) SQUARE PS
U+33B1 ( ㎱ ) SQUARE NS
U+33B2 ( ㎲ ) SQUARE MU S
U+33B3 ( ㎳ ) SQUARE MS
U+33B4 ( ㎴ ) SQUARE PV
U+33B5 ( ㎵ ) SQUARE NV
U+33B6 ( ㎶ ) SQUARE MU V
U+33B7 ( ㎷ ) SQUARE MV
U+33B8 ( ㎸ ) SQUARE KV
U+33B9 ( ㎹ ) SQUARE MV MEGA
U+33BA ( ㎺ ) SQUARE PW
U+33BB ( ㎻ ) SQUARE NW
U+33BC ( ㎼ ) SQUARE MU W
U+33BD ( ㎽ ) SQUARE MW
U+33BE ( ㎾ ) SQUARE KW
U+33BF ( ㎿ ) SQUARE MW MEGA
U+33C0 ( ㏀ ) SQUARE K OHM
U+33C1 ( ㏁ ) SQUARE M OHM
U+33C2 ( ㏂ ) SQUARE AM
U+33C3 ( ㏃ ) SQUARE BQ
U+33C4 ( ㏄ ) SQUARE CC
U+33C5 ( ㏅ ) SQUARE CD
U+33C6 ( ㏆ ) SQUARE C OVER KG
U+33C7 ( ㏇ ) SQUARE CO
U+33C8 ( ㏈ ) SQUARE DB
U+33C9 ( ㏉ ) SQUARE GY
U+33CA ( ㏊ ) SQUARE HA
U+33CB ( ㏋ ) SQUARE HP
U+33CC ( ㏌ ) SQUARE IN
U+33CD ( ㏍ ) SQUARE KK
U+33CE ( ㏎ ) SQUARE KM CAPITAL
U+33CF ( ㏏ ) SQUARE KT
U+33D0 ( ㏐ ) SQUARE LM
U+33D1 ( ㏑ ) SQUARE LN
U+33D2 ( ㏒ ) SQUARE LOG
U+33D3 ( ㏓ ) SQUARE LX
U+33D4 ( ㏔ ) SQUARE MB SMALL
U+33D5 ( ㏕ ) SQUARE MIL
U+33D6 ( ㏖ ) SQUARE MOL
U+33D7 ( ㏗ ) SQUARE PH
U+33D8 ( ㏘ ) SQUARE PM
U+33D9 ( ㏙ ) SQUARE PPM
U+33DA ( ㏚ ) SQUARE PR
U+33DB ( ㏛ ) SQUARE SR
U+33DC ( ㏜ ) SQUARE SV
U+33DD ( ㏝ ) SQUARE WB
U+33DE ( ㏞ ) SQUARE V OVER M
U+33DF ( ㏟ ) SQUARE A OVER M
U+33FF ( ㏿ ) SQUARE GAL
The following is primarily used in Cyrillic and Latin.
U+02BC ( ʼ ) MODIFIER LETTER APOSTROPHE
While the following have the form of Greek or Cyrillic letters, they are functionally Latin phonetic characters.
U+1D26 ( ᴦ ) GREEK LETTER SMALL CAPITAL GAMMA
U+1D27 ( ᴧ ) GREEK LETTER SMALL CAPITAL LAMDA
U+1D28 ( ᴨ ) GREEK LETTER SMALL CAPITAL PI
U+1D29 ( ᴩ ) GREEK LETTER SMALL CAPITAL RHO
U+1D2A ( ᴪ ) GREEK LETTER SMALL CAPITAL PSI
U+1D2B ( ᴫ ) CYRILLIC LETTER SMALL CAPITAL EL
U+1D5D ( ᵝ ) MODIFIER LETTER SMALL BETA
U+1D5E ( ᵞ ) MODIFIER LETTER SMALL GREEK GAMMA
U+1D5F ( ᵟ ) MODIFIER LETTER SMALL DELTA
U+1D60 ( ᵠ ) MODIFIER LETTER SMALL GREEK PHI
U+1D61 ( ᵡ ) MODIFIER LETTER SMALL CHI
U+1D66 ( ᵦ ) GREEK SUBSCRIPT SMALL LETTER BETA
U+1D67 ( ᵧ ) GREEK SUBSCRIPT SMALL LETTER GAMMA
U+1D68 ( ᵨ ) GREEK SUBSCRIPT SMALL LETTER RHO
U+1D69 ( ᵩ ) GREEK SUBSCRIPT SMALL LETTER PHI
U+1D6A ( ᵪ ) GREEK SUBSCRIPT SMALL LETTER CHI
U+1D78 ( ᵸ ) MODIFIER LETTER CYRILLIC EN
U+1DBF ( ᶿ ) MODIFIER LETTER SMALL THETA
These have no explicit script just because they map to general punctuation marks or modifier letters.
U+0374 ( ʹ ) GREEK NUMERAL SIGN
U+037E ( ; ) GREEK QUESTION MARK
U+0385 ( ΅ ) GREEK DIALYTIKA TONOS
U+0387 ( · ) GREEK ANO TELEIA
In contrast, the following does have an explicit script, and is the only Sk (Modifier_Symbol) that does. Note that the corresponding U+0374 is a Modifier_Letter.
U+0375 ( ͵ ) GREEK LOWER NUMERAL SIGN
U+00B5 ( µ ) MICRO SIGN
U+0589 ( ։ ) ARMENIAN FULL STOP
U+0600 ( ) ARABIC NUMBER SIGN
U+0601 ( ) ARABIC SIGN SANAH
U+0602 ( ) ARABIC FOOTNOTE MARKER
U+0603 ( ) ARABIC SIGN SAFHA
U+FDFD ( ﷽ ) ARABIC LIGATURE BISMILLAH AR-RAHMAN AR-RAHEEM
(Note that the U+06Fx EXTENDED ARABIC-INDIC DIGIT x characters have already the specific script Arabic)
U+0660 ( ٠ ) ARABIC-INDIC DIGIT ZERO
U+0661 ( ١ ) ARABIC-INDIC DIGIT ONE
U+0662 ( ٢ ) ARABIC-INDIC DIGIT TWO
U+0663 ( ٣ ) ARABIC-INDIC DIGIT THREE
U+0664 ( ٤ ) ARABIC-INDIC DIGIT FOUR
U+0665 ( ٥ ) ARABIC-INDIC DIGIT FIVE
U+0666 ( ٦ ) ARABIC-INDIC DIGIT SIX
U+0667 ( ٧ ) ARABIC-INDIC DIGIT SEVEN
U+0668 ( ٨ ) ARABIC-INDIC DIGIT EIGHT
U+0669 ( ٩ ) ARABIC-INDIC DIGIT NINE
U+060C ( ، ) ARABIC COMMA
U+061B ( ؛ ) ARABIC SEMICOLON
U+061F ( ؟ ) ARABIC QUESTION MARK
U+06DD ( ) ARABIC END OF AYAH
U+0640 ( ـ ) ARABIC TATWEEL
U+064B ( ً ) ARABIC FATHATAN
U+064C ( ٌ ) ARABIC DAMMATAN
U+064D ( ٍ ) ARABIC KASRATAN
U+064E ( َ ) ARABIC FATHA
U+064F ( ُ ) ARABIC DAMMA
U+0650 ( ِ ) ARABIC KASRA
U+0651 ( ّ ) ARABIC SHADDA
U+0652 ( ْ ) ARABIC SUKUN
U+0653 ( ٓ ) ARABIC MADDAH ABOVE
U+0654 ( ٔ ) ARABIC HAMZA ABOVE
U+0655 ( ٕ ) ARABIC HAMZA BELOW
U+0670 ( ٰ ) ARABIC LETTER SUPERSCRIPT ALEF
U+02EA ( ˪ ) MODIFIER LETTER YIN DEPARTING TONE MARK
U+02EB ( ˫ ) MODIFIER LETTER YANG DEPARTING TONE MARK
U+0951 ( ॑ ) DEVANAGARI STRESS SIGN UDATTA
U+0952 ( ॒ ) DEVANAGARI STRESS SIGN ANUDATTA
U+0970 ( ॰ ) DEVANAGARI ABBREVIATION SIGN
The annotations say "scripts of India". The dandas are not normally used with Malayalam, Kannada, Telugu, Tamil and Gujarati, and presumably these are not used with Urdu, etc.
U+0964 ( । ) DEVANAGARI DANDA
U+0965 ( ॥ ) DEVANAGARI DOUBLE DANDA
The annotations say "The Vedic signs for jihvamuliya and upadhmaniya were encoded in the Kannada block, but are intended for general Vedic use with all scripts", that probably means "with all Brahmi-based Indic scripts".
U+0CF1 ( ೱ ) KANNADA SIGN JIHVAMULIYA
U+0CF2 ( ೲ ) KANNADA SIGN UPADHMANIYA
Note: historic, Latin, Cyrillic, Greek, Coptic
U+10FB ( ჻ ) GEORGIAN PARAGRAPH SEPARATOR
U+16EB ( ᛫ ) RUNIC SINGLE PUNCTUATION
U+16EC ( ᛬ ) RUNIC MULTIPLE PUNCTUATION
U+16ED ( ᛭ ) RUNIC CROSS PUNCTUATION
U+1735 ( ᜵ ) PHILIPPINE SINGLE PUNCTUATION
U+1736 ( ᜶ ) PHILIPPINE DOUBLE PUNCTUATION
U+1802 ( ᠂ ) MONGOLIAN COMMA
U+1803 ( ᠃ ) MONGOLIAN FULL STOP
U+1805 ( ᠅ ) MONGOLIAN FOUR DOTS
U+3031 ( 〱 ) VERTICAL KANA REPEAT MARK
U+3032 ( 〲 ) VERTICAL KANA REPEAT WITH VOICED SOUND MARK
U+3033 ( 〳 ) VERTICAL KANA REPEAT MARK UPPER HALF
U+3034 ( 〴 ) VERTICAL KANA REPEAT WITH VOICED SOUND MARK UPPER HALF
U+3035 ( 〵 ) VERTICAL KANA REPEAT MARK LOWER HALF
U+3099 ( ゙ ) COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK
U+309A ( ゚ ) COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK
U+309B ( ゛ ) KATAKANA-HIRAGANA VOICED SOUND MARK
U+309C ( ゜ ) KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK
U+30A0 ( ゠ ) KATAKANA-HIRAGANA DOUBLE HYPHEN
U+30FB ( ・ ) KATAKANA MIDDLE DOT
U+30FC ( ー ) KATAKANA-HIRAGANA PROLONGED SOUND MARK
U+FF65 ( ・ ) HALFWIDTH KATAKANA MIDDLE DOT
U+FF70 ( ー ) HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK
U+FF9E ( ゙ ) HALFWIDTH KATAKANA VOICED SOUND MARK
U+FF9F ( ゚ ) HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK
The following are used with the Japanese Writing system:
[:Block=Kanbun:]
U+302E ( 〮 ) HANGUL SINGLE DOT TONE MARK
U+302F ( 〯 ) HANGUL DOUBLE DOT TONE MARK
General comment, use the special codes for writing systems: Jpan, Kore,...
[:Block=Ideographic_Description_Characters:]
U+31C0 ( ㇀ ) CJK STROKE T
U+31C1 ( ㇁ ) CJK STROKE WG
U+31C2 ( ㇂ ) CJK STROKE XG
U+31C3 ( ㇃ ) CJK STROKE BXG
U+31C4 ( ㇄ ) CJK STROKE SW
U+31C5 ( ㇅ ) CJK STROKE HZZ
U+31C6 ( ㇆ ) CJK STROKE HZG
U+31C7 ( ㇇ ) CJK STROKE HP
U+31C8 ( ㇈ ) CJK STROKE HZWG
U+31C9 ( ㇉ ) CJK STROKE SZWG
U+31CA ( ㇊ ) CJK STROKE HZT
U+31CB ( ㇋ ) CJK STROKE HZZP
U+31CC ( ㇌ ) CJK STROKE HPWG
U+31CD ( ㇍ ) CJK STROKE HZW
U+31CE ( ㇎ ) CJK STROKE HZZZ
U+31CF ( ㇏ ) CJK STROKE N
U+31D0 ( ㇐ ) CJK STROKE H
U+31D1 ( ㇑ ) CJK STROKE S
U+31D2 ( ㇒ ) CJK STROKE P
U+31D3 ( ㇓ ) CJK STROKE SP
U+31D4 ( ㇔ ) CJK STROKE D
U+31D5 ( ㇕ ) CJK STROKE HZ
U+31D6 ( ㇖ ) CJK STROKE HG
U+31D7 ( ㇗ ) CJK STROKE SZ
U+31D8 ( ㇘ ) CJK STROKE SWZ
U+31D9 ( ㇙ ) CJK STROKE ST
U+31DA ( ㇚ ) CJK STROKE SG
U+31DB ( ㇛ ) CJK STROKE PD
U+31DC ( ㇜ ) CJK STROKE PZ
U+31DD ( ㇝ ) CJK STROKE TN
U+31DE ( ㇞ ) CJK STROKE SZZ
U+31DF ( ㇟ ) CJK STROKE SWG
U+31E0 ( ㇠ ) CJK STROKE HXWG
U+31E1 ( ㇡ ) CJK STROKE HZZZG
U+31E2 ( ㇢ ) CJK STROKE PG
U+31E3 ( ㇣ ) CJK STROKE Q
U+3220 ( ㈠ ) PARENTHESIZED IDEOGRAPH ONE
U+3221 ( ㈡ ) PARENTHESIZED IDEOGRAPH TWO
U+3222 ( ㈢ ) PARENTHESIZED IDEOGRAPH THREE
U+3223 ( ㈣ ) PARENTHESIZED IDEOGRAPH FOUR
U+3224 ( ㈤ ) PARENTHESIZED IDEOGRAPH FIVE
U+3225 ( ㈥ ) PARENTHESIZED IDEOGRAPH SIX
U+3226 ( ㈦ ) PARENTHESIZED IDEOGRAPH SEVEN
U+3227 ( ㈧ ) PARENTHESIZED IDEOGRAPH EIGHT
U+3228 ( ㈨ ) PARENTHESIZED IDEOGRAPH NINE
U+3229 ( ㈩ ) PARENTHESIZED IDEOGRAPH TEN
U+322A ( ㈪ ) PARENTHESIZED IDEOGRAPH MOON
U+322B ( ㈫ ) PARENTHESIZED IDEOGRAPH FIRE
U+322C ( ㈬ ) PARENTHESIZED IDEOGRAPH WATER
U+322D ( ㈭ ) PARENTHESIZED IDEOGRAPH WOOD
U+322E ( ㈮ ) PARENTHESIZED IDEOGRAPH METAL
U+322F ( ㈯ ) PARENTHESIZED IDEOGRAPH EARTH
U+3230 ( ㈰ ) PARENTHESIZED IDEOGRAPH SUN
U+3231 ( ㈱ ) PARENTHESIZED IDEOGRAPH STOCK
U+3232 ( ㈲ ) PARENTHESIZED IDEOGRAPH HAVE
U+3233 ( ㈳ ) PARENTHESIZED IDEOGRAPH SOCIETY
U+3234 ( ㈴ ) PARENTHESIZED IDEOGRAPH NAME
U+3235 ( ㈵ ) PARENTHESIZED IDEOGRAPH SPECIAL
U+3236 ( ㈶ ) PARENTHESIZED IDEOGRAPH FINANCIAL
U+3237 ( ㈷ ) PARENTHESIZED IDEOGRAPH CONGRATULATION
U+3238 ( ㈸ ) PARENTHESIZED IDEOGRAPH LABOR
U+3239 ( ㈹ ) PARENTHESIZED IDEOGRAPH REPRESENT
U+323A ( ㈺ ) PARENTHESIZED IDEOGRAPH CALL
U+323B ( ㈻ ) PARENTHESIZED IDEOGRAPH STUDY
U+323C ( ㈼ ) PARENTHESIZED IDEOGRAPH SUPERVISE
U+323D ( ㈽ ) PARENTHESIZED IDEOGRAPH ENTERPRISE
U+323E ( ㈾ ) PARENTHESIZED IDEOGRAPH RESOURCE
U+323F ( ㈿ ) PARENTHESIZED IDEOGRAPH ALLIANCE
U+3240 ( ㉀ ) PARENTHESIZED IDEOGRAPH FESTIVAL
U+3241 ( ㉁ ) PARENTHESIZED IDEOGRAPH REST
U+3242 ( ㉂ ) PARENTHESIZED IDEOGRAPH SELF
U+3243 ( ㉃ ) PARENTHESIZED IDEOGRAPH REACH
U+3280 ( ㊀ ) CIRCLED IDEOGRAPH ONE
U+3281 ( ㊁ ) CIRCLED IDEOGRAPH TWO
U+3282 ( ㊂ ) CIRCLED IDEOGRAPH THREE
U+3283 ( ㊃ ) CIRCLED IDEOGRAPH FOUR
U+3284 ( ㊄ ) CIRCLED IDEOGRAPH FIVE
U+3285 ( ㊅ ) CIRCLED IDEOGRAPH SIX
U+3286 ( ㊆ ) CIRCLED IDEOGRAPH SEVEN
U+3287 ( ㊇ ) CIRCLED IDEOGRAPH EIGHT
U+3288 ( ㊈ ) CIRCLED IDEOGRAPH NINE
U+3289 ( ㊉ ) CIRCLED IDEOGRAPH TEN
U+328A ( ㊊ ) CIRCLED IDEOGRAPH MOON
U+328B ( ㊋ ) CIRCLED IDEOGRAPH FIRE
U+328C ( ㊌ ) CIRCLED IDEOGRAPH WATER
U+328D ( ㊍ ) CIRCLED IDEOGRAPH WOOD
U+328E ( ㊎ ) CIRCLED IDEOGRAPH METAL
U+328F ( ㊏ ) CIRCLED IDEOGRAPH EARTH
U+3290 ( ㊐ ) CIRCLED IDEOGRAPH SUN
U+3291 ( ㊑ ) CIRCLED IDEOGRAPH STOCK
U+3292 ( ㊒ ) CIRCLED IDEOGRAPH HAVE
U+3293 ( ㊓ ) CIRCLED IDEOGRAPH SOCIETY
U+3294 ( ㊔ ) CIRCLED IDEOGRAPH NAME
U+3295 ( ㊕ ) CIRCLED IDEOGRAPH SPECIAL
U+3296 ( ㊖ ) CIRCLED IDEOGRAPH FINANCIAL
U+3297 ( ㊗ ) CIRCLED IDEOGRAPH CONGRATULATION
U+3298 ( ㊘ ) CIRCLED IDEOGRAPH LABOR
U+3299 ( ㊙ ) CIRCLED IDEOGRAPH SECRET
U+329A ( ㊚ ) CIRCLED IDEOGRAPH MALE
U+329B ( ㊛ ) CIRCLED IDEOGRAPH FEMALE
U+329C ( ㊜ ) CIRCLED IDEOGRAPH SUITABLE
U+329D ( ㊝ ) CIRCLED IDEOGRAPH EXCELLENT
U+329E ( ㊞ ) CIRCLED IDEOGRAPH PRINT
U+329F ( ㊟ ) CIRCLED IDEOGRAPH ATTENTION
U+32A0 ( ㊠ ) CIRCLED IDEOGRAPH ITEM
U+32A1 ( ㊡ ) CIRCLED IDEOGRAPH REST
U+32A2 ( ㊢ ) CIRCLED IDEOGRAPH COPY
U+32A3 ( ㊣ ) CIRCLED IDEOGRAPH CORRECT
U+32A4 ( ㊤ ) CIRCLED IDEOGRAPH HIGH
U+32A5 ( ㊥ ) CIRCLED IDEOGRAPH CENTRE
U+32A6 ( ㊦ ) CIRCLED IDEOGRAPH LOW
U+32A7 ( ㊧ ) CIRCLED IDEOGRAPH LEFT
U+32A8 ( ㊨ ) CIRCLED IDEOGRAPH RIGHT
U+32A9 ( ㊩ ) CIRCLED IDEOGRAPH MEDICINE
U+32AA ( ㊪ ) CIRCLED IDEOGRAPH RELIGION
U+32AB ( ㊫ ) CIRCLED IDEOGRAPH STUDY
U+32AC ( ㊬ ) CIRCLED IDEOGRAPH SUPERVISE
U+32AD ( ㊭ ) CIRCLED IDEOGRAPH ENTERPRISE
U+32AE ( ㊮ ) CIRCLED IDEOGRAPH RESOURCE
U+32AF ( ㊯ ) CIRCLED IDEOGRAPH ALLIANCE
U+32B0 ( ㊰ ) CIRCLED IDEOGRAPH NIGHT
U+32C0 ( ㋀ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR JANUARY
U+32C1 ( ㋁ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR FEBRUARY
U+32C2 ( ㋂ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR MARCH
U+32C3 ( ㋃ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR APRIL
U+32C4 ( ㋄ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR MAY
U+32C5 ( ㋅ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR JUNE
U+32C6 ( ㋆ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR JULY
U+32C7 ( ㋇ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR AUGUST
U+32C8 ( ㋈ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR SEPTEMBER
U+32C9 ( ㋉ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR OCTOBER
U+32CA ( ㋊ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR NOVEMBER
U+32CB ( ㋋ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DECEMBER
U+3358 ( ㍘ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR ZERO
U+3359 ( ㍙ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR ONE
U+335A ( ㍚ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR TWO
U+335B ( ㍛ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR THREE
U+335C ( ㍜ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR FOUR
U+335D ( ㍝ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR FIVE
U+335E ( ㍞ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR SIX
U+335F ( ㍟ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR SEVEN
U+3360 ( ㍠ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR EIGHT
U+3361 ( ㍡ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR NINE
U+3362 ( ㍢ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR TEN
U+3363 ( ㍣ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR ELEVEN
U+3364 ( ㍤ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR TWELVE
U+3365 ( ㍥ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR THIRTEEN
U+3366 ( ㍦ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR FOURTEEN
U+3367 ( ㍧ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR FIFTEEN
U+3368 ( ㍨ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR SIXTEEN
U+3369 ( ㍩ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR SEVENTEEN
U+336A ( ㍪ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR EIGHTEEN
U+336B ( ㍫ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR NINETEEN
U+336C ( ㍬ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR TWENTY
U+336D ( ㍭ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR TWENTY-ONE
U+336E ( ㍮ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR TWENTY-TWO
U+336F ( ㍯ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR TWENTY-THREE
U+3370 ( ㍰ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR TWENTY-FOUR
U+337B ( ㍻ ) SQUARE ERA NAME HEISEI
U+337C ( ㍼ ) SQUARE ERA NAME SYOUWA
U+337D ( ㍽ ) SQUARE ERA NAME TAISYOU
U+337E ( ㍾ ) SQUARE ERA NAME MEIZI
U+337F ( ㍿ ) SQUARE CORPORATION
U+33E0 ( ㏠ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY ONE
U+33E1 ( ㏡ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY TWO
U+33E2 ( ㏢ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY THREE
U+33E3 ( ㏣ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY FOUR
U+33E4 ( ㏤ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY FIVE
U+33E5 ( ㏥ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY SIX
U+33E6 ( ㏦ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY SEVEN
U+33E7 ( ㏧ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY EIGHT
U+33E8 ( ㏨ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY NINE
U+33E9 ( ㏩ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY TEN
U+33EA ( ㏪ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY ELEVEN
U+33EB ( ㏫ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY TWELVE
U+33EC ( ㏬ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY THIRTEEN
U+33ED ( ㏭ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY FOURTEEN
U+33EE ( ㏮ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY FIFTEEN
U+33EF ( ㏯ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY SIXTEEN
U+33F0 ( ㏰ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY SEVENTEEN
U+33F1 ( ㏱ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY EIGHTEEN
U+33F2 ( ㏲ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY NINETEEN
U+33F3 ( ㏳ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY TWENTY
U+33F4 ( ㏴ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY TWENTY-ONE
U+33F5 ( ㏵ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY TWENTY-TWO
U+33F6 ( ㏶ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY TWENTY-THREE
U+33F7 ( ㏷ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY TWENTY-FOUR
U+33F8 ( ㏸ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY TWENTY-FIVE
U+33F9 ( ㏹ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY TWENTY-SIX
U+33FA ( ㏺ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY TWENTY-SEVEN
U+33FB ( ㏻ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY TWENTY-EIGHT
U+33FC ( ㏼ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY TWENTY-NINE
U+33FD ( ㏽ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY THIRTY
U+33FE ( ㏾ ) IDEOGRAPHIC TELEGRAPH SYMBOL FOR DAY THIRTY-ONE
U+3001 ( 、 ) IDEOGRAPHIC COMMA
U+3002 ( 。 ) IDEOGRAPHIC FULL STOP
U+FF61 ( 。 ) HALFWIDTH IDEOGRAPHIC FULL STOP
U+FF64 ( 、 ) HALFWIDTH IDEOGRAPHIC COMMA
Jpan = Japanese (alias for Han + Hiragana + Katakana)
U+3012 ( 〒 ) POSTAL MARK
U+3036 ( 〶 ) CIRCLED POSTAL MARK
Kore = Korean (alias for Hangul + Han)
U+327F ( ㉿ ) KOREAN STANDARD SYMBOL
For easier comparison, these are also broken down by General Category
U+3001 ( 、 ) IDEOGRAPHIC COMMA
U+3002 ( 。 ) IDEOGRAPHIC FULL STOP
U+3003 ( 〃 ) DITTO MARK
U+301C ( 〜 ) WAVE DASH
U+301D ( 〝 ) REVERSED DOUBLE PRIME QUOTATION MARK
U+301E ( 〞 ) DOUBLE PRIME QUOTATION MARK
U+301F ( 〟 ) LOW DOUBLE PRIME QUOTATION MARK
U+3030 ( 〰 ) WAVY DASH
U+303D ( 〽 ) PART ALTERNATION MARK
U+FE45 ( ﹅ ) SESAME DOT
U+FE46 ( ﹆ ) WHITE SESAME DOT
U+3004 ( 〄 ) JAPANESE INDUSTRIAL STANDARD SYMBOL
U+3020 ( 〠 ) POSTAL MARK FACE
U+3013 ( 〓 ) GETA MARK
U+3037 ( 〷 ) IDEOGRAPHIC TELEGRAPH LINE FEED SEPARATOR SYMBOL
U+303E ( 〾 ) IDEOGRAPHIC VARIATION INDICATOR
U+303F ( 〿 ) IDEOGRAPHIC HALF FILL SPACE
U+3006 ( 〆 ) IDEOGRAPHIC CLOSING MARK
U+303C ( 〼 ) MASU MARK
U+3008 ( 〈 ) LEFT ANGLE BRACKET
U+3009 ( 〉 ) RIGHT ANGLE BRACKET
U+300A ( 《 ) LEFT DOUBLE ANGLE BRACKET
U+300B ( 》 ) RIGHT DOUBLE ANGLE BRACKET
U+300C ( 「 ) LEFT CORNER BRACKET
U+300D ( 」 ) RIGHT CORNER BRACKET
U+300E ( 『 ) LEFT WHITE CORNER BRACKET
U+300F ( 』 ) RIGHT WHITE CORNER BRACKET
U+3010 ( 【 ) LEFT BLACK LENTICULAR BRACKET
U+3011 ( 】 ) RIGHT BLACK LENTICULAR BRACKET
U+3014 ( 〔 ) LEFT TORTOISE SHELL BRACKET
U+3015 ( 〕 ) RIGHT TORTOISE SHELL BRACKET
U+3016 ( 〖 ) LEFT WHITE LENTICULAR BRACKET
U+3017 ( 〗 ) RIGHT WHITE LENTICULAR BRACKET
U+3018 ( 〘 ) LEFT WHITE TORTOISE SHELL BRACKET
U+3019 ( 〙 ) RIGHT WHITE TORTOISE SHELL BRACKET
U+301A ( 〚 ) LEFT WHITE SQUARE BRACKET
U+301B ( 〛 ) RIGHT WHITE SQUARE BRACKET
U+FF62 ( 「 ) HALFWIDTH LEFT CORNER BRACKET
U+FF63 ( 」 ) HALFWIDTH RIGHT CORNER BRACKET
U+302A ( 〪 ) IDEOGRAPHIC LEVEL TONE MARK
U+302B ( 〫 ) IDEOGRAPHIC RISING TONE MARK
U+302C ( 〬 ) IDEOGRAPHIC DEPARTING TONE MARK
U+302D ( 〭 ) IDEOGRAPHIC ENTERING TONE MARK
U+2100 ( ℀ ) ACCOUNT OF
U+2101 ( ℁ ) ADDRESSED TO THE SUBJECT
U+2103 ( ℃ ) DEGREE CELSIUS
U+2104 ( ℄ ) CENTRE LINE SYMBOL
U+2105 ( ℅ ) CARE OF
U+2106 ( ℆ ) CADA UNA
U+2107 ( ℇ ) EULER CONSTANT
U+2108 ( ℈ ) SCRUPLE
U+2109 ( ℉ ) DEGREE FAHRENHEIT
U+2114 ( ℔ ) L B BAR SYMBOL
U+2116 ( № ) NUMERO SIGN
U+2117 ( ℗ ) SOUND RECORDING COPYRIGHT
U+2118 ( ℘ ) SCRIPT CAPITAL P
U+211E ( ℞ ) PRESCRIPTION TAKE
U+211F ( ℟ ) RESPONSE
U+2120 ( ℠ ) SERVICE MARK
U+2121 ( ℡ ) TELEPHONE SIGN
U+2122 ( ™ ) TRADE MARK SIGN
U+2123 ( ℣ ) VERSICLE
U+2125 ( ℥ ) OUNCE SIGN
U+2127 ( ℧ ) INVERTED OHM SIGN
U+212E ( ℮ ) ESTIMATED SYMBOL
U+2139 ( ℹ ) INFORMATION SOURCE
U+213A ( ℺ ) ROTATED CAPITAL Q
U+213B ( ℻ ) FACSIMILE SIGN
U+214A ( ⅊ ) PROPERTY LINE
U+214C ( ⅌ ) PER SIGN
U+214D ( ⅍ ) AKTIESELSKAB
The following are the only Sm (Math_Symbol) with explicit scripts or are Letterlike-symbols, and may need to be treated as Common.
U+03F6 ( ϶ ) GREEK REVERSED LUNATE EPSILON SYMBOL
U+0606 ( ؆ ) ARABIC-INDIC CUBE ROOT
U+0607 ( ؇ ) ARABIC-INDIC FOURTH ROOT
U+0608 ( ؈ ) ARABIC RAY
U+2140 ( ⅀ ) DOUBLE-STRUCK N-ARY SUMMATION
U+2141 ( ⅁ ) TURNED SANS-SERIF CAPITAL G
U+2142 ( ⅂ ) TURNED SANS-SERIF CAPITAL L
U+2143 ( ⅃ ) REVERSED SANS-SERIF CAPITAL L
U+2144 ( ⅄ ) TURNED SANS-SERIF CAPITAL Y
U+214B ( ⅋ ) TURNED AMPERSAND
The following characters are Script=Greek, but their canonical equivalents are Script=Common. These are the only such characters that change from an explicit script to Common.
U+1FC1 ( ῁ ) GREEK DIALYTIKA AND PERISPOMENI
U+1FED ( ῭ ) GREEK DIALYTIKA AND VARIA
U+1FEE ( ΅ ) GREEK DIALYTIKA AND OXIA
U+1FEF ( ` ) GREEK VARIA
U+1FFD ( ´ ) GREEK OXIA
This is not an issue for the other Modifier Symbols (Sk) in Greek blocks:
U+0375 ( ͵ ) GREEK LOWER NUMERAL SIGN // the only one without a compat decomp.
U+0384 ( ΄ ) GREEK TONOS
U+0385 ( ΅ ) GREEK DIALYTIKA TONOS
U+1FBD ( ᾽ ) GREEK KORONIS
U+1FBF ( ᾿ ) GREEK PSILI
U+1FC0 ( ῀ ) GREEK PERISPOMENI
U+1FCD ( ῍ ) GREEK PSILI AND VARIA
U+1FCE ( ῎ ) GREEK PSILI AND OXIA
U+1FCF ( ῏ ) GREEK PSILI AND PERISPOMENI
U+1FDD ( ῝ ) GREEK DASIA AND VARIA
U+1FDE ( ῞ ) GREEK DASIA AND OXIA
U+1FDF ( ῟ ) GREEK DASIA AND PERISPOMENI
U+1FFE ( ῾ ) GREEK DASIA
Or the other Modifier Letters (Lm) in Greek blocks:
U+0374 ( ʹ ) GREEK NUMERAL SIGN
U+037A ( ͺ ) GREEK YPOGEGRAMMENI