The following was L2/09-029 R3. It is now a working draft of text and data. 99. Script Edge CasesIn most cases, the assignment of scripts is straightfoward. There are, however, some edge cases that people may stumble over. In particular, the Common script value is too coarse for many applications. It only indicates that the character is used with multiple scripts, but not with which ones. Many applications need more information in order to do a reasonable job. For example, where a Character Picker organizes characters into buckets by script, such characters should show up in two (or more) buckets. The following text provides information about such edge cases that may be useful to implementations. The following notation means that the following characters may, in some circumstances, be treated as if they had the listed scripts. @Latin, Cyrillic,... [Issue: should this be a data file instead?; it would be easier to use.] [Ed note: define in #24 someplace that "explicit script" means a
script value other than Common or Inherited.] @LatinThe following are functionally Latin script; applications may find it useful in certain circumstances to treat them as such.Basic Latin - ASCII punctuation and symbolsLatin 1 Supplement - Latin-1 punctuation and symbolsSpacing Modifier Letters - Miscellaneous phonetic modifiersU+02B9 ( ʹ ) MODIFIER LETTER PRIMEU+02BA ( ʺ ) MODIFIER LETTER DOUBLE PRIMEU+02BB ( ʻ ) MODIFIER LETTER TURNED COMMAU+02BD ( ʽ ) MODIFIER LETTER REVERSED COMMAU+02BE ( ʾ ) MODIFIER LETTER RIGHT HALF RINGU+02BF ( ʿ ) MODIFIER LETTER LEFT HALF RINGU+02C0 ( ˀ ) MODIFIER LETTER GLOTTAL STOPU+02C1 ( ˁ ) MODIFIER LETTER REVERSED GLOTTAL STOPU+02C2 ( ˂ ) MODIFIER LETTER LEFT ARROWHEADU+02C3 ( ˃ ) MODIFIER LETTER RIGHT ARROWHEADU+02C4 ( ˄ ) MODIFIER LETTER UP ARROWHEADU+02C5 ( ˅ ) MODIFIER LETTER DOWN ARROWHEADU+02C6 ( ˆ ) MODIFIER LETTER CIRCUMFLEX ACCENTU+02C7 ( ˇ ) CARONU+02C8 ( ˈ ) MODIFIER LETTER VERTICAL LINEU+02C9 ( ˉ ) MODIFIER LETTER MACRONU+02CA ( ˊ ) MODIFIER LETTER ACUTE ACCENTU+02CB ( ˋ ) MODIFIER LETTER GRAVE ACCENTU+02CC ( ˌ ) MODIFIER LETTER LOW VERTICAL LINEU+02CD ( ˍ ) MODIFIER LETTER LOW MACRONU+02CE ( ˎ ) MODIFIER LETTER LOW GRAVE ACCENTU+02CF ( ˏ ) MODIFIER LETTER LOW ACUTE ACCENTU+02D0 ( ː ) MODIFIER LETTER TRIANGULAR COLONU+02D1 ( ˑ ) MODIFIER LETTER HALF TRIANGULAR COLONU+02D2 ( ˒ ) MODIFIER LETTER CENTRED RIGHT HALF RINGU+02D3 ( ˓ ) MODIFIER LETTER CENTRED LEFT HALF RINGU+02D4 ( ˔ ) MODIFIER LETTER UP TACKU+02D5 ( ˕ ) MODIFIER LETTER DOWN TACKU+02D6 ( ˖ ) MODIFIER LETTER PLUS SIGNU+02D7 ( ˗ ) MODIFIER LETTER MINUS SIGNSpacing Modifier Letters - Spacing clones of diacriticsU+02D8 ( ˘ ) BREVEU+02D9 ( ˙ ) DOT ABOVEU+02DA ( ˚ ) RING ABOVEU+02DB ( ˛ ) OGONEKU+02DC ( ˜ ) SMALL TILDEU+02DD ( ˝ ) DOUBLE ACUTE ACCENTSpacing Modifier Letters - Additions based on 1989 IPASpacing Modifier Letters - Tone lettersU+02E5 ( ˥ ) MODIFIER LETTER EXTRA-HIGH TONE BARU+02E6 ( ˦ ) MODIFIER LETTER HIGH TONE BARU+02E7 ( ˧ ) MODIFIER LETTER MID TONE BARU+02E8 ( ˨ ) MODIFIER LETTER LOW TONE BARU+02E9 ( ˩ ) MODIFIER LETTER EXTRA-LOW TONE BARSpacing Modifier Letters - IPA modifiersSpacing Modifier Letters - Other modifier letterU+02EE ( ˮ ) MODIFIER LETTER DOUBLE APOSTROPHEModifier Tone Letters - Corner tone marks for ChineseU+A700 ( ꜀ ) MODIFIER LETTER CHINESE TONE YIN PINGU+A701 ( ꜁ ) MODIFIER LETTER CHINESE TONE YANG PINGU+A702 ( ꜂ ) MODIFIER LETTER CHINESE TONE YIN SHANGU+A703 ( ꜃ ) MODIFIER LETTER CHINESE TONE YANG SHANGU+A704 ( ꜄ ) MODIFIER LETTER CHINESE TONE YIN QUU+A705 ( ꜅ ) MODIFIER LETTER CHINESE TONE YANG QUU+A706 ( ꜆ ) MODIFIER LETTER CHINESE TONE YIN RUU+A707 ( ꜇ ) MODIFIER LETTER CHINESE TONE YANG RUModifier Tone Letters - Dotted tone lettersU+A708 ( ꜈ ) MODIFIER LETTER EXTRA-HIGH DOTTED TONE BARU+A709 ( ꜉ ) MODIFIER LETTER HIGH DOTTED TONE BARU+A70A ( ꜊ ) MODIFIER LETTER MID DOTTED TONE BARU+A70B ( ꜋ ) MODIFIER LETTER LOW DOTTED TONE BARU+A70C ( ꜌ ) MODIFIER LETTER EXTRA-LOW DOTTED TONE BARU+A70D ( ꜍ ) MODIFIER LETTER EXTRA-HIGH DOTTED LEFT-STEM TONE BARU+A70E ( ꜎ ) MODIFIER LETTER HIGH DOTTED LEFT-STEM TONE BARU+A70F ( ꜏ ) MODIFIER LETTER MID DOTTED LEFT-STEM TONE BARU+A710 ( ꜐ ) MODIFIER LETTER LOW DOTTED LEFT-STEM TONE BARU+A711 ( ꜑ ) MODIFIER LETTER EXTRA-LOW DOTTED LEFT-STEM TONE BARModifier Tone Letters - Left-stem tone lettersU+A712 ( ꜒ ) MODIFIER LETTER EXTRA-HIGH LEFT-STEM TONE BARU+A713 ( ꜓ ) MODIFIER LETTER HIGH LEFT-STEM TONE BARU+A714 ( ꜔ ) MODIFIER LETTER MID LEFT-STEM TONE BARU+A715 ( ꜕ ) MODIFIER LETTER LOW LEFT-STEM TONE BARU+A716 ( ꜖ ) MODIFIER LETTER EXTRA-LOW LEFT-STEM TONE BARModifier Tone Letters - Chinantec tone marksU+A717 ( ꜗ ) MODIFIER LETTER DOT VERTICAL BARU+A718 ( ꜘ ) MODIFIER LETTER DOT SLASHU+A719 ( ꜙ ) MODIFIER LETTER DOT HORIZONTAL BARU+A71A ( ꜚ ) MODIFIER LETTER LOWER RIGHT CORNER ANGLEModifier Tone Letters - Africanist tone lettersU+A71B ( ꜛ ) MODIFIER LETTER RAISED UP ARROWU+A71C ( ꜜ ) MODIFIER LETTER RAISED DOWN ARROWU+A71D ( ꜝ ) MODIFIER LETTER RAISED EXCLAMATION MARKU+A71E ( ꜞ ) MODIFIER LETTER RAISED INVERTED EXCLAMATION MARKU+A71F ( ꜟ ) MODIFIER LETTER LOW INVERTED EXCLAMATION MARKLatin Extended D - Modifier lettersU+A788 ( ꞈ ) MODIFIER LETTER LOW CIRCUMFLEX ACCENTU+A789 ( ꞉ ) MODIFIER LETTER COLONU+A78A ( ꞊ ) MODIFIER LETTER SHORT EQUALS SIGNSpacing Modifier Letters - UPA modifiersU+02EF ( ˯ ) MODIFIER LETTER LOW DOWN ARROWHEADU+02F0 ( ˰ ) MODIFIER LETTER LOW UP ARROWHEADU+02F1 ( ˱ ) MODIFIER LETTER LOW LEFT ARROWHEADU+02F2 ( ˲ ) MODIFIER LETTER LOW RIGHT ARROWHEADU+02F3 ( ˳ ) MODIFIER LETTER LOW RINGU+02F4 ( ˴ ) MODIFIER LETTER MIDDLE GRAVE ACCENTU+02F5 ( ˵ ) MODIFIER LETTER MIDDLE DOUBLE GRAVE ACCENTU+02F6 ( ˶ ) MODIFIER LETTER MIDDLE DOUBLE ACUTE ACCENTU+02F7 ( ˷ ) MODIFIER LETTER LOW TILDEU+02F8 ( ˸ ) MODIFIER LETTER RAISED COLONU+02F9 ( ˹ ) MODIFIER LETTER BEGIN HIGH TONEU+02FA ( ˺ ) MODIFIER LETTER END HIGH TONEU+02FB ( ˻ ) MODIFIER LETTER BEGIN LOW TONEU+02FC ( ˼ ) MODIFIER LETTER END LOW TONEU+02FD ( ˽ ) MODIFIER LETTER SHELFU+02FE ( ˾ ) MODIFIER LETTER OPEN SHELFU+02FF ( ˿ ) MODIFIER LETTER LOW LEFT ARROWLatin Extended D - Additions for UPAU+A720 ( ꜠ ) MODIFIER LETTER STRESS AND HIGH TONEU+A721 ( ꜡ ) MODIFIER LETTER STRESS AND LOW TONELetterlike Symbols - Letterlike symbolsU+2102 ( ℂ ) DOUBLE-STRUCK CAPITAL CU+210A ( ℊ ) SCRIPT SMALL GU+210B ( ℋ ) SCRIPT CAPITAL HU+210C ( ℌ ) BLACK-LETTER CAPITAL HU+210D ( ℍ ) DOUBLE-STRUCK CAPITAL HU+210E ( ℎ ) PLANCK CONSTANTU+210F ( ℏ ) PLANCK CONSTANT OVER TWO PIU+2110 ( ℐ ) SCRIPT CAPITAL IU+2111 ( ℑ ) BLACK-LETTER CAPITAL IU+2112 ( ℒ ) SCRIPT CAPITAL LU+2113 ( ℓ ) SCRIPT SMALL LU+2115 ( ℕ ) DOUBLE-STRUCK CAPITAL NU+2119 ( ℙ ) DOUBLE-STRUCK CAPITAL PU+211A ( ℚ ) DOUBLE-STRUCK CAPITAL QU+211B ( ℛ ) SCRIPT CAPITAL RU+211C ( ℜ ) BLACK-LETTER CAPITAL RU+211D ( ℝ ) DOUBLE-STRUCK CAPITAL RU+2124 ( ℤ ) DOUBLE-STRUCK CAPITAL ZU+2128 ( ℨ ) BLACK-LETTER CAPITAL ZU+212C ( ℬ ) SCRIPT CAPITAL BU+212D ( ℭ ) BLACK-LETTER CAPITAL CU+212F ( ℯ ) SCRIPT SMALL EU+2130 ( ℰ ) SCRIPT CAPITAL EU+2131 ( ℱ ) SCRIPT CAPITAL FU+2133 ( ℳ ) SCRIPT CAPITAL MU+2134 ( ℴ ) SCRIPT SMALL OLetterlike Symbols - Double-struck italic math symbolsU+2145 ( ⅅ ) DOUBLE-STRUCK ITALIC CAPITAL DU+2146 ( ⅆ ) DOUBLE-STRUCK ITALIC SMALL DU+2147 ( ⅇ ) DOUBLE-STRUCK ITALIC SMALL EU+2148 ( ⅈ ) DOUBLE-STRUCK ITALIC SMALL IU+2149 ( ⅉ ) DOUBLE-STRUCK ITALIC SMALL JEnclosed Alphanumerics - Parenthesized Latin lettersU+249C ( ⒜ ) PARENTHESIZED LATIN SMALL LETTER AU+249D ( ⒝ ) PARENTHESIZED LATIN SMALL LETTER BU+249E ( ⒞ ) PARENTHESIZED LATIN SMALL LETTER CU+249F ( ⒟ ) PARENTHESIZED LATIN SMALL LETTER DU+24A0 ( ⒠ ) PARENTHESIZED LATIN SMALL LETTER EU+24A1 ( ⒡ ) PARENTHESIZED LATIN SMALL LETTER FU+24A2 ( ⒢ ) PARENTHESIZED LATIN SMALL LETTER GU+24A3 ( ⒣ ) PARENTHESIZED LATIN SMALL LETTER HU+24A4 ( ⒤ ) PARENTHESIZED LATIN SMALL LETTER IU+24A5 ( ⒥ ) PARENTHESIZED LATIN SMALL LETTER JU+24A6 ( ⒦ ) PARENTHESIZED LATIN SMALL LETTER KU+24A7 ( ⒧ ) PARENTHESIZED LATIN SMALL LETTER LU+24A8 ( ⒨ ) PARENTHESIZED LATIN SMALL LETTER MU+24A9 ( ⒩ ) PARENTHESIZED LATIN SMALL LETTER NU+24AA ( ⒪ ) PARENTHESIZED LATIN SMALL LETTER OU+24AB ( ⒫ ) PARENTHESIZED LATIN SMALL LETTER PU+24AC ( ⒬ ) PARENTHESIZED LATIN SMALL LETTER QU+24AD ( ⒭ ) PARENTHESIZED LATIN SMALL LETTER RU+24AE ( ⒮ ) PARENTHESIZED LATIN SMALL LETTER SU+24AF ( ⒯ ) PARENTHESIZED LATIN SMALL LETTER TU+24B0 ( ⒰ ) PARENTHESIZED LATIN SMALL LETTER UU+24B1 ( ⒱ ) PARENTHESIZED LATIN SMALL LETTER VU+24B2 ( ⒲ ) PARENTHESIZED LATIN SMALL LETTER WU+24B3 ( ⒳ ) PARENTHESIZED LATIN SMALL LETTER XU+24B4 ( ⒴ ) PARENTHESIZED LATIN SMALL LETTER YU+24B5 ( ⒵ ) PARENTHESIZED LATIN SMALL LETTER ZEnclosed Alphanumerics - Circled Latin lettersU+24B6 ( Ⓐ ) CIRCLED LATIN CAPITAL LETTER AU+24B7 ( Ⓑ ) CIRCLED LATIN CAPITAL LETTER BU+24B8 ( Ⓒ ) CIRCLED LATIN CAPITAL LETTER CU+24B9 ( Ⓓ ) CIRCLED LATIN CAPITAL LETTER DU+24BA ( Ⓔ ) CIRCLED LATIN CAPITAL LETTER EU+24BB ( Ⓕ ) CIRCLED LATIN CAPITAL LETTER FU+24BC ( Ⓖ ) CIRCLED LATIN CAPITAL LETTER GU+24BD ( Ⓗ ) CIRCLED LATIN CAPITAL LETTER HU+24BE ( Ⓘ ) CIRCLED LATIN CAPITAL LETTER IU+24BF ( Ⓙ ) CIRCLED LATIN CAPITAL LETTER JU+24C0 ( Ⓚ ) CIRCLED LATIN CAPITAL LETTER KU+24C1 ( Ⓛ ) CIRCLED LATIN CAPITAL LETTER LU+24C2 ( Ⓜ ) CIRCLED LATIN CAPITAL LETTER MU+24C3 ( Ⓝ ) CIRCLED LATIN CAPITAL LETTER NU+24C4 ( Ⓞ ) CIRCLED LATIN CAPITAL LETTER OU+24C5 ( Ⓟ ) CIRCLED LATIN CAPITAL LETTER PU+24C6 ( Ⓠ ) CIRCLED LATIN CAPITAL LETTER QU+24C7 ( Ⓡ ) CIRCLED LATIN CAPITAL LETTER RU+24C8 ( Ⓢ ) CIRCLED LATIN CAPITAL LETTER SU+24C9 ( Ⓣ ) CIRCLED LATIN CAPITAL LETTER TU+24CA ( Ⓤ ) CIRCLED LATIN CAPITAL LETTER UU+24CB ( Ⓥ ) CIRCLED LATIN CAPITAL LETTER VU+24CC ( Ⓦ ) CIRCLED LATIN CAPITAL LETTER WU+24CD ( Ⓧ ) CIRCLED LATIN CAPITAL LETTER XU+24CE ( Ⓨ ) CIRCLED LATIN CAPITAL LETTER YU+24CF ( Ⓩ ) CIRCLED LATIN CAPITAL LETTER ZU+24D0 ( ⓐ ) CIRCLED LATIN SMALL LETTER AU+24D1 ( ⓑ ) CIRCLED LATIN SMALL LETTER BU+24D2 ( ⓒ ) CIRCLED LATIN SMALL LETTER CU+24D3 ( ⓓ ) CIRCLED LATIN SMALL LETTER DU+24D4 ( ⓔ ) CIRCLED LATIN SMALL LETTER EU+24D5 ( ⓕ ) CIRCLED LATIN SMALL LETTER FU+24D6 ( ⓖ ) CIRCLED LATIN SMALL LETTER GU+24D7 ( ⓗ ) CIRCLED LATIN SMALL LETTER HU+24D8 ( ⓘ ) CIRCLED LATIN SMALL LETTER IU+24D9 ( ⓙ ) CIRCLED LATIN SMALL LETTER JU+24DA ( ⓚ ) CIRCLED LATIN SMALL LETTER KU+24DB ( ⓛ ) CIRCLED LATIN SMALL LETTER LU+24DC ( ⓜ ) CIRCLED LATIN SMALL LETTER MU+24DD ( ⓝ ) CIRCLED LATIN SMALL LETTER NU+24DE ( ⓞ ) CIRCLED LATIN SMALL LETTER OU+24DF ( ⓟ ) CIRCLED LATIN SMALL LETTER PU+24E0 ( ⓠ ) CIRCLED LATIN SMALL LETTER QU+24E1 ( ⓡ ) CIRCLED LATIN SMALL LETTER RU+24E2 ( ⓢ ) CIRCLED LATIN SMALL LETTER SU+24E3 ( ⓣ ) CIRCLED LATIN SMALL LETTER TU+24E4 ( ⓤ ) CIRCLED LATIN SMALL LETTER UU+24E5 ( ⓥ ) CIRCLED LATIN SMALL LETTER VU+24E6 ( ⓦ ) CIRCLED LATIN SMALL LETTER WU+24E7 ( ⓧ ) CIRCLED LATIN SMALL LETTER XU+24E8 ( ⓨ ) CIRCLED LATIN SMALL LETTER YU+24E9 ( ⓩ ) CIRCLED LATIN SMALL LETTER ZEnclosed CJK Letters And Months - Squared Latin abbreviationU+3250 ( ㉐ ) PARTNERSHIP SIGNEnclosed CJK Letters And Months - Squared Latin abbreviationsU+32CC ( ㋌ ) SQUARE HGU+32CD ( ㋍ ) SQUARE ERGU+32CE ( ㋎ ) SQUARE EVU+32CF ( ㋏ ) LIMITED LIABILITY SIGNCJK Compatibility - Squared Latin abbreviationsU+3371 ( ㍱ ) SQUARE HPAU+3372 ( ㍲ ) SQUARE DAU+3373 ( ㍳ ) SQUARE AUU+3374 ( ㍴ ) SQUARE BARU+3375 ( ㍵ ) SQUARE OVU+3376 ( ㍶ ) SQUARE PCU+3377 ( ㍷ ) SQUARE DMU+3378 ( ㍸ ) SQUARE DM SQUAREDU+3379 ( ㍹ ) SQUARE DM CUBEDU+337A ( ㍺ ) SQUARE IUU+3380 ( ㎀ ) SQUARE PA AMPSU+3381 ( ㎁ ) SQUARE NAU+3382 ( ㎂ ) SQUARE MU AU+3383 ( ㎃ ) SQUARE MAU+3384 ( ㎄ ) SQUARE KAU+3385 ( ㎅ ) SQUARE KBU+3386 ( ㎆ ) SQUARE MBU+3387 ( ㎇ ) SQUARE GBU+3388 ( ㎈ ) SQUARE CALU+3389 ( ㎉ ) SQUARE KCALU+338A ( ㎊ ) SQUARE PFU+338B ( ㎋ ) SQUARE NFU+338C ( ㎌ ) SQUARE MU FU+338D ( ㎍ ) SQUARE MU GU+338E ( ㎎ ) SQUARE MGU+338F ( ㎏ ) SQUARE KGU+3390 ( ㎐ ) SQUARE HZU+3391 ( ㎑ ) SQUARE KHZU+3392 ( ㎒ ) SQUARE MHZU+3393 ( ㎓ ) SQUARE GHZU+3394 ( ㎔ ) SQUARE THZU+3395 ( ㎕ ) SQUARE MU LU+3396 ( ㎖ ) SQUARE MLU+3397 ( ㎗ ) SQUARE DLU+3398 ( ㎘ ) SQUARE KLU+3399 ( ㎙ ) SQUARE FMU+339A ( ㎚ ) SQUARE NMU+339B ( ㎛ ) SQUARE MU MU+339C ( ㎜ ) SQUARE MMU+339D ( ㎝ ) SQUARE CMU+339E ( ㎞ ) SQUARE KMU+339F ( ㎟ ) SQUARE MM SQUAREDU+33A0 ( ㎠ ) SQUARE CM SQUAREDU+33A1 ( ㎡ ) SQUARE M SQUAREDU+33A2 ( ㎢ ) SQUARE KM SQUAREDU+33A3 ( ㎣ ) SQUARE MM CUBEDU+33A4 ( ㎤ ) SQUARE CM CUBEDU+33A5 ( ㎥ ) SQUARE M CUBEDU+33A6 ( ㎦ ) SQUARE KM CUBEDU+33A7 ( ㎧ ) SQUARE M OVER SU+33A8 ( ㎨ ) SQUARE M OVER S SQUAREDU+33A9 ( ㎩ ) SQUARE PAU+33AA ( ㎪ ) SQUARE KPAU+33AB ( ㎫ ) SQUARE MPAU+33AC ( ㎬ ) SQUARE GPAU+33AD ( ㎭ ) SQUARE RADU+33AE ( ㎮ ) SQUARE RAD OVER SU+33AF ( ㎯ ) SQUARE RAD OVER S SQUAREDU+33B0 ( ㎰ ) SQUARE PSU+33B1 ( ㎱ ) SQUARE NSU+33B2 ( ㎲ ) SQUARE MU SU+33B3 ( ㎳ ) SQUARE MSU+33B4 ( ㎴ ) SQUARE PVU+33B5 ( ㎵ ) SQUARE NVU+33B6 ( ㎶ ) SQUARE MU VU+33B7 ( ㎷ ) SQUARE MVU+33B8 ( ㎸ ) SQUARE KVU+33B9 ( ㎹ ) SQUARE MV MEGAU+33BA ( ㎺ ) SQUARE PWU+33BB ( ㎻ ) SQUARE NWU+33BC ( ㎼ ) SQUARE MU WU+33BD ( ㎽ ) SQUARE MWU+33BE ( ㎾ ) SQUARE KWU+33BF ( ㎿ ) SQUARE MW MEGAU+33C0 ( ㏀ ) SQUARE K OHMU+33C1 ( ㏁ ) SQUARE M OHMU+33C2 ( ㏂ ) SQUARE AMU+33C3 ( ㏃ ) SQUARE BQU+33C4 ( ㏄ ) SQUARE CCU+33C5 ( ㏅ ) SQUARE CDU+33C6 ( ㏆ ) SQUARE C OVER KGU+33C7 ( ㏇ ) SQUARE COU+33C8 ( ㏈ ) SQUARE DBU+33C9 ( ㏉ ) SQUARE GYU+33CA ( ㏊ ) SQUARE HAU+33CB ( ㏋ ) SQUARE HPU+33CC ( ㏌ ) SQUARE INU+33CD ( ㏍ ) SQUARE KKU+33CE ( ㏎ ) SQUARE KM CAPITALU+33CF ( ㏏ ) SQUARE KTU+33D0 ( ㏐ ) SQUARE LMU+33D1 ( ㏑ ) SQUARE LNU+33D2 ( ㏒ ) SQUARE LOGU+33D3 ( ㏓ ) SQUARE LXU+33D4 ( ㏔ ) SQUARE MB SMALLU+33D5 ( ㏕ ) SQUARE MILU+33D6 ( ㏖ ) SQUARE MOLU+33D7 ( ㏗ ) SQUARE PHU+33D8 ( ㏘ ) SQUARE PMU+33D9 ( ㏙ ) SQUARE PPMU+33DA ( ㏚ ) SQUARE PRU+33DB ( ㏛ ) SQUARE SRU+33DC ( ㏜ ) SQUARE SVU+33DD ( ㏝ ) SQUARE WBU+33DE ( ㏞ ) SQUARE V OVER MU+33DF ( ㏟ ) SQUARE A OVER MCJK Compatibility - Squared Latin abbreviationU+33FF ( ㏿ ) SQUARE GAL@Latin, CyrillicThe following is primarily used in Cyrillic and Latin.Spacing Modifier Letters - Miscellaneous phonetic modifiersU+02BC ( ʼ ) MODIFIER LETTER APOSTROPHE@LatinWhile the following have the form of Greek or Cyrillic letters, they are functionally Latin phonetic characters.Phonetic Extensions - Greek lettersU+1D26 ( ᴦ ) GREEK LETTER SMALL CAPITAL GAMMAU+1D27 ( ᴧ ) GREEK LETTER SMALL CAPITAL LAMDAU+1D28 ( ᴨ ) GREEK LETTER SMALL CAPITAL PIU+1D29 ( ᴩ ) GREEK LETTER SMALL CAPITAL RHOU+1D2A ( ᴪ ) GREEK LETTER SMALL CAPITAL PSIPhonetic Extensions - Cyrillic letterU+1D2B ( ᴫ ) CYRILLIC LETTER SMALL CAPITAL ELPhonetic Extensions - Greek superscript modifier lettersU+1D5D ( ᵝ ) MODIFIER LETTER SMALL BETAU+1D5E ( ᵞ ) MODIFIER LETTER SMALL GREEK GAMMAU+1D5F ( ᵟ ) MODIFIER LETTER SMALL DELTAU+1D60 ( ᵠ ) MODIFIER LETTER SMALL GREEK PHIU+1D61 ( ᵡ ) MODIFIER LETTER SMALL CHIPhonetic Extensions - Greek subscript modifier lettersU+1D66 ( ᵦ ) GREEK SUBSCRIPT SMALL LETTER BETAU+1D67 ( ᵧ ) GREEK SUBSCRIPT SMALL LETTER GAMMAU+1D68 ( ᵨ ) GREEK SUBSCRIPT SMALL LETTER RHOU+1D69 ( ᵩ ) GREEK SUBSCRIPT SMALL LETTER PHIU+1D6A ( ᵪ ) GREEK SUBSCRIPT SMALL LETTER CHIPhonetic Extensions - Caucasian linguisticsU+1D78 ( ᵸ ) MODIFIER LETTER CYRILLIC ENPhonetic Extensions Supplement - Modifier lettersU+1DBF ( ᶿ ) MODIFIER LETTER SMALL THETA@Greek
These have no explicit script just because they map to general punctuation marks or modifier letters. Greek And Coptic - Numeral signsU+0374 ( ʹ ) GREEK NUMERAL SIGNGreek And Coptic - PunctuationU+037E ( ; ) GREEK QUESTION MARKGreek And Coptic - Spacing accent marksU+0385 ( ΅ ) GREEK DIALYTIKA TONOSGreek And Coptic - PunctuationIn contrast, the following does have an explicit script, and is the only Sk (Modifier_Symbol) that does. Note that the corresponding U+0374 is a Modifier_Letter.U+0375 ( ͵ ) GREEK LOWER NUMERAL SIGNLatin 1 Supplement - Latin-1 punctuation and symbolsU+00B5 ( µ ) MICRO SIGN@Armenian, GeorgianArmenian - PunctuationU+0589 ( ։ ) ARMENIAN FULL STOP@ArabicArabic - Subtending marksU+0603 ( ) ARABIC SIGN SAFHAArabic Presentation Forms A - SymbolU+FDFD ( ﷽ ) ARABIC LIGATURE BISMILLAH AR-RAHMAN AR-RAHEEM@Arabic, ThaanaArabic - Arabic-Indic digits(Note that the U+06Fx EXTENDED ARABIC-INDIC DIGIT x characters have already the specific script Arabic)U+0660 ( ٠ ) ARABIC-INDIC DIGIT ZEROU+0661 ( ١ ) ARABIC-INDIC DIGIT ONEU+0662 ( ٢ ) ARABIC-INDIC DIGIT TWOU+0663 ( ٣ ) ARABIC-INDIC DIGIT THREEU+0664 ( ٤ ) ARABIC-INDIC DIGIT FOURU+0665 ( ٥ ) ARABIC-INDIC DIGIT FIVEU+0666 ( ٦ ) ARABIC-INDIC DIGIT SIXU+0667 ( ٧ ) ARABIC-INDIC DIGIT SEVENU+0668 ( ٨ ) ARABIC-INDIC DIGIT EIGHTU+0669 ( ٩ ) ARABIC-INDIC DIGIT NINE@Arabic, Syriac, ThaanaArabic - Punctuation@CommonArabic - Koranic annotation signsU+06DD ( ) ARABIC END OF AYAH@Arabic, SyriacArabic - Based on ISO 8859-6U+0640 ( ـ ) ARABIC TATWEELArabic - Points from ISO 8859-6U+064B ( ً ) ARABIC FATHATANU+064C ( ٌ ) ARABIC DAMMATANU+064D ( ٍ ) ARABIC KASRATANU+064E ( َ ) ARABIC FATHAU+064F ( ُ ) ARABIC DAMMAU+0650 ( ِ ) ARABIC KASRAU+0651 ( ّ ) ARABIC SHADDAU+0652 ( ْ ) ARABIC SUKUNArabic - Combining maddah and hamzaArabic - PointU+0670 ( ٰ ) ARABIC LETTER SUPERSCRIPT ALEF@BopomofoSpacing Modifier Letters - Extended Bopomofo tone marksU+02EA ( ˪ ) MODIFIER LETTER YIN DEPARTING TONE MARKU+02EB ( ˫ ) MODIFIER LETTER YANG DEPARTING TONE MARK@DevanagariDevanagari - Various signsDevanagari - Devanagari-specific additionsU+0970 ( ॰ ) DEVANAGARI ABBREVIATION SIGN
@Devanagari, Bengali, Gurmukhi, Oriya
The annotations say "scripts of India". The dandas are not normally used with Malayalam, Kannada, Telugu, Tamil and Gujarati, and presumably these are not used with Urdu, etc. |
Unicode & Int’l SW > UTC >