diff --git a/tools/unicodetools/com/ibm/text/UCD/idn-charsHeader.html b/tools/unicodetools/com/ibm/text/UCD/idn-charsHeader.html new file mode 100644 index 00000000000..e00d1b0869b --- /dev/null +++ b/tools/unicodetools/com/ibm/text/UCD/idn-charsHeader.html @@ -0,0 +1,71 @@ + + +
+ + + + +$Date: 2005/03/29 18:31:15 $, MED
+This page lists all of the valid output IDN characters broken down by category. By "output" IDN +characters, we mean ones that can result from nameprep. Characters are grouped first by script, and +then by subcategory. Within each subcategory characters are sorted according to the default +UCA order. Tooltips provide the character code +and name (in enabled browsers).
+Subcategory | +Description | +
---|---|
Atomic | +Characters that don't fall into any of the following subcategories | +
Atomic-no-uppercase | +For bicameral scripts, Atomic characters without an uppercase. | +
Pattern_Syntax | +Characters recommended as a basis for syntax, as in + UAX #31: Identifier and Pattern Syntax. + Excludes the word characters in Section 4 Word Boundaries of + UAX# 29, in the + Word_Break property and notes at the end of the section. | +
Non-XID | +Characters recommended as a basis for identifiers, as in + UAX #31: Identifier and Pattern Syntax + (XID_Continue). Excludes the word characters in Section 4 Word Boundaries of + UAX# 29, in the + Word_Break property and notes at the end of the section. | +
Decomposable | +Characters with NFC decompositions. | +