This is the cl-unicode Reference Manual, version 0.1.6, generated automatically by Declt version 4.0 beta 2 "William Riker" on Sun Sep 15 04:33:59 2024 GMT+0.
cl-unicode/cl-unicode.asd
cl-unicode/conditions.lisp
cl-unicode/lists.lisp
cl-unicode/hash-tables.lisp
cl-unicode/api.lisp
cl-unicode/methods.lisp
cl-unicode/test-functions.lisp
cl-unicode/derived.lisp
cl-unicode/alias.lisp
cl-unicode/base/packages.lisp
cl-unicode/base/specials.lisp
cl-unicode/base/util.lisp
The main system appears first, followed by any subsystem dependency.
cl-unicode
Portable Unicode Library
BSD-2-Clause
0.1.6
cl-unicode/base
(system).
conditions.lisp
(file).
lists.lisp
(file).
hash-tables.lisp
(file).
api.lisp
(file).
methods.lisp
(file).
test-functions.lisp
(file).
derived.lisp
(file).
alias.lisp
(file).
cl-unicode/base
BSD-2-Clause
cl-ppcre
(system).
packages.lisp
(file).
specials.lisp
(file).
util.lisp
(file).
Files are sorted by type and then listed depth-first from the systems components trees.
cl-unicode/cl-unicode.asd
cl-unicode/conditions.lisp
cl-unicode/lists.lisp
cl-unicode/hash-tables.lisp
cl-unicode/api.lisp
cl-unicode/methods.lisp
cl-unicode/test-functions.lisp
cl-unicode/derived.lisp
cl-unicode/alias.lisp
cl-unicode/base/packages.lisp
cl-unicode/base/specials.lisp
cl-unicode/base/util.lisp
cl-unicode/conditions.lisp
cl-unicode
(system).
unicode-error
(condition).
character-not-found
(condition).
signal-unicode-error
(function).
simple-unicode-error
(condition).
cl-unicode/lists.lisp
conditions.lisp
(file).
cl-unicode
(system).
cl-unicode/hash-tables.lisp
lists.lisp
(file).
cl-unicode
(system).
cl-unicode/api.lisp
hash-tables.lisp
(file).
cl-unicode
(system).
age
(generic function).
bidi-class
(generic function).
bidi-classes
(function).
bidi-mirroring-glyph
(function).
binary-properties
(function).
canonical-decomposition
(generic function).
case-fold-mapping
(function).
character-named
(function).
code-block
(generic function).
code-blocks
(function).
combining-class
(generic function).
disable-alternative-character-syntax
(macro).
enable-alternative-character-syntax
(macro).
general-categories
(function).
general-category
(generic function).
has-binary-property
(generic function).
has-property
(function).
list-all-characters
(function).
lowercase-mapping
(function).
normalization-form-c
(generic function).
normalization-form-d
(generic function).
normalization-form-k-c
(generic function).
normalization-form-k-d
(generic function).
numeric-type
(generic function).
numeric-value
(generic function).
property-test
(generic function).
recognized-properties
(function).
script
(generic function).
scripts
(function).
titlecase-mapping
(function).
unicode-name
(generic function).
unicode1-name
(generic function).
uppercase-mapping
(function).
word-break
(generic function).
%disable-alternative-character-syntax
(function).
%enable-alternative-character-syntax
(function).
compatibility-decomposition
(generic function).
try-abbreviations
(function).
unicode-name-reader
(function).
cl-unicode/methods.lisp
api.lisp
(file).
cl-unicode
(system).
age
(method).
bidi-class
(method).
code-block
(method).
combining-class
(method).
general-category
(method).
idna-mapping
(method).
numeric-type
(method).
numeric-value
(method).
script
(method).
word-break
(method).
bidi-mirroring-glyph%
(method).
binary-props
(method).
case-folding-mapping
(method).
decomposition-mapping
(method).
cl-unicode/test-functions.lisp
methods.lisp
(file).
cl-unicode
(system).
install-bidi-class-tests
(function).
install-binary-properties-tests
(function).
install-code-block-tests
(function).
install-general-category-tests
(function).
install-miscellaneous-tests
(function).
install-one-test
(function).
install-script-tests
(function).
install-tests
(function).
cl-unicode/derived.lisp
test-functions.lisp
(file).
cl-unicode
(system).
*derived-map*
(special variable).
+xid-difference+
(constant).
build-derived-test-function
(function).
build-derived-test-functions
(function).
cl-unicode/alias.lisp
derived.lisp
(file).
cl-unicode
(system).
*alias-map*
(special variable).
*bidi-alias-map*
(special variable).
build-all-property-tests
(function).
create-alias
(function).
create-aliases
(function).
cl-unicode/base/specials.lisp
packages.lisp
(file).
cl-unicode/base
(system).
*scripts-to-try*
(special variable).
*try-abbreviations-p*
(special variable).
*try-hex-notation-p*
(special variable).
*try-lisp-names-p*
(special variable).
*try-unicode1-names-p*
(special variable).
+code-point-limit+
(constant).
*bidi-classes*
(special variable).
*binary-properties*
(special variable).
*canonical-names*
(special variable).
*case-mappings*
(special variable).
*code-blocks*
(special variable).
*code-points-to-names*
(special variable).
*code-points-to-unicode1-names*
(special variable).
*compatibility-formatting-tags*
(special variable).
*composition-mappings*
(special variable).
*general-categories*
(special variable).
*hangul-syllables-to-code-points*
(special variable).
*hyperdoc-base-uri*
(special variable).
*jamo-short-names*
(special variable).
*names-to-code-points*
(special variable).
*previous-readtables*
(special variable).
*property-aliases*
(special variable).
*property-map*
(special variable).
*property-tests*
(special variable).
*scripts*
(special variable).
*special-case-mappings*
(special variable).
*unicode1-names-to-code-points*
(special variable).
hyperdoc-lookup
(function).
cl-unicode/base/util.lisp
specials.lisp
(file).
cl-unicode/base
(system).
canonicalize-name
(function).
property-name
(function).
property-symbol
(function).
+first-hangul-syllable+
(constant).
+l-base+
(constant).
+last-hangul-syllable+
(constant).
+n-count+
(constant).
+s-base+
(constant).
+t-base+
(constant).
+t-count+
(constant).
+v-base+
(constant).
+v-count+
(constant).
add-hangul-names
(function).
canonical-composition
(function).
canonical-sort
(function).
case-folding
(generic function).
cjk-unified-ideograph-p
(function).
compute-hangul-decomposition
(function).
compute-hangul-name
(function).
define-hangul-constant
(macro).
ensure-code-point
(macro).
evaluate-casing-condition
(function).
hangul-syllable-p
(function).
lookup-property-alias
(function).
mapping
(generic function).
maybe-compute-cjk-name
(function).
maybe-compute-hangul-syllable-name
(function).
maybe-find-cjk-code-point
(function).
maybe-find-hangul-syllable-code-point
(function).
parse-hex
(function).
register-property-symbol
(function).
special-mapping
(generic function).
tree-lookup
(function).
Packages are listed by definition order.
cl-unicode
common-lisp
.
*scripts-to-try*
(special variable).
*try-abbreviations-p*
(special variable).
*try-hex-notation-p*
(special variable).
*try-lisp-names-p*
(special variable).
*try-unicode1-names-p*
(special variable).
+code-point-limit+
(constant).
age
(generic function).
bidi-class
(generic function).
bidi-classes
(function).
bidi-mirroring-glyph
(function).
binary-properties
(function).
canonical-decomposition
(generic function).
canonicalize-name
(function).
case-fold-mapping
(function).
character-named
(function).
code-block
(generic function).
code-blocks
(function).
combining-class
(generic function).
disable-alternative-character-syntax
(macro).
enable-alternative-character-syntax
(macro).
general-categories
(function).
general-category
(generic function).
has-binary-property
(generic function).
has-property
(function).
idna-mapping
(generic function).
list-all-characters
(function).
lowercase-mapping
(function).
normalization-form-c
(generic function).
normalization-form-d
(generic function).
normalization-form-k-c
(generic function).
normalization-form-k-d
(generic function).
numeric-type
(generic function).
numeric-value
(generic function).
property-name
(function).
property-symbol
(function).
property-test
(generic function).
recognized-properties
(function).
script
(generic function).
scripts
(function).
titlecase-mapping
(function).
unicode-error
(condition).
unicode-name
(generic function).
unicode1-name
(generic function).
uppercase-mapping
(function).
word-break
(generic function).
%disable-alternative-character-syntax
(function).
%enable-alternative-character-syntax
(function).
*alias-map*
(special variable).
*bidi-alias-map*
(special variable).
*bidi-classes*
(special variable).
*binary-properties*
(special variable).
*canonical-names*
(special variable).
*case-mappings*
(special variable).
*code-blocks*
(special variable).
*code-points-to-names*
(special variable).
*code-points-to-unicode1-names*
(special variable).
*compatibility-formatting-tags*
(special variable).
*composition-mappings*
(special variable).
*derived-map*
(special variable).
*general-categories*
(special variable).
*hangul-syllables-to-code-points*
(special variable).
*hyperdoc-base-uri*
(special variable).
*jamo-short-names*
(special variable).
*names-to-code-points*
(special variable).
*previous-readtables*
(special variable).
*property-aliases*
(special variable).
*property-map*
(special variable).
*property-tests*
(special variable).
*scripts*
(special variable).
*special-case-mappings*
(special variable).
*unicode1-names-to-code-points*
(special variable).
+first-hangul-syllable+
(constant).
+l-base+
(constant).
+last-hangul-syllable+
(constant).
+n-count+
(constant).
+s-base+
(constant).
+t-base+
(constant).
+t-count+
(constant).
+v-base+
(constant).
+v-count+
(constant).
+xid-difference+
(constant).
add-hangul-names
(function).
bidi-mirroring-glyph%
(generic function).
binary-props
(generic function).
build-all-property-tests
(function).
build-derived-test-function
(function).
build-derived-test-functions
(function).
canonical-composition
(function).
canonical-sort
(function).
case-folding
(generic function).
case-folding-mapping
(generic function).
character-not-found
(condition).
cjk-unified-ideograph-p
(function).
compatibility-decomposition
(generic function).
compute-hangul-decomposition
(function).
compute-hangul-name
(function).
create-alias
(function).
create-aliases
(function).
decomposition-mapping
(generic function).
define-hangul-constant
(macro).
ensure-code-point
(macro).
evaluate-casing-condition
(function).
hangul-syllable-p
(function).
hyperdoc-lookup
(function).
install-bidi-class-tests
(function).
install-binary-properties-tests
(function).
install-code-block-tests
(function).
install-general-category-tests
(function).
install-miscellaneous-tests
(function).
install-one-test
(function).
install-script-tests
(function).
install-tests
(function).
lookup-property-alias
(function).
mapping
(generic function).
maybe-compute-cjk-name
(function).
maybe-compute-hangul-syllable-name
(function).
maybe-find-cjk-code-point
(function).
maybe-find-hangul-syllable-code-point
(function).
parse-hex
(function).
register-property-symbol
(function).
signal-unicode-error
(function).
simple-unicode-error
(condition).
special-mapping
(generic function).
tree-lookup
(function).
try-abbreviations
(function).
unicode-name-reader
(function).
Definitions are sorted by export status, category, package, and then by lexicographic order.
The smallest integer which is not a code point in the Unicode codespace.
This is the default value for the :SCRIPTS-TO-TRY keyword argument to CHARACTER-NAMED.
This is the default value for the :TRY-ABBREVIATIONS-P keyword argument to CHARACTER-NAMED.
This is the default value for the :TRY-HEX-NOTATION-P keyword argument to CHARACTER-NAMED.
This is the default value for the :TRY-LISP-NAMES-P keyword argument to CHARACTER-NAMED.
This is the default value for the :TRY-UNICODE1-NAMES-P keyword argument to CHARACTER-NAMED.
Restores the readtable which was active before the last call to
ENABLE-ALTERNATIVE-CHARACTER-SYNTAX. If there was no such call, the
standard readtable is used.
This macro expands into an EVAL-WHEN so that if you use it as a top-level form in a file to be loaded and/or compiled it’ll do what you expect. Technically, this’ll pop a readtable from the stack described in ENABLE-ALTERNATIVE-CHARACTER-SYNTAX so that matching calls of these macros can be nested.
Enables an alternative Lisp character syntax which /replaces/ the
usual syntax: After a sharpsign and a backslash have been read, at
least one more character is read. Reading then continues as long as
ASCII letters, digits, underlines, hyphens, colons, or plus signs are
read. The resulting string is then used as input to CHARACTER-NAMED
to produce a character.
This macro expands into an EVAL-WHEN so that if you use it as a
top-level form in a file to be loaded and/or compiled it’ll do what
you expect. Technically, this’ll push the current readtable on a
stack so that matching calls of this macro and
DISABLE-ALTERNATIVE-CHARACTER-SYNTAX can be nested.
Note that by default the alternative character syntax is not enabled after loading CL-UNICODE.
Returns a sorted list of all Bidi classes known to CL-UNICODE. These are the possible return values of BIDI-CLASS.
Returns the Bidi mirroring glyph for a character if the character
has the "BidiMirrored" property and an appropriate mirroring glyph
is defined. C can be the character’s code point (a positive integer)
or a (Lisp) character assuming its character code is also its Unicode
code point.
Returns the code point instead of the character if WANT-CODE-POINT-P is true. This can be especially useful for Lisp implementations where CHAR-CODE-LIMIT is smaller than +CODE-POINT-LIMIT+.
Returns a sorted list of all binary properties known to CL-UNICODE. These are the allowed second arguments (modulo canonicalization) to HAS-BINARY-PROPERTY.
Converts the string NAME into a "canonicalized" name which can be
used for unambiguous look-ups by removing all whitespace, hyphens, and
underline characters.
Tries not to remove hyphens preceded by spaces if this could lead to
ambiguities as described in
<http://unicode.org/unicode/reports/tr18/#Name_Properties>.
All CL-UNICODE functions which accept string "names" for characters or properties will canonicalize the name first using this function and will then look up the name case-insensitively.
Returns the character which has the name NAME (a string) by
looking up the Unicode name (see UNICODE-NAME).
If TRY-UNICODE1-NAMES is true, the Unicode 1.0 name (see
UNICODE1-NAME) will be used as a fallback.
If TRY-ABBREVIATIONS-P is true, NAME is treated as an abbreviation as
follows: If NAME contains a colon, it is interpreted as
"<script>:<short-name>" and the function tries to look up, in turn,
the characters named "<script> <size> LETTER <short-name>",
"<script> LETTER <short-name>", and "<script> <short-name>" where
<size> is "SMALL" if none of the characters in <short-name> is
uppercase, "CAPITAL" otherwise. If NAME does not contain a colon,
the same algorithm as above is tried with NAME instead of <short-name>
and each element of the list of strings SCRIPTS-TO-TRY as <string>.
(SCRIPTS-TO-TRY can also be a single string which is interpreted as a
one-element list.)
If TRY-HEX-NOTATION-P is true, NAME can be of the form "U+<x>" where
<x> is a hexadecimal number with four to six digits with the obvious
meaning.
If TRY-LISP-NAMES-P is true, the function returns the character with
the character name NAME (if there is one) or, if NAME is exactly one
character, it returns this character.
All the keyword-governed alternatives are tried in the order they’re
described here.
See also *TRY-UNICODE1-NAMES-P*, *TRY-ABBREVIATIONS-P*,
*SCRIPTS-TO-TRY*, *TRY-HEX-NOTATION-P*, and *TRY-LISP-NAMES-P*.
Returns the code point instead of the character if WANT-CODE-POINT-P is true. This can be especially useful for Lisp implementations where CHAR-CODE-LIMIT is smaller than +CODE-POINT-LIMIT+.
Returns a sorted list of all blocks known to CL-UNICODE. These are the possible return values of CODE-BLOCK.
Returns a sorted list of all general categories known to CL-UNICODE. These are the possible return values of GENERAL-CATEGORY.
Checks whether a character has the named property PROPERTY.
PROPERTY can be a string naming a property (which will be used for
look-up after canonicalization) or it can be a property symbol (see
PROPERTY-SYMBOL). C can be the character’s code point (a positive
integer) or a (Lisp) character assuming its character code is also
its Unicode code point.
"Properties" in the sense of CL-UNICODE can be names of general
categories, scripts, blocks, binary properties, or Bidi classes,
amongst other things. If there are a block and a script with the same
name (like, say, "Cyrillic"), the bare name denotes the script.
Prepend "Block:" to the name to refer to the block. (You can also
prepend "Script:" to refer to the script unambiguously.) Names of
Bidi classes must be prepended with "BidiClass:" if there’s a
potential for ambiguity.
This function also recognizes several aliases for properties (like
"Symbol" for "S") and you can, as in Perl, prepend block names
with "In" instead of "Block:" and most other properties with
"Is". See RECOGNIZED-PROPERTIES.
See also PROPERTY-TEST.
Lists all character (ordered by code point) which have the property PROPERTY where PROPERTY is interpreted as in HAS-PROPERTY. If WANT-CODE-POINT-P is true, a list of code points instead of a list of characters is returned. (If CHAR-CODE-LIMIT is smaller than +CODE-POINT-LIMIT+ in your Lisp implementation, the list of code points can actually be longer than the list of characters.).
Returns the lowercase mapping of a character. C can be the
character’s code point (a positive integer) or a (Lisp) character
assuming its character code is also its Unicode code point. Returns
the character itself if no such mapping is explicitly defined. Note
that case mapping only makes sense for characters with the "LC"
property.
Returns the code point instead of the character if WANT-CODE-POINT-P is true. Returns a list of code points if WANT-SPECIAL-P is true. This can be especially useful for Lisp implementations where CHAR-CODE-LIMIT is smaller than +CODE-POINT-LIMIT+.
Returns a name (not "the" name) for a property symbol SYMBOL if
it is known to CL-UNICODE. Note that
(STRING= (PROPERTY-NAME (PROPERTY-SYMBOL <string>)) <string>)
is not necessarily true even if the property name is not NIL while
(EQ (PROPERTY-SYMBOL (PROPERTY-NAME <symbol>)) <symbol>)
always holds if there is a property name for <symbol>.
See also PROPERTY-SYMBOL.
Returns a symbol in the CL-UNICODE-NAMES package (which is only
used for this purpose) which can stand in for the string NAME in
look-ups. The symbol’s name is the result of "canonicalizing" and
then upcasing NAME.
A symbol returned by this function is only really useful and only
actually a property symbol if the second return value is true.
All exported functions of CL-UNICODE which return strings which are property names return the corresponding property symbol as their second return value. All exported functions of CL-UNICODE which accept property names as arguments will also accept property symbols.
See also PROPERTY-NAME.
Returns a list of all property names known to CL-UNICODE. These are the allowed second arguments (modulo canonicalization) to HAS-PROPERTY. If ALL is true, known aliases (like "Letter" for "L") are also included.
Returns a sorted list of all scripts known to CL-UNICODE. These are the possible return values of SCRIPT.
Returns the titlecase mapping of a character. C can be the
character’s code point (a positive integer) or a (Lisp) character
assuming its character code is also its Unicode code point. Returns
the character itself if no such mapping is explicitly defined. Note
that case mapping only makes sense for characters with the "LC"
property.
Returns the code point instead of the character if WANT-CODE-POINT-P is true. Returns a list of code points if WANT-SPECIAL-P is true. This can be especially useful for Lisp implementations where CHAR-CODE-LIMIT is smaller than +CODE-POINT-LIMIT+.
Returns the simple uppercase mapping of a character. C can be the
character’s code point (a positive integer) or a (Lisp) character
assuming its character code is also its Unicode code point. Returns
the character itself if no such mapping is explicitly defined. Note
that case mapping only makes sense for characters with the "LC"
property.
Returns the code point instead of the character if WANT-CODE-POINT-P is true. Returns a list of code points if WANT-SPECIAL-P is true. This can be especially useful for Lisp implementations where CHAR-CODE-LIMIT is smaller than +CODE-POINT-LIMIT+.
Returns the "age" of a character or NIL if there
is no age entry for that particular character. The age of a character
is a list of two integers denoting the major and minor number of the
Unicode version where the character first appeared. C can be the
character’s code point (a positive integer) or a (Lisp) character
assuming its character code is also its Unicode code point.
Returns the bidirectional ("Bidi") class of a
character as a string or NIL if there is no bidirectional class for
that particular character. C can be the character’s code point (a
positive integer) or a (Lisp) character assuming its character code
is also its Unicode code point. The second return value (if there is
one) is the property symbol of the class.
See also BIDI-CLASSES
Decomposes input according to Unicode Canonical Decomposition rules.
Returns the block of a character as a string or NIL
if there is no block for that particular character. C can be the
character’s code point (a positive integer) or a (Lisp) character
assuming its character code is also its Unicode code point. The
second return value (if there is one) is the property symbol of the
block.
See also CODE-BLOCKS.
Returns the combining class of a character as a
non-negative integer. C can be the character’s code point (a
positive integer) or a (Lisp) character assuming its character code
is also its Unicode code point.
Returns the general category of a character as a
string. C can be the character’s code point (a positive integer) or
a (Lisp) character assuming its character code is also its Unicode
code point. The second return value is the property symbol of the
category.
See also GENERAL-CATEGORIES.
Checks whether a character has the binary property
PROPERTY. C can be the character’s code point (a positive integer)
or a (Lisp) character assuming its character code is also its Unicode
code point. PROPERTY can be a string naming the property or the
corresponding property symbol. If a true value is returned, it is the
property symbol.
See also BINARY-PROPERTIES.
integer
)) ¶NFC normalization - per character canonical decomposition followed by canonical sort and canonical composition. Returns list of code points.
NFD decomposition - per character canonical decomposition followed by canonical sort. Returns list of code points.
NFKC normalization - per character compatibility decomposition followed by canonical sort and canonical composition. Returns list of code points.
NFKD decomposition - per character compatibility decomposition followed by canonical sort. Returns list of code points.
Returns the numeric type of a character as a string
or NIL if that particular character has no numeric type. C can be the
character’s code point (a positive integer) or a (Lisp) character
assuming its character code is also its Unicode code point. The
second return value (if there is one) is the property symbol of the
numeric type.
Returns the numeric value of a character as a Lisp
rational or NIL (for NaN). C can be the character’s code point (a
positive integer) or a (Lisp) character assuming its character code
is also its Unicode code point.
Returns a unary function which can test code points
or Lisp characters for the property PROPERTY. PROPERTY is interpreted
as in HAS-PROPERTY and PROPERTY-TEST is actually used internally by
HAS-PROPERTY but might come in handy if you need a faster way to test
for PROPERTY (as you’re saving the time to look up the property).
Returns NIL if no property named PROPERTY was found or signals an error if ERRORP is true.
Returns the script of a character as a string or
NIL if there is no script for that particular character. C can be the
character’s code point (a positive integer) or a (Lisp) character
assuming its character code is also its Unicode code point. The
second return value (if there is one) is the property symbol of the
script.
See also SCRIPTS.
Returns the Unicode name of a character as a string
or NIL if there is no name for that particular character. C can be
the character’s code point (a positive integer) or a (Lisp)
character assuming its character code is also its Unicode code
point.
Returns the Unicode 1.0 name of a character as a
string or NIL if there is no name for that particular character. This
name is only non-NIL if it is significantly different from the Unicode
name (see UNICODE-NAME). For control characters, sometimes the ISO
6429 name is returned instead.
C can be the character’s code point (a positive integer) or a (Lisp) character assuming its character code is also its Unicode code point.
Returns the Word_Break property of a character as a
string. C can be the character’s code point (a positive integer) or a
(Lisp) character assuming its character code is also its Unicode code
point. The second return value (if there is one) is the property symbol of the
word break.
All errors signalled by CL-UNICODE are of this type.
error
.
The code point of the first Hangul syllable the name of which can be algorithmically derived.
The constant ‘LBase’ from chapter 3 of the Unicode book.
The code point of the last Hangul syllable the name of which can be algorithmically derived.
The constant ‘NCount’ from chapter 3 of the Unicode book.
The constant ‘SBase’ from chapter 3 of the Unicode book.
The constant ‘TBase’ from chapter 3 of the Unicode book.
The constant ‘TCount’ from chapter 3 of the Unicode book.
The constant ‘VBase’ from chapter 3 of the Unicode book.
The constant ‘VCount’ from chapter 3 of the Unicode book.
A list of all property symbols which denote Bidi classes.
A list of all property symbols which denote binary properties.
A hash tables which maps property symbols (see PROPERTY-SYMBOL) to their "canonical names", i.e. to strings.
A hash table which maps code points to three-element lists containing the lowercase, uppercase, and titlecasse mapping of the corresponding character (unless all of them are NIL).
A list of all property symbols which denote blocks.
A hash table which maps code points to the corresponding character names.
A hash table which maps code points to the corresponding Unicode 1.0 character names.
A list of Character Decomposition compatibility formatting tags.
A list of all property symbols which denote general categories.
A hash table which (case-insensitively) maps Hangul syllable name parts to their code points.
A hash table which maps code points to their Jamo short names. Needed to compute Hangul syllable names - see COMPUTE-HANGUL-NAME.
A hash table which (case-insensitively) maps "canonicalized" character names to their code points.
A stack which holds the previous readtables that have been pushed here by ENABLE-ALTERNATIVE-CHARACTER-SYNTAX.
A hash table which maps property names to the long name for the property.
A hash table which (case-insensitively) maps "canonicalized" property names (including aliases) to the corresponding property symbols.
A hash table which maps property symbols to a test function which tests for the corresponding property.
A list of all property symbols which denote scripts.
A hash table which maps code points to a list of special case mapping rules.
A hash table which (case-insensitively) maps "canonicalized" Unicode 1.0 character names to their code points.
Simple helper macro to define some constants needed for the Hangul algorithm below.
Helper macro so that C can be treated like a code point even if it is a Lisp character.
Internal function used to restore previous readtable.
Internal function used to enable alternative character syntax and store current readtable on stack.
Computes the names for all Hangul syllables and registers them in the *HANGUL-SYLLABLES-TO-CODE-POINTS* hash table. Used for CHARACTER-NAMED.
Unicode Canonical Composition algorithm. See: https://dev.w3.org/cvsweb/~checkout~/charlint/charlint.pl?rev=1.28;content-type=text%2Fx-perl
Unicode Canonical Sort algorithm
Returns a true value if CODE-POINT is the code point of a CJK unified ideograph for which we can algorithmically derive the name.
Algorithmically derives the Hangul syllable canonical decomposition.
Algorithmically derives the Hangul syllable name (the part behind "HANGUL SYLLABLE ") of the character with code point CODE-POINT as described in section 3.12 of the Unicode book.
Evaluates casing condition. Requires proper implementation. Currently handles unconditional cases.
Returns a true value if CODE-POINT is the code point of a Hangul syllable for which we can algorithmically derive the name.
Returns the long-name of the given property alias
Computes the name for CODE-POINT if CODE-POINT denotes a CJK unified ideograph the name of which can be algorithmically derived.
Computes the name for CODE-POINT if CODE-POINT denotes a Hangul syllable the name of which can be algorithmically derived.
Computes the code point for NAME if NAME is the name of a CJK unified ideograph the name of which can be algorithmically derived.
Computes the code point for NAME if NAME is the name of a Hangul syllable the name of which can be algorithmically derived.
Parses STRING as a hexadecimal number.
Converts NAME to a property symbol using PROPERTY-SYMBOL and "registers" it in the *CANONICAL-NAMES* hash table.
Utility function to signal conditions of type SIMPLE-UNICODE-ERROR.
Looks up an attribute for CODE-POINT in the binary search tree TREE. TREE is a tree as created by BUILD-TREE.
Helper function called by CHARACTER-NAMED when the :TRY-ABBREVIATIONS-P keyword argument is true. Tries to interpret NAME as an abbreviation for a longer Unicode name and returns the corresponding code point if it succeeds.
The reader function used when the alternative character syntax is enabled.
integer
)) ¶integer
)) ¶Return case folding for a character or list of characters.
integer
)) ¶Decomposes input according to Unicode Compatibility Decomposition rules.
integer
)) ¶Returns the simple case mapping for the character C
(a code point or a Lisp character) in position POSITION where 0 means
lowercase, 1 uppercase, and 2 titlecase. Returns a character if
WANT-CODE-POINT-P is NIL and a code point otherwise.
Returns the special case mapping for the character C
(a code point or a Lisp character) in position POSITION where 0 means
lowercase, 1 uppercase, and 2 titlecase. Returns a code point list.
Error signalled by UNICODE-NAME-READER if a character wasn’t found.
reader-error
.
unicode-error
.
:name
A subclass of UNICODE-ERROR where the caller can supply a format control and format arguments.
simple-condition
.
unicode-error
.
Jump to: | %
A B C D E F G H I L M N P R S T U W |
---|
Jump to: | %
A B C D E F G H I L M N P R S T U W |
---|
Jump to: | *
+
C N S |
---|
Jump to: | *
+
C N S |
---|
Jump to: | A C D F H L M P S T U |
---|
Jump to: | A C D F H L M P S T U |
---|