This is the uax-15 Reference Manual, version 0.1.3, generated automatically by Declt version 4.0 beta 2 "William Riker" on Sun Dec 08 19:50:31 2024 GMT+0.
The main system appears first, followed by any subsystem dependency.
uax-15
Common lisp implementation of Unicode normalization functions :nfc, :nfd, :nfkc and :nfkd (Uax-15)
Takeru Ohta, Sabra Crolleton <sabra.crolleton@gmail.com>
MIT
0.1.3
split-sequence
(system).
cl-ppcre
(system).
src
(module).
Modules are listed depth-first from the system components tree.
uax-15/src
uax-15
(system).
package.lisp
(file).
utilities.lisp
(file).
trivial-utf-16.lisp
(file).
precomputed-tables.lisp
(file).
normalize-backend.lisp
(file).
uax-15.lisp
(file).
Files are sorted by type and then listed depth-first from the systems components trees.
uax-15/uax-15.asd
uax-15/src/package.lisp
uax-15/src/utilities.lisp
uax-15/src/trivial-utf-16.lisp
uax-15/src/precomputed-tables.lisp
uax-15/src/normalize-backend.lisp
uax-15/src/uax-15.lisp
uax-15/src/utilities.lisp
package.lisp
(file).
src
(module).
print-object
(method).
bad-char-error
(function).
bad-char-error
(condition).
bad-char-error-message
(reader method).
(setf bad-char-error-message)
(writer method).
bad-char-error-normalization-form
(reader method).
(setf bad-char-error-normalization-form)
(writer method).
bad-char-error-value
(reader method).
(setf bad-char-error-value)
(writer method).
int-to-hex-string
(function).
nconcf
(macro).
parse-hex-list-to-string
(function).
parse-hex-string-to-char
(function).
parse-hex-string-to-int
(function).
parse-hex-string-to-string
(function).
uax-15/src/trivial-utf-16.lisp
package.lisp
(file).
src
(module).
codepoint-as-utf-16
(function).
from-unicode-string
(function).
surrogates-to-codepoint
(function).
to-unicode-string
(function).
unicode-string
(type).
char-from-hexstring
(function).
codepoint-to-unicode-point
(function).
decode-utf-16
(function).
encode-utf-16
(function).
high-surrogate
(type).
low-surrogate
(type).
unicode-point
(type).
unicode-point-p
(function).
uax-15/src/precomputed-tables.lisp
package.lisp
(file).
utilities.lisp
(file).
trivial-utf-16.lisp
(file).
src
(module).
*canonical-combining-class*
(special variable).
*canonical-comp-map*
(special variable).
*canonical-decomp-map*
(special variable).
*compatible-decomp-map*
(special variable).
*composition-exclusions-data*
(special variable).
*data-directory*
(special variable).
*unicode-data*
(special variable).
*unicode-letters*
(special variable).
uax-15/src/normalize-backend.lisp
package.lisp
(file).
utilities.lisp
(file).
precomputed-tables.lisp
(file).
trivial-utf-16.lisp
(file).
src
(module).
canonical-ordering
(function).
compose
(function).
compose-hangul
(function).
decompose
(function).
decompose-char
(function).
decompose-hangul-char
(function).
get-canonical-combining-class
(function).
nfc
(function).
nfd
(function).
nfkc
(function).
nfkd
(function).
uax-15/src/uax-15.lisp
package.lisp
(file).
utilities.lisp
(file).
normalize-backend.lisp
(file).
trivial-utf-16.lisp
(file).
src
(module).
get-canonical-combining-class-map
(function).
get-illegal-char-list
(function).
get-mapping
(function).
normalize
(function).
unicode-letter-p
(function).
*derived-normalization-props-data*
(special variable).
*derived-normalization-props-data-file*
(special variable).
normalize-char
(function).
Packages are listed by definition order.
uax-15
common-lisp
.
codepoint-as-utf-16
(function).
from-unicode-string
(function).
get-canonical-combining-class-map
(function).
get-illegal-char-list
(function).
get-mapping
(function).
normalize
(function).
surrogates-to-codepoint
(function).
to-unicode-string
(function).
unicode-letter-p
(function).
unicode-string
(type).
*canonical-combining-class*
(special variable).
*canonical-comp-map*
(special variable).
*canonical-decomp-map*
(special variable).
*compatible-decomp-map*
(special variable).
*composition-exclusions-data*
(special variable).
*data-directory*
(special variable).
*derived-normalization-props-data*
(special variable).
*derived-normalization-props-data-file*
(special variable).
*unicode-data*
(special variable).
*unicode-letters*
(special variable).
bad-char-error
(function).
bad-char-error
(condition).
bad-char-error-message
(generic reader).
(setf bad-char-error-message)
(generic writer).
bad-char-error-normalization-form
(generic reader).
(setf bad-char-error-normalization-form)
(generic writer).
bad-char-error-value
(generic reader).
(setf bad-char-error-value)
(generic writer).
canonical-ordering
(function).
char-from-hexstring
(function).
codepoint-to-unicode-point
(function).
compose
(function).
compose-hangul
(function).
decode-utf-16
(function).
decompose
(function).
decompose-char
(function).
decompose-hangul-char
(function).
encode-utf-16
(function).
get-canonical-combining-class
(function).
high-surrogate
(type).
int-to-hex-string
(function).
low-surrogate
(type).
nconcf
(macro).
nfc
(function).
nfd
(function).
nfkc
(function).
nfkd
(function).
normalize-char
(function).
parse-hex-list-to-string
(function).
parse-hex-string-to-char
(function).
parse-hex-string-to-int
(function).
parse-hex-string-to-string
(function).
unicode-point
(type).
unicode-point-p
(function).
Definitions are sorted by export status, category, package, and then by lexicographic order.
Translate a Unicode code point to its UTF-16 representation. Returns a list of one or two codepoints. Passes surrogate code points straight through.
Take a vector of Unicode code points and turn it into a Lisp string.
Takes a normalization form, e.g. :nfkc and returns a list of lists of form (#NO-BREAK_SPACE NIL) where the first item is the character name and the second item has the value N or M or nil indicating whether the character may require renormalization.
Note no mapping for :nfkc
Base external function which calls the appropriate normalization for the normalization form. The default normaliation form is :nfkc, but :nfd, :nfkd and :nfc are also available.
Translate a pair of surrogate codepoints to a non-BMP codepoint. Returns the codepoint as an integer.
Take a Lisp string and turn it into a vector of Unicode code points.
Returns T if the character is one of the unicode characters falling into a letter category: uppercase, lowercase, titlecase, modifier and other.
Translating the first char from *unicode-data* hex codepoint string to a lisp character.
Translates an integer to a unicode-point
Turn a vector of UTF-16 code units into a vector of Unicode code points. Passes unpaired surrogate codepoints straight through.
Turn a vector of Unicode code points into a vector of UTF-16 code units. Indifferent to unpaired surrogates.
Runs normalize on a single character input and returns a single character string. You must provide the normalization form (:nfd, :nfkd, :nfc, or :nfkc)
Takes a list of numbers and returns a string of characters
Parse a hex string which is a single character into a character using code-char.
Parse a string which is a single character in hex to a decimal.
Takes a string which may be one or more hex numbers e.g. ’0044 0307’, builds an array of characters, coerces to string and returns the string. Mostly used for testing.
bad-char-error
)) ¶bad-char-error
)) ¶bad-char-error
)) ¶bad-char-error
)) ¶bad-char-error
)) ¶bad-char-error
)) ¶error
.
Text message indicating what went wrong with the validation.
(quote nil)
:message
The value of the field for which the error is signalled.
(quote nil)
:value
The normalization form for the error was signalled.
(quote nil)
:normalization-form
Jump to: | (
B C D E F G I M N P S T U |
---|
Jump to: | (
B C D E F G I M N P S T U |
---|
Jump to: | *
M N S V |
---|
Jump to: | *
M N S V |
---|
Jump to: | B C F H L M N P S T U |
---|
Jump to: | B C F H L M N P S T U |
---|