Next: Introduction, Previous: (dir), Up: (dir) [Contents][Index]

The langutils Reference Manual

This is the langutils Reference Manual, version 1.0, generated automatically by Declt version 4.0 beta 2 "William Riker" on Mon Feb 26 15:23:13 2024 GMT+0.

1 Introduction
2 Systems
- 2.1 langutils
3 Modules
- 3.1 langutils/src
4 Files
- 4.1 Lisp
5 Packages
6 Definitions
- 6.1 Public Interface
- 6.2 Internals
Appendix A Indexes

Next: Systems, Previous: The langutils Reference Manual, Up: The langutils Reference Manual [Contents][Index]

1 Introduction

Next: Modules, Previous: Introduction, Up: The langutils Reference Manual [Contents][Index]

2 Systems

The main system appears first, followed by any subsystem dependency.

langutils

Previous: Systems, Up: Systems [Contents][Index]

2.1 `langutils`

Language utilities

Author

Ian Eslick

License

BSD

Version

1.0

Dependencies

s-xml-rpc (system).
stdutils (system).

Source

langutils.asd.

Child Component

src (module).

Next: Files, Previous: Systems, Up: The langutils Reference Manual [Contents][Index]

3 Modules

Modules are listed depth-first from the system components tree.

langutils/src

Previous: Modules, Up: Modules [Contents][Index]

3.1 `langutils/src`

Source

langutils.asd.

Parent Component

langutils (system).

Child Components

package.lisp (file).
config.lisp (file).
tokens.lisp (file).
reference.lisp (file).
stopwords.lisp (file).
my-meta.lisp (file).
tokenize.lisp (file).
lexicon.lisp (file).
lemma.lisp (file).
porter.lisp (file).
contextual-rule-parser.lisp (file).
tagger-data.lisp (file).
tagger.lisp (file).
chunker-constants.lisp (file).
chunker.lisp (file).
concept.lisp (file).
init.lisp (file).

Next: Packages, Previous: Modules, Up: The langutils Reference Manual [Contents][Index]

4 Files

Files are sorted by type and then listed depth-first from the systems components trees.

Lisp

Previous: Files, Up: Files [Contents][Index]

4.1 Lisp

langutils/langutils.asd
langutils/src/package.lisp
langutils/src/config.lisp
langutils/src/tokens.lisp
langutils/src/reference.lisp
langutils/src/stopwords.lisp
langutils/src/my-meta.lisp
langutils/src/tokenize.lisp
langutils/src/lexicon.lisp
langutils/src/lemma.lisp
langutils/src/porter.lisp
langutils/src/contextual-rule-parser.lisp
langutils/src/tagger-data.lisp
langutils/src/tagger.lisp
langutils/src/chunker-constants.lisp
langutils/src/chunker.lisp
langutils/src/concept.lisp
langutils/src/init.lisp

Next: langutils/src/package.lisp, Previous: Lisp, Up: Lisp [Contents][Index]

4.1.1 `langutils/langutils.asd`

Source: langutils.asd.
Parent Component: langutils (system).
ASDF Systems: langutils.
Packages: langutils.system.

Next: langutils/src/config.lisp, Previous: langutils/langutils.asd, Up: Lisp [Contents][Index]

4.1.2 `langutils/src/package.lisp`

Source

langutils.asd.

Parent Component

src (module).

Packages

my-meta.
langutils.
langutils-tokenize.

Next: langutils/src/tokens.lisp, Previous: langutils/src/package.lisp, Up: Lisp [Contents][Index]

4.1.3 `langutils/src/config.lisp`

Dependency

package.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Internals

*auto-init* (special variable).
*config-paths* (special variable).
*default-concise-stopwords-file* (special variable).
*default-contextual-rule-file* (special variable).
*default-lexical-rule-file* (special variable).
*default-lexicon-file* (special variable).
*default-stems-file* (special variable).
*default-stopwords-file* (special variable).
*default-token-map-file* (special variable).
*report-status* (special variable).
handle-config-entry (function).
read-config (function).
relative-pathname (function).
write-log (macro).

Next: langutils/src/reference.lisp, Previous: langutils/src/config.lisp, Up: Lisp [Contents][Index]

4.1.4 `langutils/src/tokens.lisp`

Dependency

config.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Public Interface

get-token-count (function).
id-for-token (function).
ids-for-tokens (function).
string->token-array (function).
suspicious-string? (function).
suspicious-word? (method).
token-for-id (function).
tokens-for-ids (function).

Internals

*add-to-map-hook* (special variable).
*external-token-map* (special variable).
*id-for-token-hook* (special variable).
*id-table* (special variable).
*max-token-nums* (constant).
*max-token-others* (constant).
*suspicious-words* (special variable).
*token-counter* (special variable).
*token-counter-hook* (special variable).
*token-dirty-bit* (special variable).
*token-for-id-hook* (special variable).
*token-table* (special variable).
*tokens-load-file* (special variable).
*whitespace-chars* (constant).
add-external-mapping (function).
add-to-map-hook (function).
ensure-token-counts (function).
id-for-token-hook (function).
ids-for-string (function).
initialize-tokens (function).
reset-token-counts (function).
token-counter-hook (function).
token-for-id-hook (function).

Next: langutils/src/stopwords.lisp, Previous: langutils/src/tokens.lisp, Up: Lisp [Contents][Index]

4.1.5 `langutils/src/reference.lisp`

Dependency

tokens.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Public Interface

add-word (method).
altered-phrase (class).
change-word (method).
change-word (method).
document-annotations (reader method).
(setf document-annotations) (writer method).
document-tags (reader method).
(setf document-tags) (writer method).
document-text (reader method).
(setf document-text) (writer method).
find-phrase (method).
find-phrase-intervals (method).
find-phrase-intervals (method).
get-annotation (method).
get-annotation (method).
get-tag (method).
get-tag (method).
get-tag (method).
get-token-id (method).
get-token-id (method).
get-token-id (method).
lemmatize-phrase (method).
lemmatize-phrase (method).
length-of (method).
make-alterable-phrase (method).
make-phrase (function).
make-phrase-from-sentence (function).
make-phrase-from-vdoc (function).
make-vector-document (function).
phrase (class).
phrase->string (method).
phrase->token-array (method).
phrase-distance (method).
phrase-document (method).
phrase-document (reader method).
(setf phrase-document) (writer method).
phrase-end (method).
phrase-end (reader method).
(setf phrase-end) (writer method).
phrase-equal (method).
phrase-lemmas (method).
phrase-length (method).
phrase-length (method).
phrase-overlap (method).
phrase-start (method).
phrase-start (reader method).
(setf phrase-start) (writer method).
phrase-type (reader method).
(setf phrase-type) (writer method).
phrase-words (function).
print-object (method).
print-phrase (method).
print-phrase-lemmas (method).
print-vector-document (method).
print-window (method).
read-vector-document (method).
read-vector-document-to-string (method).
remove-word (method).
remove-word (method).
set-annotation (method).
set-annotation (method).
string-tag (function).
string-tag-tokenized (function).
unset-annotation (method).
unset-annotation (method).
vector-document (function).
vector-document (class).
vector-document-string (method).
vector-document-words (method).
write-vector-document (method).

Internals

*temp-phrase* (special variable).
*test* (special variable).
altered-phrase-custom-document (reader method).
(setf altered-phrase-custom-document) (writer method).
copy-phrase (method).
document-window-as-string (method).
make-document-from-phrase (method).
person-token-offset (function).
phrase-annotations (reader method).
(setf phrase-annotations) (writer method).
print-token-array (function).
temp-phrase (function).
token-array->words (function).
vector-doc-as-ids (method).
vector-doc-as-words (method).

Next: langutils/src/my-meta.lisp, Previous: langutils/src/reference.lisp, Up: Lisp [Contents][Index]

4.1.6 `langutils/src/stopwords.lisp`

Dependency

reference.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Public Interface

concise-stopword? (function).
contains-is? (function).
stopword? (function).
string-concise-stopword? (function).
string-contains-is? (function).
string-stopword? (function).

Internals

*concise-stopwords* (special variable).
*is-token* (special variable).
*s-token* (special variable).
*stopwords* (special variable).
clean-stopwords (function).
init-concise-stopwords (function).
init-stopwords (function).
init-word-test (function).

Next: langutils/src/tokenize.lisp, Previous: langutils/src/stopwords.lisp, Up: Lisp [Contents][Index]

4.1.7 `langutils/src/my-meta.lisp`

Dependency

stopwords.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Public Interface

disable-meta-syntax (function).
enable-meta-syntax (function).
print-object (method).
with-list-meta (macro).
with-stream-meta (macro).
with-string-meta (macro).

Internals

*meta-readtable* (special variable).
*saved-readtable* (special variable).
compile-list (function).
compileit (function).
copy-meta (function).
list-match (macro).
list-match-type (macro).
make-meta (function).
meta (structure).
meta-char (reader).
(setf meta-char) (writer).
meta-form (reader).
(setf meta-form) (writer).
meta-p (function).
meta-reader (function).
stream-match (macro).
stream-match-type (macro).
string-match (macro).
string-match-type (macro).
symbol-name-equal (function).

Next: langutils/src/lexicon.lisp, Previous: langutils/src/my-meta.lisp, Up: Lisp [Contents][Index]

4.1.8 `langutils/src/tokenize.lisp`

Dependency

my-meta.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Public Interface

tokenize-stream (function).
tokenize-string (function).

Internals

alpha (type).
alpha-lower (type).
alpha-lowercase (function).
alpha-misc (function).
alpha-upper (type).
alpha-uppercase (function).
alphanum (type).
digit (type).
end-of-sentence (condition).
known-abbreviations (special variable).
non-digit (type).
non-digit-or-ws (type).
non-punc-or-white (type).
non-whitespace (type).
punctuation (type).
tokenize-file2 (function).
whitespace (type).

Next: langutils/src/lemma.lisp, Previous: langutils/src/tokenize.lisp, Up: Lisp [Contents][Index]

4.1.9 `langutils/src/lexicon.lisp`

Dependency

tokenize.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Public Interface

get-lexicon-case-forms (function).
get-lexicon-default-pos (function).
(setf get-lexicon-entry) (setf expander).
get-lexicon-entry (function).
lexicon-entry (structure).
lexicon-entry-id (reader).
(setf lexicon-entry-id) (writer).
lexicon-entry-roots (reader).
(setf lexicon-entry-roots) (writer).
lexicon-entry-surface-forms (reader).
(setf lexicon-entry-surface-forms) (writer).
lexicon-entry-tag (function).
lexicon-entry-tags (reader).
(setf lexicon-entry-tags) (writer).

Internals

*lexicon* (special variable).
add-basic-entry (function).
add-root (function).
add-root-forms (function).
add-roots (function).
add-surface-form (function).
add-unknown-lexicon-entry (function).
clean-lexicon (function).
copy-lexicon-entry (function).
ensure-lexicon-entry (function).
init-lexicon (function).
lexicon-entry-case-forms (reader).
(setf lexicon-entry-case-forms) (writer).
lexicon-entry-p (function).
make-cases (function).
make-lexicon-entry (function).
set-lexicon-entry (function).
with-static-memory-allocation (macro).

Next: langutils/src/porter.lisp, Previous: langutils/src/lexicon.lisp, Up: Lisp [Contents][Index]

4.1.10 `langutils/src/lemma.lisp`

Dependency

lexicon.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Public Interface

get-lemma (function).
get-lemma-for-id (function).
in-pos-class? (function).
lemmatize (method).
lemmatize (method).
morph-case-surface-forms (function).
morph-surface-forms (function).
morph-surface-forms-text (function).

Internals

*get-determiners* (function).
*pos-class-map* (special variable).
select-token (function).

Next: langutils/src/contextual-rule-parser.lisp, Previous: langutils/src/lemma.lisp, Up: Lisp [Contents][Index]

4.1.11 `langutils/src/porter.lisp`

Dependency

lemma.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Internals

consonantp (function).
cvc (function).
doublec (function).
ends (function).
m (function).
r (function).
setto (function).
stem (function).
step1ab (function).
step1c (function).
step2 (function).
step3 (function).
step4 (function).
step5 (function).
vowelinstem (function).

Next: langutils/src/tagger-data.lisp, Previous: langutils/src/porter.lisp, Up: Lisp [Contents][Index]

4.1.12 `langutils/src/contextual-rule-parser.lisp`

Dependency

porter.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Internals

*contextual-rule-args* (special variable).
def-contextual-rule-parser (macro).
gen-rule-arg-bindings (function).
gen-rule-arg-decls (function).
gen-rule-closure (function).
gen-rule-closure-decl (function).
gen-rule-match (function).
get-bind-entry (function).

Next: langutils/src/tagger.lisp, Previous: langutils/src/contextual-rule-parser.lisp, Up: Lisp [Contents][Index]

4.1.13 `langutils/src/tagger-data.lisp`

Dependency

contextual-rule-parser.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Internals

apply-rules (function).
guess-tag (function).
load-contextual-rules (function).
load-lexical-rules (function).
make-contextual-rule (function).
make-lexical-rule (function).

Next: langutils/src/chunker-constants.lisp, Previous: langutils/src/tagger-data.lisp, Up: Lisp [Contents][Index]

4.1.14 `langutils/src/tagger.lisp`

Dependency

tagger-data.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Public Interface

clean-tagger (function).
init-tagger (function).
initial-tag (function).
read-and-tag-file (function).
read-file-as-tagged-document (function).
tag (function).
tag-tokenized (function).
vector-tag (function).
vector-tag-tokenized (function).

Internals

*tagger-bigrams* (special variable).
*tagger-contextual-rules* (special variable).
*tagger-lexical-rules* (special variable).
*tagger-wordlist* (special variable).
apply-contextual-rules (function).
default-tag (function).
duplicate-from (function).
load-tagger-files (function).
read-file-to-string (function).
return-vector-doc (function).
test-vector-tag-tokenized (function).
write-temp (function).

Next: langutils/src/chunker.lisp, Previous: langutils/src/tagger.lisp, Up: Lisp [Contents][Index]

4.1.15 `langutils/src/chunker-constants.lisp`

Dependency

tagger.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Internals

adv-pattern (constant).
noun-pattern (constant).
p-pattern (constant).
verb-pattern (constant).

Next: langutils/src/concept.lisp, Previous: langutils/src/chunker-constants.lisp, Up: Lisp [Contents][Index]

4.1.16 `langutils/src/chunker.lisp`

Dependency

chunker-constants.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Public Interface

chunk (function).
chunk-tokenized (function).
get-adverb-chunks (method).
get-event-chunks (method).
get-extended-event-chunks1 (method).
get-extended-event-chunks2 (method).
get-imperative-chunks (method).
get-nx-chunks (method).
get-p-chunks (method).
get-pp-chunks (method).
get-vx-chunks (method).
head-verb (function).
head-verbs (function).
root-noun (function).
root-nouns (function).

Internals

*common-verbs* (special variable).
all-vx+nx-phrases (function).
ensure-common-verbs (function).
get-basic-chunks (method).
test-phrase (function).

Next: langutils/src/init.lisp, Previous: langutils/src/chunker.lisp, Up: Lisp [Contents][Index]

4.1.17 `langutils/src/concept.lisp`

Dependency

chunker.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Public Interface

associate-concepts (function).
concat-concepts (method).
concept->string (method).
concept->token-array (method).
concept->words (method).
concept-contains (method).
conceptually-equal (method).
conceptually-equal (method).
conceptually-equal (method).
conceptually-equal (method).
force-concept (function).
make-concept (function).
phrase->concept (function).
print-object (method).
string->concept (function).
token-array->concept (function).
token-vector (reader method).
words->concept (function).

Internals

*concept-store-scratch-array* (special variable).
*concept-vhash* (special variable).
clear-concept-cache (method).
concept (class).
ensure-concept (function).
lookup-canonical-concept-instance (method).
lookup-canonical-concept-instance (method).
register-new-concept-instance (method).
test-concept-equality (function).

Previous: langutils/src/concept.lisp, Up: Lisp [Contents][Index]

4.1.18 `langutils/src/init.lisp`

Dependency

concept.lisp (file).

Source

langutils.asd.

Parent Component

src (module).

Public Interface

clean-langutils (function).
init-langutils (function).
reset-langutils (function).

Next: Definitions, Previous: Files, Up: The langutils Reference Manual [Contents][Index]

5 Packages

Packages are listed by definition order.

my-meta
langutils
langutils-tokenize
langutils.system

Next: langutils, Previous: Packages, Up: Packages [Contents][Index]

5.1 `my-meta`

Source

package.lisp.

Use List

common-lisp.

Used By List

langutils-tokenize.

Public Interface

disable-meta-syntax (function).
enable-meta-syntax (function).
with-list-meta (macro).
with-stream-meta (macro).
with-string-meta (macro).

Internals

*meta-readtable* (special variable).
*saved-readtable* (special variable).
compile-list (function).
compileit (function).
copy-meta (function).
list-match (macro).
list-match-type (macro).
make-meta (function).
meta (structure).
meta-char (reader).
(setf meta-char) (writer).
meta-form (reader).
(setf meta-form) (writer).
meta-p (function).
meta-reader (function).
stream-match (macro).
stream-match-type (macro).
string-match (macro).
string-match-type (macro).
symbol-name-equal (function).

Next: langutils-tokenize, Previous: my-meta, Up: Packages [Contents][Index]

5.2 `langutils`

Source

package.lisp.

Use List

common-lisp.
stdutils.

Public Interface

add-word (generic function).
altered-phrase (class).
associate-concepts (function).
change-word (generic function).
chunk (function).
chunk-tokenized (function).
clean-langutils (function).
clean-tagger (function).
concat-concepts (generic function).
concept->string (generic function).
concept->token-array (generic function).
concept->words (generic function).
concept-contains (generic function).
conceptually-equal (generic function).
concise-stopword? (function).
contains-is? (function).
document-annotations (generic reader).
(setf document-annotations) (generic writer).
document-tags (generic reader).
(setf document-tags) (generic writer).
document-text (generic reader).
(setf document-text) (generic writer).
find-phrase (generic function).
find-phrase-intervals (generic function).
force-concept (function).
get-adverb-chunks (generic function).
get-annotation (generic function).
get-event-chunks (generic function).
get-extended-event-chunks1 (generic function).
get-extended-event-chunks2 (generic function).
get-imperative-chunks (generic function).
get-lemma (function).
get-lemma-for-id (function).
get-lexicon-case-forms (function).
get-lexicon-default-pos (function).
(setf get-lexicon-entry) (setf expander).
get-lexicon-entry (function).
get-nx-chunks (generic function).
get-p-chunks (generic function).
get-pp-chunks (generic function).
get-tag (generic function).
get-token-count (function).
get-token-id (generic function).
get-vx-chunks (generic function).
head-verb (function).
head-verbs (function).
id-for-token (function).
ids-for-tokens (function).
in-pos-class? (function).
init-langutils (function).
init-tagger (function).
initial-tag (function).
lemmatize (generic function).
lemmatize-phrase (generic function).
length-of (generic function).
lexicon-entry (structure).
lexicon-entry-id (reader).
(setf lexicon-entry-id) (writer).
lexicon-entry-roots (reader).
(setf lexicon-entry-roots) (writer).
lexicon-entry-surface-forms (reader).
(setf lexicon-entry-surface-forms) (writer).
lexicon-entry-tag (function).
lexicon-entry-tags (reader).
(setf lexicon-entry-tags) (writer).
make-alterable-phrase (generic function).
make-concept (function).
make-phrase (function).
make-phrase-from-sentence (function).
make-phrase-from-vdoc (function).
make-vector-document (function).
morph-case-surface-forms (function).
morph-surface-forms (function).
morph-surface-forms-text (function).
phrase (class).
phrase->concept (function).
phrase->string (generic function).
phrase->token-array (generic function).
phrase-distance (generic function).
phrase-document (generic function).
(setf phrase-document) (generic writer).
phrase-end (generic function).
(setf phrase-end) (generic writer).
phrase-equal (generic function).
phrase-lemmas (generic function).
phrase-length (generic function).
phrase-overlap (generic function).
phrase-start (generic function).
(setf phrase-start) (generic writer).
phrase-type (generic reader).
(setf phrase-type) (generic writer).
phrase-words (function).
print-phrase (generic function).
print-phrase-lemmas (generic function).
print-vector-document (generic function).
print-window (generic function).
read-and-tag-file (function).
read-file-as-tagged-document (function).
read-vector-document (generic function).
read-vector-document-to-string (generic function).
remove-word (generic function).
reset-langutils (function).
root-noun (function).
root-nouns (function).
set-annotation (generic function).
stopword? (function).
string->concept (function).
string->token-array (function).
string-concise-stopword? (function).
string-contains-is? (function).
string-stopword? (function).
string-tag (function).
string-tag-tokenized (function).
suspicious-string? (function).
suspicious-word? (generic function).
tag (function).
tag-tokenized (function).
token-array->concept (function).
token-for-id (function).
token-vector (generic reader).
tokens-for-ids (function).
unset-annotation (generic function).
vector-document (function).
vector-document (class).
vector-document-string (generic function).
vector-document-words (generic function).
vector-tag (function).
vector-tag-tokenized (function).
words->concept (function).
write-vector-document (generic function).

Internals

*add-to-map-hook* (special variable).
*auto-init* (special variable).
*common-verbs* (special variable).
*concept-store-scratch-array* (special variable).
*concept-vhash* (special variable).
*concise-stopwords* (special variable).
*config-paths* (special variable).
*contextual-rule-args* (special variable).
*default-concise-stopwords-file* (special variable).
*default-contextual-rule-file* (special variable).
*default-lexical-rule-file* (special variable).
*default-lexicon-file* (special variable).
*default-stems-file* (special variable).
*default-stopwords-file* (special variable).
*default-token-map-file* (special variable).
*external-token-map* (special variable).
*get-determiners* (function).
*id-for-token-hook* (special variable).
*id-table* (special variable).
*is-token* (special variable).
*lexicon* (special variable).
*max-token-nums* (constant).
*max-token-others* (constant).
*pos-class-map* (special variable).
*report-status* (special variable).
*s-token* (special variable).
*stopwords* (special variable).
*suspicious-words* (special variable).
*tagger-bigrams* (special variable).
*tagger-contextual-rules* (special variable).
*tagger-lexical-rules* (special variable).
*tagger-wordlist* (special variable).
*temp-phrase* (special variable).
*test* (special variable).
*token-counter* (special variable).
*token-counter-hook* (special variable).
*token-dirty-bit* (special variable).
*token-for-id-hook* (special variable).
*token-table* (special variable).
*tokens-load-file* (special variable).
*whitespace-chars* (constant).
add-basic-entry (function).
add-external-mapping (function).
add-root (function).
add-root-forms (function).
add-roots (function).
add-surface-form (function).
add-to-map-hook (function).
add-unknown-lexicon-entry (function).
adv-pattern (constant).
all-vx+nx-phrases (function).
altered-phrase-custom-document (generic reader).
(setf altered-phrase-custom-document) (generic writer).
apply-contextual-rules (function).
apply-rules (function).
clean-lexicon (function).
clean-stopwords (function).
clear-concept-cache (generic function).
concept (class).
consonantp (function).
copy-lexicon-entry (function).
copy-phrase (generic function).
cvc (function).
def-contextual-rule-parser (macro).
default-tag (function).
document-window-as-string (generic function).
doublec (function).
duplicate-from (function).
ends (function).
ensure-common-verbs (function).
ensure-concept (function).
ensure-lexicon-entry (function).
ensure-token-counts (function).
gen-rule-arg-bindings (function).
gen-rule-arg-decls (function).
gen-rule-closure (function).
gen-rule-closure-decl (function).
gen-rule-match (function).
get-basic-chunks (generic function).
get-bind-entry (function).
guess-tag (function).
handle-config-entry (function).
id-for-token-hook (function).
ids-for-string (function).
init-concise-stopwords (function).
init-lexicon (function).
init-stopwords (function).
init-word-test (function).
initialize-tokens (function).
lexicon-entry-case-forms (reader).
(setf lexicon-entry-case-forms) (writer).
lexicon-entry-p (function).
load-contextual-rules (function).
load-lexical-rules (function).
load-tagger-files (function).
lookup-canonical-concept-instance (generic function).
m (function).
make-cases (function).
make-contextual-rule (function).
make-document-from-phrase (generic function).
make-lexical-rule (function).
make-lexicon-entry (function).
noun-pattern (constant).
p-pattern (constant).
person-token-offset (function).
phrase-annotations (generic reader).
(setf phrase-annotations) (generic writer).
print-token-array (function).
r (function).
read-config (function).
read-file-to-string (function).
register-new-concept-instance (generic function).
relative-pathname (function).
reset-token-counts (function).
return-vector-doc (function).
select-token (function).
set-lexicon-entry (function).
setto (function).
stem (function).
step1ab (function).
step1c (function).
step2 (function).
step3 (function).
step4 (function).
step5 (function).
temp-phrase (function).
test-concept-equality (function).
test-phrase (function).
test-vector-tag-tokenized (function).
token-array->words (function).
token-counter-hook (function).
token-for-id-hook (function).
vector-doc-as-ids (generic function).
vector-doc-as-words (generic function).
verb-pattern (constant).
vowelinstem (function).
with-static-memory-allocation (macro).
write-log (macro).
write-temp (function).

Next: langutils.system, Previous: langutils, Up: Packages [Contents][Index]

5.3 `langutils-tokenize`

Source

package.lisp.

Use List

common-lisp.
my-meta.

Public Interface

tokenize-stream (function).
tokenize-string (function).

Internals

alpha (type).
alpha-lower (type).
alpha-lowercase (function).
alpha-misc (function).
alpha-upper (type).
alpha-uppercase (function).
alphanum (type).
digit (type).
end-of-sentence (condition).
known-abbreviations (special variable).
non-digit (type).
non-digit-or-ws (type).
non-punc-or-white (type).
non-whitespace (type).
punctuation (type).
tokenize-file2 (function).
whitespace (type).

Previous: langutils-tokenize, Up: Packages [Contents][Index]

5.4 `langutils.system`

Source

langutils.asd.

Use List

asdf/interface.
common-lisp.

Next: Indexes, Previous: Packages, Up: The langutils Reference Manual [Contents][Index]

6 Definitions

Definitions are sorted by export status, category, package, and then by lexicographic order.

Public Interface
Internals

Next: Internals, Previous: Definitions, Up: Definitions [Contents][Index]

Next: Setf expanders, Previous: Public Interface, Up: Public Interface [Contents][Index]

6.1.1 Macros

Macro: with-list-meta ((source-symbol list) &body body) ¶

Package: my-meta.
Source: my-meta.lisp.

Macro: with-stream-meta ((source-symbol stream) &body body) ¶

Package: my-meta.
Source: my-meta.lisp.

Macro: with-string-meta ((source-symbol string-buffer &key start end) &body body) ¶

Package: my-meta.
Source: my-meta.lisp.

Next: Ordinary functions, Previous: Macros, Up: Public Interface [Contents][Index]

6.1.2 Setf expanders

Setf Expander: (setf get-lexicon-entry) (word) ¶

Package: langutils.
Source: lexicon.lisp.
Reader: get-lexicon-entry (function).
Writer: set-lexicon-entry (function).

Next: Generic functions, Previous: Setf expanders, Up: Public Interface [Contents][Index]

6.1.3 Ordinary functions

Function: associate-concepts (phrases) ¶

Return the list of phrase/list/token-arrays as pairs with the first element being the original and the second being a canonicalized concept instance

Package: langutils.
Source: concept.lisp.

Function: chunk (text) ¶

Returns a phrase-list for the provided text

Package: langutils.
Source: chunker.lisp.

Function: chunk-tokenized (text) ¶

Returns a phrase-list for the provided tokenized string

Package: langutils.
Source: chunker.lisp.

Function: clean-langutils () ¶

Package: langutils.
Source: init.lisp.

Function: clean-tagger () ¶

Package: langutils.
Source: tagger.lisp.

Function: concise-stopword? (id) ¶

Identifies id as a ’concise-stopword’ word.
concise-stopwords are a *very* small list of words. Mainly pronouns and determiners

Package: langutils.
Source: stopwords.lisp.

Function: contains-is? (ids) ¶

Tests list of ids for ’is’ words

Package: langutils.
Source: stopwords.lisp.

Function: disable-meta-syntax () ¶

Package: my-meta.
Source: my-meta.lisp.

Function: enable-meta-syntax () ¶

Package: my-meta.
Source: my-meta.lisp.

Function: force-concept (c) ¶

Package: langutils.
Source: concept.lisp.

Function: get-lemma (word &key pos noun porter) ¶

Provides the root word string for the provided word string

Package: langutils.
Source: lemma.lisp.

Function: get-lemma-for-id (id &key pos noun porter) ¶

Returns a lemma id for the provided word id. pos only returns the root for the provided pos type. noun will stem nouns to the singular form by default and porter determines whether the porter algorithm is used for unknown terms. pos type causes the noun argument to be ignored

Package: langutils.
Source: lemma.lisp.

Function: get-lexicon-case-forms (word) ¶

Package: langutils.
Source: lexicon.lisp.

Function: get-lexicon-default-pos (word) ¶

Package: langutils.
Source: lexicon.lisp.

Function: get-lexicon-entry (word) ¶

Package: langutils.
Source: lexicon.lisp.
Setf expander for this function: (setf get-lexicon-entry).

Function: get-token-count () ¶

Return the current token counter

Package: langutils.
Source: tokens.lisp.

Function: head-verb (phrase &key filter-common) ¶

Package: langutils.
Source: chunker.lisp.

Function: head-verbs (phrases &key filter-common) ¶

Package: langutils.
Source: chunker.lisp.

Function: id-for-token (token &optional trim) ¶

This takes string ’tokens’ and returns a unique id for that character sequence - beware of whitespace, etc.

Package: langutils.
Source: tokens.lisp.

Function: ids-for-tokens (tokens) ¶

Package: langutils.
Source: tokens.lisp.

Function: in-pos-class? (element class) ¶

Package: langutils.
Source: lemma.lisp.

Function: init-langutils () ¶

Package: langutils.
Source: init.lisp.

Function: init-tagger (&optional lexical-rule-file contextual-rule-file) ¶

Package: langutils.
Source: tagger.lisp.

Function: initial-tag (token) ¶

Return an initial tag for a given token string using the langutils lexicon and the tagger lexical rules (via guess-tag)

Package: langutils.
Source: tagger.lisp.

Reader: lexicon-entry-id (instance) ¶

Writer: (setf lexicon-entry-id) (instance) ¶

Package: langutils.
Source: lexicon.lisp.
Target Slot: id.

Reader: lexicon-entry-roots (instance) ¶

Writer: (setf lexicon-entry-roots) (instance) ¶

Package: langutils.
Source: lexicon.lisp.
Target Slot: roots.

Reader: lexicon-entry-surface-forms (instance) ¶

Writer: (setf lexicon-entry-surface-forms) (instance) ¶

Package: langutils.
Source: lexicon.lisp.
Target Slot: surface-forms.

Function: lexicon-entry-tag (entry) ¶

Package: langutils.
Source: lexicon.lisp.

Reader: lexicon-entry-tags (instance) ¶

Writer: (setf lexicon-entry-tags) (instance) ¶

Package: langutils.
Source: lexicon.lisp.
Target Slot: tags.

Function: make-concept (ta) ¶

Package: langutils.
Source: concept.lisp.

Function: make-phrase (text-array tag-array &optional type) ¶

Take two arrays of test and tags and create a phrase that points at a vdoc created from the two arrays

Package: langutils.
Source: reference.lisp.

Function: make-phrase-from-sentence (tok-string &optional tag-array) ¶

Package: langutils.
Source: reference.lisp.

Function: make-phrase-from-vdoc (doc start len &optional type) ¶

Package: langutils.
Source: reference.lisp.

Function: make-vector-document (text &optional tags) ¶

Package: langutils.
Source: reference.lisp.

Function: morph-case-surface-forms (root &optional pos-class) ¶

All cases of morphological surface forms of the provided root

Package: langutils.
Source: lemma.lisp.

Function: morph-surface-forms (root &optional pos-class) ¶

Takes a word or id and returns all surface form ids or all forms of class ’pos-class’ where pos-class is a symbol of langutils::V,A,N

Package: langutils.
Source: lemma.lisp.

Function: morph-surface-forms-text (root &optional pos-class) ¶

Package: langutils.
Source: lemma.lisp.

Function: phrase->concept (p &key lemmatized) ¶

Create a canonical concept from an arbitrary phrase by removing determiners and lemmatizing verbs.

Package: langutils.
Source: concept.lisp.

Function: phrase-words (phrase &optional index) ¶

Package: langutils.
Source: reference.lisp.

Function: read-and-tag-file (file) ¶

Package: langutils.
Source: tagger.lisp.

Function: read-file-as-tagged-document (file) ¶

Package: langutils.
Source: tagger.lisp.

Function: reset-langutils () ¶

Package: langutils.
Source: init.lisp.

Function: root-noun (phrase) ¶

Package: langutils.
Source: chunker.lisp.

Function: root-nouns (phrases) ¶

Package: langutils.
Source: chunker.lisp.

Function: stopword? (id) ¶

Identifies id as a ’stopword’

Package: langutils.
Source: stopwords.lisp.

Function: string->concept (s &key lemmatized) ¶

Package: langutils.
Source: concept.lisp.

Function: string->token-array (string) ¶

Package: langutils.
Source: tokens.lisp.

Function: string-concise-stopword? (word) ¶

Check the word if it is a ’concise-stopword’ word.
concise-stopwords are a *very* small list of words. Mainly pronouns and determiners

Package: langutils.
Source: stopwords.lisp.

Function: string-contains-is? (words) ¶

Checks the list for a string containing ’is’

Package: langutils.
Source: stopwords.lisp.

Function: string-stopword? (word) ¶

Package: langutils.
Source: stopwords.lisp.

Function: string-tag (string &optional stream) ¶

Tokenizes and tags the string returning
a standard tagged string using ’/’ as a separator

Package: langutils.
Source: reference.lisp.

Function: string-tag-tokenized (string &optional stream) ¶

Package: langutils.
Source: reference.lisp.

Function: suspicious-string? (string) ¶

Determine if the alpha-num and number balance is reasonable for lingustic processing or if non-alpha-nums are present

Package: langutils.
Source: tokens.lisp.

Function: tag (string) ¶

Package: langutils.
Source: tagger.lisp.

Function: tag-tokenized (string) ¶

Package: langutils.
Source: tagger.lisp.

Function: token-array->concept (tokens &key lemmatized) ¶

Package: langutils.
Source: concept.lisp.

Function: token-for-id (id) ¶

Return a string token for a given token id

Package: langutils.
Source: tokens.lisp.

Function: tokenize-stream (stream &key by-sentence fragment) ¶

Converts a stream into a string and tokenizes, optionally, one sentence
at a time which is nice for large files. Pretty hairy code: a token processor inside a stream scanner. The stream scanner walks the input stream and tokenizes all punctuation (except periods). After a sequences of non-whitespace has been read, the inline tokenizer looks at the end of the string for mis-tokenized words (can ’ t -> ca n’t)

Package: langutils-tokenize.
Source: tokenize.lisp.

Function: tokenize-string (string) ¶

Returns a fresh, linguistically tokenized string

Package: langutils-tokenize.
Source: tokenize.lisp.

Function: tokens-for-ids (ids) ¶

Return a list of string tokens for each id in ids

Package: langutils.
Source: tokens.lisp.

Function: vector-document (input) ¶

Package: langutils.
Source: reference.lisp.

Function: vector-tag (string) ¶

Returns a ’document’ which is a class containing a pair of vectors representing the string in the internal token format. Handles arbitrary data.

Package: langutils.
Source: tagger.lisp.

Function: vector-tag-tokenized (string &key end-tokens) ¶

Returns a document representing the string using the
internal token dictionary; requires the string to be tokenized. Parses the string into tokens (whitespace separators) then populates the two temp arrays above with token id’s and initial tags. Contextual rules are applied and a new vector document is produced which
is a copy of the enclosed data. This is all done at once so good compilers can open-code the array refs and simplify the calling
of the labels functions.

Package: langutils.
Source: tagger.lisp.

Function: words->concept (slist &key lemmatized) ¶

Package: langutils.
Source: concept.lisp.

Next: Standalone methods, Previous: Ordinary functions, Up: Public Interface [Contents][Index]

6.1.4 Generic functions

Generic Function: add-word (p index word tag) ¶

Package

	Index Entry	Section

*
	`add-to-map-hook`:	Private special variables
	`auto-init`:	Private special variables
	`common-verbs`:	Private special variables
	`concept-store-scratch-array`:	Private special variables
	`concept-vhash`:	Private special variables
	`concise-stopwords`:	Private special variables
	`config-paths`:	Private special variables
	`contextual-rule-args`:	Private special variables
	`default-concise-stopwords-file`:	Private special variables
	`default-contextual-rule-file`:	Private special variables
	`default-lexical-rule-file`:	Private special variables
	`default-lexicon-file`:	Private special variables
	`default-stems-file`:	Private special variables
	`default-stopwords-file`:	Private special variables
	`default-token-map-file`:	Private special variables
	`external-token-map`:	Private special variables
	`id-for-token-hook`:	Private special variables
	`id-table`:	Private special variables
	`is-token`:	Private special variables
	`lexicon`:	Private special variables
	`max-token-nums`:	Private constants
	`max-token-others`:	Private constants
	`meta-readtable`:	Private special variables
	`pos-class-map`:	Private special variables
	`report-status`:	Private special variables
	`s-token`:	Private special variables
	`saved-readtable`:	Private special variables
	`stopwords`:	Private special variables
	`suspicious-words`:	Private special variables
	`tagger-bigrams`:	Private special variables
	`tagger-contextual-rules`:	Private special variables
	`tagger-lexical-rules`:	Private special variables
	`tagger-wordlist`:	Private special variables
	`temp-phrase`:	Private special variables
	`test`:	Private special variables
	`token-counter`:	Private special variables
	`token-counter-hook`:	Private special variables
	`token-dirty-bit`:	Private special variables
	`token-for-id-hook`:	Private special variables
	`token-table`:	Private special variables
	`tokens-load-file`:	Private special variables
	`whitespace-chars`:	Private constants

A
	`adv-pattern`:	Private constants
	`annotations`:	Public classes
	`annotations`:	Public classes

C
	`case-forms`:	Public structures
	`char`:	Private structures
	`Constant, max-token-nums`:	Private constants
	`Constant, max-token-others`:	Private constants
	`Constant, whitespace-chars`:	Private constants
	`Constant, adv-pattern`:	Private constants
	`Constant, noun-pattern`:	Private constants
	`Constant, p-pattern`:	Private constants
	`Constant, verb-pattern`:	Private constants
	`custom-document`:	Public classes

D
	`document`:	Public classes

E
	`end`:	Public classes

F
	`form`:	Private structures

I
	`id`:	Public structures

K
	`known-abbreviations`:	Private special variables

N
	`noun-pattern`:	Private constants

P
	`p-pattern`:	Private constants

R
	`roots`:	Public structures

S
	`Slot, annotations`:	Public classes
	`Slot, annotations`:	Public classes
	`Slot, case-forms`:	Public structures
	`Slot, char`:	Private structures
	`Slot, custom-document`:	Public classes
	`Slot, document`:	Public classes
	`Slot, end`:	Public classes
	`Slot, form`:	Private structures
	`Slot, id`:	Public structures
	`Slot, roots`:	Public structures
	`Slot, start`:	Public classes
	`Slot, surface-forms`:	Public structures
	`Slot, tags`:	Public structures
	`Slot, tags`:	Public classes
	`Slot, text`:	Public classes
	`Slot, token-vector`:	Private classes
	`Slot, type`:	Public classes
	`Special Variable, add-to-map-hook`:	Private special variables
	`Special Variable, auto-init`:	Private special variables
	`Special Variable, common-verbs`:	Private special variables
	`Special Variable, concept-store-scratch-array`:	Private special variables
	`Special Variable, concept-vhash`:	Private special variables
	`Special Variable, concise-stopwords`:	Private special variables
	`Special Variable, config-paths`:	Private special variables
	`Special Variable, contextual-rule-args`:	Private special variables
	`Special Variable, default-concise-stopwords-file`:	Private special variables
	`Special Variable, default-contextual-rule-file`:	Private special variables
	`Special Variable, default-lexical-rule-file`:	Private special variables
	`Special Variable, default-lexicon-file`:	Private special variables
	`Special Variable, default-stems-file`:	Private special variables
	`Special Variable, default-stopwords-file`:	Private special variables
	`Special Variable, default-token-map-file`:	Private special variables
	`Special Variable, external-token-map`:	Private special variables
	`Special Variable, id-for-token-hook`:	Private special variables
	`Special Variable, id-table`:	Private special variables
	`Special Variable, is-token`:	Private special variables
	`Special Variable, lexicon`:	Private special variables
	`Special Variable, meta-readtable`:	Private special variables
	`Special Variable, pos-class-map`:	Private special variables
	`Special Variable, report-status`:	Private special variables
	`Special Variable, s-token`:	Private special variables
	`Special Variable, saved-readtable`:	Private special variables
	`Special Variable, stopwords`:	Private special variables
	`Special Variable, suspicious-words`:	Private special variables
	`Special Variable, tagger-bigrams`:	Private special variables
	`Special Variable, tagger-contextual-rules`:	Private special variables
	`Special Variable, tagger-lexical-rules`:	Private special variables
	`Special Variable, tagger-wordlist`:	Private special variables
	`Special Variable, temp-phrase`:	Private special variables
	`Special Variable, test`:	Private special variables
	`Special Variable, token-counter`:	Private special variables
	`Special Variable, token-counter-hook`:	Private special variables
	`Special Variable, token-dirty-bit`:	Private special variables
	`Special Variable, token-for-id-hook`:	Private special variables
	`Special Variable, token-table`:	Private special variables
	`Special Variable, tokens-load-file`:	Private special variables
	`Special Variable, known-abbreviations`:	Private special variables
	`start`:	Public classes
	`surface-forms`:	Public structures

T
	`tags`:	Public structures
	`tags`:	Public classes
	`text`:	Public classes
	`token-vector`:	Private classes
	`type`:	Public classes

V
	`verb-pattern`:	Private constants

	Index Entry	Section

A
	`alpha`:	Private types
	`alpha-lower`:	Private types
	`alpha-upper`:	Private types
	`alphanum`:	Private types
	`altered-phrase`:	Public classes

C
	`chunker-constants.lisp`:	The langutils/src/chunker-constants․lisp file
	`chunker.lisp`:	The langutils/src/chunker․lisp file
	`Class, altered-phrase`:	Public classes
	`Class, concept`:	Private classes
	`Class, phrase`:	Public classes
	`Class, vector-document`:	Public classes
	`concept`:	Private classes
	`concept.lisp`:	The langutils/src/concept․lisp file
	`Condition, end-of-sentence`:	Private conditions
	`config.lisp`:	The langutils/src/config․lisp file
	`contextual-rule-parser.lisp`:	The langutils/src/contextual-rule-parser․lisp file

D
	`digit`:	Private types

E
	`end-of-sentence`:	Private conditions

F
	`File, chunker-constants.lisp`:	The langutils/src/chunker-constants․lisp file
	`File, chunker.lisp`:	The langutils/src/chunker․lisp file
	`File, concept.lisp`:	The langutils/src/concept․lisp file
	`File, config.lisp`:	The langutils/src/config․lisp file
	`File, contextual-rule-parser.lisp`:	The langutils/src/contextual-rule-parser․lisp file
	`File, init.lisp`:	The langutils/src/init․lisp file
	`File, langutils.asd`:	The langutils/langutils․asd file
	`File, lemma.lisp`:	The langutils/src/lemma․lisp file
	`File, lexicon.lisp`:	The langutils/src/lexicon․lisp file
	`File, my-meta.lisp`:	The langutils/src/my-meta․lisp file
	`File, package.lisp`:	The langutils/src/package․lisp file
	`File, porter.lisp`:	The langutils/src/porter․lisp file
	`File, reference.lisp`:	The langutils/src/reference․lisp file
	`File, stopwords.lisp`:	The langutils/src/stopwords․lisp file
	`File, tagger-data.lisp`:	The langutils/src/tagger-data․lisp file
	`File, tagger.lisp`:	The langutils/src/tagger․lisp file
	`File, tokenize.lisp`:	The langutils/src/tokenize․lisp file
	`File, tokens.lisp`:	The langutils/src/tokens․lisp file

I
	`init.lisp`:	The langutils/src/init․lisp file

L
	`langutils`:	The langutils system
	`langutils`:	The langutils package
	`langutils-tokenize`:	The langutils-tokenize package
	`langutils.asd`:	The langutils/langutils․asd file
	`langutils.system`:	The langutils․system package
	`lemma.lisp`:	The langutils/src/lemma․lisp file
	`lexicon-entry`:	Public structures
	`lexicon.lisp`:	The langutils/src/lexicon․lisp file

M
	`meta`:	Private structures
	`Module, src`:	The langutils/src module
	`my-meta`:	The my-meta package
	`my-meta.lisp`:	The langutils/src/my-meta․lisp file

N
	`non-digit`:	Private types
	`non-digit-or-ws`:	Private types
	`non-punc-or-white`:	Private types
	`non-whitespace`:	Private types

P
	`Package, langutils`:	The langutils package
	`Package, langutils-tokenize`:	The langutils-tokenize package
	`Package, langutils.system`:	The langutils․system package
	`Package, my-meta`:	The my-meta package
	`package.lisp`:	The langutils/src/package․lisp file
	`phrase`:	Public classes
	`porter.lisp`:	The langutils/src/porter․lisp file
	`punctuation`:	Private types

R
	`reference.lisp`:	The langutils/src/reference․lisp file

S
	`src`:	The langutils/src module
	`stopwords.lisp`:	The langutils/src/stopwords․lisp file
	`Structure, lexicon-entry`:	Public structures
	`Structure, meta`:	Private structures
	`System, langutils`:	The langutils system

T
	`tagger-data.lisp`:	The langutils/src/tagger-data․lisp file
	`tagger.lisp`:	The langutils/src/tagger․lisp file
	`tokenize.lisp`:	The langutils/src/tokenize․lisp file
	`tokens.lisp`:	The langutils/src/tokens․lisp file
	`Type, alpha`:	Private types
	`Type, alpha-lower`:	Private types
	`Type, alpha-upper`:	Private types
	`Type, alphanum`:	Private types
	`Type, digit`:	Private types
	`Type, non-digit`:	Private types
	`Type, non-digit-or-ws`:	Private types
	`Type, non-punc-or-white`:	Private types
	`Type, non-whitespace`:	Private types
	`Type, punctuation`:	Private types
	`Type, whitespace`:	Private types

V
	`vector-document`:	Public classes

W
	`whitespace`:	Private types

The langutils Reference Manual

Table of Contents

1 Introduction

2 Systems

2.1 langutils

3 Modules

3.1 langutils/src

4 Files

4.1 Lisp

4.1.1 langutils/langutils.asd

4.1.2 langutils/src/package.lisp

4.1.3 langutils/src/config.lisp

4.1.4 langutils/src/tokens.lisp

4.1.5 langutils/src/reference.lisp

4.1.6 langutils/src/stopwords.lisp

4.1.7 langutils/src/my-meta.lisp

4.1.8 langutils/src/tokenize.lisp

4.1.9 langutils/src/lexicon.lisp

4.1.10 langutils/src/lemma.lisp

4.1.11 langutils/src/porter.lisp

4.1.12 langutils/src/contextual-rule-parser.lisp

4.1.13 langutils/src/tagger-data.lisp

4.1.14 langutils/src/tagger.lisp

4.1.15 langutils/src/chunker-constants.lisp

4.1.16 langutils/src/chunker.lisp

4.1.17 langutils/src/concept.lisp

4.1.18 langutils/src/init.lisp

5 Packages

5.1 my-meta

5.2 langutils

5.3 langutils-tokenize

5.4 langutils.system

6 Definitions

6.1 Public Interface

6.1.1 Macros

6.1.2 Setf expanders

6.1.3 Ordinary functions

6.1.4 Generic functions

6.1.5 Standalone methods

6.1.6 Structures

6.1.7 Classes

6.2 Internals

6.2.1 Constants

6.2.2 Special variables

6.2.3 Macros

6.2.4 Ordinary functions

6.2.5 Generic functions

6.2.6 Conditions

6.2.7 Structures

6.2.8 Classes

6.2.9 Types

Appendix A Indexes

A.1 Concepts

A.2 Functions

A.3 Variables

A.4 Data types

2.1 `langutils`

3.1 `langutils/src`

4.1.1 `langutils/langutils.asd`

4.1.2 `langutils/src/package.lisp`

4.1.3 `langutils/src/config.lisp`

4.1.4 `langutils/src/tokens.lisp`

4.1.5 `langutils/src/reference.lisp`

4.1.6 `langutils/src/stopwords.lisp`

4.1.7 `langutils/src/my-meta.lisp`

4.1.8 `langutils/src/tokenize.lisp`

4.1.9 `langutils/src/lexicon.lisp`

4.1.10 `langutils/src/lemma.lisp`

4.1.11 `langutils/src/porter.lisp`

4.1.12 `langutils/src/contextual-rule-parser.lisp`

4.1.13 `langutils/src/tagger-data.lisp`

4.1.14 `langutils/src/tagger.lisp`

4.1.15 `langutils/src/chunker-constants.lisp`

4.1.16 `langutils/src/chunker.lisp`

4.1.17 `langutils/src/concept.lisp`

4.1.18 `langutils/src/init.lisp`

5.1 `my-meta`

5.2 `langutils`

5.3 `langutils-tokenize`

5.4 `langutils.system`