Next: Introduction, Previous: (dir), Up: (dir) [Contents][Index]
This is the vas-string-metrics Reference Manual, generated automatically by Declt version 2.4 "Will Decker" on Wed Jun 20 12:44:32 2018 GMT+0.
• Introduction: | What vas-string-metrics is all about | |
• Systems: | The systems documentation | |
• Files: | The files documentation | |
• Packages: | The packages documentation | |
• Definitions: | The symbols documentation | |
• Indexes: | Concepts, functions, variables and data types |
vas-string-metrics provides the Jaro, Jaro-Winkler, Soerensen-Dice, Levenshtein, and normalized Levenshtein string distance/similarity metrics algorithms. The Jaro (function jaro-distance), Jaro-Winkler (function jaro-winkler-distance), Soerensen-Dice (function soerensen-dice-coefficient) and normalized Levenshtein (function normalized-levenshtein-distance) algorithms return a number in the range 0 to 1 indicating how similar two given strings are - where 0 indicates no similarity, and 1 indicatesa perfect match. The Jaro-Winkler metric is a heuristic suitable for shorter strings (such as place and people names), while the Levenshtein distance is computed as the minimum number of insertions, deletions, or substitutions needed to transform one string into the other (function levenshtein-distance). The Soerensen-Dice coefficient is a statistic suitable for heterogenous data sets and gives less weight to outliers[1]. The code is distributed under the terms of the LLGPLv3 (see LICENSE for details), except for the unit tests, which are in the public domain. [1] https://en.wikipedia.org/wiki/S%C3%B8rensen%E2%80%93Dice_coefficient#Applications
Next: Files, Previous: Introduction, Up: Top [Contents][Index]
The main system appears first, followed by any subsystem dependency.
• The vas-string-metrics system: |
Vladimir Sedach <vsedach@gmail.com>
LLGPLv3
Jaro-Winkler and Levenshtein string distance algorithms.
vas-string-metrics.asd (file)
Files are sorted by type and then listed depth-first from the systems components trees.
• Lisp files: |
Next: The vas-string-metrics/package<dot>lisp file, Previous: Lisp files, Up: Lisp files [Contents][Index]
vas-string-metrics.asd
vas-string-metrics (system)
Next: The vas-string-metrics/levenshtein<dot>lisp file, Previous: The vas-string-metrics<dot>asd file, Up: Lisp files [Contents][Index]
vas-string-metrics (system)
package.lisp
Next: The vas-string-metrics/jaro-winkler<dot>lisp file, Previous: The vas-string-metrics/package<dot>lisp file, Up: Lisp files [Contents][Index]
package.lisp (file)
vas-string-metrics (system)
levenshtein.lisp
Next: The vas-string-metrics/soerensen-dice<dot>lisp file, Previous: The vas-string-metrics/levenshtein<dot>lisp file, Up: Lisp files [Contents][Index]
package.lisp (file)
vas-string-metrics (system)
jaro-winkler.lisp
Previous: The vas-string-metrics/jaro-winkler<dot>lisp file, Up: Lisp files [Contents][Index]
package.lisp (file)
vas-string-metrics (system)
soerensen-dice.lisp
soerensen-dice-coefficient (function)
Next: Definitions, Previous: Files, Up: Top [Contents][Index]
Packages are listed by definition order.
• The vas-string-metrics package: |
package.lisp (file)
common-lisp
Definitions are sorted by export status, category, package, and then by lexicographic order.
• Exported definitions: | ||
• Internal definitions: |
Next: Internal definitions, Previous: Definitions, Up: Definitions [Contents][Index]
• Exported functions: |
Previous: Exported definitions, Up: Exported definitions [Contents][Index]
Finds the Jaro distance (measure of similarity) from string s1 to string s2. Returns a value in the range from 0 (no similarity) to 1 (exact match).
jaro-winkler.lisp (file)
Finds the Jaro distance (measure of similarity) from string s1 to string s2. Returns a value in the range from 0 (no similarity) to 1 (exact match).
jaro-winkler.lisp (file)
Finds the Levenshtein distance (minimum number of edits) from string s1 to string s2.
levenshtein.lisp (file)
Finds the normalized Levenshtein distance (from 0 for no similarity to 1 for exact match) from string s1 to string s2.
levenshtein.lisp (file)
soerensen-dice.lisp (file)
Previous: Exported definitions, Up: Definitions [Contents][Index]
• Internal functions: |
Previous: Internal definitions, Up: Internal definitions [Contents][Index]
soerensen-dice.lisp (file)
jaro-winkler.lisp (file)
jaro-winkler.lisp (file)
soerensen-dice.lisp (file)
Previous: Definitions, Up: Top [Contents][Index]
• Concept index: | ||
• Function index: | ||
• Variable index: | ||
• Data type index: |
Next: Function index, Previous: Indexes, Up: Indexes [Contents][Index]
Jump to: | F L V |
---|
Jump to: | F L V |
---|
Next: Variable index, Previous: Concept index, Up: Indexes [Contents][Index]
Jump to: | B F J L M N P S |
---|
Jump to: | B F J L M N P S |
---|
Next: Data type index, Previous: Function index, Up: Indexes [Contents][Index]
Previous: Variable index, Up: Indexes [Contents][Index]
Jump to: | P S V |
---|
Jump to: | P S V |
---|