Levenshtein is a Python C extension and C libray to compute Levenshtein string distance, string similarity, normal and generalized string set medians, and other related quantities.
| Tags | Scientific/Engineering Software Development Libraries Python Modules Text Processing |
|---|---|
| Licenses | GPL |
| Operating Systems | OS Independent |
| Implementation | C Python |
Recent releases


Changes: Broken apply_edit() for Unicode was fixed, and a subtract_edit() function was added.


Changes: Hamming distance and Jaro and Jaro-Winkler string similarity metrics were added. Functions raising ValueErrors on bad types were fixed to raise TypeErrors.


Changes: A poor-but-fast generalized median method quickmedian() was added. Several new auxiliary functions are exported in the C API.


Changes: Compilation problems with non-gcc were fixed.


Changes: A new function improving generalized median strings has been added. An arbitrary length limitation imposed on greedy median results has been removed. "Out of memory" errors should now be handled more gracefully.