commit | ac5bd01dc27857069cf0212b3307442418eb67e8 | [log] [tgz] |
---|---|---|
author | James Turk <dev@jamesturk.net> | Sun Mar 26 18:40:34 2023 -0500 |
committer | James Turk <dev@jamesturk.net> | Sun Mar 26 18:40:34 2023 -0500 |
tree | efffbd8346505e9086f9a23afdf8a129ae9dc7f8 | |
parent | 48dc2c1fc391001d47a1276f0ec12573b5cafcb7 [diff] |
damerau: use a suggested improvement, a flattened array, 10% speedup
jellyfish is a library for approximate & phonetic matching of strings.
Source: https://github.com/jamesturk/jellyfish
Documentation: https://jamesturk.github.io/jellyfish/
Issues: https://github.com/jamesturk/jellyfish/issues
String comparison:
Phonetic encoding:
>>> import jellyfish >>> jellyfish.levenshtein_distance('jellyfish', 'smellyfish') 2 >>> jellyfish.jaro_distance('jellyfish', 'smellyfish') 0.89629629629629637 >>> jellyfish.damerau_levenshtein_distance('jellyfish', 'jellyfihs') 1 >>> jellyfish.metaphone('Jellyfish') 'JLFX' >>> jellyfish.soundex('Jellyfish') 'J412' >>> jellyfish.nysiis('Jellyfish') 'JALYF' >>> jellyfish.match_rating_codex('Jellyfish') 'JLLFSH'