commit | d9bd5996b2318bf8385027c3d2e3a35f5ddc4d73 | [log] [tgz] |
---|---|---|
author | James Turk <dev@jamesturk.net> | Sun Mar 26 18:03:36 2023 -0500 |
committer | James Turk <dev@jamesturk.net> | Sun Mar 26 18:03:36 2023 -0500 |
tree | ecf31aed36fbacc6aa5c0de5db9d7f5461d24f9c | |
parent | 149e9421c04a6df94c91fe57f5be3b2e29ba00f8 [diff] |
more FastVec usage, with benchmarks to measure time improvement
jellyfish is a library for approximate & phonetic matching of strings.
Source: https://github.com/jamesturk/jellyfish
Documentation: https://jamesturk.github.io/jellyfish/
Issues: https://github.com/jamesturk/jellyfish/issues
String comparison:
Phonetic encoding:
>>> import jellyfish >>> jellyfish.levenshtein_distance('jellyfish', 'smellyfish') 2 >>> jellyfish.jaro_distance('jellyfish', 'smellyfish') 0.89629629629629637 >>> jellyfish.damerau_levenshtein_distance('jellyfish', 'jellyfihs') 1 >>> jellyfish.metaphone('Jellyfish') 'JLFX' >>> jellyfish.soundex('Jellyfish') 'J412' >>> jellyfish.nysiis('Jellyfish') 'JALYF' >>> jellyfish.match_rating_codex('Jellyfish') 'JLLFSH'