commit | 149e9421c04a6df94c91fe57f5be3b2e29ba00f8 | [log] [tgz] |
---|---|---|
author | James Turk <dev@jamesturk.net> | Sun Mar 26 17:40:00 2023 -0500 |
committer | James Turk <dev@jamesturk.net> | Sun Mar 26 17:40:00 2023 -0500 |
tree | ba0e6d566e668f1da1dfdb316f85ff6eeb4ea292 | |
parent | 6373b9e14b7e6fec522896c65c9c8874fe3a7a22 [diff] |
add FastVec usage to levenshtein: from 5.33us to 2.70us (C was 1.55us)
jellyfish is a library for approximate & phonetic matching of strings.
Source: https://github.com/jamesturk/jellyfish
Documentation: https://jamesturk.github.io/jellyfish/
Issues: https://github.com/jamesturk/jellyfish/issues
String comparison:
Phonetic encoding:
>>> import jellyfish >>> jellyfish.levenshtein_distance('jellyfish', 'smellyfish') 2 >>> jellyfish.jaro_distance('jellyfish', 'smellyfish') 0.89629629629629637 >>> jellyfish.damerau_levenshtein_distance('jellyfish', 'jellyfihs') 1 >>> jellyfish.metaphone('Jellyfish') 'JLFX' >>> jellyfish.soundex('Jellyfish') 'J412' >>> jellyfish.nysiis('Jellyfish') 'JALYF' >>> jellyfish.match_rating_codex('Jellyfish') 'JLLFSH'