commit | d94848dd1f659f8c9e195629ee90ab33dc1b6dfc | [log] [tgz] |
---|---|---|
author | James Turk <dev@jamesturk.net> | Sun Mar 26 19:46:28 2023 -0500 |
committer | James Turk <dev@jamesturk.net> | Sun Mar 26 19:46:28 2023 -0500 |
tree | 97cf211288b94f1251ee8965faef8b1ef028b50a | |
parent | 89399df155deef1f7062b1a2324f476d283a11d9 [diff] |
update testdata with broken Unicode
jellyfish is a library for approximate & phonetic matching of strings.
Source: https://github.com/jamesturk/jellyfish
Documentation: https://jamesturk.github.io/jellyfish/
Issues: https://github.com/jamesturk/jellyfish/issues
String comparison:
Phonetic encoding:
>>> import jellyfish >>> jellyfish.levenshtein_distance('jellyfish', 'smellyfish') 2 >>> jellyfish.jaro_distance('jellyfish', 'smellyfish') 0.89629629629629637 >>> jellyfish.damerau_levenshtein_distance('jellyfish', 'jellyfihs') 1 >>> jellyfish.metaphone('Jellyfish') 'JLFX' >>> jellyfish.soundex('Jellyfish') 'J412' >>> jellyfish.nysiis('Jellyfish') 'JALYF' >>> jellyfish.match_rating_codex('Jellyfish') 'JLLFSH'