commit | b7bbeddbf1c5342629449154e272f60ba329c3b1 | [log] [tgz] |
---|---|---|
author | Julian Gilbey <julian-git@d-and-j.net> | Sun Jan 02 11:47:04 2022 +0000 |
committer | Julian Gilbey <julian-git@d-and-j.net> | Sun Jan 02 11:47:04 2022 +0000 |
tree | 9a37b93a55ca5862a0bc02462bbb81f8daf19d0e | |
parent | e1be2f9055c698ba9e89c588b7ac321f8ff540b1 [diff] |
Handle spaces correctly in MRA algorithm
jellyfish is a library for approximate & phonetic matching of strings.
Source: https://github.com/jamesturk/jellyfish
Documentation: https://jamesturk.github.io/jellyfish/
Issues: https://github.com/jamesturk/jellyfish/issues
String comparison:
Phonetic encoding:
>>> import jellyfish >>> jellyfish.levenshtein_distance(u'jellyfish', u'smellyfish') 2 >>> jellyfish.jaro_distance(u'jellyfish', u'smellyfish') 0.89629629629629637 >>> jellyfish.damerau_levenshtein_distance(u'jellyfish', u'jellyfihs') 1 >>> jellyfish.metaphone(u'Jellyfish') 'JLFX' >>> jellyfish.soundex(u'Jellyfish') 'J412' >>> jellyfish.nysiis(u'Jellyfish') 'JALYF' >>> jellyfish.match_rating_codex(u'Jellyfish') 'JLLFSH'