commit | 54f2c425291a283fb415e47a952b740bbe8637a8 | [log] [tgz] |
---|---|---|
author | James Turk <dev@jamesturk.net> | Mon Jan 03 15:10:00 2022 -0500 |
committer | GitHub <noreply@github.com> | Mon Jan 03 15:10:00 2022 -0500 |
tree | 6870039b002bdd8e61536a3f7b6f681b59a7e21c | |
parent | 1d9f0476e1a5c6767dbac2c5e6e66dd5957cc484 [diff] | |
parent | b7bbeddbf1c5342629449154e272f60ba329c3b1 [diff] |
Merge pull request #158 from juliangilbey/fix-mra-algorithm Handle spaces correctly in MRA algorithm
jellyfish is a library for approximate & phonetic matching of strings.
Source: https://github.com/jamesturk/jellyfish
Documentation: https://jamesturk.github.io/jellyfish/
Issues: https://github.com/jamesturk/jellyfish/issues
String comparison:
Phonetic encoding:
>>> import jellyfish >>> jellyfish.levenshtein_distance(u'jellyfish', u'smellyfish') 2 >>> jellyfish.jaro_distance(u'jellyfish', u'smellyfish') 0.89629629629629637 >>> jellyfish.damerau_levenshtein_distance(u'jellyfish', u'jellyfihs') 1 >>> jellyfish.metaphone(u'Jellyfish') 'JLFX' >>> jellyfish.soundex(u'Jellyfish') 'J412' >>> jellyfish.nysiis(u'Jellyfish') 'JALYF' >>> jellyfish.match_rating_codex(u'Jellyfish') 'JLLFSH'