A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to polisher in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
polisher (0) - 1 freq
polished (1) - 39 freq
pisher (2) - 1 freq
plister (2) - 1 freq
polismen (2) - 2 freq
publisher (2) - 10 freq
polisht (2) - 5 freq
polyshen (2) - 1 freq
polishin (2) - 10 freq
poalished (2) - 1 freq
posher (2) - 1 freq
polish (2) - 70 freq
pishes (3) - 2 freq
publishers (3) - 16 freq
notished (3) - 1 freq
oliver (3) - 10 freq
plitter (3) - 1 freq
policie (3) - 23 freq
poleeshed (3) - 1 freq
plish (3) - 2 freq
plaster (3) - 14 freq
kosher (3) - 1 freq
'offisher (3) - 1 freq
pooshen (3) - 1 freq
clister (3) - 1 freq
polisher (0) - 1 freq
polished (2) - 39 freq
polishin (3) - 10 freq
poalished (3) - 1 freq
pleesher (3) - 2 freq
polyshen (3) - 1 freq
polish (3) - 70 freq
posher (3) - 1 freq
polisht (3) - 5 freq
pisher (3) - 1 freq
plister (3) - 1 freq
publisher (3) - 10 freq
cleisher (4) - 1 freq
publishar (4) - 1 freq
flesher (4) - 5 freq
plaister (4) - 20 freq
plashed (4) - 1 freq
plashes (4) - 1 freq
plester (4) - 2 freq
pusher (4) - 3 freq
poailshed (4) - 1 freq
pilsner (4) - 1 freq
plaster (4) - 14 freq
preesher (4) - 5 freq
polismen (4) - 2 freq
SoundEx code - P426
playgrund - 29 freq
pilgrim's - 7 freq
pleasure - 74 freq
pleisures - 4 freq
plaisure - 2 freq
pleesure - 23 freq
plooshares - 1 freq
pilgrimage - 12 freq
pleasures - 11 freq
pleisur - 33 freq
playgroup - 4 freq
pleisure - 60 freq
pleyscrievin - 2 freq
playgrun - 17 freq
playgruns - 1 freq
pilgrims - 16 freq
pilgrim - 14 freq
pleisurit - 1 freq
pleesher - 2 freq
playgroond - 11 freq
pleisour - 1 freq
play-gruns - 1 freq
playground - 6 freq
polygraph - 1 freq
pleesuir - 2 freq
pleesuirs - 2 freq
pleesures - 3 freq
pliesjir - 1 freq
plagiarist - 1 freq
plaisir - 3 freq
plaesur - 2 freq
ploushare - 2 freq
plooshare - 2 freq
pleasour - 1 freq
pilgrimer - 4 freq
pluscarden - 1 freq
pleisurable - 1 freq
pilgrimers - 2 freq
pleisir - 3 freq
plagiarisin - 1 freq
playgrunn - 2 freq
pleisur-snowker - 1 freq
plagiarism - 2 freq
plei-sured - 1 freq
pleesurin - 1 freq
pilgremer - 1 freq
pleygroup - 1 freq
pleisured - 1 freq
pleisurs - 1 freq
pleygroups - 1 freq
policework - 1 freq
placards - 2 freq
placard - 2 freq
pleygrund - 1 freq
pleygrun - 1 freq
playgroun - 2 freq
pylqzqr - 1 freq
pauljcorrigan - 1 freq
pilchard - 1 freq
polisher - 1 freq
paulgardinerdj - 1 freq
plucker - 1 freq
MetaPhone code - PLXR
pleesher - 2 freq
ploushare - 2 freq
plooshare - 2 freq
polisher - 1 freq
POLISHER
Time to execute Levenshtein function - 0.320293 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.609592 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030859 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.046441 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000875 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.