A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to amymacleanpod in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
amymacleanpod (0) - 1 freq
maclean's (6) - 4 freq
robmacleansport (6) - 2 freq
alexmacleod (6) - 1 freq
macleod (6) - 11 freq
macleans (6) - 1 freq
dry-cleaned (6) - 1 freq
maclean (6) - 15 freq
cleaned (7) - 35 freq
johnmacleanma (7) - 2 freq
shivmaclean (7) - 1 freq
cleansed (7) - 5 freq
manacled (7) - 1 freq
amscanlon (7) - 1 freq
amalkadog (7) - 20 freq
mcleanÂ’s (7) - 1 freq
macleoid (7) - 1 freq
ruamaclennan (7) - 1 freq
anclapped (7) - 1 freq
mclean's (7) - 1 freq
mclean (7) - 13 freq
americanos (7) - 1 freq
amyspineapple (7) - 1 freq
mcleod (7) - 10 freq
americano (7) - 2 freq
amymacleanpod (0) - 1 freq
maclean (9) - 15 freq
macleod (9) - 11 freq
macleans (9) - 1 freq
maclean's (9) - 4 freq
anclapped (10) - 1 freq
macleoid (10) - 1 freq
mclean's (10) - 1 freq
mcleod (10) - 10 freq
mclean (10) - 13 freq
manacled (10) - 1 freq
ruamaclennan (10) - 1 freq
dry-cleaned (10) - 1 freq
cleansed (10) - 5 freq
alexmacleod (10) - 1 freq
cleaned (10) - 35 freq
immaculately (11) - 1 freq
clanked (11) - 1 freq
moorlaand (11) - 1 freq
clamped (11) - 7 freq
merkland (11) - 4 freq
monicalennon (11) - 2 freq
mossland (11) - 1 freq
sclanced (11) - 2 freq
emmaissandy (11) - 1 freq
SoundEx code - A552
amang - 699 freq
amangst - 40 freq
among - 128 freq
amongst - 62 freq
anyhing - 33 freq
announcement - 10 freq
announces - 12 freq
announced - 33 freq
announcer's - 1 freq
aming - 3 freq
anyone's - 2 freq
annoyance - 12 freq
announcements - 5 freq
annuncee - 1 freq
amaing - 1 freq
amencg - 1 freq
annoyingly - 2 freq
amung - 3 freq
annunciation - 1 freq
announcer - 1 freq
aaaamang - 1 freq
annoying - 11 freq
annoonce - 4 freq
'among - 1 freq
annoyince - 2 freq
amungst - 2 freq
anooncements - 2 freq
announce - 7 freq
annoonced - 9 freq
announcing - 4 freq
anoonced - 2 freq
ananias - 2 freq
amangits - 1 freq
annunced - 2 freq
annooncement - 2 freq
anyhin's - 1 freq
annooncet - 1 freq
animacy - 1 freq
announcers - 1 freq
€˜among - 1 freq
amangat - 1 freq
annuncement - 1 freq
annuncit - 1 freq
announcin - 4 freq
annoonces - 1 freq
annooncements - 1 freq
annooncit - 2 freq
anenst - 52 freq
-anenst - 1 freq
anaemic - 1 freq
annooncin - 3 freq
anoonce - 1 freq
amunsgt - 1 freq
€˜amang - 1 freq
€™anyhing - 1 freq
amymacleanpod - 1 freq
annemclaughlin - 8 freq
anumqaisarjaved - 1 freq
MetaPhone code - AMMKLNPT
amymacleanpod - 1 freq
AMYMACLEANPOD
Time to execute Levenshtein function - 0.563652 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.584073 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030617 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.049548 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000904 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.