A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to purim in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
purim (0) - 1 freq
prim (1) - 2 freq
purrin (2) - 20 freq
purrs (2) - 3 freq
uri (2) - 7 freq
paris (2) - 52 freq
peri (2) - 2 freq
prima (2) - 7 freq
yuri (2) - 1 freq
aurum (2) - 1 freq
pur (2) - 5 freq
scrim (2) - 1 freq
furm (2) - 20 freq
wurm (2) - 1 freq
grim (2) - 53 freq
ferim (2) - 1 freq
purist (2) - 3 freq
pubic (2) - 1 freq
rim (2) - 14 freq
wurid (2) - 1 freq
puil (2) - 37 freq
eurig (2) - 1 freq
porin (2) - 1 freq
puin (2) - 6 freq
purity (2) - 5 freq
purim (0) - 1 freq
prim (1) - 2 freq
prima (2) - 7 freq
pirm (2) - 1 freq
prom (2) - 8 freq
perm (2) - 3 freq
prum (2) - 1 freq
pram (2) - 18 freq
prime (2) - 36 freq
jorim (3) - 3 freq
purk (3) - 1 freq
puir (3) - 548 freq
primp (3) - 5 freq
pum (3) - 3 freq
purpie (3) - 23 freq
purse (3) - 38 freq
penim (3) - 1 freq
pirie (3) - 5 freq
trim (3) - 16 freq
poyim (3) - 3 freq
pursie (3) - 2 freq
purn (3) - 1 freq
purl (3) - 3 freq
parami (3) - 4 freq
frim (3) - 1 freq
SoundEx code - P650
parin - 4 freq
prim - 2 freq
preen - 44 freq
peeryin - 1 freq
poorin - 45 freq
premnay - 1 freq
prima - 7 freq
purrin - 20 freq
peerin - 34 freq
prom - 8 freq
prayin - 64 freq
pram - 18 freq
pourin - 23 freq
paranoia - 10 freq
powerin - 3 freq
pairin' - 1 freq
preen' - 1 freq
poorin' - 3 freq
preyin - 3 freq
prime - 36 freq
perm - 3 freq
pryin - 1 freq
preein - 11 freq
purim - 1 freq
pooren - 2 freq
peeren - 3 freq
prune - 5 freq
prone - 11 freq
'prawn - 1 freq
prawn - 3 freq
praan - 4 freq
porno - 2 freq
purn - 1 freq
prn - 1 freq
pierian - 1 freq
prayan - 2 freq
prein - 1 freq
pooran - 1 freq
porny - 1 freq
porin - 1 freq
prum - 1 freq
peeran - 2 freq
pirn - 8 freq
pæprin - 1 freq
pirm - 1 freq
purran - 3 freq
pran - 1 freq
'preein' - 1 freq
príe-in - 1 freq
'preen - 1 freq
pranny - 19 freq
preen- - 1 freq
pooerin - 1 freq
pourin' - 1 freq
prayin' - 1 freq
pron - 1 freq
pyran - 1 freq
pernee - 1 freq
porn - 2 freq
promo - 2 freq
perrin - 2 freq
parami - 4 freq
prma - 1 freq
MetaPhone code - PRM
prim - 2 freq
prima - 7 freq
prom - 8 freq
pram - 18 freq
prime - 36 freq
perm - 3 freq
purim - 1 freq
prum - 1 freq
pirm - 1 freq
promo - 2 freq
parami - 4 freq
prma - 1 freq
PURIM
Time to execute Levenshtein function - 0.200717 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.346853 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027574 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037411 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000923 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.