A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dullion in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dullion (0) - 1 freq
bullion (1) - 1 freq
dullin (1) - 2 freq
mullion (1) - 1 freq
rullion (1) - 3 freq
dillion (1) - 3 freq
pullin (2) - 107 freq
full-on (2) - 5 freq
million (2) - 156 freq
mullin (2) - 3 freq
dillon (2) - 2 freq
dunlin (2) - 2 freq
zillion (2) - 1 freq
dullies (2) - 2 freq
scullion (2) - 2 freq
hallion (2) - 16 freq
wullin (2) - 23 freq
dublin (2) - 29 freq
pallion (2) - 1 freq
pillion (2) - 1 freq
llion (2) - 57 freq
duellin (2) - 2 freq
dellin (2) - 5 freq
billion (2) - 34 freq
kullin (2) - 1 freq
dullion (0) - 1 freq
dullin (1) - 2 freq
dillion (1) - 3 freq
dillon (2) - 2 freq
duellin (2) - 2 freq
mullion (2) - 1 freq
bullion (2) - 1 freq
rullion (2) - 3 freq
dellin (2) - 5 freq
llion (3) - 57 freq
pillion (3) - 1 freq
pallion (3) - 1 freq
diallin (3) - 4 freq
dillan (3) - 1 freq
dublin (3) - 29 freq
fullin (3) - 29 freq
kullin (3) - 1 freq
billion (3) - 34 freq
mullin (3) - 3 freq
wullin (3) - 23 freq
pullin (3) - 107 freq
dunlin (3) - 2 freq
million (3) - 156 freq
dullies (3) - 2 freq
hallion (3) - 16 freq
SoundEx code - D450
dealin - 30 freq
daelin - 6 freq
dillion - 3 freq
dylan - 15 freq
dwallin - 15 freq
dullin - 2 freq
dileema - 1 freq
delayin - 3 freq
diallin - 4 freq
dwellin - 4 freq
dilemma - 10 freq
dullion - 1 freq
dial-an - 1 freq
dolin - 1 freq
dillan - 1 freq
deelin - 1 freq
dellin - 5 freq
dalin - 4 freq
delaney - 10 freq
duellin - 2 freq
dillon - 2 freq
diallan - 1 freq
dailin - 1 freq
dlooney - 1 freq
dalhanna - 1 freq
dwlen - 1 freq
MetaPhone code - TLN
tellin - 611 freq
dealin - 30 freq
daelin - 6 freq
dillion - 3 freq
dylan - 15 freq
toilin - 2 freq
toilan - 1 freq
dullin - 2 freq
tillin - 3 freq
tellen - 3 freq
tellin' - 6 freq
diallin - 4 freq
tail-en - 2 freq
dullion - 1 freq
dial-an - 1 freq
dolin - 1 freq
tail-an - 1 freq
tellan - 13 freq
telleen - 1 freq
dillan - 1 freq
deelin - 1 freq
dellin - 5 freq
dalin - 4 freq
toulon - 1 freq
delaney - 10 freq
duellin - 2 freq
tailen - 1 freq
dillon - 2 freq
diallan - 1 freq
tailin - 1 freq
telllin - 1 freq
dailin - 1 freq
€˜taliani - 1 freq
toolin - 1 freq
dlooney - 1 freq
tail-eyne - 1 freq
dwlen - 1 freq
tellinÂ’ - 1 freq
DULLION
Time to execute Levenshtein function - 0.210418 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.566679 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.043861 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039438 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000953 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.