A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to title in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
title (0) - 65 freq
title' (1) - 1 freq
tistle (1) - 3 freq
titles (1) - 13 freq
tile (1) - 5 freq
tithe (1) - 4 freq
titlet (1) - 1 freq
titled (1) - 2 freq
tite (1) - 2 freq
oiyle (2) - 1 freq
tntae (2) - 1 freq
taikle (2) - 1 freq
aisle (2) - 27 freq
aizle (2) - 1 freq
aite (2) - 4 freq
little (2) - 421 freq
pitie (2) - 1 freq
aigle (2) - 4 freq
fite (2) - 199 freq
titch (2) - 5 freq
ainle (2) - 2 freq
bile (2) - 56 freq
tiefe (2) - 1 freq
cite (2) - 2 freq
tipple (2) - 3 freq
title (0) - 65 freq
ettle (2) - 190 freq
eittle (2) - 1 freq
tetley (2) - 1 freq
taitl (2) - 1 freq
teetle (2) - 10 freq
tite (2) - 2 freq
tootle (2) - 1 freq
titled (2) - 2 freq
title' (2) - 1 freq
tistle (2) - 3 freq
titles (2) - 13 freq
tile (2) - 5 freq
tithe (2) - 4 freq
titlet (2) - 1 freq
titi (3) - 1 freq
tytler (3) - 1 freq
mittle (3) - 1 freq
tito (3) - 2 freq
thule (3) - 5 freq
tale (3) - 300 freq
tanle (3) - 1 freq
tattle (3) - 3 freq
tele (3) - 32 freq
httle (3) - 1 freq
SoundEx code - T340
that'll - 151 freq
totally - 121 freq
toatlly - 1 freq
total - 98 freq
that''ll - 1 freq
title - 65 freq
teetle - 10 freq
tattle - 3 freq
'that'll - 9 freq
toodle - 2 freq
tweeddale - 4 freq
tod-hole - 3 freq
toddle - 3 freq
tidal - 9 freq
twiddle - 1 freq
thit'll - 4 freq
tottle - 1 freq
tetley - 1 freq
tootle - 1 freq
tea-towel - 3 freq
title' - 1 freq
tiddely - 11 freq
'tiddely - 2 freq
tod'll - 1 freq
'total - 1 freq
twiddly - 1 freq
twaddle - 1 freq
teitil - 6 freq
tweedale - 2 freq
tiddle - 1 freq
tidily - 1 freq
totallie - 1 freq
tweedle - 3 freq
tweddle - 1 freq
tiddly - 1 freq
totalee - 1 freq
taitl - 1 freq
thuathail' - 1 freq
that’ll - 1 freq
totalleo - 1 freq
toodleoo - 1 freq
thatll - 5 freq
MetaPhone code - TTL
doddle - 4 freq
totally - 121 freq
toatlly - 1 freq
total - 98 freq
daidle - 2 freq
detail - 43 freq
daudle - 1 freq
dau-dle - 1 freq
title - 65 freq
deadly - 18 freq
deidly - 19 freq
teetle - 10 freq
doodle - 10 freq
'deadly - 1 freq
dtll - 1 freq
tattle - 3 freq
dottle - 1 freq
toodle - 2 freq
toddle - 3 freq
tidal - 9 freq
dawdle - 10 freq
deedly - 2 freq
tottle - 1 freq
dettol - 1 freq
deidlie - 7 freq
tetley - 1 freq
daadle - 1 freq
tootle - 1 freq
dad'll - 2 freq
dat'll - 10 freq
title' - 1 freq
tiddely - 11 freq
'tiddely - 2 freq
diddle - 14 freq
tod'll - 1 freq
'total - 1 freq
deedlie - 1 freq
deedle - 2 freq
dudley - 11 freq
teitil - 6 freq
tiddle - 1 freq
d'italia - 1 freq
tidily - 1 freq
totallie - 1 freq
tiddly - 1 freq
totalee - 1 freq
taitl - 1 freq
diddly - 1 freq
dat’ll - 2 freq
totalleo - 1 freq
toodleoo - 1 freq
dettel - 1 freq
TITLE
Time to execute Levenshtein function - 0.385605 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.406573 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028413 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040124 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000943 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.