A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to unspokken in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
unspokken (0) - 1 freq
unspoken (1) - 5 freq
unspoke (2) - 1 freq
spokken (2) - 47 freq
b'spikken (3) - 1 freq
unbroken (3) - 4 freq
spokkan (3) - 1 freq
unsportin (3) - 1 freq
unlokket (3) - 1 freq
spikken (3) - 15 freq
on-spoken (3) - 1 freq
slokken (3) - 1 freq
spokkin (3) - 1 freq
spukken (3) - 10 freq
spocken (3) - 3 freq
spakken (3) - 2 freq
sokken (3) - 1 freq
spoken (3) - 217 freq
unstickan (4) - 1 freq
nokket (4) - 2 freq
unseen (4) - 29 freq
pouken (4) - 1 freq
soaken (4) - 2 freq
unpickin (4) - 1 freq
smokkin (4) - 3 freq
unspokken (0) - 1 freq
unspoken (2) - 5 freq
spokken (3) - 47 freq
spokkin (4) - 1 freq
spukken (4) - 10 freq
spikken (4) - 15 freq
spakken (4) - 2 freq
spokkan (4) - 1 freq
unspoke (4) - 1 freq
sokken (5) - 1 freq
spoken (5) - 217 freq
unsportin (5) - 1 freq
spikkin (5) - 189 freq
spakkin (5) - 1 freq
spocken (5) - 3 freq
spikkan (5) - 4 freq
b'spikken (5) - 1 freq
on-spoken (5) - 1 freq
slokken (5) - 1 freq
stukken (6) - 7 freq
pikken (6) - 6 freq
spokan (6) - 3 freq
sppken (6) - 1 freq
unspun (6) - 1 freq
spaken (6) - 4 freq
SoundEx code - U521
unexpeckit - 10 freq
unexpected - 16 freq
unexpectit - 9 freq
unceevil - 1 freq
unsavoury - 1 freq
unspelt - 1 freq
unexpectedly - 4 freq
unexpectitly - 3 freq
unspecified - 6 freq
unspoken - 5 freq
uncivil - 2 freq
unspeakable - 2 freq
uncouple - 1 freq
unsupervised - 1 freq
unspoke - 1 freq
'unacceptable - 1 freq
uncoupled - 1 freq
unsafe - 2 freq
unspret - 1 freq
unspaekable - 2 freq
unsoupled - 1 freq
unspared - 1 freq
unsubstantiate - 1 freq
unspaed - 1 freq
unspokken - 1 freq
unspun - 1 freq
ungebrochnen - 1 freq
unexpeckitlie - 2 freq
unsportin - 1 freq
unspikkable - 1 freq
€œungava - 1 freq
unexpress - 1 freq
unkept - 2 freq
uncovered - 2 freq
unship - 1 freq
unshiftable - 1 freq
unexpectit-like - 1 freq
unexplained - 1 freq
unmissable - 2 freq
unseparably - 1 freq
uynsieboke - 1 freq
unexplainedpod - 1 freq
unacceptable - 1 freq
MetaPhone code - UNSPKN
unspoken - 5 freq
unspokken - 1 freq
UNSPOKKEN
Time to execute Levenshtein function - 0.374372 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.549439 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.037066 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.048924 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001156 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.