A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sandollar in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sandollar (0) - 2 freq
dollar (3) - 12 freq
saiddlar (3) - 1 freq
sammillar (3) - 1 freq
mandolins (4) - 1 freq
randomly (4) - 3 freq
man-o-war (4) - 2 freq
sandals (4) - 17 freq
sanctuar (4) - 5 freq
sandiloos (4) - 1 freq
sandpaper (4) - 3 freq
sandhole (4) - 3 freq
sanquhar (4) - 9 freq
ayallar (4) - 4 freq
sadler (4) - 1 freq
sandras (4) - 1 freq
sandiloo (4) - 1 freq
singular (4) - 33 freq
ancillary (4) - 1 freq
iancoll (4) - 1 freq
randolph (4) - 1 freq
mandolin (4) - 14 freq
dollars (4) - 7 freq
angular (4) - 1 freq
andorian (4) - 1 freq
sandollar (0) - 2 freq
saiddlar (5) - 1 freq
sammillar (5) - 1 freq
dollar (5) - 12 freq
handler (6) - 1 freq
sandal (6) - 1 freq
saidill (6) - 1 freq
sillar (6) - 3 freq
randall (6) - 7 freq
sandiloch (6) - 1 freq
sandle (6) - 1 freq
sandra (6) - 51 freq
sandancer (6) - 3 freq
schullar (6) - 1 freq
candill (6) - 1 freq
sander (6) - 1 freq
ancillary (6) - 1 freq
sandier (6) - 2 freq
sellar (6) - 15 freq
sandhole (6) - 3 freq
sandiloo (6) - 1 freq
sandpaper (6) - 3 freq
sandiloos (6) - 1 freq
sandals (6) - 17 freq
singular (6) - 33 freq
SoundEx code - S534
soondless - 2 freq
scandalised - 2 freq
soundly - 3 freq
sandyhills - 3 freq
scandal - 10 freq
smittle - 3 freq
smuithely - 1 freq
soundhole - 2 freq
sauntlik - 1 freq
smoothly - 5 freq
sandals - 17 freq
scandalous - 1 freq
smithhill - 1 freq
some'dy'll - 1 freq
sun'tl - 1 freq
saintlie - 1 freq
scantlbury - 1 freq
seendil - 2 freq
sentle - 1 freq
sandiloch - 1 freq
sundial - 1 freq
smoothlie - 1 freq
sindle - 1 freq
sauntlie - 1 freq
scandalizin - 1 freq
smittal - 6 freq
sun-dial - 1 freq
saandiloos - 1 freq
scantlins - 1 freq
skantlins - 1 freq
sandhole - 3 freq
smuithlik - 1 freq
snodly - 1 freq
sandle - 1 freq
soundless - 1 freq
smuithly - 2 freq
soundlessly - 1 freq
scantily - 1 freq
scintillating - 1 freq
sandollar - 2 freq
sandal - 1 freq
somewhataldente - 2 freq
sandiloos - 1 freq
sandiloo - 1 freq
scandals - 1 freq
samatlounge - 1 freq
swindells - 1 freq
MetaPhone code - SNTLR
sandollar - 2 freq
SANDOLLAR
Time to execute Levenshtein function - 0.616534 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.725043 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.065482 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039710 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000885 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.