A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sonjahern in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sonjahern (0) - 1 freq
southern (3) - 16 freq
soothern (3) - 1 freq
sinnahard (4) - 1 freq
snichert (4) - 2 freq
sootherin (4) - 7 freq
snocher (4) - 6 freq
consarn (4) - 6 freq
southers (4) - 1 freq
'onywhere (4) - 1 freq
sojier (4) - 1 freq
snocherin (4) - 7 freq
sojourn (4) - 1 freq
soajer (4) - 1 freq
sniggern (4) - 9 freq
sodjers (4) - 23 freq
sojer (4) - 23 freq
sojers (4) - 54 freq
southert (4) - 1 freq
soaken (4) - 2 freq
sowthert (4) - 2 freq
concern (4) - 38 freq
ongaein (4) - 4 freq
southeran (4) - 1 freq
sowther (4) - 5 freq
sonjahern (0) - 1 freq
snocherin (5) - 7 freq
snicherin (5) - 1 freq
snicheran (5) - 1 freq
snicheren (5) - 2 freq
southern (5) - 16 freq
soothern (5) - 1 freq
southeran (6) - 1 freq
snochert (6) - 1 freq
sowtherin (6) - 2 freq
sunsheen (6) - 6 freq
sunshein (6) - 1 freq
sniggern (6) - 9 freq
snashers (6) - 1 freq
snicher (6) - 3 freq
sootherin (6) - 7 freq
snocher (6) - 6 freq
snichert (6) - 2 freq
sojourn (6) - 1 freq
sinnahard (6) - 1 freq
benthorn (7) - 1 freq
snarin (7) - 2 freq
swutherin (7) - 3 freq
santerin (7) - 1 freq
stjerne (7) - 1 freq
SoundEx code - S526
snickerin - 5 freq
smacher - 2 freq
singers - 55 freq
singer - 47 freq
singer's - 2 freq
snocherin - 7 freq
smokers' - 4 freq
sincerely - 8 freq
sincere - 13 freq
sneegart - 1 freq
sneegert - 1 freq
sniggerin - 6 freq
sincerity - 3 freq
snochert - 1 freq
snickert - 1 freq
smoker - 2 freq
smasher - 4 freq
sanquhar - 9 freq
semi-circles - 1 freq
smickered - 1 freq
smickert - 1 freq
snicheren - 2 freq
smokers - 8 freq
sing-greet - 1 freq
sniggers - 5 freq
smugger - 1 freq
smickerin - 2 freq
sangria - 2 freq
snichered - 2 freq
snicher - 3 freq
skincare - 1 freq
smachrie - 2 freq
sniggered - 4 freq
singers' - 2 freq
sniggern - 9 freq
sniggert - 5 freq
smoker's - 2 freq
snooker - 16 freq
synchronised - 1 freq
snocher - 6 freq
snochered - 1 freq
smackaroonies - 2 freq
snicker - 1 freq
shae-makker - 1 freq
shae-maakers - 1 freq
smaikrie - 1 freq
sensory-seutid - 1 freq
sneegired - 1 freq
sneegirs - 1 freq
sanger - 1 freq
sheanchara - 1 freq
snicheran - 1 freq
snicherin - 1 freq
snickered - 1 freq
snigger' - 1 freq
snichert - 2 freq
sanchar - 1 freq
smacker - 3 freq
syncretism - 1 freq
snashers - 1 freq
snickerin' - 1 freq
shoemaker-levy - 1 freq
smacherie - 1 freq
synchronise - 1 freq
snookert - 1 freq
sensoryattachmentintervention - 1 freq
sensory - 4 freq
snochrie - 12 freq
singer-sangwriters - 1 freq
sanskrit - 1 freq
semi-circle - 1 freq
somequhaar - 1 freq
smicker - 1 freq
€œsincerely - 1 freq
synchronically - 1 freq
smacherry - 2 freq
sunscreem - 6 freq
smokiered - 16 freq
samcornwell - 1 freq
skinnycortado - 1 freq
sincerest - 1 freq
shingaurds - 1 freq
smoocher - 3 freq
shaunwkearney - 1 freq
sunnygradio - 1 freq
sensored - 1 freq
sonjahern - 1 freq
sinker - 1 freq
samgray - 29 freq
seanmccrory - 1 freq
snookers - 1 freq
sensors - 1 freq
snickers - 1 freq
sammgreer - 14 freq
suncream - 1 freq
MetaPhone code - SNJHRN
sonjahern - 1 freq
SONJAHERN
Time to execute Levenshtein function - 0.810417 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 2.178474 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.102582 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.189044 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000889 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.