A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sardines in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sardines (0) - 3 freq
sardine (1) - 1 freq
sarnies (2) - 1 freq
saidnes (2) - 1 freq
sarvints (2) - 10 freq
gardiner (2) - 1 freq
cardies (2) - 1 freq
hardiness (2) - 2 freq
fardins (2) - 1 freq
sardane (2) - 1 freq
sarpints (2) - 1 freq
jardine (2) - 29 freq
bardies (2) - 1 freq
hardies (2) - 1 freq
gardies (2) - 1 freq
sardinia (2) - 3 freq
carlines (2) - 3 freq
sarvices (2) - 5 freq
darknes (3) - 1 freq
erlines (3) - 2 freq
spraings (3) - 1 freq
spurdies (3) - 3 freq
sabine (3) - 3 freq
birdies (3) - 27 freq
arises (3) - 5 freq
sardines (0) - 3 freq
sardine (2) - 1 freq
sardinia (3) - 3 freq
sardane (3) - 1 freq
fardins (3) - 1 freq
saidnes (3) - 1 freq
sarnies (3) - 1 freq
sardinian (4) - 3 freq
herdins (4) - 3 freq
gardens (4) - 10 freq
shaidins (4) - 6 freq
saddens (4) - 1 freq
sardonic (4) - 1 freq
hairdnes (4) - 1 freq
soartins (4) - 2 freq
gairdins (4) - 2 freq
sidins (4) - 1 freq
wirdins (4) - 1 freq
wardens (4) - 2 freq
hardiness (4) - 2 freq
sarvices (4) - 5 freq
cardies (4) - 1 freq
gardiner (4) - 1 freq
sarvints (4) - 10 freq
jardine (4) - 29 freq
SoundEx code - S635
scartin - 18 freq
skirtin - 12 freq
scrattin - 10 freq
seartin - 1 freq
scriddans - 2 freq
skyrie-tonguit - 1 freq
serten - 4 freq
sorten - 2 freq
scrooteneer's - 4 freq
scrootenized - 1 freq
scrootinizing - 1 freq
scratten - 1 freq
seratonin - 1 freq
squirtin - 4 freq
sardinia - 3 freq
scrutiny - 5 freq
sortin - 24 freq
sweertness - 1 freq
soartin - 5 freq
sartin - 6 freq
sheridan's - 2 freq
sheridan - 5 freq
sardinian - 3 freq
scrittin - 1 freq
scrutinised - 1 freq
sortan - 1 freq
skartin - 4 freq
sardines - 3 freq
skirteen - 2 freq
shreddan - 1 freq
skirteens - 1 freq
shoartens - 1 freq
scartins - 2 freq
shortened - 2 freq
shorthaand - 1 freq
sardane - 1 freq
shortenin - 2 freq
skrattin - 1 freq
sword-dauncin - 1 freq
scartin' - 1 freq
skirtins - 1 freq
shortening - 1 freq
sardine - 1 freq
scrattins - 1 freq
schrödinger - 2 freq
soartins - 2 freq
shoardin - 1 freq
shortness - 2 freq
sorting - 2 freq
shreadin' - 1 freq
scrutinise - 1 freq
shoartenin - 2 freq
sardonic - 1 freq
sardonicism - 1 freq
“scartin - 1 freq
shorthaund - 1 freq
saortony - 1 freq
schrodinger's - 1 freq
MetaPhone code - SRTNS
sardines - 3 freq
soartins - 2 freq
SARDINES
Time to execute Levenshtein function - 0.198955 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.550189 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.059505 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037535 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000937 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.