A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sardine in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sardine (0) - 1 freq
sardines (1) - 3 freq
sardane (1) - 1 freq
jardine (1) - 29 freq
wardin (2) - 1 freq
sabine (2) - 3 freq
marine (2) - 15 freq
karine (2) - 2 freq
sarkin (2) - 2 freq
sartin (2) - 6 freq
sarvin (2) - 22 freq
shrine (2) - 12 freq
fardin (2) - 1 freq
sarvice (2) - 5 freq
sarvint (2) - 22 freq
gardiner (2) - 1 freq
jardin (2) - 2 freq
hardie (2) - 26 freq
fardins (2) - 1 freq
sandie (2) - 32 freq
bardin (2) - 1 freq
sardinia (2) - 3 freq
sadie (2) - 25 freq
sarpint (2) - 3 freq
cardie (2) - 2 freq
sardine (0) - 1 freq
sardane (1) - 1 freq
sardinia (2) - 3 freq
jardine (2) - 29 freq
sardines (2) - 3 freq
jardin (3) - 2 freq
sarnie (3) - 3 freq
fardin (3) - 1 freq
cardin (3) - 2 freq
sarne (3) - 1 freq
bardin (3) - 1 freq
sartin (3) - 6 freq
sarkin (3) - 2 freq
wardin (3) - 1 freq
sarvin (3) - 22 freq
saidna (4) - 1 freq
seartin (4) - 1 freq
hoardin (4) - 6 freq
saurin (4) - 1 freq
ardoyne (4) - 2 freq
yirdin (4) - 8 freq
girdin (4) - 2 freq
soartin (4) - 5 freq
guardin (4) - 3 freq
survin (4) - 1 freq
SoundEx code - S635
scartin - 18 freq
skirtin - 12 freq
scrattin - 10 freq
seartin - 1 freq
scriddans - 2 freq
skyrie-tonguit - 1 freq
serten - 4 freq
sorten - 2 freq
scrooteneer's - 4 freq
scrootenized - 1 freq
scrootinizing - 1 freq
scratten - 1 freq
seratonin - 1 freq
squirtin - 4 freq
sardinia - 3 freq
scrutiny - 5 freq
sortin - 24 freq
sweertness - 1 freq
soartin - 5 freq
sartin - 6 freq
sheridan's - 2 freq
sheridan - 5 freq
sardinian - 3 freq
scrittin - 1 freq
scrutinised - 1 freq
sortan - 1 freq
skartin - 4 freq
sardines - 3 freq
skirteen - 2 freq
shreddan - 1 freq
skirteens - 1 freq
shoartens - 1 freq
scartins - 2 freq
shortened - 2 freq
shorthaand - 1 freq
sardane - 1 freq
shortenin - 2 freq
skrattin - 1 freq
sword-dauncin - 1 freq
scartin' - 1 freq
skirtins - 1 freq
shortening - 1 freq
sardine - 1 freq
scrattins - 1 freq
schrödinger - 2 freq
soartins - 2 freq
shoardin - 1 freq
shortness - 2 freq
sorting - 2 freq
shreadin' - 1 freq
scrutinise - 1 freq
shoartenin - 2 freq
sardonic - 1 freq
sardonicism - 1 freq
“scartin - 1 freq
shorthaund - 1 freq
saortony - 1 freq
schrodinger's - 1 freq
MetaPhone code - SRTN
certain - 150 freq
seartin - 1 freq
certin - 1 freq
serten - 4 freq
sorten - 2 freq
sardinia - 3 freq
sortin - 24 freq
soartin - 5 freq
sartin - 6 freq
sortan - 1 freq
certane - 2 freq
certan - 1 freq
sardane - 1 freq
certayne - 1 freq
sardine - 1 freq
saortony - 1 freq
SARDINE
Time to execute Levenshtein function - 0.242737 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.438046 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032943 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.043042 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001064 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.