A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to goves in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
goves (0) - 2 freq
toves (1) - 1 freq
govts (1) - 1 freq
moves (1) - 32 freq
govies (1) - 1 freq
gloves (1) - 48 freq
loves (1) - 68 freq
godes (1) - 2 freq
roves (1) - 1 freq
joves (1) - 2 freq
goges (1) - 5 freq
gives (1) - 20 freq
doves (1) - 2 freq
goved (1) - 2 freq
goes (1) - 331 freq
groves (1) - 2 freq
gove (1) - 14 freq
lover (2) - 27 freq
gowns (2) - 2 freq
govt (2) - 7 freq
moses (2) - 68 freq
lovies (2) - 1 freq
doses (2) - 9 freq
golem (2) - 1 freq
dovey (2) - 3 freq
goves (0) - 2 freq
gives (1) - 20 freq
govies (1) - 1 freq
doves (2) - 2 freq
guvs (2) - 1 freq
goved (2) - 2 freq
groves (2) - 2 freq
goges (2) - 5 freq
gove (2) - 14 freq
gvs (2) - 1 freq
goes (2) - 331 freq
govts (2) - 1 freq
joves (2) - 2 freq
moves (2) - 32 freq
toves (2) - 1 freq
gloves (2) - 48 freq
roves (2) - 1 freq
godes (2) - 2 freq
loves (2) - 68 freq
saves (3) - 11 freq
gavel (3) - 6 freq
movs (3) - 2 freq
gunes (3) - 2 freq
gooms (3) - 4 freq
fives (3) - 7 freq
SoundEx code - G120
gaps - 11 freq
gapes - 1 freq
guffs - 4 freq
gives - 20 freq
gaffs - 4 freq
gypes - 17 freq
gps - 4 freq
gaups - 2 freq
gubs - 1 freq
gabs - 3 freq
gibbous - 2 freq
gap's - 1 freq
gobs - 4 freq
guffaws - 2 freq
gavse - 1 freq
gowp's - 1 freq
gappas - 2 freq
gappus - 3 freq
gabsie - 1 freq
gowps - 3 freq
goves - 2 freq
govies - 1 freq
gypsy - 2 freq
gebbies - 1 freq
'gobbag - 1 freq
gsfc - 2 freq
gapus - 1 freq
guavas - 1 freq
gobbs - 20 freq
gawps - 1 freq
gbwg - 1 freq
gbpx - 1 freq
gavzo - 2 freq
gpaj - 1 freq
gypos - 1 freq
gvs - 1 freq
gype's - 1 freq
gebs - 2 freq
gfqzq - 1 freq
gwupz - 1 freq
gxewbfc - 1 freq
gibbsy - 1 freq
gpsg - 1 freq
geebies - 1 freq
guvs - 1 freq
gjbwz - 1 freq
MetaPhone code - KFS
cuffs - 7 freq
guffs - 4 freq
caves - 17 freq
caufs - 3 freq
gaffs - 4 freq
coughs - 6 freq
cauves - 7 freq
coffees - 1 freq
cvs - 1 freq
guffaws - 2 freq
caffies' - 2 freq
gavse - 1 freq
cafés - 2 freq
coofs - 5 freq
caiaphas - 2 freq
coafs - 1 freq
cafés - 3 freq
cofos - 2 freq
goves - 2 freq
govies - 1 freq
coffs - 2 freq
cave's - 1 freq
cafes - 6 freq
guavas - 1 freq
€œcaves - 2 freq
cuifs - 1 freq
gavzo - 2 freq
coof's - 1 freq
coveÂ’s - 1 freq
gvs - 1 freq
qyfws - 1 freq
cfiz - 1 freq
guvs - 1 freq
covey's - 1 freq
GOVES
Time to execute Levenshtein function - 0.211867 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.338581 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027714 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037153 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000877 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.