A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cattyish in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cattyish (0) - 20 freq
catfish (2) - 1 freq
fattish (2) - 1 freq
catties (3) - 3 freq
scoattish (3) - 2 freq
cattrick (3) - 1 freq
fatty's (3) - 2 freq
ttish (3) - 1 freq
earlyish (3) - 1 freq
matty's (3) - 15 freq
cathy's (3) - 27 freq
catty (3) - 27 freq
patty's (3) - 1 freq
a-tish (3) - 3 freq
cattlie (3) - 2 freq
scottish (3) - 1237 freq
paitish (3) - 1 freq
cattie (3) - 2 freq
waftish (3) - 1 freq
daftish (3) - 2 freq
cattil (3) - 2 freq
kitty's (4) - 1 freq
sattles (4) - 2 freq
captains (4) - 3 freq
oaffish (4) - 1 freq
cattyish (0) - 20 freq
fattish (3) - 1 freq
catfish (3) - 1 freq
ttish (4) - 1 freq
scottish (4) - 1237 freq
scoattish (4) - 2 freq
catties (4) - 3 freq
cattie (5) - 2 freq
paitish (5) - 1 freq
cutties (5) - 3 freq
cotts (5) - 3 freq
cattlie (5) - 2 freq
daftish (5) - 2 freq
waftish (5) - 1 freq
cattil (5) - 2 freq
catty (5) - 27 freq
a-tish (5) - 3 freq
cutting (6) - 16 freq
cants (6) - 1 freq
cartoonish (6) - 1 freq
patties (6) - 2 freq
british (6) - 232 freq
cuttins (6) - 5 freq
cattler (6) - 3 freq
cathie (6) - 1 freq
SoundEx code - C320
cottage - 49 freq
catch - 353 freq
cities - 43 freq
city's - 10 freq
cats - 124 freq
cat's - 32 freq
ceeties - 7 freq
cuddies - 49 freq
cuddie's - 10 freq
codes - 4 freq
cuts - 45 freq
coats - 28 freq
chats - 3 freq
cotch - 8 freq
cds - 20 freq
cadiz - 2 freq
cute's - 1 freq
cd's - 2 freq
cothous - 9 freq
cot-hous - 2 freq
cahoots - 2 freq
chat's - 1 freq
cuits - 2 freq
cuddies' - 2 freq
cut's - 2 freq
cit's - 1 freq
cathoose - 1 freq
cautch - 1 freq
coat's - 4 freq
cathy's - 27 freq
'cathy's - 2 freq
'cuddies - 1 freq
citz - 2 freq
cautious - 8 freq
cheats - 2 freq
cats' - 2 freq
catties - 3 freq
c-c-d's - 1 freq
'cheats' - 2 freq
couttie's - 3 freq
chotce - 1 freq
cïties - 2 freq
cottage' - 1 freq
'catch - 2 freq
cuithes - 4 freq
cots - 2 freq
cadgy - 1 freq
châteaus - 1 freq
chates - 1 freq
cadgie - 2 freq
cahootchie - 2 freq
catchy - 3 freq
cotts - 3 freq
cöts - 3 freq
caddies - 2 freq
chutes - 1 freq
cutties - 3 freq
cyties - 1 freq
cities' - 1 freq
cits - 1 freq
'catchie' - 1 freq
cites - 12 freq
caats - 4 freq
codgie - 2 freq
cowtious - 1 freq
cahoutchy - 1 freq
cootch - 3 freq
ceities - 8 freq
codds - 2 freq
cadge - 2 freq
coits - 1 freq
coutch - 1 freq
cutesy - 1 freq
chits - 1 freq
cíties - 1 freq
coattage - 1 freq
cooties - 1 freq
chaotic - 3 freq
coots - 1 freq
cods - 2 freq
€˜cuddies - 1 freq
coads - 1 freq
catchie - 2 freq
cottige - 1 freq
ceuithes - 1 freq
cattie's - 1 freq
czdq - 1 freq
caddis - 1 freq
cts - 1 freq
cuddy's - 1 freq
cattyish - 20 freq
cuddys - 1 freq
chdk - 1 freq
cyatcy - 1 freq
czdxi - 1 freq
cuties - 2 freq
coutts - 1 freq
caithess - 1 freq
ctdg - 1 freq
cedk - 1 freq
catwawk - 1 freq
MetaPhone code - KTYX
cattyish - 20 freq
CATTYISH
Time to execute Levenshtein function - 0.275175 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.883042 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.083004 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.093228 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000869 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.