A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to malaya in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
malaya (0) - 3 freq
malay (1) - 1 freq
malaga (1) - 1 freq
maara (2) - 1 freq
maya (2) - 2 freq
alya (2) - 1 freq
malawi (2) - 5 freq
masayl (2) - 1 freq
millya (2) - 1 freq
malady (2) - 1 freq
masala (2) - 1 freq
malaysia (2) - 1 freq
walays (2) - 1 freq
malty (2) - 6 freq
palava (2) - 1 freq
alana (2) - 12 freq
malky (2) - 5 freq
maaa (2) - 4 freq
mally (2) - 1 freq
maana (2) - 2 freq
malta (2) - 2 freq
adaya (2) - 1 freq
mawlana (2) - 1 freq
malaria (2) - 3 freq
lally (3) - 2 freq
malaya (0) - 3 freq
malay (1) - 1 freq
malaga (2) - 1 freq
mily (3) - 2 freq
male (3) - 59 freq
malta (3) - 2 freq
mally (3) - 1 freq
malaria (3) - 3 freq
maana (3) - 2 freq
mla (3) - 1 freq
melea (3) - 3 freq
mela (3) - 3 freq
mal (3) - 2 freq
mealy (3) - 5 freq
maaa (3) - 4 freq
mula (3) - 1 freq
malky (3) - 5 freq
malawi (3) - 5 freq
millya (3) - 1 freq
alya (3) - 1 freq
maya (3) - 2 freq
maara (3) - 1 freq
malady (3) - 1 freq
masayl (3) - 1 freq
malty (3) - 6 freq
SoundEx code - M400
malawi - 5 freq
meal - 130 freq
mool - 3 freq
mile - 278 freq
mull - 35 freq
male - 59 freq
mill - 82 freq
mell - 23 freq
millie - 5 freq
mealy - 5 freq
moolah - 1 freq
mollie - 16 freq
mail - 54 freq
moyle - 6 freq
melee - 3 freq
mle - 1 freq
mole - 11 freq
mal - 2 freq
meyle - 1 freq
meel - 3 freq
myle - 15 freq
mellow - 4 freq
mily - 2 freq
mel - 4 freq
mealie - 9 freq
mille - 1 freq
m'll - 2 freq
mall - 4 freq
mule - 4 freq
molly - 10 freq
ma'll - 1 freq
melea - 3 freq
mallae - 4 freq
moil - 1 freq
mál - 1 freq
malaya - 3 freq
ml - 12 freq
myll - 2 freq
mally - 1 freq
'mole' - 1 freq
me'll - 4 freq
millya - 1 freq
mahal - 3 freq
mêlée - 1 freq
mellie - 2 freq
meal- - 1 freq
mailie - 3 freq
moo'll - 1 freq
mellay - 1 freq
mele - 1 freq
mul - 2 freq
€˜male - 1 freq
maw'ill - 1 freq
Émile - 1 freq
meelie - 3 freq
mael - 2 freq
malay - 1 freq
-mile - 1 freq
mayol - 1 freq
maol - 1 freq
milieu - 1 freq
mela - 3 freq
milo - 1 freq
mnl - 1 freq
miuli - 1 freq
molloy - 4 freq
mla - 1 freq
mula - 1 freq
mlle - 1 freq
mml - 1 freq
maalie - 7 freq
moul - 1 freq
mlih - 1 freq
milla - 1 freq
myla - 1 freq
MetaPhone code - MLY
malaya - 3 freq
millya - 1 freq
MALAYA
Time to execute Levenshtein function - 0.262163 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.480793 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033149 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.043063 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000976 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.