A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to penicillin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
penicillin (0) - 5 freq
pencilthin (3) - 1 freq
feiillit (4) - 1 freq
penicuik (4) - 5 freq
deciplin (4) - 1 freq
cancellin (4) - 1 freq
recallin (4) - 1 freq
tentillie (4) - 2 freq
pedallin (4) - 1 freq
pencil's (4) - 1 freq
pencils (4) - 9 freq
tribillin (4) - 1 freq
unwillin (4) - 7 freq
pencil (4) - 42 freq
snivellin (4) - 1 freq
refillin (4) - 1 freq
elicitin (4) - 1 freq
pencils'd (4) - 1 freq
picklin (4) - 2 freq
panickin (4) - 3 freq
ticklin (5) - 6 freq
pendicle (5) - 5 freq
pickin (5) - 111 freq
craichlin (5) - 1 freq
pensie-lik (5) - 1 freq
penicillin (0) - 5 freq
pencilthin (5) - 1 freq
cancellin (5) - 1 freq
pencil (6) - 42 freq
unwillin (6) - 7 freq
snivellin (6) - 1 freq
picklin (6) - 2 freq
panickin (6) - 3 freq
pencils (6) - 9 freq
punchline (6) - 2 freq
recallin (6) - 1 freq
pencil's (6) - 1 freq
pedallin (6) - 1 freq
scuillin (7) - 3 freq
nicoll (7) - 5 freq
inclin (7) - 1 freq
pochlin (7) - 1 freq
pensell (7) - 2 freq
manically (7) - 1 freq
lenabellina (7) - 1 freq
pensells (7) - 3 freq
pincil (7) - 22 freq
pincils (7) - 2 freq
pullin (7) - 107 freq
pillion (7) - 1 freq
SoundEx code - P524
phone-calls - 1 freq
pencil - 42 freq
pencils'd - 1 freq
pingle - 2 freq
pincil - 22 freq
pincils - 2 freq
pencils - 9 freq
penjulim - 1 freq
painkillers - 3 freq
penicillin - 5 freq
pencil-box - 3 freq
pencil-case - 5 freq
pensie-lik - 1 freq
pingils - 1 freq
pencilthin - 1 freq
phonecalls - 1 freq
pinklin - 2 freq
pensie-like - 1 freq
pensell - 2 freq
pensells - 3 freq
pensel - 1 freq
phone-caals - 1 freq
phone-caal - 1 freq
pennsylvania - 1 freq
pingle-pan - 1 freq
pingilt - 1 freq
pencil's - 1 freq
peencils - 1 freq
phonecall - 2 freq
pinnacle - 1 freq
panglish - 1 freq
punchline - 2 freq
pinglin - 1 freq
pencil-shapit - 1 freq
phone-call - 1 freq
pan-slavic - 1 freq
pengelly - 3 freq
pmacgiollabhain - 1 freq
painkiller - 1 freq
MetaPhone code - PNSLN
penicillin - 5 freq
PENICILLIN
Time to execute Levenshtein function - 0.703263 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.219365 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.091819 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.109436 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001212 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.