A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to pencil-case in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
pencil-case (0) - 5 freq
pencils (4) - 9 freq
pencil's (4) - 1 freq
pencil-box (4) - 3 freq
pendils (5) - 1 freq
pensie-like (5) - 1 freq
test-case (5) - 1 freq
peencils (5) - 1 freq
penchance (5) - 1 freq
pencilthin (5) - 1 freq
packin-cases (5) - 1 freq
deil-lake (5) - 1 freq
pincils (5) - 2 freq
mentalcases (5) - 1 freq
pendice (5) - 2 freq
pendicle (5) - 5 freq
pencil (5) - 42 freq
pen-name (5) - 4 freq
enchiladas (5) - 1 freq
penniless (5) - 1 freq
enclave (5) - 3 freq
inculcate (5) - 1 freq
pontificate (5) - 1 freq
pencil-shapit (5) - 1 freq
pencils'd (5) - 1 freq
pencil-case (0) - 5 freq
pencil-box (6) - 3 freq
pencil's (6) - 1 freq
pencils (6) - 9 freq
pincils (7) - 2 freq
peencils (7) - 1 freq
pendils (8) - 1 freq
penniless (8) - 1 freq
pencils'd (8) - 1 freq
punch-ups (8) - 1 freq
inculcate (8) - 1 freq
pencil (8) - 42 freq
penchance (8) - 1 freq
pencilthin (8) - 1 freq
penels (9) - 7 freq
enceladus (9) - 1 freq
pingils (9) - 1 freq
pensells (9) - 3 freq
pillowcase (9) - 1 freq
panic's (9) - 1 freq
pint-cans (9) - 1 freq
pences (9) - 3 freq
pincil (9) - 22 freq
angel-wise (9) - 1 freq
pendicles (9) - 1 freq
SoundEx code - P524
phone-calls - 1 freq
pencil - 42 freq
pencils'd - 1 freq
pingle - 2 freq
pincil - 22 freq
pincils - 2 freq
pencils - 9 freq
penjulim - 1 freq
painkillers - 3 freq
penicillin - 5 freq
pencil-box - 3 freq
pencil-case - 5 freq
pensie-lik - 1 freq
pingils - 1 freq
pencilthin - 1 freq
phonecalls - 1 freq
pinklin - 2 freq
pensie-like - 1 freq
pensell - 2 freq
pensells - 3 freq
pensel - 1 freq
phone-caals - 1 freq
phone-caal - 1 freq
pennsylvania - 1 freq
pingle-pan - 1 freq
pingilt - 1 freq
pencil's - 1 freq
peencils - 1 freq
phonecall - 2 freq
pinnacle - 1 freq
panglish - 1 freq
punchline - 2 freq
pinglin - 1 freq
pencil-shapit - 1 freq
phone-call - 1 freq
pan-slavic - 1 freq
pengelly - 3 freq
pmacgiollabhain - 1 freq
painkiller - 1 freq
MetaPhone code - PNSLKS
pencil-case - 5 freq
PENCIL-CASE
Time to execute Levenshtein function - 0.294776 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.514156 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.037600 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.048174 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000944 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.