A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to waarned in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
waarned (0) - 2 freq
wairned (1) - 2 freq
waarmed (1) - 3 freq
warned (1) - 36 freq
worned (2) - 1 freq
bairned (2) - 3 freq
waarneen (2) - 1 freq
harned (2) - 3 freq
waarn (2) - 1 freq
waarnt (2) - 1 freq
waaled (2) - 2 freq
warner (2) - 1 freq
swaaned (2) - 1 freq
waand (2) - 2 freq
wared (2) - 9 freq
wearied (2) - 9 freq
waanted (2) - 19 freq
learned (2) - 79 freq
gaaned (2) - 1 freq
warmed (2) - 18 freq
yarned (2) - 2 freq
waakened (2) - 12 freq
waarnin (2) - 4 freq
airned (2) - 1 freq
darned (2) - 2 freq
waarned (0) - 2 freq
warned (1) - 36 freq
wairned (1) - 2 freq
waarmed (2) - 3 freq
worned (2) - 1 freq
warked (3) - 23 freq
earned (3) - 19 freq
waurked (3) - 1 freq
airned (3) - 1 freq
weared (3) - 1 freq
darned (3) - 2 freq
larned (3) - 12 freq
waned (3) - 6 freq
laerned (3) - 9 freq
lairned (3) - 14 freq
waarld (3) - 1 freq
waired (3) - 1 freq
warped (3) - 5 freq
warnen (3) - 1 freq
waarnin (3) - 4 freq
waard (3) - 2 freq
wakened (3) - 9 freq
waarnt (3) - 1 freq
warner (3) - 1 freq
waarn (3) - 1 freq
SoundEx code - W653
wairmth - 8 freq
warmed - 18 freq
wormed - 1 freq
warned - 36 freq
waarnt - 1 freq
warmt - 2 freq
warmth - 59 freq
warrant - 19 freq
worm-eeten - 1 freq
wormit - 7 freq
wairned - 2 freq
whirwind - 1 freq
weren't - 4 freq
warrandice - 2 freq
warnt - 20 freq
wormwidd - 77 freq
wormwidd's - 4 freq
wormwidds' - 1 freq
'wormwidd - 1 freq
waarmth - 12 freq
weerin't - 1 freq
warmit - 2 freq
worned - 1 freq
waarmed - 3 freq
warn't - 1 freq
worn-oot - 2 freq
waarantie - 3 freq
where-inti - 1 freq
where-intil - 9 freq
worm-taen - 1 freq
waarmt - 3 freq
waarint - 1 freq
waar-naitered - 1 freq
warranted - 1 freq
warranty - 1 freq
warrandyces - 1 freq
warrandyce - 1 freq
warrender - 1 freq
wirmwid - 1 freq
wirmed - 2 freq
warrand - 5 freq
waarned - 2 freq
wirm-etten - 1 freq
warranties - 2 freq
wirmit - 1 freq
werent - 1 freq
weren’t - 1 freq
MetaPhone code - WRNT
warned - 36 freq
waarnt - 1 freq
warrant - 19 freq
wairned - 2 freq
weren't - 4 freq
warnt - 20 freq
weerin't - 1 freq
worned - 1 freq
warn't - 1 freq
worn-oot - 2 freq
waarantie - 3 freq
where-inti - 1 freq
waarint - 1 freq
warranty - 1 freq
warrand - 5 freq
waarned - 2 freq
werent - 1 freq
weren’t - 1 freq
WAARNED
Time to execute Levenshtein function - 0.472728 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.809548 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027723 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.083266 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000911 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.