Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to amymacleanpod in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
amymacleanpod (0) - 1 freq maclean's (6) - 4 freq robmacleansport (6) - 2 freq alexmacleod (6) - 1 freq macleod (6) - 11 freq macleans (6) - 1 freq dry-cleaned (6) - 1 freq maclean (6) - 15 freq cleaned (7) - 35 freq johnmacleanma (7) - 2 freq shivmaclean (7) - 1 freq cleansed (7) - 5 freq manacled (7) - 1 freq amscanlon (7) - 1 freq amalkadog (7) - 20 freq mcleans (7) - 1 freq macleoid (7) - 1 freq ruamaclennan (7) - 1 freq anclapped (7) - 1 freq mclean's (7) - 1 freq mclean (7) - 13 freq americanos (7) - 1 freq amyspineapple (7) - 1 freq mcleod (7) - 10 freq americano (7) - 2 freq	amymacleanpod (0) - 1 freq maclean (9) - 15 freq macleod (9) - 11 freq macleans (9) - 1 freq maclean's (9) - 4 freq anclapped (10) - 1 freq macleoid (10) - 1 freq mclean's (10) - 1 freq mcleod (10) - 10 freq mclean (10) - 13 freq manacled (10) - 1 freq ruamaclennan (10) - 1 freq dry-cleaned (10) - 1 freq cleansed (10) - 5 freq alexmacleod (10) - 1 freq cleaned (10) - 35 freq immaculately (11) - 1 freq clanked (11) - 1 freq moorlaand (11) - 1 freq clamped (11) - 7 freq merkland (11) - 4 freq monicalennon (11) - 2 freq mossland (11) - 1 freq sclanced (11) - 2 freq emmaissandy (11) - 1 freq	SoundEx code - A552 amang - 699 freq amangst - 40 freq among - 128 freq amongst - 62 freq anyhing - 33 freq announcement - 10 freq announces - 12 freq announced - 33 freq announcer's - 1 freq aming - 3 freq anyone's - 2 freq annoyance - 12 freq announcements - 5 freq annuncee - 1 freq amaing - 1 freq amencg - 1 freq annoyingly - 2 freq amung - 3 freq annunciation - 1 freq announcer - 1 freq aaaamang - 1 freq annoying - 11 freq annoonce - 4 freq 'among - 1 freq annoyince - 2 freq amungst - 2 freq anooncements - 2 freq announce - 7 freq annoonced - 9 freq announcing - 4 freq anoonced - 2 freq ananias - 2 freq amangits - 1 freq annunced - 2 freq annooncement - 2 freq anyhin's - 1 freq annooncet - 1 freq animacy - 1 freq announcers - 1 freq ��among - 1 freq amangat - 1 freq annuncement - 1 freq annuncit - 1 freq announcin - 4 freq annoonces - 1 freq annooncements - 1 freq annooncit - 2 freq anenst - 52 freq -anenst - 1 freq anaemic - 1 freq annooncin - 3 freq anoonce - 1 freq amunsgt - 1 freq ��amang - 1 freq ��anyhing - 1 freq amymacleanpod - 1 freq annemclaughlin - 8 freq anumqaisarjaved - 1 freq	MetaPhone code - AMMKLNPT amymacleanpod - 1 freq	AMYMACLEANPOD
Time to execute Levenshtein function - 0.563652 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.584073 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.030617 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.049548 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.000904 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics