Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to kach in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
kach (0) - 1 freq klach (1) - 1 freq kath (1) - 3 freq each (1) - 652 freq rach (1) - 5 freq kauch (1) - 3 freq ach (1) - 313 freq dach (1) - 1 freq cach (1) - 1 freq yach (1) - 1 freq bach (1) - 2 freq hach (1) - 3 freq aach (1) - 1 freq wach (1) - 9 freq mach (1) - 2 freq kace (1) - 2 freq kmch (1) - 1 freq kich (1) - 1 freq lach (1) - 62 freq 'ach (1) - 27 freq keach (1) - 2 freq kfcs (2) - 1 freq kecp (2) - 1 freq path (2) - 178 freq latch (2) - 11 freq	kach (0) - 1 freq keach (1) - 2 freq kich (1) - 1 freq kauch (1) - 3 freq kace (2) - 2 freq mach (2) - 2 freq wach (2) - 9 freq kmch (2) - 1 freq keich (2) - 8 freq aach (2) - 1 freq lach (2) - 62 freq keech (2) - 43 freq 'ach (2) - 27 freq rach (2) - 5 freq hach (2) - 3 freq klach (2) - 1 freq kath (2) - 3 freq ach (2) - 313 freq each (2) - 652 freq bach (2) - 2 freq cach (2) - 1 freq dach (2) - 1 freq yach (2) - 1 freq awch (3) - 2 freq ketch (3) - 9 freq	SoundEx code - K200 keek - 203 freq keys - 63 freq kis - 142 freq kick - 122 freq kiss - 123 freq kicks - 29 freq keech - 43 freq keich - 8 freq kegs - 8 freq kowk - 1 freq kich - 1 freq keeks - 16 freq kecks - 5 freq key's - 1 freq kweek - 2 freq kesh - 1 freq kwik - 2 freq keekiy - 1 freq kock - 1 freq keach - 2 freq keik - 13 freq keess - 10 freq kaiys - 1 freq kush - 1 freq kay's - 2 freq kiz - 18 freq kess - 5 freq kieks - 3 freq kik - 1 freq kishie - 12 freq koko - 1 freq kyiss - 1 freq khaki - 5 freq kek - 1 freq kiosks - 1 freq kaas - 2 freq 'kyiss - 1 freq kissie - 1 freq keekie - 1 freq kyaaks - 1 freq kwesi - 1 freq kiek - 1 freq kauch - 3 freq kace - 2 freq kies - 1 freq kake - 1 freq kiosk - 3 freq kays - 1 freq kehs - 1 freq kezia - 5 freq kayak - 1 freq keg - 2 freq keks - 4 freq kecksie - 1 freq kach - 1 freq kyg - 1 freq khza - 1 freq kuq - 1 freq kaz - 3 freq kycc - 1 freq kic - 1 freq kxiac - 1 freq kazoo - 1 freq ksac - 1 freq 'keek' - 3 freq koxo - 2 freq kwaz - 1 freq kwc - 1 freq keogh - 1 freq kix - 1 freq kes - 1 freq kiss' - 1 freq kzhz - 1 freq kswiz - 3 freq kwhowj - 1 freq	MetaPhone code - KX catch - 353 freq cosh - 12 freq ketch - 9 freq cash - 85 freq couch - 67 freq keech - 43 freq keich - 8 freq quaich - 25 freq gash - 14 freq coach - 51 freq cushie - 15 freq cotch - 8 freq cushy - 3 freq kitchie - 71 freq kich - 1 freq cautch - 1 freq gush - 2 freq quash - 1 freq qu'she - 2 freq goach - 1 freq 'cash - 1 freq gouch - 1 freq kesh - 1 freq goochee' - 1 freq keach - 2 freq quiche - 3 freq catia - 1 freq cooch - 7 freq 'catch - 2 freq kush - 1 freq cösh - 1 freq 'gosh - 1 freq catchy - 3 freq kishie - 12 freq 'catchie' - 1 freq cootch - 3 freq kauch - 3 freq coutch - 1 freq goch - 1 freq gotcha - 2 freq catchie - 2 freq kach - 1 freq gwcia - 1 freq gooch - 2 freq cashe - 1 freq gosh - 2 freq cach - 1 freq 'cach - 1 freq	KACH
Time to execute Levenshtein function - 0.213196 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.334454 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.027798 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.037932 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.000864 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics