A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to tarf in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
tarf (0) - 1 freq
tar (1) - 26 freq
tara (1) - 16 freq
targ (1) - 2 freq
tharf (1) - 1 freq
taf (1) - 1 freq
turf (1) - 13 freq
tare (1) - 4 freq
tart (1) - 12 freq
barf (1) - 7 freq
tarp (1) - 1 freq
tay (2) - 186 freq
darl (2) - 1 freq
raf (2) - 7 freq
barx (2) - 1 freq
taffy (2) - 1 freq
tear (2) - 103 freq
toarn (2) - 1 freq
start (2) - 592 freq
lara (2) - 1 freq
nard (2) - 3 freq
tax (2) - 61 freq
tash (2) - 4 freq
baf (2) - 1 freq
zarg (2) - 1 freq
tarf (0) - 1 freq
turf (1) - 13 freq
tare (2) - 4 freq
barf (2) - 7 freq
tarp (2) - 1 freq
tar (2) - 26 freq
tart (2) - 12 freq
tara (2) - 16 freq
taf (2) - 1 freq
targ (2) - 2 freq
tharf (2) - 1 freq
turfs (3) - 1 freq
raaf (3) - 1 freq
turr (3) - 1 freq
toff (3) - 12 freq
tarot (3) - 5 freq
tre (3) - 4 freq
tyre (3) - 17 freq
tqyf (3) - 1 freq
tears (3) - 307 freq
taury (3) - 1 freq
taur (3) - 9 freq
terfs (3) - 1 freq
tairt (3) - 3 freq
arfu (3) - 1 freq
SoundEx code - T610
trivia - 2 freq
trap - 62 freq
throb - 5 freq
threip - 35 freq
trip - 150 freq
tirravee - 3 freq
troupe - 15 freq
trev - 73 freq
threep - 2 freq
thrave - 8 freq
tirrivee - 10 freq
tribe - 26 freq
thrive - 22 freq
threap - 34 freq
turravee - 4 freq
trippie - 1 freq
troop - 11 freq
thrap - 2 freq
turf - 13 freq
trophie - 1 freq
therapy - 11 freq
terribie - 1 freq
turriff - 3 freq
thrab - 2 freq
trophy - 15 freq
trippy - 2 freq
tripe - 17 freq
tirivee - 3 freq
trb - 1 freq
thereby - 9 freq
trep - 2 freq
tyrefu - 1 freq
tirrivie - 4 freq
tarf - 1 freq
trawphy - 1 freq
traep - 6 freq
thraip - 1 freq
trive - 1 freq
tarbh - 1 freq
throve - 1 freq
tharefae - 1 freq
thairof - 4 freq
thereof - 5 freq
tharf - 1 freq
tryve - 1 freq
€œtrev - 1 freq
terrify - 1 freq
trrrrap - 1 freq
trrrap - 1 freq
trapp - 1 freq
trove - 3 freq
tarp - 1 freq
thrup - 1 freq
trope - 2 freq
€œthreap - 1 freq
thrupp - 1 freq
turbo - 2 freq
trhyf - 2 freq
therepy - 1 freq
twerp - 1 freq
terp - 2 freq
MetaPhone code - TRF
drove - 68 freq
trivia - 2 freq
drave - 61 freq
drive - 170 freq
tirravee - 3 freq
trev - 73 freq
tirrivee - 10 freq
turravee - 4 freq
turf - 13 freq
trophie - 1 freq
driv - 12 freq
dryve - 2 freq
dry'v - 1 freq
turriff - 3 freq
trough - 6 freq
trophy - 15 freq
drehve - 1 freq
tirivee - 3 freq
tyrefu - 1 freq
tirrivie - 4 freq
derf - 2 freq
tarf - 1 freq
dreef - 4 freq
dræv - 2 freq
trawphy - 1 freq
dhrive - 4 freq
dhriv - 5 freq
drehv - 1 freq
derive - 2 freq
trive - 1 freq
tryve - 1 freq
€œtrev - 1 freq
terrify - 1 freq
€œdrive - 1 freq
dreive - 4 freq
trove - 3 freq
druv - 1 freq
drf - 1 freq
trhyf - 2 freq
hwdrioaf - 1 freq
drfw - 1 freq
'dreef' - 1 freq
TARF
Time to execute Levenshtein function - 0.578209 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.599835 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028568 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.068597 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000910 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.