A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Curt_Kenobi

Basic Stats

Total words by this author in corpus - 525
Total unique words used by this author in corpus - 283
Ratio of total words to unique words - 1.855
Tagged as LAL (General Central) dialect.
Top ten most common words - ah, tae, a, the, me, it, ehs, fir, ay, sae,

List of texts in corpus

Mates
archiveofourown.org (2016-09-27) in Central dialect (LAL), categorised as prose (525 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
ehs8 15,238.10115.552
ah20 38,095.2444.223
ah'm6 11,428.5741.857
sick4 7,619.0533.928
fir7 13,333.3330.110
n6 11,428.5730.062
all5 9,523.8127.026
natural3 5,714.2923.007
compliment2 3,809.5220.907
blond2 3,809.5220.907
eh's6 11,428.57nan
shite3 5,714.2920.784
fucker2 3,809.5219.283
eh4 7,619.0518.613
ay7 13,333.3318.581
theory2 3,809.5218.134
kissed2 3,809.5217.892
long3 5,714.2916.470
ihm5 9,523.81nan
fuckin4 7,619.0516.120
it's5 9,523.8116.018
me9 17,142.8615.767
chest2 3,809.5215.766
oan6 11,428.5715.181
wonder2 3,809.5215.020
forward2 3,809.5214.908
almost2 3,809.5214.908
makes2 3,809.5214.204
jist6 11,428.5714.013
sae6 11,428.5713.543
when5 9,523.8113.305
ah've2 3,809.5212.955
birds2 3,809.5212.513
that's3 5,714.2912.089
the14 26,666.6711.592
colour2 3,809.5210.885
dinnae3 5,714.2910.758
go3 5,714.2910.006
mooth2 3,809.529.986
an4 7,619.059.476
boy2 3,809.529.416
fuck2 3,809.529.364
now2 3,809.529.040
since2 3,809.528.993
eyes2 3,809.528.786
across2 3,809.528.570
about2 3,809.527.954
do2 3,809.527.954
cunt2 3,809.527.798
hair2 3,809.527.323
am2 3,809.526.918
in2 3,809.526.821
masel2 3,809.526.473
git2 3,809.526.271
boy's2 3,809.52nan
though2 3,809.526.206
face2 3,809.524.699
are3 5,714.294.486
ken3 5,714.294.387
life2 3,809.524.031
still2 3,809.523.346
gie2 3,809.523.334
think2 3,809.523.213
but5 9,523.813.063
aboot4 7,619.052.645
us2 3,809.521.674
no3 5,714.291.474
like3 5,714.291.464
ma4 7,619.051.205
wi2 3,809.521.162
it9 17,142.861.156
tae15 28,571.431.138
his5 9,523.811.105
outta2 3,809.52nan
had2 3,809.521.548
or3 5,714.290.761
and2 3,809.520.732
back2 3,809.520.650
mair2 3,809.520.616
wis4 7,619.050.340
we3 5,714.290.326
him2 3,809.520.233
a14 26,666.670.138
be3 5,714.290.047
that5 9,523.810.007