A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Harvey, Lewis

Basic Stats

Total words by this author in corpus - 777
Total unique words used by this author in corpus - 289
Ratio of total words to unique words - 2.689
Tagged as MNB (Mid Northern B) dialect.
Top ten most common words - a, the, wis, and, her, tae, she, ma, me, hid,

List of texts in corpus

Crash
Scots Hoose (2020) in Doric (Forres) dialect (MNB), categorised as prose (777 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
a67 86,229.0958.692
wifie7 9,009.0142.694
skweel5 6,435.0141.322
her22 28,314.0339.698
and24 30,888.0336.908
usual5 6,435.0136.397
detention3 3,861.0034.986
hid11 14,157.0131.981
fit10 12,870.0131.872
ok4 5,148.0131.180
me15 19,305.0229.270
she18 23,166.0226.893
wis26 33,462.0326.634
wid9 11,583.0126.581
telt7 9,009.0124.507
ma15 19,305.0222.649
bawlin2 2,574.0022.138
well5 6,435.0121.435
jist9 11,583.0121.244
heidie2 2,574.0021.232
happened4 5,148.0120.410
lookit3 3,861.0017.055
bobbies2 2,574.0016.571
could6 7,722.0116.368
heided2 2,574.0015.304
fan4 5,148.0114.714
excuse2 2,574.0013.820
go4 5,148.0112.578
to6 7,722.0112.134
an7 9,009.0111.542
next3 3,861.0011.229
arrived2 2,574.0011.218
pulled2 2,574.0010.915
should3 3,861.0010.521
office2 2,574.0010.082
then5 6,435.019.799
gaen2 2,574.009.559
ca2 2,574.008.791
stick2 2,574.008.567
walk2 2,574.008.445
hiv3 3,861.008.387
wait2 2,574.008.184
road3 3,861.008.118
they9 11,583.017.750
thocht4 5,148.017.695
there7 9,009.017.604
car2 2,574.007.546
done2 2,574.007.453
seemed2 2,574.007.066
wint2 2,574.006.829
thing3 3,861.006.400
roond2 2,574.006.301
tell3 3,861.006.016
turn2 2,574.005.816
door3 3,861.005.418
get4 5,148.015.096
git2 2,574.004.860
ony3 3,861.004.833
help2 2,574.004.757
niver2 2,574.004.619
let2 2,574.004.148
them4 5,148.013.842
heid3 3,861.003.809
need2 2,574.003.786
fair2 2,574.003.728
the33 42,471.043.694
was3 3,861.003.680
through2 2,574.003.440
so3 3,861.003.431
weel3 3,861.002.966
fir2 2,574.002.743
o10 12,870.012.696
aff3 3,861.002.510
kent2 2,574.002.268
it5 6,435.012.258
if3 3,861.002.004
i2 2,574.001.800
see3 3,861.001.781
my2 2,574.001.753
s3 3,861.001.616
for2 2,574.001.612
at8 10,296.011.549
back3 3,861.001.013
aboot4 5,148.011.004
as7 9,009.010.923
awa2 2,574.000.902
ower3 3,861.000.762
said2 2,574.000.661
chik2 2,574.00nan
auld2 2,574.000.732
wi4 5,148.010.639
tae20 25,740.030.630
in10 12,870.010.402
noo2 2,574.000.364
intae2 2,574.000.319
d2 2,574.000.308
but4 5,148.010.290
be5 6,435.010.283
angrier2 2,574.00nan
up4 5,148.010.308
been2 2,574.000.218
day2 2,574.000.156
or2 2,574.000.135
that7 9,009.010.062
doon2 2,574.000.058
oot3 3,861.000.021
nae2 2,574.000.021