A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Forrest, Liam

Basic Stats

Total words by this author in corpus - 1,175
Total unique words used by this author in corpus - 462
Ratio of total words to unique words - 2.543
Tagged as SEC ((South) East Central) dialect.
Top ten most common words - the, eh, a, they, wurr, it, tae, in, he, like,

List of texts in corpus

Hittin the Toon
Scots Hoose (2020) in Central (Dunfermline) dialect (SEC), categorised as prose (1,175 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
eh37 31,489.36280.136
wiz19 16,170.2178.482
awbdy6 5,106.3872.651
n12 10,212.7757.679
they26 22,127.6648.081
wurr25 21,276.60nan
pals7 5,957.4548.498
looked9 7,659.5746.767
like21 17,872.3445.507
queue5 4,255.3241.423
wit6 5,106.3838.157
hud9 7,659.5736.540
group7 5,957.4535.707
their15 12,765.9634.739
club5 4,255.3233.070
lassies5 4,255.3231.313
gon3 2,553.1931.262
dresses3 2,553.1931.262
um5 4,255.3229.713
nuhin3 2,553.1929.370
av3 2,553.1924.627
bunch3 2,553.1924.627
er5 4,255.3224.380
who6 5,106.3824.264
boys4 3,404.2622.929
spice2 1,702.1321.667
dum2 1,702.1320.484
pal4 3,404.2619.737
surprising2 1,702.1319.578
tanned2 1,702.1319.578
guards2 1,702.1318.843
id2 1,702.1318.224
stawndin6 5,106.38nan
skirts2 1,702.1317.689
gonny2 1,702.1317.689
to8 6,808.5114.503
got7 5,957.4514.218
way4 3,404.2613.951
hair4 3,404.2613.832
blawn2 1,702.1313.159
shop3 2,553.1913.016
security2 1,702.1312.856
obviously2 1,702.1311.949
nixt3 2,553.1911.923
thurr5 4,255.32nan
front4 3,404.2613.921
aroond3 2,553.1911.884
wan5 4,255.3211.860
waws2 1,702.1311.835
git4 3,404.2611.739
gits2 1,702.1311.513
line3 2,553.1911.082
wearin2 1,702.1310.939
surprised2 1,702.1310.600
he22 18,723.4010.173
oan7 5,957.4510.061
erse2 1,702.139.608
ticht2 1,702.139.425
throw2 1,702.139.251
faces2 1,702.139.140
went4 3,404.268.527
them7 5,957.458.189
aboot10 8,510.648.021
pick2 1,702.137.818
his15 12,765.967.566
it25 21,276.607.430
flair2 1,702.137.158
morn2 1,702.137.095
fae9 7,659.577.034
since2 1,702.135.965
which3 2,553.195.888
cause2 1,702.135.834
chance2 1,702.135.687
there8 6,808.515.666
oot11 9,361.705.557
could4 3,404.265.397
two2 1,702.135.283
blue2 1,702.135.283
every2 1,702.135.212
turned2 1,702.134.877
skil3 2,553.19nan
just3 2,553.194.848
aff5 4,255.324.846
where2 1,702.134.722
try2 1,702.134.677
tryin2 1,702.134.589
lookin2 1,702.134.380
tweedle3 2,553.19nan
sat2 1,702.134.380
than4 3,404.264.002
dee2 1,702.133.955
when4 3,404.263.951
so4 3,404.263.860
well2 1,702.133.732
body2 1,702.133.596
came2 1,702.133.380
is3 2,553.193.219
men2 1,702.133.164
rain2 1,702.133.054
right2 1,702.132.956
hame3 2,553.192.793
year3 2,553.192.783
folk2 1,702.132.736
go2 1,702.132.522
booncers2 1,702.13nan
while2 1,702.132.406
fur6 5,106.382.231
up8 6,808.512.181
we2 1,702.132.037
bouncer2 1,702.13nan
how2 1,702.132.878
was3 2,553.191.987
s5 4,255.321.883
then3 2,553.191.804
wid3 2,553.191.716
as4 3,404.261.712
here3 2,553.191.630
re2 1,702.131.496
were3 2,553.191.472
aw5 4,255.321.452
afore3 2,553.191.396
been4 3,404.261.299
get3 2,553.191.132
but7 5,957.451.120
in23 19,574.471.117
even2 1,702.131.087
nicht2 1,702.130.928
muckle2 1,702.130.743
if3 2,553.190.714
be4 3,404.260.700
and10 8,510.640.607
at6 5,106.380.341
ye5 4,255.320.298
a37 31,489.360.182
that13 11,063.830.163
wi10 8,510.640.162
us2 1,702.130.135
auld2 1,702.130.123
wee2 1,702.130.093
doon3 2,553.190.079
tae24 20,425.530.064
aye2 1,702.130.044
nae3 2,553.190.037
by2 1,702.130.014
see2 1,702.130.008
no3 2,553.190.001
the68 57,872.340.000