A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Warwick, Matthew

Basic Stats

Total words by this author in corpus - 930
Total unique words used by this author in corpus - 397
Ratio of total words to unique words - 2.343
Tagged as BUL (Ballymena Ulster (Mid Antrim)) dialect.
Top ten most common words - tha, a, an, ye, tae, o, wullie, they, at, feardie,

List of texts in corpus

Tha Feardie Geng hae tae bide at hame
Sensory Attachment Centre (13-05-2020) in Ulster (Ballymena) dialect (BUL), categorised as weans (531 words)

Fergie an Freens oan tha fairm
Ulster-Scots Community Network (2011 ) in Ulster (Rural Mid-Antrim) dialect (BUL), categorised as weans (399 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
tha72 77,419.35445.321
feardie9 9,677.42120.223
geng9 9,677.4286.865
wullie12 12,903.2378.179
docter6 6,451.6175.468
fergie7 7,526.8868.291
bug6 6,451.6160.052
sadie5 5,376.3459.800
saip4 4,301.0845.216
liz5 5,376.3442.690
toul5 5,376.3441.422
g4 4,301.0832.505
wash4 4,301.0829.375
weans5 5,376.3428.290
axed4 4,301.0827.571
ir6 6,451.6125.924
thaim9 9,677.4225.711
the'2 2,150.5424.328
hame8 8,602.1523.374
thon's3 3,225.8123.197
ye19 20,430.1122.964
lake3 3,225.8121.292
gye3 3,225.8121.132
wile3 3,225.8120.823
smit2 2,150.5420.514
aisy2 2,150.5420.514
bide5 5,376.3420.165
oan9 9,677.4220.086
loanen2 2,150.5419.779
thon7 7,526.8819.387
jock4 4,301.0818.464
hauns4 4,301.0817.907
smittal5 5,376.34nan
cannae5 5,376.3417.746
gether2 2,150.5417.350
hi2 2,150.5417.350
pepper2 2,150.5417.002
guid7 7,526.8815.703
crack3 3,225.8115.563
whut3 3,225.8115.443
bae3 3,225.8115.326
frae9 9,677.4214.657
reddin2 2,150.5414.247
cantie2 2,150.5414.247
forbye3 3,225.8113.856
thrie2 2,150.5413.781
wee9 9,677.4213.507
boady2 2,150.5412.871
reek2 2,150.5412.646
doag2 2,150.5411.947
why3 3,225.8111.922
risin2 2,150.5411.683
maks3 3,225.8111.500
gies3 3,225.8111.386
luk2 2,150.5411.356
need4 4,301.0811.159
bes2 2,150.5411.129
freens2 2,150.5411.056
dae6 6,451.6110.631
fur9 9,677.4210.580
ach2 2,150.5410.158
their7 7,526.8810.053
they11 11,827.969.787
yer7 7,526.889.507
naw3 3,225.818.840
in5 5,376.348.468
watter3 3,225.818.163
til4 4,301.088.110
dinnae3 3,225.817.597
yin4 4,301.087.549
gien3 3,225.817.531
hale3 3,225.817.518
buik2 2,150.546.920
ava2 2,150.546.738
siz3 3,225.81nan
hae7 7,526.886.628
caa'ed3 3,225.81nan
yersel2 2,150.545.914
it4 4,301.085.860
haein2 2,150.545.327
hoo2 2,150.544.549
gie3 3,225.814.216
comin2 2,150.544.119
it's3 3,225.814.020
that4 4,301.083.750
hoose3 3,225.813.487
at11 11,827.963.417
best2 2,150.543.387
cud2 2,150.543.270
fowk4 4,301.083.208
keep2 2,150.543.026
things2 2,150.542.857
then3 3,225.812.697
same2 2,150.542.536
wur2 2,150.542.510
ah2 2,150.542.411
o13 13,978.492.318
awa3 3,225.812.142
aa4 4,301.082.002
as3 3,225.811.558
aye3 3,225.811.465
wiz2 2,150.541.421
tak2 2,150.541.406
up6 6,451.611.359
sae3 3,225.811.248
noo3 3,225.811.136
heid2 2,150.540.991
oot6 6,451.610.974
haes2 2,150.540.965
bit4 4,301.080.910
lang2 2,150.540.896
tae16 17,204.300.869
she3 3,225.810.717
be3 3,225.810.693
wi5 5,376.340.613
s9 9,677.420.594
richt2 2,150.540.463
his4 4,301.080.366
ower3 3,225.810.363
said2 2,150.540.359
a30 32,258.060.265
aboot2 2,150.540.241
jist2 2,150.540.231
if2 2,150.540.230
totey2 2,150.54nan
by2 2,150.540.182
see2 2,150.540.161
him2 2,150.540.096
time2 2,150.540.069
gulders2 2,150.54nan
an27 29,032.260.382
da2 2,150.540.038
is6 6,451.610.012