A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Dee Har

Basic Stats

Total words by this author in corpus - 109
Total unique words used by this author in corpus - 76
Ratio of total words to unique words - 1.434
Tagged as GUL (General Ulster) dialect.
Top ten most common words - tha, as, a, tae, the, in, an, troot, he, fer,

List of texts in corpus

Tha Broon Troot
Facebook (2020-03-07) in Ulster dialect (GUL), categorised as poetry (109 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
tha8 73,394.5048.584
troot3 27,522.9435.656
flea2 18,348.6229.115
fer2 18,348.6212.504
as5 45,871.5611.640
get2 18,348.626.936
oan2 18,348.626.786
doon2 18,348.625.058
him2 18,348.624.339
the4 36,697.251.021
tae4 36,697.250.992
he2 18,348.620.880
in3 27,522.940.822
a4 36,697.250.191
an3 27,522.940.013