Skip to content

Style

The corpus can be investigated from a number of stylistical perspectives.

Vocabulary Richness (All Words)

The vocabulary richness of each text was calculated on the lemmatized work both with vanilla type-token ratio, but also with moving windows of size 10, 50, 500, and 1000.

Use of Terms on a Group and Individual Level

Vocabulary Richness (Noun, Adj, Verb)

The vocabulary richness of each fable was calculated on the lemmatized work both with vanilla type-token ratio, but also with moving windows of size 500 and 1000.

Use of Terms on a Group and Individual Level

Vocabulary Richness (Others)

The vocabulary richness of each fable was calculated on the lemmatized work both with vanilla type-token ratio, but also with moving windows of size 500 and 1000. POS-tags used were ADV, INTJ, ADP, CCONJ, SCONJ, DET, PART, and PRON.

Use of Terms on a Group and Individual Level

UPOS Tags

UPOS tags were tallied up in all texts without removal of any stop words or lemmatization.

Relative Frequencies of Nouns, Verbs and Adjectives in all Texts
Relative Frequencies of Nouns, Verbs and Adjectives in all Fables (3D)
Relative Frequencies of Function Word Categories
Relative Frequencies of Function Word Categories (Cconj, Sconj, Adv)
Relative Frequencies of Function Word Categories (Adv, Det, Part)
Wave Plot of UPOS Tag Distributions

The most frequent 3-grams of UPOS tags were also counted for each work.

Most Frequent N-grams of UPOS tags

The most frequent 5-grams of UPOS tags were also counted for each work.

Most Frequent N-grams of UPOS tags

The most frequent 7-grams of UPOS tags were also counted for each work.

Most Frequent N-grams of UPOS tags

Lengths

The length of texts (number of tokens), average length of tokens and mean sentence length were calculated for each work.

Lengths in all texts per Work

3d plot for exploration.

Lengths in all Texts per Group

καὶ

Number of tokens vs. occurrences of καὶ

The length of texts (number of tokens) and number of occurrences of καὶ were calculated for each work.

Lengths in all Texts per Group

καὶ Richness

Calculated on the lemmatized work both with vanilla καὶ-token ratio, but also with moving windows of size 10 and 50.

Use of καὶ on a Group and Individual Level

punct vs. punct + καὶ

Scatterplot of occurrences of punct vs. punct followed by καὶ