Skip to content

Style

The corpus can be investigated from a number of stylistical perspectives.

Vocabulary Richness

The vocabulary richness of each fable was calculated on the lemmatized work both with vanilla type-token ratio, but also with moving windows of size 10 and 50.

Use of Terms on a Group and Individual Level

UPOS Tags

UPOS tags were tallied up in all fables without removal of any stop words or lemmatization.

Relative Frequencies of Nouns, Verbs and Adjectives in all Fables
Relative Frequencies of Nouns, Verbs and Adjectives in all Fables
Relative Frequencies of Function Word Categories
Wave Plot of UPOS Tag Distributions

The most frequent 2 to 4-grams of UPOS tags were also counted for each work.

Most Frequent N-grams of UPOS tags

The most frequent 4-grams of UPOS tags were also counted for each work.

Most Frequent N-grams of UPOS tags

Lengths

The length of fables (number of tokens), average length of tokens and mean sentence length were calculated for each work. Texts were split on punctuation to create the sentences. Punctuation was defined as full stops (.) and Greek question marks (;). Commas and elevated dots were not counted as punctuation. Metrical feet, metre, and stanzas were not taken into account.

Lengths in all Fables per Group
Total number of words (length), number of unique words (n_types), and number of unique lemmata (n_lemmata) in all Fables per Group

Vocabulary Richness (Noun, Adj, Verb)

The vocabulary richness of each fable was calculated on the lemmatized work both with vanilla type-token ratio, but also with moving windows of size 10 and 50.

Use of Terms on a Group and Individual Level