Published September 24, 2021 | Version 1.0.0
Presentation Open

A Hybrid Approach to Stanza Classification in Spanish Poetry

  • 1. UNED
  • 2. IE School of Human Sciences and Technology

Description

The creation and analysis of poetry have been commonly carried out by hand; with only a few computer-assisted approaches appearing over the years. In the Spanish context, the promise of machine learning is starting to pan out in specific tasks such as metrical annotation and rhythm extraction. Among the possible tasks that comprise the analysis of a poem, identifying the type of a stanza remains underexplored. The classification of the inner structures of verses in which a poem is built upon is an especially relevant task for poetry studies since it complements the structural information of a poem. In this work, we analyzed different computational approaches to stanza classification in the Spanish poetic tradition. We collected a corpus of 5005 stanzas of 46 different types, and created a baseline expert system on a set of rules defined by poetry scholars. We show that this task continues to be hard for computers systems even when leveraging the best performing embeddings. However, combining the knowledge of experts as prior to machine learning approaches yields rates of accuracy around 92%. We believe that this combination of approaches could improve many other tasks, as the rules that govern poetry are somewhat arbitrary and hard for computers to learn from examples.

Files

EADH2021 - Stanzas - presentation [20mins].pdf

Files (38.9 MB)

Name Size Download all
md5:8123938d34da81affb294ce6a4918c2d
34.5 MB Download
md5:fc54142b2e32b29caeb69fa3519c0ffa
1.3 MB Preview Download
md5:2ab033b26289962d371d87cd8abf2815
3.1 MB Download

Additional details

Funding

POSTDATA – Poetry Standardization and Linked Open Data 679528
European Commission