Presentation Open Access

A Hybrid Approach to Stanza Classification in Spanish Poetry

Javier de la Rosa; Álvaro Pérez Pozo; Laura Hernández; Mirella de Sisto; Salvador Ros; Elena Gonzálex-Blanco

The creation and analysis of poetry have been commonly carried out by hand; with only a few computer-assisted approaches appearing over the years. In the Spanish context, the promise of machine learning is starting to pan out in specific tasks such as metrical annotation and rhythm extraction. Among the possible tasks that comprise the analysis of a poem, identifying the type of a stanza remains underexplored. The classification of the inner structures of verses in which a poem is built upon is an especially relevant task for poetry studies since it complements the structural information of a poem. In this work, we analyzed different computational approaches to stanza classification in the Spanish poetic tradition. We collected a corpus of 5005 stanzas of 46 different types, and created a baseline expert system on a set of rules defined by poetry scholars. We show that this task continues to be hard for computers systems even when leveraging the best performing embeddings. However, combining the knowledge of experts as prior to machine learning approaches yields rates of accuracy around 92%. We believe that this combination of approaches could improve many other tasks, as the rules that govern poetry are somewhat arbitrary and hard for computers to learn from examples.

Files (38.9 MB)
Name Size
EADH2021 - Stanzas - presentation [20mins].mkv
34.5 MB Download
EADH2021 - Stanzas - presentation [20mins].pdf
1.3 MB Download
EADH2021 - Stanzas - presentation [20mins].pptx
3.1 MB Download
All versions This version
Views 8989
Downloads 4343
Data volume 98.2 MB98.2 MB
Unique views 8080
Unique downloads 3636


Cite as