A Hybrid Approach to Stanza Classification in Spanish Poetry

doi:10.5281/zenodo.5525767

Published September 24, 2021 | Version 1.0.0

Presentation Open

A Hybrid Approach to Stanza Classification in Spanish Poetry

1. UNED
2. IE School of Human Sciences and Technology

The creation and analysis of poetry have been commonly carried out by hand; with only a few computer-assisted approaches appearing over the years. In the Spanish context, the promise of machine learning is starting to pan out in specific tasks such as metrical annotation and rhythm extraction. Among the possible tasks that comprise the analysis of a poem, identifying the type of a stanza remains underexplored. The classification of the inner structures of verses in which a poem is built upon is an especially relevant task for poetry studies since it complements the structural information of a poem. In this work, we analyzed different computational approaches to stanza classification in the Spanish poetic tradition. We collected a corpus of 5005 stanzas of 46 different types, and created a baseline expert system on a set of rules defined by poetry scholars. We show that this task continues to be hard for computers systems even when leveraging the best performing embeddings. However, combining the knowledge of experts as prior to machine learning approaches yields rates of accuracy around 92%. We believe that this combination of approaches could improve many other tasks, as the rules that govern poetry are somewhat arbitrary and hard for computers to learn from examples.

Files

EADH2021 - Stanzas - presentation [20mins].pdf

Files (38.9 MB)

Name	Size	Download all
EADH2021 - Stanzas - presentation [20mins].mkv md5:8123938d34da81affb294ce6a4918c2d	34.5 MB	Download
EADH2021 - Stanzas - presentation [20mins].pdf md5:fc54142b2e32b29caeb69fa3519c0ffa	1.3 MB	Preview Download
EADH2021 - Stanzas - presentation [20mins].pptx md5:2ab033b26289962d371d87cd8abf2815	3.1 MB	Download

Additional details

POSTDATA – Poetry Standardization and Linked Open Data 679528: European Commission

	All versions	This version
Views	128	128
Downloads	72	72
Data volume	155.6 MB	155.6 MB

A Hybrid Approach to Stanza Classification in Spanish Poetry

Creators

Description

Files

EADH2021 - Stanzas - presentation [20mins].pdf

Files (38.9 MB)

Additional details

Funding