Published September 25, 2020 | Version v1
Journal article Open

Rantanplan: Fast and Accurate Syllabification and Scansion of Spanish Poetry

Description

Automated analysis of Spanish poetry corpora lacks the richness of tools available for English. The existing options suffer from a number of issues: are limited to fixed-metre hendecasyllabic verses, are not publicly available, the syllabification procedure underneath is not thoroughly tested, and their speed is questionable. This paper introduces new methods to alleviate these concerns. For syllabification, we contribute with our own method and manually crafted corpus. For scansion, our approach is based on a heuristic for the application of rhetorical figures that alter metrical length. Experimental evaluation shows that both fixed-metre and mixed-metre poetry can be successfully analyzed, producing metrical patterns more accurately (increasing accuracy by 2% and 15%, respectively), and at a fraction of the time other methods need (running at least 100 times faster).

Files

SEPLN2020.pdf

Files (248.3 kB)

Name Size Download all
md5:c4fd887f3da9920519585f5e3c957302
248.3 kB Preview Download

Additional details

Funding

POSTDATA – Poetry Standardization and Linked Open Data 679528
European Commission