Published April 13, 2021 | Version 2
Dataset Open

Parish Memories with Named Entities

  • 1. University of Évora, CIDEHUS
  • 2. Polytechnics of Portalegre, CIDEHUS
  • 3. University of Évora, Dept. of Informatics

Description

The Parish Memories with Named Entities dataset consists of 366 transcribed texts from the original handwritten collection, the Parish Memories (1758-1761), where each text contains the description of a  Portuguese parish. The data consists of a) the transcribed texts, b) the texts annotated with person, location and organisation named entities,  c) the list of the extracted entities for each text, and d) a global list with all entities from the collection (all lists have frequency counts). The annotation was done automatically. Related publicationVieira, R., Olival, F., Cameron, H. F., Santos, J., Sequeira, O., & Santos, I. (2021). Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities. Journal of Open Humanities Data7, 20. DOI: http://doi.org/10.5334/johd.43

 

Notes

Please Cite: Vieira, R., Olival, F., Cameron, H. F., Santos, J., Sequeira, O., & Santos, I. (2021). Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities. Journal of Open Humanities Data, 7, 20. DOI: http://doi.org/10.5334/johd.43

Files

ParishMemoriesWithNE.zip

Files (13.7 MB)

Name Size Download all
md5:a0d0cd2490f39a47e197d405759374d7
6.7 MB Preview Download
md5:a640648d267edd0b9c7a68d2a2681f76
6.9 MB Preview Download

Additional details

References

  • Vieira, R., Olival, F., Cameron, H. F., Santos, J., Sequeira, O., & Santos, I. (2021). Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities. Journal of Open Humanities Data, 7, 20. DOI: http://doi.org/10.5334/johd.43