Published October 27, 2025 | Version v1.0.0
Dataset Open

Code and data for paper 'Fine-grained Named-Entity Recognition for the East-India Company domain'

  • 1. ROR icon Vrije Universiteit Amsterdam
  • 2. ROR icon Huygens Institute for History and Culture of the Netherlands
  • 3. Universiteit van Amsterdam

Description

Code and data accompanying the paper presented at the Computational Humanities Research Conference 2025. 

Citation of the paper, published in the Anthology of Computers and the Humanities (Vol. 3):

@article{10.63744@DRbhWNTzqNzR,
  title = {Fine-grained Named-Entity Recognition for the East-India Company domain},
  author = {Sophie Arnoult and Brecht Nijman and Leon van Wissen},
  year = {2025},
  journal = {Anthology of Computers and the Humanities},
  volume = {3},
  pages = {953--967},
  editor = {Taylor Arnold, Margherita Fantoli, and Ruben Ros},
  doi = {10.63744/DRbhWNTzqNzR}
}

Notes

If you use this dataset, please cite the accompanying paper. 

Files

globalise-huygens/finegrained-hist-ner-v1.0.0.zip

Files (1.1 MB)

Additional details

Related works

Is documented by
Conference paper: 10.63744/DRbhWNTzqNzR (DOI)
Is supplement to
Dataset: https://github.com/globalise-huygens/finegrained-hist-ner/tree/v1.0.0 (URL)