{
  "DOI": "10.5281/zenodo.4422045",
  "abstract": "This repository contains the enrichments for the dataset The New York Times Annotated Corpus developed for the paper:\n\n\n\u201cMarco Ponza, Diego Ceccarelli, Paolo Ferragina, Edgar Meij, Sambhav Kothari. Contextualizing Trending Entities in News Stories. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining (WSDM 2021).\u201d\n\n\nIt includes a total of 149 trends constituted by 120K entities. The goal is to retrieve a set of entities ranked with respect to their usefulness in explaining why a given trending entity is actually trending.\n\n\nFormat\n\n\nThe repository contains the enrichments in JSON format.\n\n\nThe news stories of the New York Times from which these enrichments have been developed are available from LDC.\n\n\nData Splits\n\n\nWe perform two kinds of evaluation.\n\n\n\n\t\nUnsupervised evaluation, where we use the complete dataset of 149 trends as a benchmark.\n\t\nSupervised evaluation, where we train/tune our models on a training/development set and we test them on a test set.\n\n\n\n\n\t\nThe training set contains 50 trends constituted by 36.3K entities from 1996 to 2000.\n\t\nThe development set contains 34 trends constituted by 26.7K entities from 2000 to 2002.\n\t\nThe test set contains 65 trends constituted by 57K entities from 2002 to 2007.\n\n\n\nUse\n\n\nPlease cite the data set and the accompanying paper if you found the resources in this repository useful:\n\n\n@inproceedings{ponza2021,\n\u00a0\u00a0\u00a0\u00a0 Title = {Contextualizing Trending Entities in News Stories},\n\u00a0\u00a0\u00a0\u00a0 author = {Ponza, Marco and Ceccarelli, Diego and Ferragina, Paolo and Meij, Edgar and Kothari, Sambhav},\n\u00a0\u00a0\u00a0\u00a0 Booktitle = {Proceedings of the 14th ACM International Conference on Web Search and Data Mining},\n\u00a0\u00a0\u00a0\u00a0 Year = {2021},\n}",
  "author": [
    {
      "family": "Ponza",
      "given": "Marco"
    },
    {
      "family": "Ceccarelli",
      "given": "Diego"
    },
    {
      "family": "Ferragina",
      "given": "Paolo"
    },
    {
      "family": "Meij",
      "given": "Edgar"
    },
    {
      "family": "Kothari",
      "given": "Sambhav"
    }
  ],
  "event": "14th ACM International Conference on Web Search and Data Mining (WSDM 2021)",
  "id": "4422045",
  "issued": {
    "date-parts": [
      [
        "2021",
        "01",
        "06"
      ]
    ]
  },
  "language": "eng",
  "publisher": "Zenodo",
  "title": "Contextualizing Trending Entities in News Stories",
  "type": "dataset",
  "version": "v1"
}