{ "access": { "embargo": { "active": false, "reason": null }, "files": "public", "record": "public", "status": "open" }, "created": "2021-03-02T17:36:15.709512+00:00", "custom_fields": {}, "deletion_status": { "is_deleted": false, "status": "P" }, "files": { "count": 1, "enabled": true, "entries": { "v3.1-Guidelines-NewsEye-NER-EL-StD.pdf": { "checksum": "md5:6e0452aa0969a509c7de06f75b3f7c41", "ext": "pdf", "id": "177b8fe2-1245-425d-be4b-5ee3b10a52ff", "key": "v3.1-Guidelines-NewsEye-NER-EL-StD.pdf", "metadata": null, "mimetype": "application/pdf", "size": 310532 } }, "order": [], "total_bytes": 310532 }, "id": "4574199", "is_draft": false, "is_published": true, "links": { "access": "https://zenodo.org/api/records/4574199/access", "access_links": "https://zenodo.org/api/records/4574199/access/links", "access_request": "https://zenodo.org/api/records/4574199/access/request", "access_users": "https://zenodo.org/api/records/4574199/access/users", "archive": "https://zenodo.org/api/records/4574199/files-archive", "archive_media": "https://zenodo.org/api/records/4574199/media-files-archive", "communities": "https://zenodo.org/api/records/4574199/communities", "communities-suggestions": "https://zenodo.org/api/records/4574199/communities-suggestions", "doi": "https://doi.org/10.5281/zenodo.4574199", "draft": "https://zenodo.org/api/records/4574199/draft", "files": "https://zenodo.org/api/records/4574199/files", "latest": "https://zenodo.org/api/records/4574199/versions/latest", "latest_html": "https://zenodo.org/records/4574199/latest", "media_files": "https://zenodo.org/api/records/4574199/media-files", "parent": "https://zenodo.org/api/records/4574198", "parent_doi": "https://zenodo.org/doi/10.5281/zenodo.4574198", "parent_html": "https://zenodo.org/records/4574198", "requests": "https://zenodo.org/api/records/4574199/requests", "reserve_doi": "https://zenodo.org/api/records/4574199/draft/pids/doi", "self": "https://zenodo.org/api/records/4574199", "self_doi": "https://zenodo.org/doi/10.5281/zenodo.4574199", "self_html": "https://zenodo.org/records/4574199", "self_iiif_manifest": "https://zenodo.org/api/iiif/record:4574199/manifest", "self_iiif_sequence": "https://zenodo.org/api/iiif/record:4574199/sequence/default", "versions": "https://zenodo.org/api/records/4574199/versions" }, "media_files": { "count": 0, "enabled": false, "entries": {}, "order": [], "total_bytes": 0 }, "metadata": { "creators": [ { "affiliations": [ { "name": "University of La Rochelle" } ], "person_or_org": { "family_name": "Ahmed Hamdi", "identifiers": [ { "identifier": "0000-0002-8964-2135", "scheme": "orcid" } ], "name": "Ahmed Hamdi", "type": "personal" } }, { "affiliations": [ { "name": "University of La Rochelle" } ], "person_or_org": { "family_name": "Elvys Linhares Pontes", "name": "Elvys Linhares Pontes", "type": "personal" } }, { "affiliations": [ { "name": "University of La Rochelle" } ], "person_or_org": { "family_name": "Antoine Doucet", "name": "Antoine Doucet", "type": "personal" } } ], "description": "
We describe the NewsEye annotation guidelines for named entity recognition, entity linking and stance detection.
\nWhile the part of the guidelines on stance detection annotation is new, these guidelines are derived from Impresso NE annotation guidelines which are derived from Quaero guidelines. Originally designed for the annotation of “extended” named entities (i.e. more than the 3 or 4 traditional classes) in French speech transcriptions, Quaero guidelines have furthermore been used on historic press corpora. Impresso guidelines main’s difference with respect to Quaero’s is reduction: only a subset of Quaero entity types and components are considered, as well as a subset of linguistic units eligible as named entities. These adaptations result from what we deemed most relevant to annotate in our context, and from time and resource constraints. Despite these adaptations, impresso annotated corpora will mostly remain compatible with Quaero guidelines.
\n\n
These guidelines allowed building a multilingual dataset for named entity recognition, entity linking and stance detection in historical newpapers in French, German, Finnish and Swedish. The paper describing the guidelines and the dataset is available here. If you end up using the guidelines or the resource, please cite this paper:
\n\n@inproceedings{Hamdi2021multilingual,\n title={A Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers},\n author={Hamdi, Ahmed and Boro{\\c{s}}, Emanuela and Pontes, Elvys Linhares and Nguyen, Thi Tuyet Hai and Hackl, G{\\\"u}nter and Moreno, Jose G and Doucet, Antoine},\n booktitle={Proceedings of the 44rd International ACM SIGIR Conference on Research and Development in Information Retrieval},\n year={2021}\n}
\n\nAcknowledgments
\nThis work has been supported by the European Union's Horizon 2020 research and innovation programme under grant 770299 [NewsEye](https://www.newseye.eu/).
all outputs related to the newseye projects, including entries from internal and external (project team members) alike.
\r\n", "page": "NewsEye, funded by the European Union’s Horizon 2020 research and innovation programme, is a research project advancing the state of the art and introducing new concepts, methods and tools for digital humanities by providing enhanced access to historical newspapers for a wide range of users. With the tools and methods created by Newseye, crucial user groups will be able to investigate views and perspectives on historical events and development and, as a consequence, the project will change the way European digital heritage data is (re)searched, accessed, used and analysed.
", "title": "NewsEye" }, "revision_id": 0, "slug": "newseye", "updated": "2019-12-16T10:55:58.970031+00:00" }, { "access": { "member_policy": "open", "members_visibility": "restricted", "record_policy": "open", "review_policy": "closed", "visibility": "public" }, "children": { "allow": true }, "created": "2022-11-23T15:53:29.436323+00:00", "custom_fields": {}, "deletion_status": { "is_deleted": false, "status": "P" }, "id": "f0a8b890-f97a-4eb2-9eac-8b8a712d3a6c", "links": {}, "metadata": { "curation_policy": "The EU Open Research Repository serves as a repository for research outputs (data, software, posters, presentations, publications, etc) which have been funded under an EU research funding programme such as Horizon Europe, Euratom or earlier Framework Programmes.
\nThe community is managed by CERN on behalf of the European Commission.
\nZenodo’s general policies and Terms of Use apply to all content.
\nThe EU Open Research Repository accepts all digital research objects which is a research output stemming from one of EU’s research and innovation funding programmes. The funding programmes currently include:
\nHorizon Europe (including ERC, MSCA), earlier Framework Programmes (eg Horizon 2020) as well as Euratom.
\nIn line with the principle as open as possible, as closed as necessary both public and restricted content is accepted. See note on how Zenodo handles restricted content.
\nEU programme beneficiaries are eligible to submit content to the community. The community supports three types of content submissions:
\nSubmission via an EU Project Community (through user interface or programmatic APIs).
\nSubmission directly to the EU Open Research Repository.
\nAutomated harvesting from existing Zenodo content.
\nA representative of an EU project may request an EU Project Community and invite other project participants as members of the community. The project community is linked to one or more European Commission grants. All records in the project community are automatically integrated into the EU Open Research Repository immediately upon acceptance into the project community.
\nAny user may submit a record directly to the EU Open Research Repository. The submission will be moderated by Zenodo staff for compliance with the minimal required metadata requirements and its correctness.
\nRecords found among Zenodo’s existing content will on a regular basis automatically be integrated if they are found to comply with the requirements. The submissions through this method are integrated into the EU Open Research Repository with delay in a fully automated way.
\nRecords in the EU Open Research Repository are required to comply with the following minimal metadata requirements:
\nVisibility: Both public and restricted (with or without embargo and/or access request)
\nResource types: All resource types.
\nLicenses: Public and embargoed records MUST specify a license.
\nFunding information: Records MUST specify at least one grant from the European Commission.
\nCreators: Creators SHOULD be identified with a persistent identifier (e.g. ORCID, GND, …), and affiliations SHOULD be identified with a persistent identifier (e.g. ROR, ISNI, …)
\nSubjects: Records SHOULD specify one or more fields of science from the European Science Vocabulary.
\nAll submissions will undergo automated curation checks for compliance with the policy. Submissions through project communities are reviewed by the project community. Submission directly to the EU Open Research Repository is reviewed by Zenodo staff.
\nCommunity curators may at any point edit metadata of the records in the community without notice through human or automated processing. The curators may at their sole discretion remove records from the community that are deemed not to comply with the content and curation policy or which are deemed of insufficient quality.
\nThe content and curation policy is subject to change by the community owner at any time and without notice, other than through updating this page.
", "description": "Open repository for EU-funded research outputs from Horizon Europe, Euratom and earlier Framework Programmes.", "organizations": [ { "id": "00k4n6c32" } ], "page": "The EU Open Research Repository is a Zenodo-community dedicated to fostering open science and enhancing the visibility and accessibility of research outputs funded by the European Union. The community is managed by CERN on behalf of the European Commission.
\nThe mission of the repository is to support the implementation of the EU's open science policy, providing a trusted and comprehensive space for researchers to share their research outputs such as data, software, reports, presentations, posters and more. The EU Open Research Repository simplifies the process of complying with open science requirements, ensuring that research outputs from Horizon Europe, Euratom, and earlier Framework Programmes are freely accessible, thereby accelerating scientific discovery and innovation.
\nThe EU Open Research Repository serves as a complementary platform to the Open Research Europe (ORE) publishing platform. Open Research Europe focuses on providing a publishing venue for peer-reviewed articles, ensuring that research meets rigorous academic standards. The EU Open Research Repository provides a space for all the other research outputs including data sets, software, posters, and presentations that are out of scope for ORE. This holistic approach enables researchers to not only publish their findings but also share the underlying data and materials that support their work, fostering transparency and reproducibility in the scientific process.
\nCurrently in its pilot phase and set to be fully operational during autumn 2024, the EU Open Research Repository is constantly evolving. Efforts are committed to integrating cutting-edge features, including automated curation checks and FAIR (Findable, Accessible, Interoperable, and Reusable) assistance, to further support the research community. The goal is to provide researchers with a simple goto solution for making their publicly funded research open and as FAIR as possible.
\nThe EU Open Research Repository is funded by the European Union under grant agreement no. 101122956(HORIZON-ZEN). For more information about the project see https://about.zenodo.org/projects/horizon-zen/.
", "title": "EU Open Research Repository (Pilot)", "type": { "id": "organization" }, "website": "https://research-and-innovation.ec.europa.eu" }, "revision_id": 16, "slug": "eu", "theme": { "brand": "horizon", "enabled": true, "style": { "font": { "family": "Arial, sans-serif", "size": "16px", "weight": 600 }, "mainHeaderBackgroundColor": "#FFFFFF", "primaryColor": "#004494", "primaryTextColor": "#FFFFFF", "secondaryColor": "#FFD617", "secondaryTextColor": "#000000", "tertiaryColor": "#e3eefd", "tertiaryTextColor": "#1c5694" } }, "updated": "2024-03-20T06:47:47.577483+00:00" } ], "ids": [ "62edd9a9-6173-4379-9408-21a4bff4269f", "f0a8b890-f97a-4eb2-9eac-8b8a712d3a6c" ] }, "id": "4574198", "pids": { "doi": { "client": "datacite", "identifier": "10.5281/zenodo.4574198", "provider": "datacite" } } }, "pids": { "doi": { "client": "datacite", "identifier": "10.5281/zenodo.4574199", "provider": "datacite" }, "oai": { "identifier": "oai:zenodo.org:4574199", "provider": "oai" } }, "revision_id": 5, "stats": { "all_versions": { "data_volume": 189424520.0, "downloads": 610, "unique_downloads": 547, "unique_views": 687, "views": 749 }, "this_version": { "data_volume": 188803456.0, "downloads": 608, "unique_downloads": 545, "unique_views": 679, "views": 741 } }, "status": "published", "updated": "2021-05-25T12:35:41.763481+00:00", "versions": { "index": 1, "is_latest": true } }