Published November 28, 2022 | Version v1
Dataset Restricted

GGPONC 2.0 (Molecular Entities)

Description

Subset of GGPONC 2.0 (2000 sentences), which was manually annotated with 822 Gene / Protein mentions by a single annotator.

The dev and test splits can be used to reproduce the results from: GGTWEAK: Gene Tagging with Weak Supervision for German Clinical Text

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

You need to satisfy these conditions in order for this request to be accepted:

Instructions

Please describe your research project briefly to gain access to GGPONC.

 

Terms of Use

  • GGPONC may be used for non-commercial research activities only.
  • GGPONC may not be distributed by the corpus users to any other third party (including any project collaborators). All prospective users of the corpus must apply for access individually.
  • The copyright of the corpus is protected in all parts.  Any use outside of the Copyright Protection Law is not allowed and illegal without written permit of the German Guideline Program in Oncology (GGPO). No part of the corpus may be reproduced in any form without prior written permission of the GGPO.
  • GGPONC is provided free of charge.
  • The corpus comes with absolutely no warranties including (but not limited to) the correctness of the information provided in the text corpus itself. The latest version of the clinical guidelines used for the corpus can be found at: https://www.leitlinienprogramm-onkologie.de/english-language/
  • Contributions which are based on the corpus must cite the following publication:

You are currently not logged in. Do you have an account? Log in here

Additional details

Related works

Is published in
10.1007/978-3-031-34344-5_22 (DOI)

Dates

Available
2022-11-28

Software