Published February 2026 | Version v1
Dataset Open

CPL-Gold-Entity-Link: annotated entity linking corpus between publications and executable code for Nextflow workflows

  • 1. Université Paris-Saclay, CNRS, LISN, 91400, Orsay, France
  • 2. Université Paris-Saclay, CEA, Institut LIST, 91191, Gif-sur-Yvette, France

Description

CPL-Gold-Entity-Link is an annotated entity linking corpus between publications and executable code for Nextflow workflows. These annotations are available in the YAML format.

This corpus is composed of 15 workflows extracted from CPL-Article (https://doi.org/10.5281/zenodo.18526700) and CPL-Code (https://doi.org/10.5281/zenodo.18526760). 190 potentials cross-model links are represented.
 

Repository organisation

This repository contains YAML files. For each workflow, two files are associated: one corresponding to the article and one to the code.
Each file includes a list of the bioinformatics tools mentioned, along with their potential links in the associated code and possible alternative tool formulations within the same source.

Contact


  • Clémence Sebe, clemence.sebe@universite-paris-saclay.fr

Funding


This work received support from the National Research Agency under the France 2030 program, with reference to ANR-22-PESN-0007.

Files

CPL-Gold-Entity-Link.zip

Files (15.0 kB)

Name Size Download all
md5:8488b65382db2236ca4d1b1b7c814ab2
15.0 kB Preview Download