Published February 2026
| Version v1
Dataset
Open
CPL-Gold-Entity-Link: annotated entity linking corpus between publications and executable code for Nextflow workflows
Authors/Creators
- 1. Université Paris-Saclay, CNRS, LISN, 91400, Orsay, France
- 2. Université Paris-Saclay, CEA, Institut LIST, 91191, Gif-sur-Yvette, France
Description
CPL-Gold-Entity-Link is an annotated entity linking corpus between publications and executable code for Nextflow workflows. These annotations are available in the YAML format.
This corpus is composed of 15 workflows extracted from CPL-Article (https://doi.org/10.5281/zenodo.18526700) and CPL-Code (https://doi.org/10.5281/zenodo.18526760). 190 potentials cross-model links are represented.
Repository organisation
This repository contains YAML files. For each workflow, two files are associated: one corresponding to the article and one to the code.
Each file includes a list of the bioinformatics tools mentioned, along with their potential links in the associated code and possible alternative tool formulations within the same source.
Contact
- Clémence Sebe, clemence.sebe@universite-paris-saclay.fr
Funding
This work received support from the National Research Agency under the France 2030 program, with reference to ANR-22-PESN-0007.
Files
CPL-Gold-Entity-Link.zip
Files
(15.0 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:8488b65382db2236ca4d1b1b7c814ab2
|
15.0 kB | Preview Download |