hybrid-doc-relevance-training

Ravinder, Rohitha; Geist, Lukas; Quiñones Virgen, Nelson David; Müller, Bernd; Solanki, Dhwani; Rebholz-Schuhmann, Dietrich; Castro, Leyla Jael

doi:10.5281/zenodo.15094064

Published March 13, 2026 | Version v1.0.0

Software Open

hybrid-doc-relevance-training

1. ZB MED - Information Centre for Life Sciences

This software corresponds to Hybrid-doc-relevance-training which provides a set of hybrid embedding approaches for literature-based document-to-document similarity, leveraging the RELISH corpus along with integrating semantic understanding. It implements three distinct algorithms: pre-annotation, post-annotation, and post-reduction annotation, utilizing models such as Doc2Vec, Word2Vec, and FastText. The software combines these models with ontology-based background knowledge to improve document similarity, relevance, and recommendation. Detailed documentation is provided, including input data preprocessing and execution instructions for easy integration and use.

This work used deNBI resources and therefore was supported by the de.NBI Cloud within the German Network for Bioinformatics Infrastructure (de.NBI) and ELIXIR-DE (Forschungszentrum Jülich and W-de.NBI-001, W-de.NBI-004, W-de.NBI-008, W-de.NBI-010, W-de.NBI-013, W-de.NBI-014, W-de.NBI-016, W-de.NBI-022).

Files

hybrid-doc-relevance-training-1.0.0.zip

Files (7.7 MB)

Name	Size	Download all
hybrid-doc-relevance-training-1.0.0.tar.gz md5:4a7c688f3185bd040a1d9044510be5a0	3.8 MB	Download
hybrid-doc-relevance-training-1.0.0.zip md5:fb9fb7df1f821373db764d773f93b289	3.9 MB	Preview Download

Additional details

Deutsche Forschungsgemeinschaft
STELLA II (Infrastructure for Living Labs) 407518790
Deutsche Forschungsgemeinschaft
NFDI4DS - NFDI for Data Science and Artificial Intelligence 460234259

Repository URL: https://github.com/zbmed-semtec/hybrid-doc-relevance-training
Programming language: Python
Development Status: Active

	All versions	This version
Views	36	36
Downloads	2	2
Data volume	7.7 MB	7.7 MB

hybrid-doc-relevance-training

Authors/Creators

Description

Files

hybrid-doc-relevance-training-1.0.0.zip

Files (7.7 MB)

Additional details

Funding

Software