Published January 27, 2025
| Version v2
Dataset
Open
Input data and outputs for the OCALM project
Description
This repository contain a zip file with input and output data for an experiment with OCALM (https://github.com/fanavarro/ocalm):
- input
- ontologies: ontologies used as input (FoodOn, LKIF, GeneOntology), together with their normalized form.
- text
- food_text: natural language text corpus about food, including the original and the processed text.
- gene_text: natural language text corpus about genetics, including the original and the processed text.
- legal_text: natural language text corpus about legal topics, including the original and the processed text.
- results: the results derived from comparing each ontology with each natural language text corpus by using OCALM.
- NCBO_Recommender_results: the results of the NCBO Recommender with the same experiment performed by OCALM.
- analysis.R: R script to get figures summarizing the results.
The SNOMED ontology and the medical text corpus used for input were not included due to licensing issues; however, the results are included in this repository.
Files
OntologyCoverageData.zip
Files
(26.2 MB)
Name | Size | Download all |
---|---|---|
md5:819402fb98754846a10de8f7e4094f26
|
26.2 MB | Preview Download |
Additional details
Software
- Repository URL
- https://github.com/fanavarro/ocalm
- Programming language
- Python