LLMs4OL 2024 Datasets: Toward Ontology Learning with Large Language Models

Babaei Giglou, Hamed; D'Souza, Jennifer; Sadruddin, Sameer; Auer, Sören

doi:10.5281/zenodo.13851373

Published September 27, 2024 | Version v1

Dataset Open

LLMs4OL 2024 Datasets: Toward Ontology Learning with Large Language Models

1. Technische Informationsbibliothek (TIB)

Ontology learning (OL) from unstructured data has evolved significantly, with recent advancements integrating large language models (LLMs) to enhance various aspects of the process. The LLMs4OL 2024 datasets, were developed to benchmark and advance research in OL using LLMs. This dataset as a key component of the LLMs4OL Challenge, targets three primary OL tasks: Term Typing, Taxonomy Discovery, and Non-Taxonomic Relation Extraction. It encompasses seven domains, i.e. lexosemantics and biological functions, offering a comprehensive resource for evaluating LLM-based OL approaches Each task within the dataset is carefully crafted to facilitate both Few-Shot (FS) and Zero-Shot (ZS) evaluation scenarios, allowing for robust assessment of model performance across different knowledge domains to address a critical gap in the field by offering standardized benchmarks for fair comparison for evaluating LLM applications in OL.

Files

LLMs4OL 2024 Challenge Datasets.zip

Files (123.8 MB)

Name	Size	Download all
LLMs4OL 2024 Challenge Datasets.zip md5:3efc9ede2fb08af3b876a115047c5098	123.8 MB	Preview Download

Additional details

Repository URL: https://github.com/HamedBabaei/LLMs4OL-Challenge-ISWC2024

	All versions	This version
Views	736	736
Downloads	103	103
Data volume	14.7 GB	14.7 GB

LLMs4OL 2024 Datasets: Toward Ontology Learning with Large Language Models

Authors/Creators

Description

Files

LLMs4OL 2024 Challenge Datasets.zip

Files (123.8 MB)

Additional details

Software