Published June 16, 2023
| Version v1
Dataset
Open
A small dataset of short abstract from Wikipedia
Authors/Creators
- 1. University of Cagliari
- 2. GREYC- Universit ́e Caen Normandie
- 3. FIZ Karlsruhe – Leibniz Institute for Information Infrastructure, Germany
- 4. TIB Leibniz Information Centre for Science and Technology, Hannover, Germany
- 5. German Federal Institute for Risk Assessment, Germany
- 6. KU Leuven – Flanders Make@KULeuven – Leuven.AI, Leuven, Belgium
- 7. Research Manager, Group Head (Knowledge Technologies Group, Cefriel, Milano, Italy)
Description
This dataset has been collected during the ISWS2023 PhD School of Semantic Web at Bertinoro for the Project Work of the Dragon Team Research Group.
It contains a collection of 10 short abstract taken from Wikipedia.
The short abstract have been published in a date that is before the data of the pre-train of the gpt-3.5-turbo LLM so contains informations that are known by this model and can be useful to study the inference of triples in KG generation from texts process.
Files
isws_pw_wikipedia_dataset - Wikipedia Short Abstracts.pdf
Files
(54.5 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:f3d5ccdd500e17e1f8021d892a7af0c9
|
54.5 kB | Preview Download |