The Information Potential of Books

Kaplan, Frederic

doi:10.5281/zenodo.16098038

Published July 2, 2025 | Version v1

Presentation Open

The Information Potential of Books

Kaplan, Frederic¹

1. École Polytechnique Fédérale de Lausanne

For practical and legal reasons, Large Language Models are primarily trained on contemporary, web-based texts and not on the vast array of content found in published books. As a consequence, their competence does not capture the rich diversity of knowledge that libraries have worked to preserve and make accessible. Because of this epistemic gap, libraries can potentially play a crucial role in the development of future versions of these models. In this presentation, I will discuss a computational strategy designed to effectively quantify and utilize the knowledge contained within books, addressing the opportunities and challenges for libraries in this process.

Files

Frédéric Kaplan_LIBER2025.pdf

Files (14.2 MB)

Name	Size	Download all
Frédéric Kaplan_LIBER2025.pdf md5:834c5374fde25a4ace9454143b468563	14.2 MB	Preview Download

217

Views

148

Downloads

Show more details

	All versions	This version
Views	217	217
Downloads	148	148
Data volume	3.0 GB	3.0 GB

More info on how stats are collected....

DOI

Resource type

Presentation

Publisher

Zenodo

Conference

LIBER Annual Conference 2025 , University of Lausanne (UNIL), Lausanne, Switzerland, 2-4 July 2025

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: July 18, 2025
Modified: July 18, 2025

The Information Potential of Books

Authors/Creators

Description

Files

Frédéric Kaplan_LIBER2025.pdf

Files (14.2 MB)