Gaze-based Personal Memory: Leveraging Eye Tracking to Improve Relevance in Text Retrieval Systems

Philipp, Gross; Ulges, Adrian; Weier, Martin

doi:10.5281/zenodo.18564970

Published February 13, 2026 | Version 1.0

Dataset Open

Gaze-based Personal Memory: Leveraging Eye Tracking to Improve Relevance in Text Retrieval Systems

1. RheinMain University of Applied Sciences

g-Rel-RANKER is a benchmark dataset for studying gaze-based boosting strategies in text retrieval and personal memory systems. It contains two complementary components: (1) an extension of the public g-Rel-READER corpus with additional background passages and refined relevance annotations, and (2) a user study corpus collected with a dedicated search prototype. Both components provide aligned text passages, gaze data, and passage-level relevance labels, enabling systematic evaluation of whether and how eye tracking signals can improve retrieval of information that users have actually read before.

The g-Rel-READER-based part comprises news-style documents originally used for relevance judgement experiments, enriched with passage segmentations, screen coordinates for each word, gaze fixations from 24 participants, and curated relevance labels for question–passage pairs. To approximate realistic search scenarios, this core is complemented with 2.6M background passages sampled from English Wikipedia, of which a pooled subset is labeled for relevance using a combination of LLM-based and expert judgement. This structure yields a mixed corpus containing (a) relevant and non-relevant passages with gaze (previously read “personal memory”) and (b) relevant and non-relevant passages without gaze (unseen or merely visible content).

The user study component captures a more natural visual-memory setting: nine participants read selected Wikipedia passages (German language), while gaze was recorded and passages were indexed in an Elasticsearch-based prototype. Afterwards, participants issued both self-formulated and predefined questions; for each query, they rated retrieved passages for relevance, which is again complemented with automatic LLM judgements. This second component mirrors the structure of g-Rel-RANKER (text, gaze, relevance) and allows testing the transfer of methods from controlled benchmark conditions to practical search scenarios.

Across both components, g-Rel-RANKER is designed to evaluate a family of gaze-based boosting functions on top of standard BM25 retrieval, including boolean gaze filters, distance-weighted gaze-point counts, reading intensity, and reading-velocity-based boosts. The dataset and accompanying benchmark code support common IR metrics (MRR, MAP, Hits@K) and separate analyses of viewed vs. non-viewed passages, making it possible to quantify trade-offs between prioritizing previously read content and maintaining recall for unseen but relevant material. Overall, g-Rel-RANKER provides a reproducible resource to assess the feasibility of using gaze data as an implicit personalization signal in information retrieval and personal memory systems.

Files

g-Rel-Ranker (Experiment 1).zip

Files (354.6 MB)

Name	Size	Download all
g-Rel-Ranker (Experiment 1).zip md5:66d654ea2fc97b13d710f530f8b26b0f	182.6 MB	Preview Download
README.md md5:959bb7bf203e5ccf359d54f210371f47	2.2 kB	Preview Download
user study (Experiment 2).zip md5:bd4f0c0ff7f94f4a45fe12377cf97f3d	172.0 MB	Preview Download

Additional details

Alternative title: g-Rel-RANKER

Is source of: Publication: 10.1145/3698204.3716474 (DOI); Dataset: https://osf.io/3bvtx/overview (URL)

Created: 2026-02-13

	All versions	This version
Views	12	12
Downloads	12	12
Data volume	847.7 MB	847.7 MB

g-Rel-Ranker (Experiment 1).zip

Files (354.6 MB)

Additional titles

Related works

Dates

Gaze-based Personal Memory: Leveraging Eye Tracking to Improve Relevance in Text Retrieval Systems

Authors/Creators

Description

Files

g-Rel-Ranker (Experiment 1).zip

Files (354.6 MB)

Additional details

Additional titles

Related works

Dates