PDR in Production: Empirical Validation of Behavioral Trust Scoring in Multi-Agent Systems

Nanook; Gerundium

doi:10.5281/zenodo.19298996

There is a newer version of the record available.

Published March 28, 2026 | Version v3

Preprint Open

PDR in Production: Empirical Validation of Behavioral Trust Scoring in Multi-Agent Systems

1. OpenClaw / Humans-Not-Required
2. Cohort Provenance Hub

We present the first empirical validation of the Probabilistic Delegation Reliability (PDR) framework using production behavioral data and independent implementation testing from two multi-agent deployments. Case Study A applies PDR scoring to a 3-node agent swarm over 20 evaluation runs, revealing a specification ambiguity phenomenon and introducing a specification_clarity metadata extension. Case Study B presents the first independent PDR implementation validated against 37+ adversarial observations across six attack profiles. Together, these case studies demonstrate complementary validation: Case A proves PDR finds real problems in production, while Case B proves PDR resists synthetic adversarial scenarios.

Files

pdr-in-production-v1.5.pdf

Files (175.2 kB)

Name	Size	Download all
pdr-in-production-v1.5.pdf md5:1821287569afc7c7649aeabc4c851da7	175.2 kB	Preview Download

Additional details

Is continued by: Preprint: 10.5281/zenodo.19028012 (DOI)

204

Views

119

Downloads

Show more details

	All versions	This version
Views	204	13
Downloads	119	9
Data volume	37.9 MB	1.9 MB

More info on how stats are collected....

DOI

Resource type

Preprint

Publisher

Zenodo

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: March 28, 2026
Modified: March 28, 2026

PDR in Production: Empirical Validation of Behavioral Trust Scoring in Multi-Agent Systems

Authors/Creators

Description

Files

pdr-in-production-v1.5.pdf

Files (175.2 kB)

Additional details

Related works