SICS Human-State Proxy Benchmark Track: Scientific Rationale and Research Value v0.1
Description
This document explains the scientific rationale and research value of the SICS Human-State Proxy Benchmark Track.
The SICS Human-State Proxy Benchmark Track is a research-stage, non-diagnostic, non-therapeutic, non-clinical, non-surveillance, non-coercive benchmark support track for Human-State-Aware AI Interaction, Human-State Cost, multimodal proxy benchmarking, measurement-layer simplification, and future Sal-Meter A/B comparison.
This document is designed to accompany, but not replace, the SICS Human-State Proxy Benchmark Track: Public Boundary and Program Charter v0.1.
The central claim is that AI should not be evaluated only by what it produces. It should also be evaluated by what it leaves in the human being.
The document argues that conventional AI evaluation metrics such as accuracy, speed, completion quality, benchmark score, productivity gain, computational efficiency, and user engagement are necessary but incomplete. As AI systems become conversational, persuasive, adaptive, emotionally expressive, and embedded in daily life, a second evaluation layer becomes necessary: measurable human-state impact.
This document introduces Human-State Cost as a non-diagnostic benchmark construct for comparing measurable proxy burdens left during or after interaction with an AI system. Human-State Cost is not a medical score, psychiatric score, clinical score, consciousness score, CAIS output, Sal-Meter output, certified benchmark, or human-ranking measure.
This document does not redefine CAIS. It does not redefine Sal-Meter. It does not grant CAIS compliance. It does not validate Sal-Meter. It does not certify any benchmark, dataset, model, dashboard, laboratory, implementation, or technology.
The contribution claimed here is structural: organizing existing human-state proxy modalities into a synchronized, leakage-controlled, non-diagnostic benchmark layer for AI performance versus human-state impact evaluation, measurement-layer simplification, and future Sal-Meter I/G-channel A/B comparison.
Files
SICS Human-State Proxy Benchmark Track — Scientific Rationale and Research Value v0.1.pdf
Files
(308.7 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:3eee20663ac3d74a914327c77d292104
|
308.7 kB | Preview Download |
Additional details
Related works
- Is supplement to
- Report: 10.5281/zenodo.19837423 (DOI)
Dates
- Issued
-
2026-04-28
References
- SICS Human-State Proxy Benchmark Track: Public Boundary and Program Charter v0.1. Zenodo. https://doi.org/10.5281/zenodo.19837423
- MIT Media Lab Affective Computing Group. https://www.media.mit.edu/groups/affective-computing/overview/
- Fang, C. M., Liu, A. R., Danry, V., Lee, E., Chan, S. W. T., Pataranutaporn, P., Maes, P., Phang, J., Lampe, M., Ahmad, L., and Agarwal, S. How AI and Human Behaviors Shape Psychosocial Effects of Chatbot Use: A Longitudinal Controlled Study. MIT Media Lab, 2025.
- WESAD: Wearable Stress and Affect Detection Dataset. Dataset DOI: 10.24432/C57K5T. https://doi.org/10.24432/C57K5T
- NIST Artificial Intelligence Risk Management Framework (AI RMF 1.0). DOI: 10.6028/NIST.AI.100-1. https://doi.org/10.6028/NIST.AI.100-1