Resource-Aware Goal-Driven Policy Reinforcement Learning (RAGP-RL)

Syuaib

doi:10.5281/zenodo.18414902

Published January 23, 2026 | Version v3

Report Open

Resource-Aware Goal-Driven Policy Reinforcement Learning (RAGP-RL)

Syuaib

By early 2026, global AI data center energy consumption is projected to reach 1,050 TWh, creating significant sustainability challenges for the implementation of large-scale intelligent agents. This paper proposes Resource-Aware Goal-Driven Policy Reinforcement Learning (RAGP-RL), a formal framework that explicitly integrates computational power constraints into the agent's objective function.

Unlike traditional Reinforcement Learning (RL) algorithms, RAGP-RL introduces the Imagination variable (I) as an internal generative process constrained by an energy budget (C). Through a Primal–Dual Lagrangian optimization formulation, it is shown how an agent can modulate its cognitive intensity based on the urgency of the situation (c) to achieve metabolic efficiency that mimics biological systems. Validation is proposed through Red vs. Blue adversarial simulations to measure the systemic efficiency and robustness of agents under resource-constrained conditions.

The RAGP-RL framework is defined by six key variables:

Computational Power (C) : the energy capacity or processing resources available to the system.
Imagination (I) : a generative stochastic process based on a world model to simulate future trajectories without direct interaction with the environment.
Reality (R) : data resulting from actual interactions with the environment.
Urgency Cost (c) : an adaptive scalar function that reflects the criticality of a situation.
Decrease Function (d) : the rate of resource depletion or degradation.
Direction (D) : a global goal priority vector that constrains the policy space when an agent is in a critical energy state.

Files

RAGP_2026-01-29_161808.pdf

Files (167.8 kB)

Name	Size	Download all
RAGP_2026-01-29_161808.pdf md5:bf4b515ee66e699dd138b13ad886c5ce	167.8 kB	Preview Download

Additional details

Repository URL: https://github.com/syuaibsyuaib/RAGP-RL
Programming language: Python
Development Status: Active

	All versions	This version
Views	174	31
Downloads	124	26
Data volume	29.0 MB	6.4 MB

Resource-Aware Goal-Driven Policy Reinforcement Learning (RAGP-RL)

Authors/Creators

Description

Files

RAGP_2026-01-29_161808.pdf

Files (167.8 kB)

Additional details

Software