Code for the rfPG algorithm and the experiments in the paper: "Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs" (IJCAI 2025 main track)

Galesloot, Maris

doi:10.5281/zenodo.17038491

Published 2025 | Version v2

Software Open

Code for the rfPG algorithm and the experiments in the paper: "Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs" (IJCAI 2025 main track)

Galesloot, Maris¹

1. Radboud University Nijmegen

This repository contains code (in the .zip) and a virtual image (in the .tar) in the form of a Docker image to run the experiments in the paper.

For future reference and potential updates to rfPG, please refer to the GitHub repository used (https://github.com/marisgg/synthesis/tree/ijcai). An extended and standalone version may be added (or pointers to it). For example, you can now run rfPG for optimizing reachability probabilities (in addition to reachability rewards). Note that it is not well-tested.

EDIT:

Use the following command to execute the code after loading the Docker image from the .tar:

docker run  -v "$(pwd):/opt/payntdev" -v "/opt/payntdev/payntbind/" --name YOURCONTAINERNAMEHERE -it localhost/rfpg:ijcai python3 entrypoint.py

Files

rfpg-ijcai-25-code.zip

Files (5.2 GB)

Name	Size	Download all
rfpg-ijcai-25-code.zip md5:dc038b1a7ddebe62de5a84eaa6e3d45a	11.0 MB	Preview Download
rfpg.tar md5:1b27d2554cf72b143d844b32fbcd4c6f	5.2 GB	Download

Additional details

European Commission
DEUCE - Data-Driven Verification and Learning Under Uncertainty 101077178
European Commission
VASSAL - Verification and Analysis for Safety and Security of Applications in Life 101160022

	All versions	This version
Views	97	63
Downloads	42	32
Data volume	150.4 GB	129.6 GB

Code for the rfPG algorithm and the experiments in the paper: "Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs" (IJCAI 2025 main track)

Authors/Creators

Description

Files

rfpg-ijcai-25-code.zip

Files (5.2 GB)

Additional details

Funding