Journal article Open Access

Generic reinforcement learning codebase in TensorFlow

Li, Bryan; Cowen-Rivers, Alexander; Kozakowski, Piotr; Tao, David; Kamalakara, Siddhartha; Rajkumar, Nitarshan; Sezhiyan, Hariharan; Huang, Sicong; Gomez, Aidan

Vast reinforcement learning (RL) research groups, such as DeepMind and OpenAI, have their internal (private) reinforcement learning codebases, which enable quick prototyping and comparing of ideas to many SOTA methods. We argue the five fundamental properties of a sophisticated research codebase are; modularity, reproducibility, many RL algorithms pre-implemented, speed and ease of running on different hardware/ integration with visualization packages. Currently, there does not exist any RL codebase, to the author's knowledge, which contains all the five properties, particularly with TensorBoard logging and abstracting away cloud hardware such as TPU's from the user. The codebase aims to help distil the best research practices into the community as well as ease the entry access and accelerate the pace of the field. More detailed documentation can be found here.

Files (2.3 MB)
Name Size
2.3 MB Download
All versions This version
Views 4116
Downloads 83
Data volume 18.5 MB6.9 MB
Unique views 2914
Unique downloads 53


Cite as