kengz/SLM-Lab: Resume mode, Plotly and PyTorch update, OnPolicyCrossEntropy memory

Wah Loon Keng; Laura Graesser; Pierre TASSEL; allan-avatar1; Snyk bot; Sean Gillen; Rahim16; Milan Cvitkovic; Michael Schock; Angel Ayala

doi:10.5281/zenodo.3751787

Published April 14, 2020 | Version v4.2.0

Software Open

kengz/SLM-Lab: Resume mode, Plotly and PyTorch update, OnPolicyCrossEntropy memory

1. University of Klagenfurt
2. @Snyk
3. @opleinc

Resume mode

#455 adds train@ resume mode and refactors the enjoy mode. See PR for detailed info.

train@ usage example

Specify train mode as train@{predir}, where {predir} is the data directory of the last training run, or simply uselatest` to use the latest. e.g.:

python run_lab.py slm_lab/spec/benchmark/reinforce/reinforce_cartpole.json reinforce_cartpole train
# terminate run before its completion
# optionally edit the spec file in a past-future-consistent manner

# run resume with either of the commands:
python run_lab.py slm_lab/spec/benchmark/reinforce/reinforce_cartpole.json reinforce_cartpole train@latest
# or to use a specific run folder
python run_lab.py slm_lab/spec/benchmark/reinforce/reinforce_cartpole.json reinforce_cartpole train@data/reinforce_cartpole_2020_04_13_232521

enjoy mode refactor

The train@ resume mode API allows for the enjoy mode to be refactored. Both share similar syntax. Continuing with the example above, to enjoy a train model, we now use:

python run_lab.py slm_lab/spec/benchmark/reinforce/reinforce_cartpole.json reinforce_cartpole enjoy@data/reinforce_cartpole_2020_04_13_232521/reinforce_cartpole_t0_s0_spec.json

Plotly and PyTorch update

#453 updates Plotly to 4.5.4 and PyTorch to 1.3.1.
#454 explicitly shuts down Plotly orca server after plotting to prevent zombie processes

PPO batch size optimization

#453 adds chunking to allow PPO to run on larger batch size by breaking up the forward loop.

New OnPolicyCrossEntropy memory

#446 adds a new OnPolicyCrossEntropy memory class. See PR for details. Credits to @ingambe.

Files

kengz/SLM-Lab-v4.2.0.zip

Files (416.7 kB)

Name	Size	Download all
kengz/SLM-Lab-v4.2.0.zip md5:46163e1453543f5fdd4a1529c693f19f	416.7 kB	Preview Download

Additional details

Is supplement to: https://github.com/kengz/SLM-Lab/tree/v4.2.0 (URL)

	All versions	This version
Views	1,315	73
Downloads	342	17
Data volume	171.8 MB	7.1 MB

kengz/SLM-Lab: Resume mode, Plotly and PyTorch update, OnPolicyCrossEntropy memory

Authors/Creators

Description

Files

kengz/SLM-Lab-v4.2.0.zip

Files (416.7 kB)

Additional details

Related works