Published April 14, 2020
| Version v4.2.0
Software
Open
kengz/SLM-Lab: Resume mode, Plotly and PyTorch update, OnPolicyCrossEntropy memory
Authors/Creators
- 1. University of Klagenfurt
- 2. @Snyk
- 3. @opleinc
Description
Resume mode
- #455 adds
train@resume mode and refactors theenjoymode. See PR for detailed info.
train@ usage example
Specify train mode as train@{predir}, where {predir} is the data directory of the last training run, or simply uselatest` to use the latest. e.g.:
python run_lab.py slm_lab/spec/benchmark/reinforce/reinforce_cartpole.json reinforce_cartpole train
# terminate run before its completion
# optionally edit the spec file in a past-future-consistent manner
# run resume with either of the commands:
python run_lab.py slm_lab/spec/benchmark/reinforce/reinforce_cartpole.json reinforce_cartpole train@latest
# or to use a specific run folder
python run_lab.py slm_lab/spec/benchmark/reinforce/reinforce_cartpole.json reinforce_cartpole train@data/reinforce_cartpole_2020_04_13_232521
enjoy mode refactor
The train@ resume mode API allows for the enjoy mode to be refactored. Both share similar syntax. Continuing with the example above, to enjoy a train model, we now use:
python run_lab.py slm_lab/spec/benchmark/reinforce/reinforce_cartpole.json reinforce_cartpole enjoy@data/reinforce_cartpole_2020_04_13_232521/reinforce_cartpole_t0_s0_spec.json
Plotly and PyTorch update
- #453 updates Plotly to 4.5.4 and PyTorch to 1.3.1.
- #454 explicitly shuts down Plotly orca server after plotting to prevent zombie processes
- #453 adds chunking to allow PPO to run on larger batch size by breaking up the forward loop.
- #446 adds a new
OnPolicyCrossEntropymemory class. See PR for details. Credits to @ingambe.
Files
kengz/SLM-Lab-v4.2.0.zip
Files
(416.7 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:46163e1453543f5fdd4a1529c693f19f
|
416.7 kB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/kengz/SLM-Lab/tree/v4.2.0 (URL)