GitHub - janEbert/sb3-ppg: Phasic policy gradient algorithm for stable-baselines3

Implementation of the phasic policy gradient (PPG) algorithm for stable-baselines3.

The CNN policy with an auxiliary head is currently missing, so you can only use the AuxMlpPolicy.

To initialize the policy with the paper's initialization values, uncomment the code for init_weights in ./ppg/aux_ac_policy.py.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
ppg		ppg
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback