Expected Policy Gradient

This repo aims to replicate the result of Expected Policy Gradient with pytorch.

Dependencies

Python
PyTorch (tested on 1.8.1+cpu and 1.9.0+cpu)
OpenAI Gym
MuJoCo (Warning: MuJoCo is not supported by Apple Silicon)
numpy
numdifftools (only for epg_rb_target_numdifftools.py and epg_vanilla.py)
matplotlib and pandas (for graphing)

Install the dependencies:

pip install -r requirements.txt

Learning Environment

MuJoCo

InvertedPendulum-v2
HalfCheetah-v2
Reacher-v2
Walker2d-v2

How to run

Train model

python [spg.py|ddpg.py|epg_*.py]

Generate a graph with the existing data

python figure.py

Generate a comparison graph of the variation4 and ddpg in HalfCheetah-v2

python variation4/test_curve.py

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
ddpg		ddpg
epg		epg
spg		spg
variation4_data		variation4_data
.gitignore		.gitignore
README.md		README.md
ddpg.py		ddpg.py
epg_rb_target_diag_quadric.py		epg_rb_target_diag_quadric.py
epg_rb_target_full_quadric.py		epg_rb_target_full_quadric.py
epg_rb_target_numdifftools.py		epg_rb_target_numdifftools.py
epg_vanilla.py		epg_vanilla.py
epg_vanilla_quadricfit.py		epg_vanilla_quadricfit.py
figure.py		figure.py
graphing.py		graphing.py
memory.py		memory.py
model.py		model.py
random_process.py		random_process.py
requirements.txt		requirements.txt
spg.py		spg.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Expected Policy Gradient

Dependencies

Learning Environment

MuJoCo

How to run

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

psychicalcoder/RL-EPG

Folders and files

Latest commit

History

Repository files navigation

Expected Policy Gradient

Dependencies

Learning Environment

MuJoCo

How to run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages