rlpy / RLPy / issues / #25 - Including policy gradient methods — Bitbucket

Issue #25 resolved

Pierre-Luc Bacon created an issue 2013-10-08

The project description suggests that RLPy is mainly about value function based algorithms. However, I think it'd be nice to add Will Dabney's implementation of some of the popular policy gradient methods.

https://github.com/amarack/python-rl/blob/master/pyrl/agents/policy_gradient.py

Comments (4)

cdann@cdann.de
We totally agree with you. This is definitely a near-future goal for RLPy. Which specific method you suggest to address first?

Btw: There is an implementation of Natural Actor Critic in RLPy, but unfortunately it is tested very little so far (c.f. the simple example in examples/gridworld/nac.py)
- 2013-10-09T17:25:32+00:00
Pierre-Luc Bacon reporter
I think that all of Will's code should be included !

Having an implementation of REINFORCE would also be a useful baseline.
- 2013-10-10T02:01:55+00:00
Alborz Geramifard
Thanks Pierre. I sent an email to Will about this.
- 2015-07-18T15:24:42+00:00
Alborz Geramifard
- changed status to resolved
- 2015-07-18T15:38:26+00:00
Log in to comment

Assignee: Will Dabney

Type: enhancement

Priority: major

Status: resolved

Votes: 0

Watchers: 5

Jira: the preferred issue tracker for Bitbucket. Join the team!