rlpy / RLPy / issues / #13 - Value Function Prediction Code — Bitbucket

Issue #13 on hold

cdann@cdann.de created an issue 2013-09-24

current policy evaluation code seems to be broken. replace old code with an ValuePredictionExperiment (maybe also LinearValuePredictionExperiment) as a proper way of clean Policy Evaluation

Comments (3)

Will Dabney
Could someone familiar with the original Policy Evaluation code add a comment to this issue (or the other Policy Evaluation issue) which outlines the desired behavior of the class?
- 2013-09-28T14:49:57+00:00
cdann@cdann.de reporter
I didn't wrote the code, but from what I understood it is pretty experimental and hacky. I belief we did not announce policy evaluation as a feature so I would mark it as unfinished work and do not care about it at the moment.

I am probably going to do some policy evaluation stuff again soon. Then I will code up a proper setup for Policy Evaluation with a (Linear)ValueEstimationExperiment. I will maybe transfer parts of my code at https://bitbucket.org/chrodan/tdlearn into RLPy.
- 2013-09-28T18:30:41+00:00
cdann@cdann.de reporter
- changed status to on hold
- 2015-06-24T21:44:07+00:00
Log in to comment

Assignee: –

Type: bug

Priority: minor

Status: on hold

Votes: 0

Watchers: 2

Jira: the preferred issue tracker for Bitbucket. Join the team!