CNN Policy #8

bbalaji-ucsd · 2018-09-12T01:45:48Z

Can you please add an example of a CNN policy? All the code is oriented towards MLP policies.

yanglixiaoshen · 2018-10-25T08:49:49Z

I found that the observation dimension of Hopper-v1/v2 is 11, so it's not an image as a input for network. I think the MLP policy is the suitable method in fixing such gym game. Surely, if you work on another different domain, you can have a try for CNN policy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CNN Policy #8

CNN Policy #8

bbalaji-ucsd commented Sep 12, 2018

yanglixiaoshen commented Oct 25, 2018

CNN Policy #8

CNN Policy #8

Comments

bbalaji-ucsd commented Sep 12, 2018

yanglixiaoshen commented Oct 25, 2018