ValueError: Shapes (31, 1) and (31, 2) are incompatible #28

Sanketd420 · 2020-05-26T14:02:53Z

loss = update_network(network, rewards, states, actions, num_actions)

loss = network.train_on_batch(states, discounted_rewards)

The text was updated successfully, but these errors were encountered:

Joe-Darling · 2020-06-21T05:58:06Z

I also had this issue. If you are on tensorflow 2.2 then downgrading to either 2.0 or 2.1 may fix the issue, it did for me at least. However after fixing that issue the model never converges and after thousands of episodes still gets stuck on a reward value of about 10-20 and an avg loss in the negative thousands... Not sure how to fix this

asokraju · 2020-09-10T16:57:03Z

adding these lines will fix the issue...
target_actions = np.array([[1 if a==i else 0 for i in range(2)] for a in actions]) loss = network.train_on_batch(states,target_actions, sample_weight=discounted_rewards)

asokraju mentioned this issue Sep 10, 2020

Bugfix policy gradinet reinforce tf2 #29

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ValueError: Shapes (31, 1) and (31, 2) are incompatible #28

ValueError: Shapes (31, 1) and (31, 2) are incompatible #28

Sanketd420 commented May 26, 2020

Joe-Darling commented Jun 21, 2020

asokraju commented Sep 10, 2020

ValueError: Shapes (31, 1) and (31, 2) are incompatible #28

ValueError: Shapes (31, 1) and (31, 2) are incompatible #28

Comments

Sanketd420 commented May 26, 2020

Joe-Darling commented Jun 21, 2020

asokraju commented Sep 10, 2020