Policy Gradient Issue: ValueError: Shapes (20, 1) and (20, 2) are incompatible #27

danisch-khurshid-creator · 2020-05-21T10:59:46Z

Hi.
The code Code is not working with this line: loss = network.train_on_batch(states, discounted_rewards).

The text was updated successfully, but these errors were encountered:

asokraju · 2020-09-10T17:10:11Z

Try this... it should work...
target_actions = np.array([[1 if a==i else 0 for i in range(2)] for a in actions]) loss = network.train_on_batch(states,target_actions, sample_weight=discounted_rewards)

asokraju mentioned this issue Sep 10, 2020

Bugfix policy gradinet reinforce tf2 #29

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Policy Gradient Issue: ValueError: Shapes (20, 1) and (20, 2) are incompatible #27

Policy Gradient Issue: ValueError: Shapes (20, 1) and (20, 2) are incompatible #27

danisch-khurshid-creator commented May 21, 2020 •

edited

Loading

asokraju commented Sep 10, 2020

Policy Gradient Issue: ValueError: Shapes (20, 1) and (20, 2) are incompatible #27

Policy Gradient Issue: ValueError: Shapes (20, 1) and (20, 2) are incompatible #27

Comments

danisch-khurshid-creator commented May 21, 2020 • edited Loading

asokraju commented Sep 10, 2020

danisch-khurshid-creator commented May 21, 2020 •

edited

Loading