You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I also had this issue. If you are on tensorflow 2.2 then downgrading to either 2.0 or 2.1 may fix the issue, it did for me at least. However after fixing that issue the model never converges and after thousands of episodes still gets stuck on a reward value of about 10-20 and an avg loss in the negative thousands... Not sure how to fix this
adding these lines will fix the issue... target_actions = np.array([[1 if a==i else 0 for i in range(2)] for a in actions]) loss = network.train_on_batch(states,target_actions, sample_weight=discounted_rewards)
loss = update_network(network, rewards, states, actions, num_actions)
The text was updated successfully, but these errors were encountered: