Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DDPG #1

Open
MayeZhang opened this issue May 8, 2020 · 0 comments
Open

DDPG #1

MayeZhang opened this issue May 8, 2020 · 0 comments

Comments

@MayeZhang
Copy link

肖大神您好,我最近在做毕业设计用的您的DDPG代码,环境改成了我们专业的通信的场景。但是我发现reward已经没有那种抖动上升的趋势了,非常困惑,分析了一圈感觉是critic_evaluate_net这个网络效果不好,没有得出理想的loss_tensor,这之后应该又对actor_evaluate_net的更新有了影响,所以后面得到的action并不好。有时候还会卡在动作取值的边界,请问您有啥好的办法没啊?非常感谢!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant