We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
每次update_V的时候后面调用的compute_V时的策略是基于当前V的,而不是上一次策略提升后得到的策略,这不就相当于是值迭代了,并没有体现出策略评估和策略提升的两步分别进行。 有没有大佬帮我看一下。
The text was updated successfully, but these errors were encountered:
Sorry, something went wrong.
No branches or pull requests
每次update_V的时候后面调用的compute_V时的策略是基于当前V的,而不是上一次策略提升后得到的策略,这不就相当于是值迭代了,并没有体现出策略评估和策略提升的两步分别进行。
有没有大佬帮我看一下。
The text was updated successfully, but these errors were encountered: