Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feat: unified gae #1129

Open
wants to merge 14 commits into
base: develop
Choose a base branch
from
Open

Feat: unified gae #1129

wants to merge 14 commits into from

Conversation

SimonDuToit
Copy link
Contributor

@SimonDuToit SimonDuToit commented Nov 7, 2024

Makes all PPO systems use a single shared function for calculating the GAE. This is made possible by making the feed forward systems use the dones from the previous step, bringing them in line with the recurrent systems. Benchmark results.

Copy link
Contributor

@sash-a sash-a left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks much nicer! Just some minor nitpicks. Of course we should wait for the final benchmarks, but if that's all good them I'm happy.

Please make an issue for doing this to sable/mat unless you're planning on doing this now (which would be great)

mava/utils/gae.py Outdated Show resolved Hide resolved
mava/utils/gae.py Outdated Show resolved Hide resolved
mava/utils/gae.py Outdated Show resolved Hide resolved
mava/utils/gae.py Outdated Show resolved Hide resolved
mava/utils/gae.py Outdated Show resolved Hide resolved
Copy link
Contributor

@sash-a sash-a left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

My bad, should be N instead of A for number of agents

mava/utils/multistep.py Outdated Show resolved Hide resolved
mava/utils/multistep.py Outdated Show resolved Hide resolved
SimonDuToit and others added 2 commits November 8, 2024 11:37
Co-authored-by: Sasha Abramowitz <[email protected]>
@OmaymaMahjoub OmaymaMahjoub added benchmark required Docker images get pushed if PR has this label. benchmark complete and removed benchmark required Docker images get pushed if PR has this label. labels Nov 13, 2024
Copy link
Contributor

@OmaymaMahjoub OmaymaMahjoub left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@SimonDuToit if you can update MAT and Sable to use gae from this util file 🙏

Copy link
Contributor

@OmaymaMahjoub OmaymaMahjoub left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @SimonDuToit for the changes, just if you can fix the format and type issues that i mentioned 🙏

mava/utils/multistep.py Show resolved Hide resolved
mava/systems/ppo/types.py Show resolved Hide resolved
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants