Feat: Support latest Jumanji version #1134

WiemKhlifi · 2024-11-11T16:48:52Z

What?

Upgrade to the latest Jumanji version of 1.0.1 instead of 0.3.1 and pin to the original and latest Jumanji and Matrax.

How?

Change the requirements.txt to use original versions instead of a fork.
Adapt all the wrappers and systems to use cached specs similar to jumanji wrappers and envs.

Extra:

Note that this PR will be merged after pushing this PR into jumanji for Connector env updates.
Note that in some wrappers, since the specs outputs will be cached, some attributes can't be retrieved if defined after the super().__init__(env) ( The self.__getattr__(env,name) in parent class can't get the attribute from env if it's defined with different name in the env wrapper class).
For example:

# Should work:
super().__init__(env)
self.time_limit = self._env.time_limit 

# Shouldn't work:
super().__init__(env)
self.time_limit = self._env.max_episode_length

RuanJohn

Thanks for this @WiemKhlifi! Just a few questions, but it looks mostly good to me.
As a sanity check can you please do a few test runs to just check that the system performance is unaffected?

mava/configs/env/connector.yaml

mava/wrappers/jumanji.py

sash-a

Thanks Wiem, couple small things, mostly removing the git stuff from requirements.txt where possible

requirements/requirements.txt

mava/wrappers/jumanji.py

mava/systems/sable/anakin/ff_sable.py

mava/systems/sable/anakin/rec_sable.py

RuanJohn

Thanks @WiemKhlifi. Some suggestions from my side.

mava/configs/env/vector-connector.yaml

mava/wrappers/jumanji.py

RuanJohn · 2024-11-26T10:39:51Z

mava/wrappers/jumanji.py

-            # The environment returns a list of individual rewards and these are used as is.
-            return timestep.replace(observation=modified_observation)
+        # Whether or not aggregate the list of individual rewards.
+        reward = aggregate_rewards(timestep.reward, self.num_agents, self._use_individual_rewards)


I must admit I am not a massive fan of the not here. But I prefer it over having the conditional in the aggregation function. What do you think of just having the config option be aggregate_rewards instead of use_individual_rewards? Then we could change the conditional here to if self._aggregate_rewards.

Suggested change

reward = aggregate_rewards(timestep.reward, self.num_agents, self._use_individual_rewards)

if not self._use_individual_rewards:

reward = aggregate_rewards(timestep.reward, self.num_agents)

eh either way I prefer this 😄

I chose the second option to use aggregate_rewards instead of the not which is less confusing 😅

mava/wrappers/jumanji.py

WiemKhlifi added 5 commits November 7, 2024 16:27

feat: update jumanji version

c82e954

feat: Remove Maconnector and use connector from main jumanji

4dd56f5

feat: add cached_prop decorator to all specs

6bfe221

fix: fix connector shape bug

9fc2c97

feat: fully support recent jumanji!

5df0176

WiemKhlifi self-assigned this Nov 11, 2024

WiemKhlifi requested review from arnupretorius, DriesSmit, RuanJohn, jcformanek, siddarthsingh1, sash-a, OmaymaMahjoub and ulricharmel as code owners November 11, 2024 16:48

pull-request-size bot added the size/L label Nov 11, 2024

WiemKhlifi added 2 commits November 11, 2024 17:57

chore: remove extra space

7bf1a5d

Merge branch 'develop' into feat/update_juamnji

8b13962

RuanJohn requested changes Nov 12, 2024

View reviewed changes

mava/configs/env/connector.yaml Show resolved Hide resolved

mava/wrappers/jumanji.py Outdated Show resolved Hide resolved

mava/wrappers/jumanji.py Show resolved Hide resolved

WiemKhlifi added 2 commits November 12, 2024 16:26

chore: remove uneeded comments

a2dd12c

feat: use multi-agent connector instead

6527a98

WiemKhlifi requested review from SimonDuToit and Louay-Ben-nessir as code owners November 13, 2024 15:38

WiemKhlifi removed request for arnupretorius, DriesSmit, jcformanek, siddarthsingh1 and ulricharmel November 14, 2024 17:47

WiemKhlifi added 3 commits November 19, 2024 19:22

Merge branch 'develop' into feat/update_juamnji

0b59f53

chore: pin to latest jumanji now

2a37b67

revert: add mpe to configs

e4a9008

fix: use cached action_spec

8880dc6

sash-a requested changes Nov 20, 2024

View reviewed changes

WiemKhlifi and others added 3 commits November 20, 2024 15:23

ci: remove git path from requirements

c7cf8c7

Merge branch 'develop' into feat/update_juamnji

dd50ed8

Merge branch 'develop' into feat/update_juamnji

ac56b73

RuanJohn requested changes Nov 26, 2024

View reviewed changes

chore: cleaning based on review

59c53f3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: Support latest Jumanji version #1134

Feat: Support latest Jumanji version #1134

WiemKhlifi commented Nov 11, 2024 •

edited

Loading

RuanJohn left a comment

sash-a left a comment

RuanJohn left a comment

RuanJohn Nov 26, 2024

sash-a Nov 26, 2024

WiemKhlifi Nov 28, 2024

	reward = aggregate_rewards(timestep.reward, self.num_agents, self._use_individual_rewards)
	if not self._use_individual_rewards:
	reward = aggregate_rewards(timestep.reward, self.num_agents)

Feat: Support latest Jumanji version #1134

Are you sure you want to change the base?

Feat: Support latest Jumanji version #1134

Conversation

WiemKhlifi commented Nov 11, 2024 • edited Loading

What?

How?

Extra:

RuanJohn left a comment

Choose a reason for hiding this comment

sash-a left a comment

Choose a reason for hiding this comment

RuanJohn left a comment

Choose a reason for hiding this comment

RuanJohn Nov 26, 2024

Choose a reason for hiding this comment

sash-a Nov 26, 2024

Choose a reason for hiding this comment

WiemKhlifi Nov 28, 2024

Choose a reason for hiding this comment

WiemKhlifi commented Nov 11, 2024 •

edited

Loading