Changes for using dynamic orders in harl #71

bmielnicki · 2021-03-11T10:39:28Z

Changes that are needed so my harl changes HumanCompatibleAI/human_aware_rl#16 can work.
Includes changes from #38 and #57 and:

ending episode after fulfilling n orders
max number of orders at same time
GreedyHumanModel can be pedagogical in the way it picks ingredients; it is generally improved and compatible with dynamic orders Dynamic orders #57
SlowedDownAgent (agent that does wait action fraction of the time)
fix of SampleAgent
possibility to ignore choosed dispensers/counters by mlam
multiple typos fixes
moving all of preprocessing/encoding/featurizations functions and state spaces to OvercookedGridworld/OvercookedEnv (used in harl code More flexible training human_aware_rl#16)

…ctory_visualization_merged

…ile, add potting and delivery event support

… graphs in single notebook)

… every object, add events_list field in trajectory

…ization_merged

- Configuring recipes if needed (as they are used inside orders) - Fixing save_trajectories methiod - change outdated comment/variable name inside save_traj_as_json

Agent that is slowed down version of another agent. Can be used to produce assymetry of power between agents.

- Prioritization of pots where there is less ingredients left for ideal recipe. - Choosing next ingredient in the way that reveal intention for entire recipe.

- Tests for slowed down agents - Tests for pedagogical ingredients picking - Testing both GreedyHumanModel and SimpleGreedyHumanModel

- Standarization of ingredients list (sorted lisf of tuples) as method - ingredients_diff (used in GreedyHumanModel) - More tests

- one_hot_joint_action_encoding - sparse_categorical_joint_action_encoding - multi_hot_orders_encoding - check for featurize_state if layout can be featurized - gym space for every encoding method (besides action ones)

Probably nothing changed besides order_ids. This version was used in last tests

without this change there are false positives sometimes

micahcarroll · 2022-06-10T19:06:01Z

Given that this is a lower priority issue right now, I'll temporarily close this PR for bookeeping – we can re-open it in the future if we are interested in exploring this direction further.

bmielnicki added 30 commits July 26, 2020 20:01

up canvas to 2.6 because npm throw error on installation 2.5

8f6ecd7

add visualization folder with event visualization code

d01ccd7

fill in method for event visualization

e9c3f9b

merge master to trajectory visualization branch

583310e

deleting currently not used require.js from package-lock.json

edd4800

agent evaluator introduction tutorial notebook

e715fa1

undo unwanted change done in merge

176ae86

minor tutorial notebook change

4640f91

add basic documentation to visualization code

10b19d9

delete empty file

cfb54bb

better look of the code in markdown

11882c4

Merge remote-tracking branch 'overcooked_ai_remote/master' into traje…

359a8b0

…ctory_visualization_merged

add support for ep infos

819b95c

adding line identifying file is pasted successfully

ff7bb85

changed naming convention from item to object, add line identifying f…

b3178a1

…ile, add potting and delivery event support

add identifying file line, add generating box_id (in case of multiple…

4835b23

… graphs in single notebook)

add json dumping of np arrays

0a69b18

visualization tests

38da97b

push extraction of most events into overcooked_mdp - add object_id to…

4a97a21

… every object, add events_list field in trajectory

add visualization tests to full suite

7edba54

undo updating canvas version

2ae5e7d

merge

a3a2161

adding visualization tests to proper directory

d9f3241

cleaning unused files after merge

82ac6c4

Merge branch 'master' into trajectory_visualization_merged

b75ef32

fix test to not raise error on any dollar sign (e.g. jquery code)

fc9383f

__eq__ and __hash__ now uses object_id

71e97d8

renamed one letter variables

8f258d8

Merge remote-tracking branch 'upstream/master' into trajectory_visual…

e6111b3

…ization_merged

undo using object_id in __eq__ and __hash__ - this made tests fail

9216142

bmielnicki added 20 commits February 23, 2021 22:29

Safe overwritting of mlam params in AgentEvaluator

6a17943

Fix saving/loading trajectories

55771c0

- Configuring recipes if needed (as they are used inside orders) - Fixing save_trajectories methiod - change outdated comment/variable name inside save_traj_as_json

Fix sample agent (reset in init)

73ee449

SlowedDownAgent

d15735f

Agent that is slowed down version of another agent. Can be used to produce assymetry of power between agents.

GreedyHumanModel pedagogical updates

4a6364f

- Prioritization of pots where there is less ingredients left for ideal recipe. - Choosing next ingredient in the way that reveal intention for entire recipe.

Additional tests for agents

a4cf5b5

- Tests for slowed down agents - Tests for pedagogical ingredients picking - Testing both GreedyHumanModel and SimpleGreedyHumanModel

Recipe changes

fce5d54

- Standarization of ingredients list (sorted lisf of tuples) as method - ingredients_diff (used in GreedyHumanModel) - More tests

OrdersList - max number of orders; store fullfilled orders

35926ea

end episode after fulfilling selected number of orders

c0b37b4

More reward shaping options

da13159

ids_and_reward_shaping_independent_equal

469c6fc

More state encoding options; gym obs space from mdp

338e2b5

- one_hot_joint_action_encoding - sparse_categorical_joint_action_encoding - multi_hot_orders_encoding - check for featurize_state if layout can be featurized - gym space for every encoding method (besides action ones)

updated test jsons

8dfe031

merge and regenerating test files after the merge

2dd94fa

Correction of minor typos

0c62172

Current version of expected.json and dummy.json

26225e8

Probably nothing changed besides order_ids. This version was used in last tests

another typo fixes

8789696

less strict test_interval_schedule

14a9a70

without this change there are false positives sometimes

delete layouts added by accident

7688c16

fix indent size

fa9bd9d

bmielnicki mentioned this pull request Mar 11, 2021

More flexible training HumanCompatibleAI/human_aware_rl#16

Closed

bmielnicki requested review from mesutyang97, micahcarroll and nathan-miller23 and removed request for mesutyang97 and micahcarroll March 11, 2021 11:24

micahcarroll closed this Jun 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changes for using dynamic orders in harl #71

Changes for using dynamic orders in harl #71

bmielnicki commented Mar 11, 2021 •

edited

Loading

micahcarroll commented Jun 10, 2022

Changes for using dynamic orders in harl #71

Changes for using dynamic orders in harl #71

Conversation

bmielnicki commented Mar 11, 2021 • edited Loading

micahcarroll commented Jun 10, 2022

bmielnicki commented Mar 11, 2021 •

edited

Loading