-
Notifications
You must be signed in to change notification settings - Fork 159
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tutorial Google Colab Notebook #48
Comments
There is an upcoming ipython notebook on trajectory visualization branch that can be used as a good starting point to this task.
|
This seems good to me! I think we could even just have one notebook (we should put it in Colab), and have a table of contents. |
Some non-obvious things that come to mind that the tutorial should address:
We probably later also want to have a tutorial for human_aware_rl repo which shows how to train, load, and evaluate simple agents. |
Thanks for the suggestions! I will create a notebook with tutorial for current master branch code as my next task including those suggested things. I'm not familiar with human_aware_rl yet so I will leave it for now. |
I've done a simple introduction notebook that works with the current master branch. https://colab.research.google.com/github/bmielnicki/overcooked_ai/blob/introduction_notebook/introduction.ipynb Some things worth to mention:
Questions:
|
Thanks Bartek, this is looking good! I do think that it's worth to first merge the native python visualizations so we can add a little bit of color here too! (I'll check on the native viz PR just after this) Regarding your questions:
|
Thanks for the reply!
|
Yeah I think this might be due to the mdp dynamics having changed. I thought @mesutyang97 might have fixed the planner behind the greedy human model, so I think it should be a relatively quick fix if you want to investigate the cause yourself (Bartek). If this seems to complicated, no worries! |
Hi Bartek, Thanks for the work of putting this iPython Notebook together. Wow! I suspect it is because the BC agent is not pressing "cook" in the updated dynamics (because in the old dynamics, soup will automatically cook when it has 3 items) I will look more into this. But thanks a lot for the work! |
You are right - agents are not pressing cook. I'm already close to pushing the fix for this bug, |
Could you let me know how you are planning to fix this? I had a discussion with Micah yesterday about this but couldn't come to a good conclusion. Would be nice to get your opinion |
Given that GreedyHumanModel accepts only situation when all_orders has len of 1 we can just cook any soup of selected len. inside ml_action I've changed this
to this
Additional improvement would be to cook any soup of size bigger than next_order too, or soups with ingredients that does not match next order (e.g soups that has a tomato when only order is BTW state featurization tests (test_state_featurization and test_lossless_state_featurization_shape) failed after this change. |
That looks like a good fix, thanks Bartek! I think we can keep it for That is unfortunate. Could you share how it is failing? |
Features were not equal |
@mesutyang97 Do you think it is okay to merge this code? I'm not sure how much those failed featurization tests are a concern (I'm not familiar with featurization at all so it is hard to judge for me how my code changed featurization and if it's bug or feature) - it passed all tests including featurization after overwriting the pickled files. |
i think it should be good. Could you double check the updated pickled files contain the correct information? |
EDIT: I have not double-checked updated pickled files, but if they were right before I assume they are right now (as nothing from the featurization code changed and there is a simple explanation why tests failed before) |
Minor point: @mesutyang97 I think you were confusing BC agents and GreedyHumanModels: what Bartek was fixing was GreedyHumanModels (hardcoded agents that use planner logic), rather than BC agents. I'll be checking out the PR itself soon. |
Updated version with python visualizations: Some notes:
|
I've made a v0 in the README which is currently working (not sure whether the colab notebook above is stale): https://colab.research.google.com/drive/1AAVP2P-QQhbx6WTOnIG54NXLXFbO7y6n#scrollTo=Z1RBlqADnTDw. To actually address this issue, we should merge the notebook above with my one linked here. We should also figure out what is an easy way to maintain / test these kinds of notebooks. Primarily, we would want:
|
This issue is stale because it has been open for 30 days with no activity. It will be closed in 14 days unless there is some new activity |
This issue is stale because it has been open for 30 days with no activity. It will be closed in 14 days unless there is some new activity |
Creating a tutorial Google Colab notebook on how to use the environment, visualize rollouts, etc. Most useful after python visualizations #45 are completed.
The text was updated successfully, but these errors were encountered: