-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
tianzikang
committed
Aug 30, 2022
1 parent
a8c1c74
commit ca7da80
Showing
4 changed files
with
54 additions
and
11 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,8 +1,46 @@ | ||
# QMIXRNN | ||
Referring to pymarl, qmix is implemented with RNN to cope with SMAC environment | ||
Referring to pymarl, qmix is implemented clearly with RNN to cope with SMAC environment. | ||
This clear implementation can help you figure out how does QMIX work | ||
|
||
## Run | ||
`python main.py --map-name=3s5z` | ||
Note: --optimizer=0/1 means that optimizer `Adam` and `RMSprop` is good on this scenario, please just select one of both when running | ||
`python main.py --map-name=3s5z --optimizer=0/1` | ||
`python main.py --map-name=1c3s5z --optimizer=0/1` | ||
`python main.py --map-name=2s3z --optimizer=0/1` | ||
`python main.py --map-name=8m --optimizer=0/1` | ||
`python main.py --map-name=2s_vs_1sc --optimizer=0` | ||
`python main.py --map-name=3m --optimizer=0` | ||
`python main.py --map-name=10m_vs_11m --optimizer=0` | ||
|
||
## TODO | ||
Now this code can deal with some easy scenarios like 2s3z, 3s5z, 3m, 8m, and I'm trying to approach the result of pymarl. At the same time, I'm also trying to achieve some tricks on this code like multi step TD target and so on. | ||
Now this code can do very good on part of easy scenarios like 1c3s5z, 2s3z, 3s5z and 8m, | ||
and relative good on easy scenarios like 2s_vs_1sc and 3m, | ||
but not good on easy scenarios 10m_vs_11m. | ||
|
||
I'm trying to approach the result of pymarl. At the same time, I'm also trying to achieve some tricks on this code like multi step TD target and so on. | ||
|
||
## Reference | ||
@inproceedings{rashid2018qmix, | ||
title={Qmix: Monotonic value function factorisation for deep multi-agent reinforcement learning}, | ||
author={Rashid, Tabish and Samvelyan, Mikayel and Schroeder, Christian and Farquhar, Gregory and Foerster, Jakob and Whiteson, Shimon}, | ||
booktitle={International conference on machine learning}, | ||
pages={4295--4304}, | ||
year={2018}, | ||
organization={PMLR} | ||
} | ||
|
||
@article{samvelyan19smac, | ||
title = {{The} {StarCraft} {Multi}-{Agent} {Challenge}}, | ||
author = {Mikayel Samvelyan and Tabish Rashid and Christian Schroeder de Witt and Gregory Farquhar and Nantas Nardelli and Tim G. J. Rudner and Chia-Man Hung and Philiph H. S. Torr and Jakob Foerster and Shimon Whiteson}, | ||
journal = {CoRR}, | ||
volume = {abs/1902.04043}, | ||
year = {2019}, | ||
} | ||
|
||
@article{samvelyan19smac, | ||
title = {{The} {StarCraft} {Multi}-{Agent} {Challenge}}, | ||
author = {Mikayel Samvelyan and Tabish Rashid and Christian Schroeder de Witt and Gregory Farquhar and Nantas Nardelli and Tim G. J. Rudner and Chia-Man Hung and Philiph H. S. Torr and Jakob Foerster and Shimon Whiteson}, | ||
journal = {CoRR}, | ||
volume = {abs/1902.04043}, | ||
year = {2019}, | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters