-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reproduce the results in the paper #1
Comments
Hi @RetiaAdolf I'm facing a similar issue replicating the results in the paper. I ran the experiment with 8m_vs_9m using the default configuration, but the performance lags behind QMIX quite a lot. After 500k steps, the win-rate is only around 0.1, whereas the paper reports over 80% at this point. Could you share the complete configuration files used in the paper? Additionally, I'm curious as to why the code does not support parallel running, given that the number of parallel threads is also an important hyperparameter which can significantly impact performance (see the |
I'm trying to reproduce your results, but the win rate do not increase in the Corridor scenario.
(I think "dfop" means MACPF)
I haven't changed your code at all, and in the code of the version you uploaded, Alpha and Alpha_{i} are fixed to 0.001, so it appears to be the same as in the setting of Corridor scenario you described in the paper.
Do I have an additional parameter for the Corridor scenario?
The figures attached are the results of Corridor scenario that I reproduced and the config file for MACPF (dfop in the code).
The text was updated successfully, but these errors were encountered: