Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Strange score distribution in the paper. #33

Open
erow opened this issue Feb 1, 2021 · 1 comment
Open

Strange score distribution in the paper. #33

erow opened this issue Feb 1, 2021 · 1 comment

Comments

@erow
Copy link

erow commented Feb 1, 2021

image
This is Figure 7 MIG distribution on dSprites. The MIG score should be non-negative, and the scores are low. I analyze the pre-trained models and get the following results.
image

I don't know why they are so different, and the other metrics are different too.

@Near32
Copy link

Near32 commented Sep 29, 2021

Hello,
The MIG computation function (and the Modularity one too) makes use of scikit-learn's mutual_info_score function (see here).
I suspect scikit-learn's mutual_info_score function having changed throughout the different versions is the issue here, and this is likely to be the root cause of discrepancies between the results you computed in February 2021 (I assume you were using scikit-learn version 0.24.0) and the original results of the paper (published on ArXiv around Nov. 2018, when the latest scikit-learn version was 0.20.0).

I am experiencing a lot of discrepancies with the Modularity score on my end (with a code adapted to another framework though).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants