API: summary mode=None
keeps current mode (#331)
#337
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Closes #331.
Changes the behaviour of
mode=None
insummary
.Before: used global default
Mode.EVAL
Now: keeps current model mode.
Note that this changes the default behaviour.
I couldn't pass
tests/torchinfo_xl_test.py::test_flan_t5_small
because my proxy didn't allow model downloading. Also, I didn't test on GPU so GPU tests were not ran.Side questions: While running the tests, I noticed that depending on the mode (
"train"
or"eval"
), the summary would sometimes show or not show the top level number of parameters:Is there a specific reason to this behaviour?
Furthermore, in this case, the top level shows 6.4M params, while the total count shows 13M params and Google claims around 6.xM params. Any lead to understand this?
Thanks!