Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why does the model not overfit a single image? #127

Open
karkidaju opened this issue Oct 10, 2022 · 0 comments
Open

Why does the model not overfit a single image? #127

karkidaju opened this issue Oct 10, 2022 · 0 comments

Comments

@karkidaju
Copy link

Hi! I made some changes to the network for multiclass detection. As I tried to train it with a single image for a long time, I expected the loss to converge. But strangely, it first goes down to almost a zero value then spikes suddenly. Especially for regression it goes above the starting loss (but this is random, can be less too). I thought this was introduced by my changes. But then I reverted back to the original code and observed similar behavior.

This is the tensorboard log result with the original code. The model is being trained for 50 epochs on one image.

image

Is there a general concept in deep learning that I am missing? I thought theoretically the model would overfit the single image but the loss going up says otherwise.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant