-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Early stopping: overfit prevention #4996
Comments
@StrikerRUS: It is not the same, but is indeed not completely unrelated. Both ideas fight overfit. #4580 seems easier to implement, but trickier to apply in practice (a good value for The idea in this post uses the ratio of train and valid performance to decide whether overfitting is getting too strong. I tried to draw it in the favourite data science tool "Excel" ;). As a user, I can simply wish: "I don't want to have more than x% overfit on my chosen metric(s)". We don't need to start with this one, but maybe a logical order could look like this:
I will of course help with the changes, but I am not sure if bullet point 1 is on the roadmap or not? |
@mayer79 Ah, I got the difference now, thanks for the detailed explanation with example! This idea looks good to me. Is there something similar in other Python/R packages we can check as a reference?
It is: #2479. I guess we can start from this point. |
Sounds like a plan! |
I'm working on this for Python but I have a concern for the functionality:
A typical train and validation curve might look like this: both train and validation improves with big initial gaps, the gap narrows then widens as validation plateaus while train continues to improve. My concern is overfit tolerance might be unknowingly triggered for regions where the model is actually underfitting
Early stopping would be triggered on iteration 1 instead of 24. Any ideas to mitigate this?
|
Currently, "early stopping" monitors validation loss and stops after some unsucessful rounds. This is often used together with gridsearchCV to select a best model. Sometimes, the best performing model shows quite some overfit and one might prefer a model with slightly worse performance but less overfit, depending on the situation.
To actively control for overfit, I would love to see a modification of early stopping. It would stop the booster if after a couple rounds, the validation score is more than "overfit_tolerance" worse than the training score.
It could be used e.g. like this
callbacks=[lgb.early_stopping(20, overfit_tolerance=1.1)]
This would stop the boosting process if after 20 rounds, either the performance stopped improving or the ratio of validation to train performance became >1.1.
The text was updated successfully, but these errors were encountered: