generalizable-training-algorithms

Generalizable Training Algorithms for Deep Learning-Based Image Classification

Abstract

In this project, we review and compare various existing neural network training algorithms. We find that all of the algorithms can successfully optimize the training loss function but they perform differently on unseen data points. We conduct experiments to confirm that Adam and other adaptive moment methods can minimize the training cost function faster than Stochastic Gradient Descent with Momentum (SGDM) for image classification tasks. However, the test accuracy of Adam is significantly worse than SGDM's. We also reproduce Padam, a recently proposed algorithm that combines Adam with SGDM to achieve the best from both by introducing a partial adaptive parameter.

Inspired by the existing algorithms, we propose a new class of algorithms by combining Adam and SGDM. This new class of algorithms includes Adam Switch SGDM and Linear Combination of SGDM and Adam (LCSA). LCSA further contains LCSA with Constant Weighting (LCSA-CW), LCSA with Discontinuous Dynamic Weighting (LCSA-DDW), and LCSA with Continuous Dynamic Weighting (LCSA-CDW).

We demonstrate via experiments that with proper hyperparameter tuning, LCSA-DDW can achieve a better test accuracy than SGDM and maintain a fast training convergence rate as Adam. Besides, LCSA-CW and LCSA-CDW could achieve better test accuracy than Adam while maintaining a fast training convergence rate.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
AdamSwitchSGDM.ipynb		AdamSwitchSGDM.ipynb
AlgoComparison.ipynb		AlgoComparison.ipynb
LCSA.ipynb		LCSA.ipynb
LICENSE		LICENSE
Monitor.ipynb		Monitor.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

generalizable-training-algorithms

Abstract

Read the full Paper

About

Releases

Packages

Languages

License

BilalBAI/generalizable-training-algorithms

Folders and files

Latest commit

History

Repository files navigation

generalizable-training-algorithms

Abstract

Read the full Paper

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages