Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] Page about HPO #37

Draft
wants to merge 2 commits into
base: sprint_1
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
167 changes: 167 additions & 0 deletions docs/Handbook_hpo.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,167 @@
Hyperparameter Optimization
===========================

Why it matters
--------------




RL paper
Are GANs created equal? a large-scale study. Lucic et al (2017)
On the state of the art of evaluation in neural language models. Melis et al (2017)
Knowledge Base Completion: Baselines Strike Back. Kadlec et al (2017)
Repro paper
Accounting for variance in ML Benchmarks
Optimizer benchmark (DeepOBS?)

Survey





Defining the search space
-------------------------

- Avoid turning discrete values into categories
- Use logarithmic distribution if the effect of hyperparameter on the objective is logarithmic (ex:
learning rates)
- Inverse logarithmic for hyperparameters behaving the other way around (ex: gamma of exponential learning rate schedule) (todo: get back equation from slides)

Learning-rate
^^^^^^^^^^^^^


Learning-rate schedule
^^^^^^^^^^^^^^^^^^^^^^


Momentum
^^^^^^^^

Weight decay
^^^^^^^^^^^^

Mini-batch size
^^^^^^^^^^^^^^^

Size of layers
^^^^^^^^^^^^^^

Number of layers
^^^^^^^^^^^^^^^^


Choosing the budget
-------------------


.. image:: _static/dim_1.png
:width: 300
:alt: Alternative text
:align: center

.. image:: _static/dim_2.png
:width: 300
:alt: Alternative text
:align: center

.. image:: _static/dim_3.png
:width: 300
:alt: Alternative text
:align: center

.. image:: _static/easy_vs_hard.png
:width: 600
:alt: Alternative text
:align: center




Selecting HPO algorithms
------------------------


.. image:: _static/handbook_hpo_algo_selection.png
:width: 800
:alt: Alternative text
:align: center


How many dimensions?
^^^^^^^^^^^^^^^^^^^^


.. image:: _static/grid_vs_random.png
:width: 500
:alt: Alternative text
:align: center

Computational time per trial
^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Late learners
^^^^^^^^^^^^^

How many trials?
^^^^^^^^^^^^^^^^

Are all dimensions continuous?
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^



Visualizations
--------------


.. image:: _static/regret_thumbnail.png
:width: 500
:alt: Alternative text
:align: center


.. image:: _static/parallel_coordinates_select.gif
:width: 500
:alt: Alternative text
:align: center


.. image:: _static/par_dep_thumbnail.png
:width: 500
:alt: Alternative text
:align: center





Frameworks
----------

Oríon
^^^^^

- Developped at Mila

Ray-Tune
^^^^^^^^

- Advantage if using Ray
- Many algorithms


Optuna
^^^^^^

- Good TPE implementation
- Less algorithms




References
----------
AutoML book
Binary file added docs/_static/dim_1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/_static/dim_2.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/_static/dim_3.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/_static/easy_vs_hard.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/_static/grid_vs_random.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/_static/handbook_hpo_algo_selection.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/_static/par_dep_thumbnail.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/_static/parallel_coordinates_select.gif
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/_static/regret_thumbnail.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.