Skip to content

Hyperparameter optimization that enables researchers to experiment, visualize, and scale quickly.

License

GPL-3.0, Unknown licenses found

Licenses found

GPL-3.0
LICENSE
Unknown
license.sh
Notifications You must be signed in to change notification settings

sherpa-ai/sherpa

Repository files navigation

SHERPA: A Python Hyperparameter Optimization Library

SHERPA logo
Build Status https://pepy.tech/badge/parameter-sherpa

SHERPA is a Python library for hyperparameter tuning of machine learning models. It provides:

  • hyperparameter optimization for machine learning researchers
  • it can be used with any Python machine learning library such as Keras, Tensorflow, PyTorch, or Scikit-Learn
  • a choice of hyperparameter optimization algorithms such as Bayesian optimization via GPyOpt (example notebook), Asynchronous Successive Halving (aka Hyperband) (example notebook), and Population Based Training (example notebook).
  • parallel computation that can be fitted to the user's needs
  • a live dashboard for the exploratory analysis of results.

Clone from GitHub to get the latest version or install via pip install parameter-sherpa. The documentation at http://parameter-sherpa.readthedocs.io/ provides tutorials on using the different optimization algorithms and installation instructions for parallel hyperparameter optimizations. Take a look at the demo video by clicking on the image below or read on to find out more.

We would love to hear what you think of Sherpa! Tell us how we can improve via our Feedback-Form.

http://img.youtube.com/vi/-exnF3uv0Ws/0.jpg

If you use SHERPA in your research please cite:

@article{hertel2020sherpa,
   title={Sherpa: Robust Hyperparameter Optimization for Machine Learning},
   author={Lars Hertel and Julian Collado and Peter Sadowski and Jordan Ott and Pierre Baldi},
   journal={SoftwareX},
   volume={},
   number={},
   pages={},
   note={In press.}
   year={2020},
   note  ={Also arXiv:2005.04048. Software available at: https://github.com/sherpa-ai/sherpa},
   publisher={}
}

From Keras to Sherpa in 30 seconds

This example will show how to adapt a minimal Keras script so it can be used with SHERPA. As starting point we use the "getting started in 30 seconds" tutorial from the Keras webpage.

We start out with this piece of Keras code:

from keras.models import Sequential
from keras.layers import Dense
model = Sequential()
model.add(Dense(units=64, activation='relu', input_dim=100))
model.add(Dense(units=10, activation='softmax'))
model.compile(loss='categorical_crossentropy',
          optimizer='sgd',
          metrics=['accuracy'])

We want to tune the number of hidden units via Random Search. To do that, we define one parameter of type Discrete. We also use the BayesianOptimization algorithm with maximum number of trials 50.

import sherpa
parameters = [sherpa.Discrete('num_units', [50, 200])]
alg = sherpa.algorithms.BayesianOptimization(max_num_trials=50)

We use these objects to create a SHERPA Study:

study = sherpa.Study(parameters=parameters,
                     algorithm=alg,
                     lower_is_better=True)

We obtain trials by iterating over the study. Each trial has a parameter attribute that contains the num_units parameter value. We can use that value to create our model.

for trial in study:
    model = Sequential()
    model.add(Dense(units=trial.parameters['num_units'],
                    activation='relu', input_dim=100))
    model.add(Dense(units=10, activation='softmax'))
    model.compile(loss='categorical_crossentropy',
              optimizer='sgd',
              metrics=['accuracy'])

    model.fit(x_train, y_train, epochs=5, batch_size=32,
              callbacks=[study.keras_callback(trial, objective_name='val_loss')])
    study.finalize(trial)

During training, objective values will be added to the SHERPA study via the callback. At the end of training study.finalize completes this trial. This means that no more observation will be added to this trial.

When the Study is created, SHERPA will display the dashboard address. If you put the address into your browser you will see the dashboard as shown below. As a next step you can take a look at this example of optimizing a Random Forest in sherpa/examples/randomforest.py.

SHERPA Dashboard.

Installation from PyPi

pip install parameter-sherpa

Installation from GitHub

Clone from GitHub:

git clone https://github.com/LarsHH/sherpa.git
export PYTHONPATH=$PYTHONPATH:`pwd`/sherpa

Install dependencies:

pip install pandas
pip install numpy
pip install scipy
pip install scikit-learn
pip install flask
pip install enum34  # if on < Python 3.4

You can run an example to verify SHERPA is working:

cd sherpa/examples/
python simple.py

Note that to run hyperparameter optimizations in parallel with SHERPA requires the installation of Mongo DB. Further instructions can be found in the Parallel Installation section of the documentation.