Adds several HDBO methods (#22)

* Brings all the solvers from the hdbo repo * Adds a test env for BAxUS inside tox and its GitHub action * Skips baxus in the usual tests * Updates readme * Adds tests for all Ax-related solvers, and their GitHub action * Updates the test action for Ax * Skips the ax tests in the base testenv * Updates poli import * Skips ax solvers by checking whether the module was installed * Adds docs for the Ax-related solvers * Adds testing for probabilistic reparametrization * Adds tests for Bounce * Updates the env path for bounce * Adds Bounce to the table in the readme
MachineLearningLifeScience · May 3, 2024 · da3e725 · da3e725
1 parent f3319ce
commit da3e725
Show file tree

Hide file tree

Showing 38 changed files with 1,250 additions and 23 deletions.
diff --git a/.github/workflows/python-tox-testing-ax.yml b/.github/workflows/python-tox-testing-ax.yml
@@ -0,0 +1,26 @@
+name: Ax (py3.10 in conda)
+
+on: [push]
+
+jobs:
+  build-linux:
+    runs-on: ubuntu-latest
+    strategy:
+      max-parallel: 5
+
+    steps:
+    - uses: actions/checkout@v3
+    - name: Set up Python 3.9
+      uses: actions/setup-python@v3
+      with:
+        python-version: '3.9'
+    - name: Add conda to system path
+      run: |
+        # $CONDA is an environment variable pointing to the root of the miniconda directory
+        echo $CONDA/bin >> $GITHUB_PATH
+    - name: Install dependencies
+      run: |
+        python -m pip install tox
+    - name: Test Ax-related solvers with tox
+      run: |
+        tox -e poli-ax-base-py39
diff --git a/.github/workflows/python-tox-testing-baxus.yml b/.github/workflows/python-tox-testing-baxus.yml
@@ -0,0 +1,26 @@
+name: BAxUS (py3.10 in conda)
+
+on: [push]
+
+jobs:
+  build-linux:
+    runs-on: ubuntu-latest
+    strategy:
+      max-parallel: 5
+
+    steps:
+    - uses: actions/checkout@v3
+    - name: Set up Python 3.9
+      uses: actions/setup-python@v3
+      with:
+        python-version: '3.9'
+    - name: Add conda to system path
+      run: |
+        # $CONDA is an environment variable pointing to the root of the miniconda directory
+        echo $CONDA/bin >> $GITHUB_PATH
+    - name: Install dependencies
+      run: |
+        python -m pip install tox
+    - name: Test BAxUS with tox
+      run: |
+        tox -e poli-baxus-base-py39
diff --git a/.github/workflows/python-tox-testing-bounce.yml b/.github/workflows/python-tox-testing-bounce.yml
@@ -0,0 +1,26 @@
+name: Bounce (py3.10 in conda)
+
+on: [push]
+
+jobs:
+  build-linux:
+    runs-on: ubuntu-latest
+    strategy:
+      max-parallel: 5
+
+    steps:
+    - uses: actions/checkout@v3
+    - name: Set up Python 3.9
+      uses: actions/setup-python@v3
+      with:
+        python-version: '3.9'
+    - name: Add conda to system path
+      run: |
+        # $CONDA is an environment variable pointing to the root of the miniconda directory
+        echo $CONDA/bin >> $GITHUB_PATH
+    - name: Install dependencies
+      run: |
+        python -m pip install tox
+    - name: Test bounce with tox
+      run: |
+        tox -e poli-bounce-base-py39
diff --git a/.github/workflows/python-tox-testing-pr.yml b/.github/workflows/python-tox-testing-pr.yml
@@ -0,0 +1,26 @@
+name: Prob. Rep. (py3.10 in conda)
+
+on: [push]
+
+jobs:
+  build-linux:
+    runs-on: ubuntu-latest
+    strategy:
+      max-parallel: 5
+
+    steps:
+    - uses: actions/checkout@v3
+    - name: Set up Python 3.9
+      uses: actions/setup-python@v3
+      with:
+        python-version: '3.9'
+    - name: Add conda to system path
+      run: |
+        # $CONDA is an environment variable pointing to the root of the miniconda directory
+        echo $CONDA/bin >> $GITHUB_PATH
+    - name: Install dependencies
+      run: |
+        python -m pip install tox
+    - name: Test PR with tox
+      run: |
+        tox -e poli-pr-base-py39
diff --git a/.github/workflows/python-tox-testing.yml b/.github/workflows/python-tox-testing.yml
@@ -1,4 +1,4 @@
-name: Test (conda, python 3.9)
+name: Base (python 3.9 in conda)
 
 on: [push]
 

diff --git a/.gitignore b/.gitignore
@@ -23,4 +23,14 @@ examples/ignore*
 
 # BAxUS related stuff
 conf_dict.json
-projectors.txt.xz
+projectors.txt.xz
+length_history.txt.xz
+final_target_dim.txt.xz
+lengthscales.txt.xz
+split_points.txt.xz
+splitting_dims.txt.xz
+tr_centers.txt.xz
+model_history.zip
+
+# bounce results
+bounce_results/
diff --git a/pyproject.toml b/pyproject.toml
@@ -40,4 +40,6 @@ Homepage = "https://github.com/MachineLearningLifeScience/poli-baselines"
 [tool.pytest.ini_options]
 markers = [
     "slow: marks tests as slow (deselect with '-m \"not slow\"')",
+    "baxus: marks test that can be run in the BAxUS environment",
+    "ax_solvers: marks test that can be run in the Ax environment",
 ]
diff --git a/readme.md b/readme.md
@@ -3,10 +3,12 @@
 [![Test (conda, python 3.9)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing.yml/badge.svg)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing.yml)
 
 > [!WARNING]  
-> This package is a work in progress. Some solvers have not been properly tested. For an authoritative list of the stable solvers, check [our documentation](https://machinelearninglifescience.github.io/poli-docs/#black-box-optimization-algorithms).
+> This package is a work in progress. Some solvers have not been properly tested. For an authoritative list of the stable solvers, check below or [our documentation](https://machinelearninglifescience.github.io/poli-docs/#black-box-optimization-algorithms).
 
 `poli-baselines` is a collection of **black box optimization algorithms**, aimed mostly at optimizing discrete sequences. These optimization algorithms are meant to optimize objective functions defined using [`poli`](https://github.com/MachineLearningLifeScience/poli), a tool for instantiating complex, difficult-to-query functions.
 
+If the dependencies get too specific, we provide replicable conda environments for each solver.
+
 ## Installation
 
 Create a fresh conda environment by running
@@ -36,6 +38,21 @@ After this, you could test you installation by running (inside your `poli-baseli
 python -c "import poli_baselines ; print('Everything went well!')"
 ```
 
+## Solvers available
+
+| Name | Status | Reference |
+|----------|----------|----------|
+| **Random Mutations**   |  [![Test (conda, python 3.9)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing.yml/badge.svg)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing.yml)  | N/A   |
+| **Random hill-climbing**   |  [![Test (conda, python 3.9)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing.yml/badge.svg)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing.yml)  | N/A   |
+| **CMA-ES**   |  [![Test (conda, python 3.9)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing.yml/badge.svg)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing.yml)  | [pycma](https://github.com/CMA-ES/pycma)   |
+| **(Fixed-length) Genetic Algorithm**   |  [![Test (conda, python 3.9)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing.yml/badge.svg)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing.yml)  | [pymoo's implementation](https://pymoo.org/algorithms/soo/ga.html)  |
+| **Hvarfner's Vanilla BO** |  [![Ax (py3.10 in conda)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing-ax.yml/badge.svg)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing-ax.yml)  | [Hvarfner et al. 2024](https://arxiv.org/abs/2402.02229) |
+| **Bounce** |  [![Ax (py3.10 in conda)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing-ax.yml/badge.svg)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing-ax.yml)  | [Papenmeier et al. 2023](https://arxiv.org/abs/2307.00618) |
+| **BAxUS** |  [![BAxUS (py3.10 in conda)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing-baxus.yml/badge.svg)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing-baxus.yml)  | [Papenmeier et al. 2023](https://arxiv.org/abs/2304.11468) |
+| **Probabilistic Reparametrization** |  [![Prob. Rep. (py3.10 in conda)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing-pr.yml/badge.svg)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing-pr.yml)  | [Daulton et al. 2022](https://arxiv.org/abs/2210.10199) |
+| **SAASBO** |  [![Ax (py3.10 in conda)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing-ax.yml/badge.svg)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing-ax.yml)  | [Eriksson and Jankowiak 2021](https://arxiv.org/abs/2103.00349) |
+| **ALEBO** |  [![Ax (py3.10 in conda)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing-ax.yml/badge.svg)](https://github.com/MachineLearningLifeScience/poli-baselines/actions/workflows/python-tox-testing-ax.yml)  | [Lentham et al. 2020](https://proceedings.neurips.cc/paper/2020/file/10fb6cfa4c990d2bad5ddef4f70e8ba2-Paper.pdf) |
+
 ## Your first optimization using `poli-baselines`
 
 As mentioned above, this library interoperates well with the discrete objective functions defined in [`poli`](https://github.com/MachineLearningLifeScience/poli). One such objective function is the ALOHA problem, in which we search the space of 5-letter sequences of the word "ALOHA". The following is a simple example of how one could use the `RandomMutation` solver inside `poli-baselines` to solve this problem:

diff --git a/src/poli_baselines/core/utils/ax/__init__.py b/src/poli_baselines/core/utils/ax/__init__.py
diff --git a/src/poli_baselines/core/utils/ax/ax_solver.py b/src/poli_baselines/core/utils/ax/ax_solver.py
@@ -0,0 +1,111 @@
+from __future__ import annotations
+
+from typing import Tuple
+import uuid
+
+from numpy import ndarray
+import numpy as np
+
+from poli.objective_repository import ToyContinuousBlackBox
+from poli.core.abstract_black_box import AbstractBlackBox
+
+from poli_baselines.core.abstract_solver import AbstractSolver
+
+from poli_baselines.core.utils.ax.interface import (
+    define_search_space,
+)
+
+from ax.service.ax_client import AxClient, ObjectiveProperties
+from ax.modelbridge.generation_strategy import GenerationStep, GenerationStrategy
+
+
+class AxSolver(AbstractSolver):
+    def __init__(
+        self,
+        black_box: AbstractBlackBox,
+        x0: ndarray,
+        y0: ndarray,
+        generation_strategy: GenerationStrategy,
+        bounds: list[tuple[float, float]] | None = None,
+        noise_std: float = 0.0,
+    ):
+        super().__init__(black_box, x0, y0)
+        self.noise_std = noise_std
+
+        if bounds is None:
+            assert isinstance(black_box, ToyContinuousBlackBox)
+            bounds_ = [black_box.function.limits] * x0.shape[1]
+        else:
+            # If bounds is (lb, up), then we build the bounds
+            # for the user
+            if len(bounds) == 2:
+                assert isinstance(bounds[0], (int, float))
+                assert isinstance(bounds[1], (int, float))
+                bounds_ = [bounds] * x0.shape[1]
+            else:
+                bounds_ = bounds
+
+            assert len(bounds) == x0.shape[1]
+            assert all(len(bound) == 2 for bound in bounds)
+
+        ax_client = AxClient(generation_strategy=generation_strategy)
+        exp_id = f"{uuid.uuid4()}"[:8]
+
+        search_space = define_search_space(x0=x0, bounds=bounds_)
+
+        ax_client.create_experiment(
+            name=f"experiment_on_{black_box.info.name}_{exp_id}",
+            parameters=[
+                {
+                    "name": param.name,
+                    "type": "range",
+                    "bounds": [param.lower, param.upper],
+                    "value_type": "float",
+                }
+                for param in search_space.parameters.values()
+            ],
+            objectives={black_box.info.name: ObjectiveProperties(minimize=False)},
+        )
+
+        def evaluate(
+            parametrization: dict[str, float]
+        ) -> dict[str, tuple[float, float]]:
+            x = np.array([[parametrization[f"x{i}"] for i in range(x0.shape[1])]])
+            y = black_box(x)
+            return {black_box.info.name: (y.flatten()[0], self.noise_std)}
+
+        self.evaluate = evaluate
+
+        # Run initialization with x0 and y0
+        for x, y in zip(x0, y0):
+            params = {f"x{i}": float(x_i) for i, x_i in enumerate(x)}
+            _, trial_index = ax_client.attach_trial(params)
+            ax_client.complete_trial(
+                trial_index=trial_index,
+                raw_data={black_box.info.name: (y[0], self.noise_std)},
+            )
+
+        print(ax_client.get_trials_data_frame())
+        self.ax_client = ax_client
+
+    def solve(
+        self,
+        max_iter: int = 100,
+        verbose: bool = False,
+    ) -> Tuple[np.ndarray, np.ndarray]:
+        for i in range(max_iter):
+            parameters, trial_index = self.ax_client.get_next_trial()
+            val = self.evaluate(parameters)
+            self.ax_client.complete_trial(
+                trial_index=trial_index,
+                raw_data=val,
+            )
+            # df = self.ax_client.get_trials_data_frame()
+
+            if verbose:
+                print(
+                    f"Iteration: {i}, Value in iteration: {val[self.black_box.info.name][0]:.3f}, Best so far: {self.ax_client.get_trials_data_frame()[self.black_box.info.name].max():.3f}"
+                )
+
+        # TODO: fix this return
+        return self.ax_client.get_trials_data_frame()  # type: ignore
diff --git a/src/poli_baselines/core/utils/ax/environment.ax.yml b/src/poli_baselines/core/utils/ax/environment.ax.yml
@@ -0,0 +1,14 @@
+name: poli__ax
+channels:
+  - conda-forge
+  - defaults
+dependencies:
+  - python=3.10
+  - pip
+  - pip:
+    - scikit-learn
+    - botorch
+    - ax-platform
+    - numpy
+    - "git+https://github.com/MachineLearningLifeScience/poli.git@dev"
+    - "git+https://github.com/MachineLearningLifeScience/poli-baselines@main"