forked from JasperSnoek/spearmint
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathREADME.txt
222 lines (191 loc) · 11.2 KB
/
README.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
Spearmint is a package to perform Bayesian optimization according to the
algorithms outlined in the paper:
Practical Bayesian Optimization of Machine Learning Algorithms
Jasper Snoek, Hugo Larochelle and Ryan P. Adams
Advances in Neural Information Processing Systems, 2012
This code is designed to automatically run experiments (thus the code
name 'spearmint') in a manner that iteratively adjusts a number of
parameters so as to minimize some objective in as few runs as
possible.
DEPENDENCIES
This package requires:
Python 2.7
Numpy version 1.6.1+
http://www.numpy.org/
On Ubuntu linux you can install this package using the command:
apt-get install python-numpy
Scipy version 0.9.0+
http://www.scipy.org/
On Ubuntu linux you can install this package using the command:
apt-get install python-scipy
Google Protocol Buffers (for the fully automated code)
https://developers.google.com/protocol-buffers/
On Ubuntu linux you can install this package using the command:
apt-get install python-protobuf
This package has been tested on Ubuntu linux (versions 11.0+) and
Mac-OSX.
The code consists of several parts. It is designed to be modular to
allow swapping out various 'driver' and 'chooser' modules. The
'chooser' modules are implementations of acquisition functions such as
expected improvement, UCB or random. The drivers determine how
experiments are distributed and run on the system. As the code is
designed to run experiments in parallel (spawning a new experiment as
soon a result comes in), this requires some engineering. The current
implementations of these are in 'spearmint.py', 'spearmint_sync.py'
and 'spearmint-lite.py'.
Spearmint.py is designed to run on a system with Sun Grid Engine and
uses SGE to distribute experiments on a multi-node cluster in parallel
using a queueing system in a fault-tolerant way. It is particularly
well suited to the Amazon EC2 system. Using StarCluster
(http://star.mit.edu/cluster/) will allow you to set up a large
cluster and start distributing experiments within minutes.
Spearmint_sync.py is designed to run on a single machine with
potentially many cores. This driver simply spawns a new process on
the current machine to run a new experiment. This does not allow you
to distribute across multiple machines, however.
Spearmint-lite.py is the 'bare-bones' stripped version of the code.
This version is simply driven by a flat-file and does not
automatically run experiments. Instead, it proposes new experiments
(potentially multiple at a time) and requires that the user fill in
the result. This is well suited to the case where writing a wrapper
around the code doesn't make sense (e.g. if the experiments don't
involve code at all) or if the user desires full control of the
process. Also, the dependency on Google protocol buffers is replaced
with JSON.
RUNNING THE AUTOMATED CODE - Spearmint and Spearmint_sync
The simplest way to get to know the code is probably to look at an
example. In order to start a new experiment, you must create a new
subdirectory in the spearmint directory and include a wrapper script.
We have created one simple example for you that optimizes the
'Braninhoo' benchmark in the subdirectory 'braninpy'. Take a look at
config.pb. It contains the specifications of the algorithm in
protocol buffer format. In order to specify your optimization, you
have to fill in the variables 'language' (e.g. PYTHON or MATLAB) and
'name' (the name of the wrapper function you want to optimize).
Followed by these is a list of 'variables', which specifies the name,
type and size of the variables you wish to optimize over. Each
variable must be either a FLOAT, INT or ENUM type, corresponding to
continous real valued parameters, integer sequences and categorical
variables respectively. MAX and MIN specify the bounds of the
variables over which to optimize and SIZE is the number of variables
of this type with these bounds. Spearmint will call your wrapper
function with a dictionary type (in python) containing each of your
variables in a vector of size 'size', which you can access using the
name specified. Now take a look at branin.py (the wrapper which was
specified in the 'name' variable at the top of config.pb). You will
see the file has a function 'main(job_id, params)'. Your wrapper must
include this function, which spearmint will call passing in a job_id
(which is probably not interesting to you) and a dictionary, 'params',
containing the parameter vectors of the next experiment spearmint
wants to run. The main function should take as input these parameters
and return a single real valued number representing the observed
function value (that is being optimized) at these inputs.
To run spearmint, go back into the top-level directory and type
'python spearmint_sync.py --method=GPEIOptChooser
--method-args=noiseless=1 braninpy'. This will run spearmint according
to the GP-EI MCMC strategy. The code will sequentially spawn
processes that call the wrapper function and it will poll for results.
You will see that the code prints out the current best (i.e. lowest)
observation seen thus far and sequences of numbers corresponding to GP
hyperparameter samples and candidates it is optimizing over. The
'method' argument specifies the chooser module (acquisition function)
to use and 'method-args' specifies chooser specific arguments. In
this case, as braninhoo is an analytic function we tell the GP
hyperparameter sampling routine to not try to estimate noise.
If you let it run for a while you will see that the current-best
decreases, eventually reaching the minimum at ~0.39. You can kill the
process (ctrl-c) at any time and you can restart from where it left off simply
by rerunning the spearmint command.
If you go back in to the braninpy directory you will see a number of
new files that spearmint uses to do bookkeeping. Of particular
interest are 'trace.csv' and the 'output' directory. trace.csv
contains a record of the experiments run so far and the best result
(and which experiment it came from) as a series over time. Each line
of trace.csv contains the following in csv format: a timestamp, the
best value observed up to that timestamp, the job-id of the best value
observed, the number of potential candidates left, the number of
pending (currently running) experiments, the number of experiments
completed thus far. The output directory contains a text file for
each job-id, containing the output of that job. So if you want to
see, e.g. what the output (i.e. standard out and standard error) was
for the best job (as obtained from trace.csv) you can look up
job-id.txt in the output directory. If you are debugging your code,
or the code is crashing for some reason, it's a good idea to look at
these files. Finally for ease of use, spearmint also prints out at
each iteration a file called 'best_job_and_result.txt' that contains the
best result observed so far, the job-id it came from and a dump of
the names and values of all of the parameters corresponding to that result.
A script, cleanup.sh, is provided to completely restart an experiment
and delete all the results and intermediate files. Simply run
cleanup.sh <experiment_dir>
Matlab code can also be optimized using this package. To do so, you
must specify in config.pb "type: Matlab" and use a matlab wrapper with
the "name" specified in config.pb. The matlab wrapper must have a
function with the same name as the file name of the following form:
"function result = braninhoo(job_id, params)" Above we assume the file
name is "braninhoo.m". Spearmint will pass in to this wrapper a
job_id and a matlab struct 'params' where the fields are given by the
variables specified in config.pb. See the subdirectory "braninhoo"
for a matlab example matching that of python 'braninpy' described
above.
To run multiple jobs in parallel, pass to spearmint the argument:
--max-concurrent=<#jobs>
RUNNING THE BASIC CODE - Spearmint-lite
Spearmint-lite is designed to be simple. To run an experiment in
spearmint-lite, create a subdirectory as explained above. Again, the
braninpy directory is provided as a demonstration. In this case, the
experiment specification, which must be provided in config.json, is in
JSON format. You must specify your problem as a sequence of JSON
objects. As in the protocol buffer format above, each object must
have a name, a type (float, int or enum), a 'min', a 'max' and a
'size'. Nothing else needs to be specified.
Go back to the top-level directory and run:
python spearmint-lite.py braninpy
Spearmint-lite will run one iteration of Bayesian
optimization and write out to a file named results.dat in the braninpy
subdirectory. results.dat will contain a white-space delimited line
for each experiment, of the format:
<result> <time-taken> <list of parameters in the same order as config.json>
Spearmint will propose new experiments and append them to results.dat each
time it is run. Each proposed experiment will have a 'pending' result and
time-taken, indicated by the letter P. The user must then run the experiment
and fill in these values. Note that the time can safely be set to an arbitrary
value if the chooser module does not use it (only GPEIperSecChooser currently
does). Spearmint will condition on the pending experiments when proposing new
ones, so any number of experiments can be conducted in parallel.
A script, cleanup.sh, is provided to completely clean up all the intermediate
files and results in an experimental directory and restart the
experiment from scratch.
CHOOSER MODULES:
The chooser modules implement functions that tell spearmint which next
job to run. Some correspond to 'acquisition functions' in the
Bayesian optimization literature. Spearmint takes as an argument
--method=ChooserModule which allows one to easy swap out acquisition
functions. Choosers may optionally include parameters that can be
passed to spearmint using the argument
method-args=argument1,argument2,etc. These include, for example, the
number of GP hyperparameter samples to use. See the comments in
chooser files for chooser dependent arguments. Below are described
the choosers provided in this package:
SequentialChooser: Chooses the next experiment using a high
discrepancy Sobol sequence. Experiments are taken sequentially from a
course-to-fine grid.
RandomChooser: Experiments are sampled randomly from the unit hypercube.
GPEIOptChooser: The GP EI MCMC algorithm from the paper. Jobs
are first sampled densely from a dense grid on the unit hypercube
and then the best candidates are optimized 'fine-tuned' according
to EI.
GPEIperSecChooser: The GP EI per Second algorithm from the paper.
Similar to GPEIOptChooser except points are optimized and evaluated
based on the EI per Second criterion, where each job is weighted
by the expected running time of the experiment.
GPEIChooser: Points are densely sampled on the unit hypercube and the
best is returned according to the EI criterion. This is considerably
faster than the GPEIOptChooser and works well in low-dimensional cases
where the grid covers the search space densely.
IMPORTANT!
When estimating noise, the Gaussian process prior over noise assumes
that the noise level is 'low' (e.g. between -1 to 1). If this is not
the case, make sure to rescale your function to make this true
(e.g. to approximately be between -1 and 1). Otherwise, the algorithm
will find a bad high-noise mode that will result in bad performance.