More flexible parameter initialization. #786

null-a · 2017-02-24T14:27:58Z

This extends param to take a function specifying how a parameter should be initialized. For example:

param({dims: [2, 1], init: function(dims) { return ones(dims); }})
// => Vector([1,1])

The motivation for this change is that it makes it much easier to add alternate weight initialization strategies to webppl-nn.

It was difficult to implement this new version of param in terms of the old register method as it took a JS function specifying initialization, but here we want to use a webppl function for that. By splitting register out into fetch and create, neither need take a function as an argument, so it's straight forward to work with them from both webppl and JS.

I was nervous about the fact that it's possible to call sample from this initialization function. (This seems like a pretty odd thing to do, since the structure of the model would depend on the state of the parameter table.) And this could conceivably happen through a mistaken attempt to specify a random initialization for a parameter. To guard against this, we apply the function in a coroutine where calling sample or factor generates an error.

This PR also includes a commit that adds some extra run time checks to param. Specifically, we now generate an error when the shape of the parameter value we're about to return does not match the dims argument given to param.

This can happen when a parameter name is reused and the dims arguments don't match up. It could also happen across executions if the program code changes when using a persistent parameter store.

The intention is that by checking for this explicitly, we'll get more informative errors than we'd get if we waited until some later part of the program ran into trouble.

It's possible that this will generate an error in a program that would otherwise work, but I think this is a win overall, since it will be useful to know that the dims in the code reflect the actual shape of the parameter.

stuhlmueller · 2017-02-26T01:01:25Z

Looks good.

Should we document that, to use randomness in init, people need to use dist.sample(), not sample(dist)? I imagine that many custom parameter initialization strategies will need randomness.

null-a · 2017-02-27T09:35:18Z

Should we document that, to use randomness in init, people need to use dist.sample(), not sample(dist)?

Yeah, probably. Done.

One further thought I had was that we could run these init. functions in something similar to the forward sampling coroutine, so that we can write sample(dist) and still not have a random choice added to the model. Not having to mention dist.sample() seems like a win. (Could cause confusion.)

On the other hand, perhaps we're better sticking with the current approach, where we're effectively reserving the sample(dist) notation until we decide what we want it to mean (#788), which avoids switching its meaning on people in the future?

stuhlmueller · 2017-02-27T18:02:23Z

One further thought I had was that we could run these init. functions in something similar to the forward sampling coroutine, so that we can write sample(dist) and still not have a random choice added to the model.

I like that even better. My hunch is that, whatever meaning we use for sampling during initialization in the context of meta-inference/optimization, it's likely to add up to plain forward sampling in basic contexts. (But if you want to keep using dist.sample(), that's fine with me, too.)

null-a · 2017-02-27T18:10:09Z

I like that even better.

OK, great, I'll make the necessary changes to this PR.

ngoodman · 2017-02-28T17:11:21Z

it bothers me to have dist.sample used anywhere in user code. this seems to break the abstraction barrier that we've tried to keep in place. maybe add it but leave undocumented until we get the abstractions right? or do only the sample(dist) version?

null-a · 2017-02-28T17:20:56Z

this seems to break the abstraction barrier that we've tried to keep in place

@ngoodman Couldn't agree more. That's exactly what motivated the idea of doing the work to make sample(dist) work here. That change will be made before this is merged.

Allowing the coroutine to be used without collecting return values in a distribution.

This allows `sample(dist)` to be used for random initialization, rather than `dist.sample()`.

This reverts commit 7c4aec0.

null-a · 2017-03-03T14:04:49Z

I like that even better.

OK, great, I'll make the necessary changes to this PR.

I've completed this.

null-a added 4 commits February 24, 2017 10:38

Support passing init. function to param.

713004c

Check for dimension mismatch in param.

6133cd5

Don't allow sample/factor during parameter init.

d27b1af

Use applyd for drift kernels.

4ab9c7f

null-a requested a review from stuhlmueller February 24, 2017 14:29

Mention how to implement random init.

7c4aec0

null-a added 4 commits March 3, 2017 13:37

Refactor ForwardSample.

74a74d8

Allowing the coroutine to be used without collecting return values in a distribution.

Run parameter init. in forward sampling coroutine.

0f14add

This allows `sample(dist)` to be used for random initialization, rather than `dist.sample()`.

Revert "Mention how to implement random init."

bb56b2e

This reverts commit 7c4aec0.

Tweak error text.

35526be

stuhlmueller merged commit 3bd0d8e into probmods:dev Mar 3, 2017

null-a deleted the param-init branch March 3, 2017 16:59

hawkrobe mentioned this pull request Mar 6, 2017

Add an init option to sample #609

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More flexible parameter initialization. #786

More flexible parameter initialization. #786

null-a commented Feb 24, 2017

stuhlmueller commented Feb 26, 2017

null-a commented Feb 27, 2017

stuhlmueller commented Feb 27, 2017 •

edited

Loading

null-a commented Feb 27, 2017

ngoodman commented Feb 28, 2017 •

edited

Loading

null-a commented Feb 28, 2017

null-a commented Mar 3, 2017

More flexible parameter initialization. #786

More flexible parameter initialization. #786

Conversation

null-a commented Feb 24, 2017

stuhlmueller commented Feb 26, 2017

null-a commented Feb 27, 2017

stuhlmueller commented Feb 27, 2017 • edited Loading

null-a commented Feb 27, 2017

ngoodman commented Feb 28, 2017 • edited Loading

null-a commented Feb 28, 2017

null-a commented Mar 3, 2017

stuhlmueller commented Feb 27, 2017 •

edited

Loading

ngoodman commented Feb 28, 2017 •

edited

Loading