Posterior model of random forest #254

sgbaird · 2021-12-10T11:24:54Z

Are there facilities for sampling from the posterior distribution of the random forest? (e.g. for integration with Ax/BoTorch).

bfolie · 2021-12-13T18:18:30Z

Currently there is no method to sample from an RF distribution, though one could be easily created.

A BoTorch model only needs to produce a posterior, which only needs to implement rsample. The prediction of a random forest model in Lolo implements getUncertainty, which can reasonably be considered the standard deviation of a normal distribution, so you could draw samples in that way.

However this might not be sufficient for your needs, because you'd be treating a potentially multivariate distribution as a product of several independent univariate distributions. This is in contrast to Gaussian process regression, which naturally produces a rich posterior that takes covariance into account.

We are about to release functionality inspired by some studies into correlations between random forest predictions in a multi-output setting. A similar approach (correlation over trees) would likely work to estimate the correlation coefficient between predictions made by the same RF model at distinct input points. And in that way you could construct a covariance matrix and sample from the corresponding multivariate normal. But it's not implemented or even thoroughly studied at this time.

sgbaird · 2021-12-14T11:26:32Z

@bfolie thank you for the quick response and thorough reply! Treating it as a sample from the normal distribution could work, though I agree that GPR is "richer" in terms of accounting for covariance. That is interesting to hear about the multi-output study.

sgbaird mentioned this issue Dec 15, 2021

Suggestions for implementing a composition-based optimization (i.e. fractional portion of ingredients) facebook/Ax#727

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Posterior model of random forest #254

Posterior model of random forest #254

sgbaird commented Dec 10, 2021 •

edited

Loading

bfolie commented Dec 13, 2021 •

edited

Loading

sgbaird commented Dec 14, 2021

Posterior model of random forest #254

Posterior model of random forest #254

Comments

sgbaird commented Dec 10, 2021 • edited Loading

bfolie commented Dec 13, 2021 • edited Loading

sgbaird commented Dec 14, 2021

sgbaird commented Dec 10, 2021 •

edited

Loading

bfolie commented Dec 13, 2021 •

edited

Loading