Replace old Gibbs sampler with the experimental one. #2328

mhauru · 2024-09-23T13:29:08Z

Closes #2318.

Work in progress.

codecov · 2024-09-23T15:10:26Z

Codecov Report

Attention: Patch coverage is 87.45098% with 32 lines in your changes missing coverage. Please review.

Project coverage is 85.39%. Comparing base (2707d12) to head (96f8dd4).
Report is 1 commits behind head on master.

Files with missing lines	Patch %	Lines
src/mcmc/gibbs.jl	87.67%	26 Missing ⚠️
src/mcmc/repeat_sampler.jl	80.00%	4 Missing ⚠️
src/mcmc/Inference.jl	75.00%	1 Missing ⚠️
src/mcmc/particle_mcmc.jl	66.66%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #2328      +/-   ##
==========================================
- Coverage   86.30%   85.39%   -0.92%     
==========================================
  Files          22       21       -1     
  Lines        1577     1588      +11     
==========================================
- Hits         1361     1356       -5     
- Misses        216      232      +16

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

coveralls · 2024-09-23T16:11:59Z

Pull Request Test Coverage Report for Build 12400670488

Details

223 of 255 (87.45%) changed or added relevant lines in 11 files are covered.
13 unchanged lines in 3 files lost coverage.
Overall coverage increased (+6.9%) to 85.39%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
src/mcmc/Inference.jl	3	4	75.0%
src/mcmc/particle_mcmc.jl	2	3	66.67%
src/mcmc/repeat_sampler.jl	16	20	80.0%
src/mcmc/gibbs.jl	185	211	87.68%

Files with Coverage Reduction	New Missed Lines	%
src/mcmc/Inference.jl	1	86.39%
src/mcmc/ess.jl	1	94.64%
src/mcmc/particle_mcmc.jl	11	86.75%

Totals
Change from base Build 12397554649:	6.9%
Covered Lines:	1356
Relevant Lines:	1588

💛 - Coveralls

HISTORY.md

mhauru · 2024-09-26T10:16:33Z

@torfjelde, if you have a moment to take a look at the one remaining test failure, would be interested in your thoughts. We are sampling for a model with two vector variables, m and z, and we seem to somehow end up with a case where there's a VarInfo with only z in it, but the sampler is looking for m too. I wonder if it's something about the interaction between particle sampling with Libtask and how the new Gibbs does things with the local varinfos. The test that fails is this one:

    @testset "dynamic model" begin
        @model function imm(y, alpha, ::Type{M}=Vector{Float64}) where {M}
            N = length(y)
            rpm = DirichletProcess(alpha)

            z = zeros(Int, N)
            cluster_counts = zeros(Int, N)
            fill!(cluster_counts, 0)

            for i in 1:N
                z[i] ~ ChineseRestaurantProcess(rpm, cluster_counts)
                cluster_counts[z[i]] += 1
            end

            Kmax = findlast(!iszero, cluster_counts)
            m = M(undef, Kmax)
            for k in 1:Kmax
                m[k] ~ Normal(1.0, 1.0)
            end
        end
        model = imm(Random.randn(100), 1.0)
        # https://github.com/TuringLang/Turing.jl/issues/1725
        # sample(model, Gibbs(MH(:z), HMC(0.01, 4, :m)), 100);
        sample(model, Gibbs(; z=PG(10), m=HMC(0.01, 4; adtype=adbackend)), 100)
    end

torfjelde · 2024-09-26T14:25:46Z

Will have a look at this in a bit @mhauru (just need to do some grocery shopping 😬 )

mhauru · 2024-09-26T14:29:42Z

Collecting links to old relevant PRs so I don't have to look for them again: #2231, #2099

torfjelde · 2024-09-26T18:22:53Z

Think I found the error: if the number of m increases, say, from length(m) = 2 to length(m) = 3 during the PG step, then the lines

Turing.jl/src/mcmc/gibbs.jl

Lines 57 to 65 in 6f9679a

    
           if has_conditioned_gibbs(context, vn) 
        
               value = get_conditioned_gibbs(context, vn) 
        
               return value, logpdf(right, value), vi 
        
           end 
        
           # Otherwise, falls back to the default behavior. 
        
           return DynamicPPL.tilde_assume( 
        
               rng, DynamicPPL.childcontext(context), sampler, right, vn, vi 
        
           )

doesn't hit the Gibbs branch since @varname(m[3]) is not present in the GibbsContext 😕

torfjelde · 2024-09-27T10:12:01Z

doesn't hit the Gibbs branch since @varname(m[3]) is not present in the GibbsContext

I'm a bit uncertain how we should best handle this @yebai @mhauru

The first partially viable idea that comes to mind is to subset the varinfo to make sure that it only contains the correct variables. If we do this, then m[3] will just be "ignored" (in the varinfos) until we're actually sampling the m variables, in which case it would be captured correctly.

But this would not quite be equivalent to the current implementation of Gibbs, which, AFAIK, keeps the very first occurence of m around rather than resampling everytime. And naively, I would expect this to be incorrect.

Another way is to explicitly add the varinfos to the GibbsContext itself, and then, when we encounter a value that should in fact go into a different varinfo, we add it there. But this has a few issues:

Requires the VarInfo to be mutable.
Requires the VarInfo to have a container that can keep the new incoming value m[3].
Implementation of Gibbs does end up being more complicated than the current approach. However, it might be worth it.

Thoughts?

yebai · 2024-09-27T10:22:10Z

Another way is to explicitly add the varinfos to the GibbsContext itself, and then, when we encounter a value that should in fact go into a different varinfo, we add it there. But this has a few issues:

Requires the VarInfo to be mutable.
Requires the VarInfo to have a container that can keep the new incoming value m[3].
Implementation of Gibbs does end up being more complicated than the current approach. However, it might be worth it.

I lean towards the above approach and (maybe later) provide explicit APIs to inference algorithms. This will enable us to handle reversible jumps (varying model dimensions) in MCMC more flexibly. At the moment, this is only possible in particle Gibbs; if it happens in HMC/MH, inference will likely fail (silently)

EDIT: we can keep VarInfos immutable by default, and requires inference developers to hook into specific APIs to mutate VarInfos.

torfjelde · 2024-09-27T10:34:51Z

This does however complicate the new Gibbs sampling procedure quite drastically 😕

And it makes me bring up a question I really didn't think I'd be asking: is it then actually preferable to the current Gibbs with keeping it all in a single VarInfo with a flag to specify whether it should be sampled or not? 😬

I guess we should first have a go at implementing this for the new Gibbs and then we can see 👍

Another point to add to the conversation that @mhauru brought to my attention the other day: we also want to support stuff like Gibbs(@varname(m) => NUTS(), @varname(m) => HMC()), i.e. multiple samplers targeting the same variables. This adds a few "complications" (beyond addressing the growing model problem discussed above):

Need to determine which varinfo to pick from varinfos based on the varnames present / targeted.
A naive implementation will result in duplicated entries in varinfos. We can however address this if we really feel like it's worth it, so probably a non-issue atm.

So all in all, immediate things we need to address with Gibbs:

Support changing dimensions.
Support picking a varinfo to condition on based on the varnames present rather than based on ===.

mhauru · 2024-10-10T15:27:02Z

I've been trying to think of a way to fix this, that would also fix the problem where different Gibbs subsamplers can't sample the same variables (e.g. you can't first sample x and y using one sampler, and then y and z with a different one). My best thought at the moment is the following design:

There is only one, global VarInfo, call it vi.
make_conditional takes that vi and a list of VarNames that the current subsampler samples. It hijacks the tilde pipeline to condition all other variables to their current values in vi.
vi may have some variables linked, some not.
Every time we call a subsampler we can hand it vi as the VarInfo. It won’t mess with any of the variables it’s not supposed to touch, because the tilde pipeline hijack from point 2.

Point 3. is maybe undesirable, but I think it’s minor compared to all the Selector/gibbsid stuff, which we would still get rid of.

The only problem I see with this is combining the local state from the previous iteration of the current subsampler with the global vi. Somehow we would need to join up-to-date information from the global vi with state-information from the previous iteration, specific to this subsampler. The right way to do this depends on the state, which is a different type of object for different subsamplers. EDIT: Actually, maybe this is okay, because we seem to already assume that every state object has a field called state.vi , we could just reset that.

The great benefit of sticking to one, global VarInfo is never having to worry about moving data between the local VarInfos. That would have to happen in both cases, when a new variable is introduced by one sampler (the failing test in this PR) and when two samplers sample the same variable. It sounds like a pain to implement.

mhauru · 2024-10-10T15:45:23Z

I can imagine two different philosophies to implementing a Gibbs sampler:

Every subsampler is doing its own sampling process on a low-dimensional model (a conditioned version of the full model), independent of the others. The logprobability function it's sampling from just keeps changing between iterations, because the other variables change and thus the conditioned model changes, but otherwise it's blind to the existence of the variables it isn't sampling. This is what the new Gibbs sampler does.
Every subsampler is working with the same, full model, with all the variables, but only makes the changes to a subset of those variables. It still "sees" the whole model. This is what the old Gibbs sampler did.

My above proposal would essentially be doing 2., but using code that's very much like the new sampler, where the information about which sampler modifies which variables is in the sampler/GibbsContext, and not in VarInfo like it was in the old Gibbs.

The reason I'm leaning towards 2. is that 1. seems to run to some fundamental issues in cases where either

Variables appear and disappear based on values of other variables,
Two samplers want to modify the value of the same variable.

Both of those situations quite deeply violate the idea that the different subsamplers can operate mostly independently of each other.

Any thoughts very welcome, I'm still very much trying to understand the landscape of the problem.

yebai · 2024-10-10T16:36:24Z

Thanks, @mhauru, for the excellent summary of the problem and proposals. Storing conditioned variables in a context, like GibbsContext as you suggested, is very sensible. The consequence is that VarInfo and Context will have overlapped model parameters, e.g. conditioned variables will be found in both VarInfo and Context, which is fine.

In addition, it's worth mentioning that we currently have two mechanisms for passing observations to models, i.e.

(1) via model arguments, e.g. gdemo(x, y).
(2) via condition API, e.g. condition(model, (x=1,y=2)).

Among these options, (1) will hardcode observation information directly in the model while (2) stores them in a context. You could look at the DynamicPPL codebase for a more detailed picture of how it works. We want to unify these options, perhaps towards using (2) only.

This Gibbs refactoring could be an excellent starting point for a design_notes repo to record these thoughts and discussions.

torfjelde · 2024-10-10T20:19:53Z

Every subsampler is working with the same, full model, with all the variables, but only makes the changes to a subset of those variables. It still "sees" the whole model. This is what the old Gibbs sampler did.

Overall, I'm also in favour of this @mhauru 👍 I think your reasoning is solid here.

The only other "option" I'm seeing is to keep track of which variables correpond to which varinfos (with each varinfo only containing the relevant information), but then we're effectively just re-implementing a lot of the functionality that is already provided in varinfo 😕

The only "issue" is that this does mean we have to support this "link / transform only part of the varinfo, which does mean we need something "equivalent" to all the getindex(varinfo, sampler) stuff that we've been trying to move away from (since we need a way to extract the vectorized part relevant only for the specific sampler we're going to use in that particular step) 😕

Doulby however, I think we can make this much nicer than the current approach by simply making all these getindex(varinfo, sampler) instead take the relevant varnames instead of the samplers themselves, which should make it all less painful.

But yeah, don't see how we can take approach (1) in a "nice" way, and so I'm also in favour of just trying to make (2) as painless as possible to maintain.

mhauru · 2024-10-11T08:17:52Z

Thanks for the comments both, this is very helpful.

Doulby however, I think we can make this much nicer than the current approach by simply making all these getindex(varinfo, sampler) instead take the relevant varnames instead of the samplers themselves, which should make it all less painful.

Yeah, I think this is the way to go.

…-sampler

Co-authored-by: Tor Erlend Fjelde <[email protected]>

…-sampler

mhauru · 2024-11-29T10:55:21Z

I'm done making the changes I had in mind. I may still experiment with some performance improvements, but not sure if any will make it in here. I'll also try to reduce the iteration counts in some tests to make them faster, the only CI failure is because one job just timed out at 6h.

Since both Tor and I seem to be happy, I'm gonna ping others in case they want to take a look: @penelopeysm, @willtebbutt, @sunxd3, @yebai. I think we can rely on @torfjelde giving an expert review, everyone else can judge for themselves how thorough a look they want to take, but I think everyone should be at least aware that this, somewhat major, change is happening. If you want to give this PR a review but haven't yet had time, self-request a review and we'll make sure to wait before merging.

For help in reviewing: This PR does a few things:

Deletes the old src/mcmc/gibbs.jl, and the related src/mcmc/gibbs_conditional.jl.
Moves src/experimental/gibbs.jl to be the new src/mcmc/gibbs.jl, and merges test/experimental/gibbs.jl and test/mcmc/gibbs.jl.
Makes a lot of edits to the experimental/new Gibbs to accommodate dynamic models and some other things.
Adds more, new tests to test/mcmc/gibbs.jl.
Introduces RepeatSampler and its tests. This has to be done in the same PR because the old Gibbs had repeat functionality built-in, whereas the new Gibbs doesn't.
Makes a bunch of small changes to various samplers to accommodate the new Gibbs.

Points 4-6 one can reviewed like usual, as a diff of a few hundred lines. Points 2-3 I think are better viewed as a new Gibbs sampler from scratch. The changes in point 3 are so extensive that reading it as a diff doesn't make much sense unless you know the old code really well.

penelopeysm · 2024-11-29T14:48:42Z

I'm happy to take a look next week, but doubt I'll get to it today as my head is already several layers deep in DynamicPPL stuff 😄

mhauru · 2024-11-29T17:01:10Z

I managed to decrease the iteration counts on a lot of the heaviest tests, the total runtime should be reduced substantially now. They seem to still pass somewhat robustly, i.e. I tried at least two random seeds.

Also did some quick checks of performance overheads, and the previous large overheads are gone in my example cases. Now, rather than being e.g. 100-500% slower than the old Gibbs we are more like 0-50% slower. This for models dominated by overheads from outside model evaluation, i.e. fast models where performance is not a big deal.

…-sampler

mhauru · 2024-12-02T16:49:30Z

The Mooncake stack overflows are something @willtebbutt is aware of and knows the reason for, so we can ignore them for now. Would still hold off from merging until they are fixed.

yebai · 2024-12-16T15:18:43Z

@penelopeysm, can you help resolve the merge conflicts so we can try to merge this before the new year?

penelopeysm · 2024-12-16T15:20:05Z

@yebai Sure! Are we happy otherwise with the PR, i.e. if conflicts are fixed and CI passes we can merge?

yebai · 2024-12-16T15:23:40Z

I think so.

penelopeysm · 2024-12-19T02:59:18Z

@yebai CI pretty much passes fine, apart from:

Some numerical tests fail on x86 by a fairly small amount. I can't quite tell why – as far as I can tell, everything has been seeded correctly.

One of them is in the Gibbs tests, on the dynamic Chinese restaurant process model. This test is slightly dubious anyway imo

Turing.jl/test/mcmc/gibbs.jl

Lines 475 to 485 in 96f8dd4

    
           # The below are regression tests. The values we are comparing against are from 
        
           # running the above model on the "old" Gibbs sampler that was in place still on 
        
           # 2024-11-20. The model was run 5 times with 10_000 samples each time. The values 
        
           # to compare to are the mean of those 5 runs, atol is roughly estimated from the 
        
           # standard deviation of those 5 runs. 
        
           # TODO(mhauru) Could we do something smarter here? Maybe a dynamic model for which 
        
           # the posterior is analytically known? Doing 10_000 samples to run the test suite 
        
           # is not ideal 
        
           # Issue ref: https://github.com/TuringLang/Turing.jl/issues/2402 
        
           @test isapprox(mean(num_ms), 8.6087; atol=0.8) 
        
           @test isapprox(std(num_ms), 1.8865; atol=0.02)

dynamic model: Test Failed at /home/runner/work/Turing.jl/Turing.jl/test/mcmc/gibbs.jl:484
  Expression: isapprox(mean(num_ms), 8.6087; atol = 0.8)
   Evaluated: isapprox(9.8377, 8.6087; atol = 0.8)

The other one is in ESS:

MoGtest_default with CSMC + ESS: Test Failed at /home/runner/work/Turing.jl/Turing.jl/test/test_utils/numerical_tests.jl:55
  Expression: ≈(E, val, atol = atol, rtol = rtol)
   Evaluated: 3.88348278598424 ≈ 4.0 (atol=0.1, rtol=0.0)

There's some weird behaviour in that Gibbs test suite runs much, much slower on 1.11 than 1.10. It doesn't affect the outcome though.

Personally I don't think that either of these are serious enough to prevent us from merging this PR. I reckon that both should be tracked via new issues. If you agree, feel free to hit the button 😄

yebai · 2024-12-19T11:02:00Z

There's some weird behaviour in that Gibbs test suite runs much, much slower on 1.11 than 1.10. It doesn't affect the outcome though.

This is likely a Libtask issue on Julia 1.11. Hopefully, we will resolve this in #2427. cc @willtebbutt

EDIT: it is slightly odd that Gibbs runs faster on the master branch for Julia 1.11 branch before this PR.

@penelopeysm can you open issues to track the other minor numerical issues on X86? This is likely due to an insufficient number of MCMC iterations.

yebai · 2024-12-19T11:02:48Z

Many thanks to @mhauru, @torfjelde, @penelopeysm, and all who helped!

sunxd3 · 2024-12-19T11:46:15Z

🎉🎉

mhauru added 3 commits September 23, 2024 14:28

Replace old Gibbs sampler with the experimental one.

5948253

Remove dead references to experimental

5a3e4a6

Remove mention of experimental from JuliaFormatter conf

09c739d

mhauru added 3 commits September 24, 2024 10:05

Add tests for deprecated constructor

58ebb25

Fix deprecated Gibbs constructors. Add HISTORY entry.

7715732

Bump version to 0.35.0

672f7d9

yebai reviewed Sep 24, 2024

View reviewed changes

HISTORY.md Outdated Show resolved Hide resolved

mhauru added 3 commits September 24, 2024 15:39

Add Gibbs constructor test for repeat samplers

7bf5abe

Fix typo in test/mcmc/ess.jl

85bcfa5

Use provided rng to initialise VarInfo in Gibbs

6f9679a

torfjelde mentioned this pull request Oct 4, 2024

Add some interface functions to support the new Gibbs sampler in Turing TuringLang/AbstractMCMC.jl#144

Closed

This was referenced Oct 4, 2024

More autoformatting #2359

Merged

Update to tilde overloads in mh.jl #2360

Merged

Fix a typo in GibbsContext

f247ad9

Merge remote-tracking branch 'origin/master' into mhauru/change-gibbs…

a790363

…-sampler

mhauru mentioned this pull request Oct 14, 2024

For VarInfo, fix merge and allow push!!ing new Symbols TuringLang/DynamicPPL.jl#690

Merged

mhauru and others added 2 commits November 29, 2024 09:57

Simplify is_target_varname

38ac128

Add suggestions from code review

7d6a983

Co-authored-by: Tor Erlend Fjelde <[email protected]>

mhauru mentioned this pull request Nov 29, 2024

Fix new Gibbs sampler for cases where only some variables need to be linked #2401

Open

Add a couple of issue references

39c2f21

mhauru mentioned this pull request Nov 29, 2024

Allow Gibbs sampler to have non-identity lenses for target variables #2403

Open

Merge remote-tracking branch 'origin/master' into mhauru/change-gibbs…

fa81f83

…-sampler

Restructure Gibbs inference tests and reduce iteration counts

ff09591

Reduce another iter count in Gibbs tests

1f5432f

mhauru added 2 commits December 2, 2024 15:01

Merge remote-tracking branch 'origin/master' into mhauru/change-gibbs…

74f6ac7

…-sampler

Add an info print to Gibbs tests

5f21c84

Use StableRNG, relax test tolerance

a15ce2f

torfjelde mentioned this pull request Dec 3, 2024

Remove (duplicate) samplers being defined explicitly in Turing.jl #2413

Open

3 tasks

penelopeysm self-assigned this Dec 17, 2024

penelopeysm added 2 commits December 18, 2024 18:43

Merge branch 'master' into mhauru/change-gibbs-sampler

5cf43b2

Fix a kwarg

96f8dd4

yebai merged commit 9e5467a into master Dec 19, 2024
59 of 62 checks passed

yebai deleted the mhauru/change-gibbs-sampler branch December 19, 2024 11:02

penelopeysm mentioned this pull request Dec 19, 2024

Gibbs performance regression on Julia v1.11 #2445

Closed

yebai mentioned this pull request Dec 19, 2024

Significantly improve the Libtask library using ideas from Mooncake / ReverseDiff #2427

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace old Gibbs sampler with the experimental one. #2328

Replace old Gibbs sampler with the experimental one. #2328

mhauru commented Sep 23, 2024

codecov bot commented Sep 23, 2024 •

edited

Loading

coveralls commented Sep 23, 2024 •

edited

Loading

mhauru commented Sep 26, 2024

torfjelde commented Sep 26, 2024

mhauru commented Sep 26, 2024

torfjelde commented Sep 26, 2024

torfjelde commented Sep 27, 2024

yebai commented Sep 27, 2024 •

edited

Loading

torfjelde commented Sep 27, 2024

mhauru commented Oct 10, 2024

mhauru commented Oct 10, 2024

yebai commented Oct 10, 2024 •

edited

Loading

torfjelde commented Oct 10, 2024

mhauru commented Oct 11, 2024

mhauru commented Nov 29, 2024

penelopeysm commented Nov 29, 2024

mhauru commented Nov 29, 2024

mhauru commented Dec 2, 2024

yebai commented Dec 16, 2024

penelopeysm commented Dec 16, 2024

yebai commented Dec 16, 2024

penelopeysm commented Dec 19, 2024 •

edited

Loading

yebai commented Dec 19, 2024 •

edited

Loading

yebai commented Dec 19, 2024

sunxd3 commented Dec 19, 2024

Replace old Gibbs sampler with the experimental one. #2328

Replace old Gibbs sampler with the experimental one. #2328

Conversation

mhauru commented Sep 23, 2024

codecov bot commented Sep 23, 2024 • edited Loading

Codecov Report

coveralls commented Sep 23, 2024 • edited Loading

Pull Request Test Coverage Report for Build 12400670488

Details

💛 - Coveralls

mhauru commented Sep 26, 2024

torfjelde commented Sep 26, 2024

mhauru commented Sep 26, 2024

torfjelde commented Sep 26, 2024

torfjelde commented Sep 27, 2024

yebai commented Sep 27, 2024 • edited Loading

torfjelde commented Sep 27, 2024

mhauru commented Oct 10, 2024

mhauru commented Oct 10, 2024

yebai commented Oct 10, 2024 • edited Loading

torfjelde commented Oct 10, 2024

mhauru commented Oct 11, 2024

mhauru commented Nov 29, 2024

penelopeysm commented Nov 29, 2024

mhauru commented Nov 29, 2024

mhauru commented Dec 2, 2024

yebai commented Dec 16, 2024

penelopeysm commented Dec 16, 2024

yebai commented Dec 16, 2024

penelopeysm commented Dec 19, 2024 • edited Loading

yebai commented Dec 19, 2024 • edited Loading

yebai commented Dec 19, 2024

sunxd3 commented Dec 19, 2024

codecov bot commented Sep 23, 2024 •

edited

Loading

coveralls commented Sep 23, 2024 •

edited

Loading

yebai commented Sep 27, 2024 •

edited

Loading

yebai commented Oct 10, 2024 •

edited

Loading

penelopeysm commented Dec 19, 2024 •

edited

Loading

yebai commented Dec 19, 2024 •

edited

Loading