Add interface functions to allow replacing the log density function and replacing AD wrapper type #33

sunxd3 · 2024-07-11T09:54:21Z

Ref #32 (comment)

Brief summary:

added replace_ℓ interface function
if ADgradient take in a ADGradientWrapper, then recreate a new gradient wrapper with its log density function

I only added some implementations for ReverseDiff.

This is very much a draft right now, everything is up to modify.

sunxd3 · 2024-07-11T10:01:10Z

cc @devmotion, @tpapp, @torfjelde, @yebai, @miguelbiron

tpapp

Also see comment in discussion.

Project.toml

src/LogDensityProblemsAD.jl

tpapp · 2024-07-12T07:12:41Z

It is unfortunate that the ADgradient constructor takes keywords, not structs, for the legacy interface, since if the gradient spec was all wrapped in a single container then we could just do

ADgradient(get_AD(ℓ), new_ℓ)

and just implement get_AD instead. If we wait for #29 then we could have that kind of API instead of replace_ℓ.

sunxd3 · 2024-07-12T13:46:07Z

If we wait for #29 then we could have that kind of API instead of replace_ℓ

It is cleaner. We can opt for it after the PR is merged.

tpapp · 2024-07-12T13:55:28Z

But then we would have to change the interface again...

I am inclined to go with replace_ℓ for now, with a note saying that it is experimental API at the moment and may just change. So the relevant PR in Turing could proceed.

But will wait to hear from @devmotion.

devmotion · 2024-07-14T21:36:33Z

My impression from TuringLang/Turing.jl#2231 (comment) and related comments in Turing.jl was that there's no clear need for such an API currently? One reason for such an API would be a case where calling ADgradient from scratch would be less efficient than a dedicated replace_l function (BTW IMO probably an official API - even if it is experimental - should use a non-Unicode name, potentially with a Unicode alias (but an alias seems a bit much for such a simple functionality)). But at least for the ReverseDiff example here there's no efficiency gain?

Regarding the implementation: Couldn't we achieve this functionality by overloading setproperty!?

torfjelde · 2024-07-16T07:50:56Z

It is unfortunate that the ADgradient constructor takes keywords, not structs, for the legacy interface, since if the gradient spec was all wrapped in a single container then we could just do

Does the ADTypes.jl extension not effectively solve this? Or are there some kwargs that are still missing from the ADTypes.jl structs?

sunxd3 · 2024-07-16T15:54:13Z

I have a new proposal: add an interface function getADtype (or some other better name) and don't add the interface function this PR is trying to introduce. getADtype should return a ADTypes.ADType. Then packages can just use ADgradient with ADType to create the wrapper.

EDIT: just realized this is exactly what @tpapp was suggesting 👍

The motivation is that I don't think replace_l would be enough. At least for ReverseDiff, one failure mode is that the tape compiled without specifying input (i.e. kwarg x) can result in a tape that is not correct for all inputs (something related to control flow maybe? @yebai). In that case, we really need the ability to call ADgradient with kwargs.

tpapp · 2024-07-17T07:26:56Z

Sorry for the late responses, I am on holiday with limited net access.

@torfjelde: the problem is that not all the API is using ADtypes.

@sunxd3: yes, the cleanest solution would be that, see my comment above. But we need to clean up the API first.

I am not sure how pressing is the need for this solution. We could introduce something interim that solves the problem for Turing, with the understanding that it is internal and would be removed once we solve this.

sunxd3 · 2024-07-17T09:35:58Z

@tpapp understood your point now.

There are motivations to introduce such an API. Correctness of ReverseDiff's tape is one. Do we want to wait for #29 or maybe introduce something like getADtype, which is just one function that returns the ADType if supported?

torfjelde · 2024-07-18T09:40:27Z

the problem is that not all the API is using ADtypes.

Gotcha 👍

I am not sure how pressing is the need for this solution. We could introduce something interim that solves the problem for Turing, with the understanding that it is internal and would be removed once we solve this.

We have a work-around on our side, so I think it's less pressing atm

sunxd3 added 2 commits July 11, 2024 10:41

drafts

e674fc4

add replace_ℓ for ReverseDiffExt

269ec8d

tpapp reviewed Jul 12, 2024

View reviewed changes

Project.toml Show resolved Hide resolved

src/LogDensityProblemsAD.jl Show resolved Hide resolved

src/LogDensityProblemsAD.jl Outdated Show resolved Hide resolved

apply suggestions from @tpapp

0897629

yebai mentioned this pull request Jul 15, 2024

Fixes and improvements to experimental Gibbs TuringLang/Turing.jl#2231

Merged

1 task

sunxd3 closed this Jul 18, 2024

sunxd3 mentioned this pull request Jul 19, 2024

Subtypes of ADGradientWrapper are defined in package extension, thus unable to be dispatched on #32

Closed

sunxd3 mentioned this pull request Aug 8, 2024

Test with Tapir TuringLang/Turing.jl#2289

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add interface functions to allow replacing the log density function and replacing AD wrapper type #33

Add interface functions to allow replacing the log density function and replacing AD wrapper type #33

sunxd3 commented Jul 11, 2024 •

edited

Loading

sunxd3 commented Jul 11, 2024

tpapp left a comment

tpapp commented Jul 12, 2024

sunxd3 commented Jul 12, 2024 •

edited

Loading

tpapp commented Jul 12, 2024

devmotion commented Jul 14, 2024

torfjelde commented Jul 16, 2024

sunxd3 commented Jul 16, 2024 •

edited

Loading

tpapp commented Jul 17, 2024

sunxd3 commented Jul 17, 2024

torfjelde commented Jul 18, 2024

Add interface functions to allow replacing the log density function and replacing AD wrapper type #33

Add interface functions to allow replacing the log density function and replacing AD wrapper type #33

Conversation

sunxd3 commented Jul 11, 2024 • edited Loading

sunxd3 commented Jul 11, 2024

tpapp left a comment

Choose a reason for hiding this comment

tpapp commented Jul 12, 2024

sunxd3 commented Jul 12, 2024 • edited Loading

tpapp commented Jul 12, 2024

devmotion commented Jul 14, 2024

torfjelde commented Jul 16, 2024

sunxd3 commented Jul 16, 2024 • edited Loading

tpapp commented Jul 17, 2024

sunxd3 commented Jul 17, 2024

torfjelde commented Jul 18, 2024

sunxd3 commented Jul 11, 2024 •

edited

Loading

sunxd3 commented Jul 12, 2024 •

edited

Loading

sunxd3 commented Jul 16, 2024 •

edited

Loading