Merge LPGD into diffcp #67

a-paulus · 2024-08-26T10:46:10Z

Pull request for enabling LPGD differentiation of the conic program in diffcp.

LPGD info

LPGD computes informative replacements for the true derivatives in degenerate cases as efficient finite differences.
For the forward derivatives this implementation just computes standard finite differences (with an additional optional regularization term).
For adjoint derivatives we compute finite differences between gradients of the conic program Lagrangian, evaluated at the original solution and a perturbed solution, requiring only one (two if double-sided) additional solver evaluations. See the paper for a detailed derivation of the LPGD adjoint derivatives as the gradient of an envelope function to the linearized loss.
Note that in the limit of small perturbations tau, LPGD computes the true derivatives (if they exist). For larger tau the computed derivatives do not match the true derivatives but can provide more informative signal.

Code

LPGD can be enabled with the mode=LPGD argument of solve_and_derivative. It also requires passing the perturbation strength tau (and optionally the regularization strength rho) with derivative_kwargs=dict(tau=0.1, rho=0.1). Alternatively the derivative kwargs can be passed directly, e.g. adjoint_derivative(dx, dy, ds, tau=0.1, rho=0.1)

In the code the main addition are the methods derivative_lpgd/adjoint_derivative_lpgd in cone_program.py. These methods internally call compute_perturbed_solution/ compute_adjoint_perturbed_solution to get the solution to a perturbed optimization problem, and then return the derivatives as finite differences.

For testing, the existing diffcp examples are included as modified versions using LPGD differentiation.

Note on implementation: If activated, the optional regularization requires solving a quadratic cone problem, i.e. setting P!=0. For this reason we added an optional P=None kwarg to solve_internal which is passed to the solver if quadratic objectives are supported.

SteveDiamond · 2024-08-29T18:37:50Z

This is great! I'm not sure what the failing builds are about.

PTNobel · 2024-08-31T04:20:36Z

The CI issues are a symptom of the master branch CI being broken. Fixing it is on my goals for the long weekend. I'd love a chance to review this PR carefully before merging, but on a quick pass it looks great Anselm! Is there any timeline you need this merged by?

a-paulus · 2024-08-31T08:45:58Z

Glad to hear you like it! There is no rush to merge it from my side, please take your time to carefully review it.

PTNobel

Looks great! A few small questions. Sorry for the long delay on the review.

PTNobel · 2024-12-08T03:22:02Z

diffcp/cone_program.py

        raise ValueError("Unsupported mode {}; the supported modes are "
-                         "'dense', 'lsqr' and 'lsmr'".format(mode))
+                         "'dense', 'lsqr', 'lsmr', 'lpgd', 'lpgd_right' and 'lpgd_left'".format(mode))
    if np.isnan(A.data).any():


Shouldn't we check here if P is None for the dense, lsqr, lsmr cases? They don't support quadratic objectives in my understanding.

Yes that makes sense, I will add this.

PTNobel · 2024-12-08T03:23:41Z

diffcp/cone_program.py


        return dA, db, dc
+
+    def derivative_lpgd(dA, db, dc, tau, rho):


Do you have quadratic objective support? I noticed you weren't banning them in construction above.

Yes this is probably good to discuss. The LPGD method supports derivatives and adjoint derivatives for all parameters, including the quadratic P term here. The reason I didn't include it was to not break some compatibility by changing the arguments/outputs of the exposed derivative and adjoint_derivative methods (and their batched versions). I guess the derivative would be simple to resolve by just adding an optional dP=None argument, but for the adjoint derivative one would need to change the number of outputs. I was thinking about adding a request_dP flag? If you think this makes sense I can come up with some proposal changes.

I added the differentiation w.r.t. P now, see the latest commits. I added two examples but have not tested this extensively

PTNobel · 2024-12-08T03:27:13Z

diffcp/utils.py

+        if solve_method == "ECOS":
+            warm_start = None
+        else:
+            warm_start = (np.hstack([x, s]), np.hstack([y, y]), np.hstack([s, s]))


Does Clarabel use a warmstart?

It does not seem like they support it, currently in diffcp the warm_start is also not passed to the Clarabel solver so it definitely is not used at the moment. Probably better to make this explicit, I will add a check to throw an error when trying to use warmstarteing with Clarabel.

PTNobel · 2024-12-08T03:28:02Z

examples/batch_ecos.py

    xs, ys, ss, D_batch, DT_batch = diffcp.solve_and_derivative_batch(As, bs, cs, Ks,
-                                                                      n_jobs_forward=1, n_jobs_backward=n_jobs, solver="ECOS", verbose=False)
+                                                                      n_jobs_forward=1, n_jobs_backward=n_jobs, solve_method="ECOS", verbose=False)


PTNobel · 2024-12-08T03:28:31Z

examples/dual_example.py

@@ -9,7 +9,7 @@
 # defined as a product of a 3-d fixed cone, 3-d positive orthant cone,


Suggested change

# defined as a product of a 3-d fixed cone, 3-d positive orthant cone,

# defined as a product of a 3-d zero cone, 3-d positive orthant cone,

PTNobel · 2024-12-08T03:28:47Z

examples/ecos_example.py

@@ -9,7 +9,7 @@
 # defined as a product of a 3-d fixed cone, 3-d positive orthant cone,


Suggested change

# defined as a product of a 3-d fixed cone, 3-d positive orthant cone,

# defined as a product of a 3-d zero cone, 3-d positive orthant cone,

PTNobel · 2024-12-08T03:29:00Z

examples/ecos_example_lpgd.py

+
+
+# We generate a random cone program with a cone
+# defined as a product of a 3-d fixed cone, 3-d positive orthant cone,


Suggested change

# defined as a product of a 3-d fixed cone, 3-d positive orthant cone,

# defined as a product of a 3-d zero cone, 3-d positive orthant cone,

a-paulus and others added 14 commits July 29, 2024 11:19

Update README.md

20fd10c

Add LPGD

26727e1

Update Readme

8b0f8ac

simplify argument passing

6e761ce

add more lpgd examples

e4f8e82

Catch error if perturbed problem infeasible

1fa3d05

Give better error message on infeasibility

133d30d

fix docstrings

761b8ba

minor changes

ca0eb64

cosmetic changes

c73b116

Merge branch 'cvxgrp:master' into master

476faf2

update readme

ca78344

move perturbed solution computation to utils

3d1235c

cosmetics

ca1be3f

PTNobel approved these changes Dec 8, 2024

View reviewed changes

a-paulus added 4 commits December 13, 2024 22:37

resolve domments on pull request + minor edits

1f4e4bf

add differentiation w.r.t. P

d2aada1

add examples for differentiating w.r.t. P

dcfa6ba

minor cosmetic change

8a9069c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge LPGD into diffcp #67

Merge LPGD into diffcp #67

a-paulus commented Aug 26, 2024

SteveDiamond commented Aug 29, 2024

PTNobel commented Aug 31, 2024

a-paulus commented Aug 31, 2024

PTNobel left a comment •

edited

Loading

PTNobel Dec 8, 2024

a-paulus Dec 13, 2024

PTNobel Dec 8, 2024

a-paulus Dec 13, 2024

a-paulus Dec 13, 2024

PTNobel Dec 8, 2024

a-paulus Dec 13, 2024

PTNobel Dec 8, 2024

a-paulus Dec 13, 2024

PTNobel Dec 8, 2024

PTNobel Dec 8, 2024

PTNobel Dec 8, 2024

		@@ -9,7 +9,7 @@
		# defined as a product of a 3-d fixed cone, 3-d positive orthant cone,

	# defined as a product of a 3-d fixed cone, 3-d positive orthant cone,
	# defined as a product of a 3-d zero cone, 3-d positive orthant cone,



		# We generate a random cone program with a cone
		# defined as a product of a 3-d fixed cone, 3-d positive orthant cone,

Merge LPGD into diffcp #67

Are you sure you want to change the base?

Merge LPGD into diffcp #67

Conversation

a-paulus commented Aug 26, 2024

LPGD info

Code

SteveDiamond commented Aug 29, 2024

PTNobel commented Aug 31, 2024

a-paulus commented Aug 31, 2024

PTNobel left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

PTNobel left a comment •

edited

Loading