feat: add `PETScSNES` #482

avik-pal · 2024-10-25T18:02:36Z

PETSc 0.2 resolves old compatibility issues we were having with wrapping this library.

Warning

I am still seeing occasional segfaults with PETSc. Very common in 1.11. And for some reason PETSc hijacks Ctrl + C and trying to use it leads to a segfault even when no code is running

fixes #160

TODOs

Sparsity Handling?
Testing
Rewrite one of the examples from PETSc.jl

avik-pal · 2024-10-25T18:47:36Z

This 150 lines of code https://github.com/JuliaParallel/PETSc.jl/blob/main/examples/SNES_ex2.jl translates to 30 lines in nonlinearsolve with automatic sparsity detection

# This implements src/snes/examples/tutorials/ex2.c from PETSc using the PETSc.jl package, using SNES
#
# This solves the equations sequentially
# 
# Newton method to solve u'' + u^{2} = f, sequentially.
using NonlinearSolve, PETSc, LinearAlgebra, SparseConnectivityTracer

n = 21
u0 = fill(0.5, n)

function form_residual!(resid, x, _)
    n = length(x)
    xp = LinRange(0.0, 1.0, n)
    F = 6xp .+ (xp .+ 1e-12) .^ 6

    dx = 1 / (n - 1)
    resid[1] = x[1]
    for i in 2:(n - 1)
        resid[i] = (x[i - 1] - 2x[i] + x[i + 1]) / dx^2 + x[i] * x[i] - F[i]
    end
    resid[n] = x[n] - 1

    return
end

nlfunc = NonlinearFunction{true}(form_residual!; sparsity = TracerSparsityDetector())
nlprob = NonlinearProblem(nlfunc, u0)

solve(nlprob, NewtonRaphson())
solve(nlprob, PETScSNES())

avik-pal · 2024-10-25T21:15:58Z

PETSc is faster than us in the sparse case at least on my local computer. Once all the refactoring is done we will have to revisit the benchmarks

avik-pal · 2024-10-25T22:15:33Z

Looking at the profile all of the time goes in the UMFPACK

docs/src/api/petsc.md

ChrisRackauckas · 2024-10-26T04:59:53Z

docs/src/tutorials/snes_ex2.md

+## Runtimes
+
+### Dense Jacobian
+
+```@example snes_ex2
+@benchmark solve($(nlprob_dense), $(NewtonRaphson()); abstol = 1e-8)
+nothing # hide
+```
+
+```@example snes_ex2
+@benchmark solve($(nlprob_dense), $(PETScSNES()); abstol = 1e-8)
+nothing # hide
+```
+
+### Sparse Jacobian
+
+```@example snes_ex2
+@benchmark solve($(nlprob_sparse), $(NewtonRaphson()); abstol = 1e-8)
+nothing # hide
+```
+
+```@example snes_ex2
+@benchmark solve($(nlprob_sparse), $(PETScSNES()); abstol = 1e-8)
+nothing # hide


What's the result? Just curious

https://html-preview.github.io/?url=https://raw.githubusercontent.com/SciML/NonlinearSolve.jl/refs/heads/gh-pages/previews/PR482/tutorials/snes_ex2/index.html#Sparse-Jacobian

not by much on CI. On my laptop it is 300us vs 260us (PETSc being faster).

How optimized is our sparse solver selection for LinearSolve?

Relatively optimal? What sparse linear solver is PETSc using here? Try all of UMFPACK, KLU, and MKLPardiso. Other thing to try is using MKL with Pardiso. Then note that there's a new sparse solver ParU that @rayegun is getting wrapped SciML/LinearSolve.jl#394.

One thing to double check is what the extra time is spent in. Is it only doing one symbolic factorization?

Let's get this in, since it is non-blocking. I anyways need to redo benchmarks for the paper, so I can revisit it after the splitting is done.

avik-pal force-pushed the ap/petsc branch from 4f5fbca to 8364408 Compare October 25, 2024 18:03

feat: add PETScSNES

8fe5ee6

avik-pal force-pushed the ap/petsc branch from 8364408 to 8fe5ee6 Compare October 25, 2024 18:03

avik-pal added 3 commits October 25, 2024 14:48

feat: support automatic sparsity detection for PETSc

12baf76

test: add PETScSNES to the wrapper tests

9e63995

docs: add PETSc example

370aa68

avik-pal requested a review from ChrisRackauckas October 25, 2024 21:16

avik-pal force-pushed the ap/petsc branch from 897c326 to 7248f64 Compare October 25, 2024 22:47

test: skip PETSc tests on windows

05bbc79

avik-pal force-pushed the ap/petsc branch from 7248f64 to 05bbc79 Compare October 25, 2024 23:05

ChrisRackauckas reviewed Oct 26, 2024

View reviewed changes

docs/src/api/petsc.md Show resolved Hide resolved

ChrisRackauckas reviewed Oct 26, 2024

View reviewed changes

docs: print the benchmark results

c143b5c

ChrisRackauckas merged commit 28c0189 into master Oct 27, 2024
16 of 39 checks passed

ChrisRackauckas deleted the ap/petsc branch October 27, 2024 10:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add `PETScSNES` #482

feat: add `PETScSNES` #482

avik-pal commented Oct 25, 2024 •

edited

Loading

avik-pal commented Oct 25, 2024

avik-pal commented Oct 25, 2024

avik-pal commented Oct 25, 2024

ChrisRackauckas Oct 26, 2024

avik-pal Oct 26, 2024

avik-pal Oct 26, 2024

ChrisRackauckas Oct 26, 2024

avik-pal Oct 26, 2024

feat: add PETScSNES #482

feat: add PETScSNES #482

Conversation

avik-pal commented Oct 25, 2024 • edited Loading

TODOs

avik-pal commented Oct 25, 2024

avik-pal commented Oct 25, 2024

avik-pal commented Oct 25, 2024

ChrisRackauckas Oct 26, 2024

Choose a reason for hiding this comment

avik-pal Oct 26, 2024

Choose a reason for hiding this comment

avik-pal Oct 26, 2024

Choose a reason for hiding this comment

ChrisRackauckas Oct 26, 2024

Choose a reason for hiding this comment

avik-pal Oct 26, 2024

Choose a reason for hiding this comment

feat: add `PETScSNES` #482

feat: add `PETScSNES` #482

avik-pal commented Oct 25, 2024 •

edited

Loading