Add chebyshev Iteration #1289

yhmtsai · 2023-02-22T16:12:44Z

It adds the Chebshev iteration in https://en.wikipedia.org/wiki/Chebyshev_iteration
The second-order richardson uses a similar formula but the scalars are constant in all iteration. Chebyshev Iteration update the scalar from the previous one (the scalar are the same if upper/lower eigval does not change)

MarcelKoch · 2023-03-14T13:55:36Z

Do you intend adding an eigenvalue estimation? I think that would be very helpful, because most times users don't have that available. I think PETSc is also doing that.
I briefly searched for that and here are some references that might be interesting for that:

I think this is the GMRES version used by PETSC to compute the estimate: https://petsc.org/release/docs/manualpages/KSP/KSPAGMRES/

yhmtsai · 2023-03-20T11:30:07Z

@MarcelKoch thanks for providing these reference.

I do not think I will put them in this pull request.

From https://doi.org/10.1137/0907057, they introduce the algorithm
adaptive chebyshev iteration
Perform m-step GMRES and use the information from the Hessenberg matrix to get the estimation of eigenvalue (I may be wrong),
and then perform https://link.springer.com/article/10.1007/BF01389971 to get the parameter for chebchev
Repeat these two steps until converge
This will be more like a standalone solver not a smoother for multigrid because it will require several steps on GMRES, which is too expensive IMO.
We can introduce by with_adaptive(GMRES LinOpFactory)

From https://petsc.org/release/docs/manualpages/KSP/KSPCHEBYSHEV/#kspchebyshev,
More precisely, https://petsc.org/release/docs/manualpages/KSP/KSPChebyshevEstEigSet/
They use Lanczo or GMRES to get the maximum estimation to decide the parameter by following
min-eig-boundary = a * min-eigest + b * max-eigest = 0.1 max-eigest (default), and
max-eig-boundary = c * min-eigest + d * max-eigest = 1.1 max-eigest (default)
(because min_eigest is usually inaccurate)
By this, user can use the information by GMRES and generate min/max eig boundary for Chebyshev, so we do not need additional parameter?
It only does once in generation, so this is more usable for Multigrid

For GMRES, we need to get the Hessenberg out and compute eigenvalue on it.
GMRES eigenvalue and Lanczo are related to #135

sonarcloud · 2023-03-21T23:48:49Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
8 Code Smells

44.3% Coverage
7.1% Duplication

MarcelKoch

Besides the comments left below, I want to mention the num_keep and num_generated mechanic. That needs some explaination, because right now I don't understand what that is for. It seems like some sort of restart mechanic, but I might be wrong there. Also, exposing this to the user seems confusing to me.

MarcelKoch · 2023-04-06T07:06:22Z

include/ginkgo/core/solver/chebyshev.hpp

+        /**
+         * The number of scalar to keep
+         */
+        int GKO_FACTORY_PARAMETER_SCALAR(num_keep, 2);


It is not clear what this parameter is for.

Perhaps this means, construct Chebyshev polynomials until degree num_keep, and after that just do IR with these polynomial?

no, num_keep is to keep the generated scalar in the storage.
alpha and beta are changed by iteration, but they are fixed by the given bound.
It's used for fixed iteration runs because we do not need to refill these scalars for different apply

But shouldn't we then just keep all scalars?

when using it as normal solver, we may not have the maximum iteration information.
allocating one big dense matrix and then moving them to another one when full may not be a good approach.
we can also add the ability of workspace such that it can handle the std::vector then it should be more flexible for uncertain size.

I can not assume that there's an iteration criterion. It can contain the residual norm or time criterion.
If we assume that, we should provide the standalone iteration parameter.
Users do not need to know the implementation details.
The statement is to keep how many scalars for chebyshev to avoid the refill overhead.
I can introduce this usage later when we have the use case.
I think it should help the situation when kernel launch overhead is noticeable.

You are right, this is very useful to negate the overhead. Still, I think we can assume that there is an iteration criterion somewhere in the combined one and use that. If not, we would just throw.

You could set the default value to unspecified, which you define before. When the solver is generated, you can check if the parameter is unspecified and then try to extract an iteration criterion from the criteria. If that doesn't work, you throw with a message to either pass an iteration criterion or set this parameter to something else.
Or alternatively, just replace the criteria parameter with a iterations parameter and move the class into the preconditioner namespace. I think that is also a fine option, since that would be the main use case anyway.

I have moved it to based on the given iteration from stopping criterion.
It is only increased after creating object.
I also think whether staying a fixed number is enough or not.

Thanks, I think it should be fine the way it is now.

MarcelKoch · 2023-04-06T07:09:33Z

include/ginkgo/core/solver/chebyshev.hpp

+/**
+ * Chebyshev iteration is an iterative method that uses another coarse
+ * method to approximate the error of the current solution via the current
+ * residual. It has another term for the difference of solution. Moreover, this


Suggested change

* residual. It has another term for the difference of solution. Moreover, this

* residual. The solution is then updated using the Chebyshev polynomials. Moreover, this

Is that what the sentence is trying to say? Anyway, I think it should be mentioned somewhere that this uses these polynomials.

the solution x_i is also based on the $x_{i-1} - x_{i-2}$

TBH I don't see the relevance of that. Has that any effect for the user?

no, I only try to explain the algorithm's difference from IR. IR uses $x += \alpha M^{-1}r$, but chebyshev uses $x += \alpha_i M^{-1} r + \beta_i (x_{i-1} - x_{i-2})$ $(x_{i-1} - x_{i-2})$ is the additional term against IR

I still think this has to be rephrased

I try to rephrase it again. Could you take a look?

include/ginkgo/core/solver/chebyshev.hpp

core/solver/chebyshev.cpp

include/ginkgo/core/solver/chebyshev.hpp

reference/test/solver/chebyshev_kernels.cpp

core/test/solver/chebyshev.cpp

vasilisge0

I am fine with merging this PR when corrections in parts of the documentation that are pointed out (comments) take place. I could understand what num_keep variable was used for but perhaps a more descriptive name would be better.

sonarcloud · 2023-08-08T22:32:36Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
21 Code Smells

42.6% Coverage
4.4% Duplication

The version of Java (11.0.3) you have used to run this analysis is deprecated and we will stop accepting it soon. Please update to at least Java 17.
Read more here

MarcelKoch

LGTM, but I would like to resolve some of my earlier issues. Other than that only minor nits.

core/solver/chebyshev.cpp

MarcelKoch · 2023-10-13T10:38:19Z

include/ginkgo/core/solver/chebyshev.hpp

+/**
+ * Chebyshev iteration is an iterative method that uses another coarse
+ * method to approximate the error of the current solution via the current
+ * residual. It has another term for the difference of solution. Moreover, this


I still think this has to be rephrased

include/ginkgo/core/solver/chebyshev.hpp

core/test/solver/chebyshev.cpp

reference/test/solver/chebyshev_kernels.cpp

sonarcloud · 2023-10-19T19:36:47Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
1 Security Hotspot
27 Code Smells

59.4% Coverage
5.2% Duplication

The version of Java (11.0.3) you have used to run this analysis is deprecated and we will stop accepting it soon. Please update to at least Java 17.
Read more here

include/ginkgo/core/solver/chebyshev.hpp

core/test/solver/chebyshev.cpp

sonarcloud · 2024-01-11T18:57:51Z

Quality Gate passed

The SonarCloud Quality Gate passed, but some issues were introduced.

27 New issues
1 Security Hotspot
59.4% Coverage on New Code
5.7% Duplication on New Code

See analysis details on SonarCloud

sonarcloud · 2024-07-01T17:42:22Z

Quality Gate passed

Issues
29 New issues
0 Accepted issues

Measures
1 Security Hotspot
55.6% Coverage on New Code
6.4% Duplication on New Code

See analysis details on SonarCloud

Co-authored-by: Marcel Koch <[email protected]>

upsj

There are a few questions around data recycling and mutability I would like to discuss before merging this

upsj · 2024-11-29T08:17:51Z

include/ginkgo/core/solver/chebyshev.hpp

+ * Chebyshev iteration is an iterative method that can solving nonsymeetric
+ * problems. Chebyshev Iterations avoids the inner products for computation
+ * which may be the bottleneck for distributed system. Chebyshev Iteration is
+ * developed based on the Chebyshev polynomials of the first kind. Moreover,
+ * this method requires knowledge about the spectrum of the preconditioned
+ * matrix. This implementation follows the algorithm in "Templates for the
+ * Solution of Linear Systems: Building Blocks for Iterative Methods, 2nd
+ * Edition".


I would rearrange this slightly (the fact that we need to know the spectrum is important, so we should mention it early), and try to be consistent with capitalization of iteration.

Suggested change

* Chebyshev iteration is an iterative method that can solving nonsymeetric

* problems. Chebyshev Iterations avoids the inner products for computation

* which may be the bottleneck for distributed system. Chebyshev Iteration is

* developed based on the Chebyshev polynomials of the first kind. Moreover,

* this method requires knowledge about the spectrum of the preconditioned

* matrix. This implementation follows the algorithm in "Templates for the

* Solution of Linear Systems: Building Blocks for Iterative Methods, 2nd

* Edition".

* Chebyshev iteration is an iterative method for solving nonsymmeetric

* problems based on some knowledge of the spectrum of the (preconditioned) system matrix. It avoids the computation of inner products,

* which may be a performance bottleneck for distributed system. Chebyshev iteration is

* developed based on Chebyshev polynomials of the first kind.

* This implementation follows the algorithm in "Templates for the

* Solution of Linear Systems: Building Blocks for Iterative Methods, 2nd

* Edition".

upsj · 2024-11-29T08:24:19Z

include/ginkgo/core/solver/chebyshev.hpp

+    mutable size_type num_generated_scalar_ = 0;
+    // num_max_generation_ is the number of generated scalar kept in the
+    // workspace.
+    mutable size_type num_max_generation_ = 3;


I'm a bit wary of mutable members beyond well-controlled cases like the workspace. They might make future work on thread-safety more complicated. Not that I have a good alternative for it, but I want to bring it up.

Isn't wokrspace handled in similar way?

We don't have any mutability in our LinOps right now beyond vector caches and workspaces, most of the mutable uses are in loggers etc. So the mutability is constrained into a single class. That's not the case here.

In general, if I write code, I expect a const object to not change internally, and the only reasons we don't follow this requirement fully right now seems to be the need for cache data structures to avoid reallocation, and the fact that loggers are const and can't change themselves without mutable.

upsj · 2024-11-29T08:25:16Z

test/solver/chebyshev_kernels.cpp

+    auto chebyshev_factory =
+        gko::solver::Chebyshev<value_type>::build()
+            .with_criteria(
+                gko::stop::Iteration::build().with_max_iters(2u).on(ref))


this entire file can use deferred factories

upsj · 2024-11-29T08:25:49Z

test/solver/solver.cpp

+        return SimpleSolverTest<gko::solver::Chebyshev<solver_value_type>>::
+            build(exec, iteration_count, check_residual)
+                .with_preconditioner(
+                    precond_type::build().with_max_block_size(1u).on(exec));


upsj · 2024-11-29T08:31:14Z

include/ginkgo/core/solver/chebyshev.hpp

+
+
+namespace gko {
+namespace solver {


The naming solver is a bit of a stretch here - maybe even more so than IR. It makes sense for us that solvers are iterative algorithms that use a preconditioner, but from the user perspective, it's confusing. Maybe we need a concept in-between, like smoother, but that could also include classical preconditioners like Gauß-Seidel

upsj · 2024-11-29T08:40:18Z

reference/test/solver/chebyshev_kernels.cpp

+}
+
+
+TYPED_TEST(Chebyshev, SolvesTriangularSystemUsingAdvancedApplyMixed)


Is this actually mixed precision? Doesn't look like it to me on a first glance

upsj · 2024-11-29T08:40:53Z

reference/test/solver/chebyshev_kernels.cpp

+}
+
+
+TYPED_TEST(Chebyshev, SolvesTriangularSystemUsingAdvancedApplyMixedComplex)


Is this mixed precision?

upsj · 2024-11-29T08:43:50Z

core/solver/ir.cpp

            this->template log<log::Logger::iteration_complete>(
-                this, dense_b, dense_x, iter, residual_ptr, nullptr, nullptr,
+                solver, dense_b, dense_x, iter, residual_ptr, nullptr, nullptr,


Here solver == this, so I would suggest removing the capture

upsj · 2024-11-29T08:49:29Z

core/solver/chebyshev.cpp

+    auto old_num_max_generation = num_max_generation_;
+    // Use the scalar first
+    // get the iteration information from stopping criterion.
+    visit_criteria(


I think this should be a separate parameter, not derived from the stopping criterion. The stopping criteria are not really intended to be queried this way.

It was. but @MarcelKoch suggested that getting it from stopping criterion to reduce the parameter, I think.

If we put it as a separate parameter, then we should not allow for a stopping criteria parameter. This would be fine with me, but I think it causes other issues, as this can't be considered a solver anymore.

The smoother doesn't have much purpose with norm-based stopping criteria, because that would negate the benefit of avoiding reductions, right? And can it run correctly when it uses more iterations than the number of stored scalar values? If yes, then we could have two separate parameters iterations and subspace_dimension (or similar), if no, we could have just a single parameter iterations, and internally create an Iteration stopping criterion

upsj · 2024-11-29T08:55:09Z

core/solver/chebyshev.cpp

+            if (num_generated_scalar_ < num_max_generation_) {
+                alpha_scalar->fill(alpha_ref);
+                // unused beta for first iteration, but fill zero
+                beta_scalar->fill(zero<ValueType>());
+                num_generated_scalar_++;
+            }


Am I understanding correctly that this us potentially re-using alpha and beta values from previous iterations? If so, that seems like it is using the workspace it was not originally intended for - it's intended for avoiding reallocation, not recycling values from previous iterations. That also relates to the mutable members I would prefer to avoid.

yhmtsai · 2024-11-29T09:38:25Z

@upsj To avoid the mutable and reusing value from workspace, I think moving them into generatio to pre-fill all alpha and beta before apply. I think it will also give better performance because we do not need to do some tiny fill before the limitation during apply.
Also, refill it when the foci or criterion change (or specfic factor for that).

upsj · 2024-11-29T09:43:23Z

If those are just static coefficients independent of the matrix, that sounds very reasonable. I tend to agree with what I believe @MarcelKoch said, which is that we shouldn't consider changeable stopping criteria here. We also currently don't seem to have a way to propagate information from other parts of the solver hierarchy to compute appropriate focus values.

yhmtsai added the 1:ST:WIP This PR is a work in progress. Not ready for review. label Feb 22, 2023

yhmtsai self-assigned this Feb 22, 2023

ginkgo-bot added reg:build This is related to the build system. reg:testing This is related to testing. mod:core This is related to the core module. mod:reference This is related to the reference module. type:solver This is related to the solvers labels Feb 22, 2023

yhmtsai force-pushed the cheb_iter branch from fa439a5 to 7d38ecb Compare March 21, 2023 12:34

yhmtsai added 1:ST:ready-for-review This PR is ready for review and removed 1:ST:WIP This PR is a work in progress. Not ready for review. labels Mar 21, 2023

MarcelKoch requested changes Apr 6, 2023

View reviewed changes

upsj requested a review from vasilisge0 April 6, 2023 15:51

vasilisge0 reviewed Apr 14, 2023

View reviewed changes

yhmtsai force-pushed the cheb_iter branch 3 times, most recently from c88ea63 to f9d0e28 Compare August 8, 2023 10:06

yhmtsai force-pushed the cheb_iter branch from 58530f8 to a7dcc2a Compare August 9, 2023 07:22

yhmtsai force-pushed the cheb_iter branch from a7dcc2a to edee59a Compare September 12, 2023 09:43

upsj self-requested a review September 18, 2023 12:27

yhmtsai force-pushed the cheb_iter branch from edee59a to 409a2e6 Compare October 4, 2023 15:01

yhmtsai requested review from MarcelKoch and vasilisge0 October 13, 2023 10:07

MarcelKoch requested changes Oct 13, 2023

View reviewed changes

yhmtsai force-pushed the cheb_iter branch from 409a2e6 to aeab966 Compare October 19, 2023 09:37

MarcelKoch reviewed Oct 20, 2023

View reviewed changes

include/ginkgo/core/solver/chebyshev.hpp Show resolved Hide resolved

include/ginkgo/core/solver/chebyshev.hpp Outdated Show resolved Hide resolved

core/test/solver/chebyshev.cpp Show resolved Hide resolved

MarcelKoch force-pushed the cheb_iter branch from 4d2ed80 to 15aeb09 Compare February 14, 2024 12:48

yhmtsai force-pushed the cheb_iter branch from 15aeb09 to 4d2ed80 Compare February 14, 2024 13:45

upsj mentioned this pull request May 14, 2024

Factory config #1392

Merged

yhmtsai force-pushed the cheb_iter branch 3 times, most recently from 24fc2df to 6bc0ed5 Compare June 28, 2024 16:35

MarcelKoch added this to the Ginkgo 1.9.0 milestone Aug 26, 2024

yhmtsai force-pushed the cheb_iter branch from 6bc0ed5 to 7bfdfd5 Compare September 12, 2024 12:28

yhmtsai requested a review from a team September 12, 2024 12:30

yhmtsai force-pushed the cheb_iter branch from 7bfdfd5 to 7a25596 Compare October 17, 2024 15:07

yhmtsai and others added 13 commits November 13, 2024 17:04

init chebyshev

b2a0aae

add the check for kept data and remove lend

5bac351

core test

1a25917

change to foci

677dc12

use iteration from stop criterion and update doc

ae8a263

extract the split residual update and update test

7e06d5c

Co-authored-by: Marcel Koch <[email protected]>

use preconditioner not inner_solver for chebyshev

7d4ba1a

Co-authored-by: Marcel Koch <[email protected]>

add chebyshev test and fix the number of generated

e9812c3

use scalar from ws and move some local variables

666b6da

Co-authored-by: Marcel Koch <[email protected]>

review update

b46103b

Co-authored-by: Marcel Koch <[email protected]>

update with new format

d408177

update format

fb78c1c

add config for chebyshev

46d7df3

yhmtsai force-pushed the cheb_iter branch from 7a25596 to 46d7df3 Compare November 13, 2024 16:04

yhmtsai added the 1:ST:run-full-test label Nov 13, 2024

upsj requested changes Nov 29, 2024

View reviewed changes

	* residual. It has another term for the difference of solution. Moreover, this
	* residual. The solution is then updated using the Chebyshev polynomials. Moreover, this

		}


		TYPED_TEST(Chebyshev, SolvesTriangularSystemUsingAdvancedApplyMixed)



		namespace gko {
		namespace solver {

Add chebyshev Iteration #1289

Are you sure you want to change the base?

Add chebyshev Iteration #1289

Conversation

yhmtsai commented Feb 22, 2023

MarcelKoch commented Mar 14, 2023 • edited Loading

yhmtsai commented Mar 20, 2023 • edited Loading

sonarcloud bot commented Mar 21, 2023

MarcelKoch left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yhmtsai Aug 7, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vasilisge0 left a comment

Choose a reason for hiding this comment

sonarcloud bot commented Aug 8, 2023

MarcelKoch left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sonarcloud bot commented Oct 19, 2023

sonarcloud bot commented Jan 11, 2024

Quality Gate passed

sonarcloud bot commented Jul 1, 2024

Quality Gate passed

upsj left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

upsj Nov 29, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

upsj Nov 29, 2024 • edited Loading

Choose a reason for hiding this comment

upsj Nov 29, 2024 • edited Loading

Choose a reason for hiding this comment

yhmtsai commented Nov 29, 2024 • edited Loading

upsj commented Nov 29, 2024

MarcelKoch commented Mar 14, 2023 •

edited

Loading

yhmtsai commented Mar 20, 2023 •

edited

Loading

yhmtsai Aug 7, 2023 •

edited

Loading

upsj Nov 29, 2024 •

edited

Loading

upsj Nov 29, 2024 •

edited

Loading

upsj Nov 29, 2024 •

edited

Loading

yhmtsai commented Nov 29, 2024 •

edited

Loading