`Sparse::updateData` reads nnz from `nnz_expanded_` if the `is_expanded_` flag is set #176

superwhiskers · 2024-07-10T14:40:29Z

when determining size of the buffers to copy data to, Sparse::updateData reads the number of nonzero values within the matrix from the nnz_expanded_ field if the matrix is marked as expanded, when it should be using nnz_ in all cases, for if the matrix is expanded, nnz_ should be set to nnz_expanded_ anyway. this is confusing behavior, and that branch should be removed in all copies of the Sparse::updateData source to remediate it, which should not fail on correctly behaving code. this branch is present in all copies of this method

The text was updated successfully, but these errors were encountered:

superwhiskers · 2024-07-10T19:11:17Z

this issue is also related to the klu_klu_test failure still present in #175 and #174, as the failing assertion results from an assumption that nnz_ contains the used portion of the allocated space regardless of the value of the is_expanded_. the assumption of the old coo2csr and of the Csr::updateFromCoo method is that nnz_ always contains the number of indices needed by the unexpanded matrix (or the matrix as-is if not symmetric) and that nnz_expanded_ is used if the matrix is symmetric. the old coo2csr will update these fields with the used value counts after deduplication of the main diagonal, which does introduce a potential test failure with unmodified code under the edge case in which the second input matrix used to make the csr matrix has duplicate values on the main diagonal

the newer code in #175 and #174 instead treats nnz_ as always representing the number of indices needed by the matrix in its current state, and ignores nnz_expanded_. this is the cause of the test failure. to fix this issue we'd need to decide upon a convention for these fields that all code must be updated to uphold

my proposal for this is the following:

get rid of nnz_expanded_ and replace it with nnz_unexpanded_ which, if the matrix is symmetric, will contain the number of indices used when unexpanded if it is currently expanded and is undefined otherwise
guarantee that is_expanded_ is reliable (as an invariant) so that redundant checking of upper or lower triangularity in functions such as matrix::expand and matrix::coo2csr is unnecessary
thoroughly document all of this through doxygen comments

the rationale is the following: it's more efficient to have a guaranteed single source of truth for the current number of used indices than it is to have two separate locations that you need to branch on to get the number of used indices. additionally, this is the usual case anyway, it doesn't seem to me that most code will care about the number of indices used in the smallest possible representation (the effective behavior of the current code with respect to nnz_) storing the unexpanded size (if there is one) makes this efficient to access should code care about this, but i doubt it's even necessary to do this

one variant of this proposal is to instead merge the functionality of is_expanded_ and nnz_unexpanded_ into a single field which is some sentinel value, such as 0 or -1 if the matrix is not expanded and the number of indices used if unexpanded otherwise. i'm not too sure how i feel about this option, but it's an alternative that should be considered at the least

pelesh · 2024-09-13T13:47:22Z

CC @kswirydo

pelesh · 2024-09-18T17:25:19Z

There is really no need for nnz_expanded_ or its unexpanded counterpart. It will be removed altogether. It is just a leftover from an experimental code. Thank you for bringing this issue to the forefront.

superwhiskers added bug Something isn't working documentation Improvements or additions to documentation invalid This doesn't seem right labels Jul 10, 2024

superwhiskers removed the invalid This doesn't seem right label Jul 10, 2024

pelesh mentioned this issue Sep 13, 2024

Rework of coo2csr conversion #174

Closed

pelesh mentioned this issue Sep 17, 2024

Refactor matrix file I/O #184

Merged

4 tasks

pelesh self-assigned this Sep 18, 2024

pelesh closed this as completed in #184 Sep 20, 2024

pelesh closed this as completed in b326fbd Sep 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`Sparse::updateData` reads nnz from `nnz_expanded_` if the `is_expanded_` flag is set #176

`Sparse::updateData` reads nnz from `nnz_expanded_` if the `is_expanded_` flag is set #176

superwhiskers commented Jul 10, 2024 •

edited

Loading

superwhiskers commented Jul 10, 2024 •

edited

Loading

pelesh commented Sep 13, 2024

pelesh commented Sep 18, 2024

Sparse::updateData reads nnz from nnz_expanded_ if the is_expanded_ flag is set #176

Sparse::updateData reads nnz from nnz_expanded_ if the is_expanded_ flag is set #176

Comments

superwhiskers commented Jul 10, 2024 • edited Loading

superwhiskers commented Jul 10, 2024 • edited Loading

pelesh commented Sep 13, 2024

pelesh commented Sep 18, 2024

`Sparse::updateData` reads nnz from `nnz_expanded_` if the `is_expanded_` flag is set #176

`Sparse::updateData` reads nnz from `nnz_expanded_` if the `is_expanded_` flag is set #176

superwhiskers commented Jul 10, 2024 •

edited

Loading

superwhiskers commented Jul 10, 2024 •

edited

Loading