[Containers] use OrderedDict as the data structure for SparseAxisArray #3681

odow · 2024-02-17T03:07:23Z

Closes #3678
Closes #3680

The issue is still really that SparseAxisArray is not a regular n-dimensional Array. Each slice can have a different length with different axes. It doesn't make sense to support a lot of operations with it.

The only thing that we should support are vectors.

This PR has the problem that the current iteration order is the transpose of what it "should" be (row-major, instead of column-major).

This is needed because the second and subsequent dimensions can depend on the first and prior. So it isn't really meaningful to iterate "down" the first index holding all else constant.

I can see this being a constant source of future problems.

src/Containers/SparseAxisArray.jl

codecov · 2024-02-17T04:08:44Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (a21e616) 98.33% compared to head (57eb71c) 98.36%.
Report is 1 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #3681      +/-   ##
==========================================
+ Coverage   98.33%   98.36%   +0.03%     
==========================================
  Files          43       43              
  Lines        5696     5698       +2     
==========================================
+ Hits         5601     5605       +4     
+ Misses         95       93       -2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

src/Containers/SparseAxisArray.jl

joaquimg · 2024-02-17T11:16:49Z

Shouldnt the iteration ordering comment be part of docs?

odow · 2024-02-17T21:13:27Z

Yeah. I guess my first question is: should we do this?

If we do, we need more docs on the specifics of the order and slicing, and when things can go wrong.

Also: https://github.com/jump-dev/JuMP.jl/actions/runs/7944311147

joaquimg · 2024-02-17T22:28:01Z

The slicing issues you showed seem pretty dangerous.

The only downside would be the performance penalty? Did you benchmark it?

odow · 2024-02-18T04:36:52Z

The only downside would be the performance penalty? Did you benchmark it?

I didn't benchmark it, but I think the risk of incorrect usage outweighs the cost.

It's probably slower because creating an ordered dict is slower. But it shouldn't be too bad.

joaquimg · 2024-02-18T13:26:14Z

I agree correctness must come first.

Is there any other downside?

odow · 2024-02-18T20:00:10Z

I added a warning.

I guess the main downside is that existing iteration orders may change. But that could have happened anyway, because we used a Dict.

blegat

The change of order of eachindex is indeed sub-optimal since we want SparseAxisArrays to match the behavior of Array it is dense and it's indices are Base.OneTo. However, it doesn't seem that this could be done in general and it's still a strict improvement to the behavior before this PR.

docs/src/manual/containers.md

odow · 2024-02-22T19:47:01Z

Before merging, we should check if we can trigger issues like
JuliaCollections/OrderedCollections.jl#87

We should also audit open issues in OrderedCollections for other potential problems.

odow · 2024-02-22T22:51:47Z

I audited all the open issues. I think we're safe from the delete segfault because you cannot delete elements in a SparseAxisArray.

The most up for debate issue is: JuliaCollections/OrderedCollections.jl#82

julia> Containers.@container(x[k in 1:2], k, container = SparseAxisArray)
JuMP.Containers.SparseAxisArray{Int64, 1, Tuple{Int64}} with 2 entries:
  [1]  =  1
  [2]  =  2

julia> Containers.@container(y[k in 2:-1:1], k, container = SparseAxisArray)
JuMP.Containers.SparseAxisArray{Int64, 1, Tuple{Int64}} with 2 entries:
  [2]  =  2
  [1]  =  1

julia> x == y
true

But this coincidentally just reproduces the behavior of the current JuMP:

julia> Containers.@container(x[k in 1:2], k, container = SparseAxisArray)
JuMP.Containers.SparseAxisArray{Int64, 1, Tuple{Int64}} with 2 entries:
  [1]  =  1
  [2]  =  2

julia> Containers.@container(y[k in 2:-1:1], k, container = SparseAxisArray)
JuMP.Containers.SparseAxisArray{Int64, 1, Tuple{Int64}} with 2 entries:
  [1]  =  1
  [2]  =  2

julia> x == y
true

mlubin · 2024-02-29T00:47:54Z

JuliaCollections/OrderedCollections.jl#82 isn't too concerning to me.

odow · 2024-02-29T03:05:21Z

So good to merge then?

[Containers] use OrderedDict as the data structure for SparseAxisArray

554574f

odow commented Feb 17, 2024

View reviewed changes

src/Containers/SparseAxisArray.jl Show resolved Hide resolved

odow added 2 commits February 17, 2024 16:21

Update

69a264c

Update

245bfab

odow commented Feb 17, 2024

View reviewed changes

src/Containers/SparseAxisArray.jl Outdated Show resolved Hide resolved

odow added 2 commits February 17, 2024 17:16

Update src/Containers/SparseAxisArray.jl

1786691

Update SparseAxisArray.jl

052db1b

Add warning

0859b92

odow mentioned this pull request Feb 18, 2024

[breaking] throw error if Containers.SparseAxisArray used in constraint function #3680

Closed

odow requested review from blegat and mlubin February 18, 2024 20:46

blegat approved these changes Feb 19, 2024

View reviewed changes

mlubin reviewed Feb 19, 2024

View reviewed changes

docs/src/manual/containers.md Outdated Show resolved Hide resolved

Update containers.md

57eb71c

mlubin approved these changes Feb 29, 2024

View reviewed changes

odow merged commit 1195695 into master Feb 29, 2024
11 checks passed

odow deleted the od/sparse-order branch February 29, 2024 03:21

Ethan-Russell mentioned this pull request Sep 26, 2024

Deserialization no longer backwards compatible due to JuMP.SparseAxisArray internal data structure change e4st-dev/E4ST.jl#307

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Containers] use OrderedDict as the data structure for SparseAxisArray #3681

[Containers] use OrderedDict as the data structure for SparseAxisArray #3681

odow commented Feb 17, 2024

codecov bot commented Feb 17, 2024 •

edited

Loading

joaquimg commented Feb 17, 2024

odow commented Feb 17, 2024

joaquimg commented Feb 17, 2024

odow commented Feb 18, 2024

joaquimg commented Feb 18, 2024

odow commented Feb 18, 2024

blegat left a comment

odow commented Feb 22, 2024

odow commented Feb 22, 2024

mlubin commented Feb 29, 2024

odow commented Feb 29, 2024

[Containers] use OrderedDict as the data structure for SparseAxisArray #3681

[Containers] use OrderedDict as the data structure for SparseAxisArray #3681

Conversation

odow commented Feb 17, 2024

codecov bot commented Feb 17, 2024 • edited Loading

Codecov Report

joaquimg commented Feb 17, 2024

odow commented Feb 17, 2024

joaquimg commented Feb 17, 2024

odow commented Feb 18, 2024

joaquimg commented Feb 18, 2024

odow commented Feb 18, 2024

blegat left a comment

Choose a reason for hiding this comment

odow commented Feb 22, 2024

odow commented Feb 22, 2024

mlubin commented Feb 29, 2024

odow commented Feb 29, 2024

codecov bot commented Feb 17, 2024 •

edited

Loading