Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: dependent array type #813

Open
wants to merge 30 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 26 commits
Commits
Show all changes
30 commits
Select commit Hold shift + click to select a range
b98d7fa
feat: dependent array type
fgdorais May 27, 2024
0eca0e0
feat: experimental implementation
fgdorais May 27, 2024
c632cf5
chore: docs and tests
fgdorais May 27, 2024
2e64898
feat: use inductive model definition
fgdorais May 27, 2024
a59a726
chore: cleanup
fgdorais May 27, 2024
62a6444
chore: cleanup
fgdorais May 27, 2024
90a1e93
chore: cleanup
fgdorais May 27, 2024
c2ef604
chore: more tests
fgdorais May 28, 2024
e91f84c
fix: typos
fgdorais May 28, 2024
8bd44e0
feat: add `recOn` and `casesOn`
fgdorais May 28, 2024
ee049a5
chore: cleanup and docs
fgdorais May 28, 2024
45446e7
chore: time for `mk`
fgdorais May 28, 2024
ddff5df
fix: docs
fgdorais May 28, 2024
419a4c4
feat: add `Inhabited` instance
fgdorais May 28, 2024
88e32c9
chore: add todo for lean4#2292
fgdorais May 28, 2024
11d3587
fix: use `NonScalar` instead of `Unit`
fgdorais May 28, 2024
bf511fd
feat: add `uget`
fgdorais May 28, 2024
f794ffa
fix: remove `recOn`
fgdorais May 28, 2024
b581611
fix: typo
fgdorais May 28, 2024
3f815fe
feat: add `uset`
fgdorais May 28, 2024
10d0530
feat: add `modify` and `umodify`
fgdorais May 28, 2024
c8d001c
feat: add `push` and `pop`
fgdorais May 28, 2024
332e74d
feat: lemmas for `modify`
fgdorais May 28, 2024
c665f52
fix: make `modify` and `umodify` semireducible
fgdorais May 28, 2024
f25aaa8
chore: split Basic and Lemmas
fgdorais May 28, 2024
41c015d
chore: merge main
fgdorais Oct 1, 2024
77eb2a2
chore: merge main
fgdorais Nov 15, 2024
5347980
chore: merge main
fgdorais Dec 20, 2024
d3a45c1
chore: merge main
fgdorais Dec 23, 2024
8510de1
fix: use fget and fset
fgdorais Dec 23, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions Batteries.lean
Original file line number Diff line number Diff line change
Expand Up @@ -19,6 +19,7 @@ import Batteries.Data.BinomialHeap
import Batteries.Data.ByteArray
import Batteries.Data.ByteSubarray
import Batteries.Data.Char
import Batteries.Data.DArray
import Batteries.Data.DList
import Batteries.Data.Fin
import Batteries.Data.FloatArray
Expand Down
2 changes: 2 additions & 0 deletions Batteries/Data/DArray.lean
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
import Batteries.Data.DArray.Basic
import Batteries.Data.DArray.Lemmas
157 changes: 157 additions & 0 deletions Batteries/Data/DArray/Basic.lean
Original file line number Diff line number Diff line change
@@ -0,0 +1,157 @@
/-
Copyright (c) 2024 François G. Dorais. All rights reserved.
Released under Apache 2.0 license as described in the file LICENSE.
Authors: François G. Dorais
-/

namespace Batteries

/-!
# Dependent Arrays

`DArray` is a heterogenous array where the type of each item depends on the index. The model
for this type is the dependent function type `(i : Fin n) → α i` where `α i` is the type assigned
to items at index `i`.

The implementation of `DArray` is based on Lean's dynamic array type. This means that the array
values are stored in a contiguous memory region and can be accessed in constant time. Lean's arrays
also support destructive updates when the array is exclusive (RC=1).

### Implementation Details

Lean's array API does not directly support dependent arrays. Each `DArray n α` is internally stored
as an `Array NonScalar` with length `n`. This is sound since Lean's array implementation does not
record nor use the type of the items stored in the array. So it is safe to use `UnsafeCast` to
convert array items to the appropriate type when necessary.
-/

/-- `DArray` is a heterogenous array where the type of each item depends on the index. -/
-- TODO: Use a structure once [lean4#2292](https://github.com/leanprover/lean4/pull/2292) is fixed.
inductive DArray (n) (α : Fin n → Type _) where
fgdorais marked this conversation as resolved.
Show resolved Hide resolved
/-- Makes a new `DArray` with given item values. `O(n*g)` where `get i` is `O(g)`. -/
| mk (get : (i : Fin n) → α i)

namespace DArray

section unsafe_implementation

private unsafe abbrev data : DArray n α → Array NonScalar := unsafeCast

private unsafe def mkImpl (get : (i : Fin n) → α i) : DArray n α :=
unsafeCast <| Array.ofFn fun i => (unsafeCast (get i) : NonScalar)

private unsafe def getImpl (a : DArray n α) (i) : α i :=
unsafeCast <| a.data.get ⟨i.val, lcProof⟩

private unsafe def ugetImpl (a : DArray n α) (i : USize) (h : i.toNat < n) : α ⟨i.toNat, h⟩ :=
unsafeCast <| a.data.uget i lcProof

private unsafe def setImpl (a : DArray n α) (i) (v : α i) : DArray n α :=
unsafeCast <| a.data.set ⟨i.val, lcProof⟩ <| unsafeCast v

private unsafe def usetImpl (a : DArray n α) (i : USize) (h : i.toNat < n) (v : α ⟨i.toNat, h⟩) :
DArray n α := unsafeCast <| a.data.uset i (unsafeCast v) lcProof

private unsafe def modifyFImpl [Functor f] (a : DArray n α) (i : Fin n)
(t : α i → f (α i)) : f (DArray n α) :=
let v := unsafeCast <| a.data.get ⟨i.val, lcProof⟩
-- Make sure `v` is unshared, if possible, by replacing its array entry by `box(0)`.
let a := unsafeCast <| a.data.set ⟨i.val, lcProof⟩ (unsafeCast ())
setImpl a i <$> t v

private unsafe def umodifyFImpl [Functor f] (a : DArray n α) (i : USize) (h : i.toNat < n)
(t : α ⟨i.toNat, h⟩ → f (α ⟨i.toNat, h⟩)) : f (DArray n α) :=
let v := unsafeCast <| a.data.uget i lcProof
-- Make sure `v` is unshared, if possible, by replacing its array entry by `box(0)`.
let a := unsafeCast <| a.data.uset i (unsafeCast ()) lcProof
usetImpl a i h <$> t v

private unsafe def pushImpl (a : DArray n α) (v : β) :
DArray (n+1) fun i => if h : i.val < n then α ⟨i.val, h⟩ else β :=
unsafeCast <| a.data.push <| unsafeCast v

private unsafe def popImpl (a : DArray (n+1) α) : DArray n fun i => α i.castSucc :=
unsafeCast <| a.data.pop

private unsafe def copyImpl (a : DArray n α) : DArray n α :=
unsafeCast <| a.data.extract 0 n

end unsafe_implementation

attribute [implemented_by mkImpl] DArray.mk

instance (α : Fin n → Type _) [(i : Fin n) → Inhabited (α i)] : Inhabited (DArray n α) where
default := mk fun _ => default

/-- Gets the `DArray` item at index `i`. `O(1)`. -/
@[implemented_by getImpl]
protected def get : DArray n α → (i : Fin n) → α i
| mk get => get

@[simp, inherit_doc DArray.get]
protected abbrev getN (a : DArray n α) (i) (h : i < n := by get_elem_tactic) : α ⟨i, h⟩ :=
a.get ⟨i, h⟩

/-- Gets the `DArray` item at index `i : USize`. Slightly faster than `get`; `O(1)`. -/
@[implemented_by ugetImpl]
protected def uget (a : DArray n α) (i : USize) (h : i.toNat < n) : α ⟨i.toNat, h⟩ :=
a.get ⟨i.toNat, h⟩

private def casesOnImpl.{u} {motive : DArray n α → Sort u} (a : DArray n α)
(h : (get : (i : Fin n) → α i) → motive (.mk get)) : motive a :=
h a.get

attribute [implemented_by casesOnImpl] DArray.casesOn

/-- Sets the `DArray` item at index `i`. `O(1)` if exclusive else `O(n)`. -/
@[implemented_by setImpl]
protected def set (a : DArray n α) (i : Fin n) (v : α i) : DArray n α :=
mk fun j => if h : i = j then h ▸ v else a.get j

/--
Sets the `DArray` item at index `i : USize`.
Slightly faster than `set` and `O(1)` if exclusive else `O(n)`.
-/
@[implemented_by usetImpl]
protected def uset (a : DArray n α) (i : USize) (h : i.toNat < n) (v : α ⟨i.toNat, h⟩) :=
a.set ⟨i.toNat, h⟩ v

@[simp, inherit_doc DArray.set]
protected abbrev setN (a : DArray n α) (i) (h : i < n := by get_elem_tactic) (v : α ⟨i, h⟩) :=
a.set ⟨i, h⟩ v

/-- Modifies the `DArray` item at index `i` using transform `t` and the functor `f`. -/
@[implemented_by modifyFImpl]
protected def modifyF [Functor f] (a : DArray n α) (i : Fin n)
(t : α i → f (α i)) : f (DArray n α) := a.set i <$> t (a.get i)

/-- Modifies the `DArray` item at index `i` using transform `t`. -/
@[inline]
protected def modify (a : DArray n α) (i : Fin n) (t : α i → α i) : DArray n α :=
a.modifyF (f:=Id) i t

/-- Modifies the `DArray` item at index `i : USize` using transform `t` and the functor `f`. -/
@[implemented_by umodifyFImpl]
protected def umodifyF [Functor f] (a : DArray n α) (i : USize) (h : i.toNat < n)
(t : α ⟨i.toNat, h⟩ → f (α ⟨i.toNat, h⟩)) : f (DArray n α) := a.uset i h <$> t (a.uget i h)

/-- Modifies the `DArray` item at index `i : USize` using transform `t`. -/
@[inline]
protected def umodify (a : DArray n α) (i : USize) (h : i.toNat < n)
(t : α ⟨i.toNat, h⟩ → α ⟨i.toNat, h⟩) : DArray n α :=
a.umodifyF (f:=Id) i h t

/-- Copies the `DArray` to an exclusive `DArray`. `O(1)` if exclusive else `O(n)`. -/
@[implemented_by copyImpl]
protected def copy (a : DArray n α) : DArray n α := mk a.get

/-- Push an element onto the end of a `DArray`. `O(1)` if exclusive else `O(n)`. -/
@[implemented_by pushImpl]
protected def push (a : DArray n α) (v : β) :
DArray (n+1) fun i => if h : i.val < n then α ⟨i.val, h⟩ else β :=
mk fun i => if h : i.val < n then dif_pos h ▸ a.get ⟨i.val, h⟩ else dif_neg h ▸ v

/-- Delete the last item of a `DArray`. `O(1)`. -/
@[implemented_by popImpl]
protected def pop (a : DArray (n+1) α) : DArray n fun i => α i.castSucc :=
mk fun i => a.get i.castSucc
75 changes: 75 additions & 0 deletions Batteries/Data/DArray/Lemmas.lean
Original file line number Diff line number Diff line change
@@ -0,0 +1,75 @@
/-
Copyright (c) 2024 François G. Dorais. All rights reserved.
Released under Apache 2.0 license as described in the file LICENSE.
Authors: François G. Dorais
-/

import Batteries.Data.DArray.Basic

namespace Batteries.DArray

@[ext]
protected theorem ext : {a b : DArray n α} → (∀ i, a.get i = b.get i) → a = b
| mk _, mk _, h => congrArg _ <| funext fun i => h i

@[simp]
theorem get_mk (i : Fin n) : DArray.get (.mk init) i = init i := rfl

theorem set_mk {α : Fin n → Type _} {init : (i : Fin n) → α i} (i : Fin n) (v : α i) :
DArray.set (.mk init) i v = .mk fun j => if h : i = j then h ▸ v else init j := rfl

@[simp]
theorem get_set (a : DArray n α) (i : Fin n) (v : α i) : (a.set i v).get i = v := by
simp only [DArray.get, DArray.set, dif_pos]

theorem get_set_ne (a : DArray n α) (v : α i) (h : i ≠ j) : (a.set i v).get j = a.get j := by
simp only [DArray.get, DArray.set, dif_neg h]

@[simp]
theorem set_set (a : DArray n α) (i : Fin n) (v w : α i) : (a.set i v).set i w = a.set i w := by
ext j
if h : i = j then
rw [← h, get_set, get_set]
else
rw [get_set_ne _ _ h, get_set_ne _ _ h, get_set_ne _ _ h]

theorem get_modifyF [Functor f] [LawfulFunctor f] (a : DArray n α) (i : Fin n) (t : α i → f (α i)) :
(DArray.get . i) <$> a.modifyF i t = t (a.get i) := by
simp [DArray.modifyF, ← comp_map]
conv => rhs; rw [← id_map (t (a.get i))]
congr; ext; simp

@[simp]
theorem get_modify (a : DArray n α) (i : Fin n) (t : α i → α i) :
(a.modify i t).get i = t (a.get i) := get_modifyF (f:=Id) a i t

theorem get_modify_ne (a : DArray n α) (t : α i → α i) (h : i ≠ j) :
(a.modify i t).get j = a.get j := get_set_ne _ _ h

@[simp]
theorem set_modify (a : DArray n α) (i : Fin n) (t : α i → α i) (v : α i) :
(a.set i v).modify i t = a.set i (t v) := by
ext j
if h : i = j then
cases h; simp
else
simp [h, get_modify_ne, get_set_ne]

@[simp]
theorem uget_eq_get (a : DArray n α) (i : USize) (h : i.toNat < n) :
a.uget i h = a.get ⟨i.toNat, h⟩ := rfl

@[simp]
theorem uset_eq_set (a : DArray n α) (i : USize) (h : i.toNat < n) (v : α ⟨i.toNat, h⟩) :
a.uset i h v = a.set ⟨i.toNat, h⟩ v := rfl

@[simp]
theorem umodifyF_eq_modifyF [Functor f] (a : DArray n α) (i : USize) (h : i.toNat < n)
(t : α ⟨i.toNat, h⟩ → f (α ⟨i.toNat, h⟩)) : a.umodifyF i h t = a.modifyF ⟨i.toNat, h⟩ t := rfl

@[simp]
theorem umodify_eq_modify (a : DArray n α) (i : USize) (h : i.toNat < n)
(t : α ⟨i.toNat, h⟩ → α ⟨i.toNat, h⟩) : a.umodify i h t = a.modify ⟨i.toNat, h⟩ t := rfl

@[simp]
theorem copy_eq (a : DArray n α) : a.copy = a := rfl
28 changes: 28 additions & 0 deletions test/darray.lean
Original file line number Diff line number Diff line change
@@ -0,0 +1,28 @@
import Batteries.Data.DArray

open Batteries

def foo : DArray 3 fun | 0 => String | 1 => Nat | 2 => Array Nat :=
.mk fun | 0 => "foo" | 1 => 42 | 2 => #[4, 2]

def bar := foo.set 0 "bar"

#guard foo.get 0 == "foo"
#guard foo.get 1 == 42
#guard foo.get 2 == #[4, 2]

#guard (foo.set 1 1).get 0 == "foo"
#guard (foo.set 1 1).get 1 == 1
#guard (foo.set 1 1).get 2 == #[4, 2]

#guard bar.get 0 == "bar"
#guard (bar.set 0 (foo.get 0)).get 0 == "foo"
#guard ((bar.set 0 "baz").set 1 1).get 0 == "baz"
#guard ((bar.set 0 "baz").set 0 "foo").get 0 == "foo"
#guard ((bar.set 0 "foo").set 0 "baz").get 0 == "baz"

def Batteries.DArray.head : DArray (n+1) α → α 0
| mk f => f 0

#guard foo.head == "foo"
#guard bar.head == "bar"