Environment optimization #1035

gabriel-barrett · 2024-01-09T14:19:58Z

This PR optimizes environments. Now, they're specialized data structures with its own tag Env. Empty environments are 0 valued environments, and otherwise they are a hash of size 4, the first element being the symbol payload, the second and third elements being the corresponding value's tag and payload, and the fourth element, the tail environment's payload. Doing this means lookups now take a single hash4, which allows us to do 8 lookups per step!

In the earlier PR, we simplified recursive envs by tagging symbols with a special tag to signal it came from a letrec. Now, because we don't have tags in symbols, some other strategy is needed. This is why I'm now tagging functions with a special recursive tag. It's important to note that this tag only appears in functions inside environment, as when you fetch them, it becomes a normal function with an extended environment. This is effectively the same behaviour as Lurk, although the internal representation is different

In earlier commits I gave another solution to letrecs which would solve issue #434 and actually give a proper letrec semantics, but the issue is that it would recompute recursive values and you'd end up with more iterations. This can only be solved with memoization, which will give the proper operational semantics to thunks, and can be solved after we've integrated the memoset into Lurk. Until then, we will have to settle on the other solution

porcuquine · 2024-01-10T01:45:54Z

src/lem/circuit.rs

+
+                // Allocate the image tag if it hasn't been allocated before,
+                // create the full image pointer and add it to bound allocations
+                let img_tag = ctx.global_allocator.alloc_tag_cloned(&mut cs, &Env);


Is this 'maybe' allocation deterministic within the circuit, or does it depend on the dynamic data? If the latter, then we risk inconsistent circuits. For this to be okay, the calls to alloc_tag_cloned need to always happen in the same order, with the same arguments. This is a larger question than just this usage.

I assume it's okay, since the design of the global allocator (and these lazy-allocating methods) assumes the above. I'm just mentioning so you can tell me it's fine everywhere. If in any doubt, we should convince ourselves.

I believe that all constants, including tags, are allocated before, so when you call this, it only amounts to getting the previously allocated constant. This does not depend on dynamic data, so it's fine

porcuquine · 2024-01-10T04:40:40Z

!gpu-benchmark

porcuquine · 2024-01-10T04:59:14Z

!gpu-bench

gabriel-barrett · 2024-01-10T12:17:16Z

!gpu-benchmark

github-actions · 2024-01-10T12:48:27Z

Benchmark for `73badc6`

Click to view benchmark

Test	Base	PR	%
LEM Fibonacci Prove - rc = 100/fib/num-100	1729.5±2.16ms	1742.8±2.50ms	+0.77%
LEM Fibonacci Prove - rc = 100/fib/num-200	3.3±0.00s	3.4±0.00s	+3.03%
LEM Fibonacci Prove - rc = 600/fib/num-100	1943.4±7.80ms	2.0±0.01s	+2.91%
LEM Fibonacci Prove - rc = 600/fib/num-200	3.3±0.01s	3.4±0.00s	+3.03%

porcuquine

Excellent. I'm obviously in favor of this change in principle. I looked at the code, and everything looks good. I would say that some LEM details are unfamiliar enough (have not stayed entirely up to speed on all internals) that having another set of eyes on the 'code review' rather than 'design review' would be useful.

letrec will now add thunk expressions which adds themselves to the environment before evaluating

This is so that we can delay hashing, much like `Ptr`s delay hashing

Now `letrec` works as before, but updated to the new, changed, environment. The reason for reverting is that the iteration count increased in a bunch of tests. For thunks to work properly, we will need the memoset implementation of eval

gabriel-barrett requested review from a team as code owners January 9, 2024 14:19

gabriel-barrett requested a review from a team as a code owner January 9, 2024 15:57

porcuquine reviewed Jan 10, 2024

View reviewed changes

gabriel-barrett force-pushed the new-env branch from ad82b4b to 55128a3 Compare January 10, 2024 11:27

porcuquine previously approved these changes Jan 11, 2024

View reviewed changes

gabriel-barrett added this pull request to the merge queue Jan 11, 2024

gabriel-barrett removed this pull request from the merge queue due to a manual request Jan 11, 2024

github-actions bot pushed a commit that referenced this pull request Jan 11, 2024

[automated] GPU Benchmark from PR #1035

a58f0e0

gabriel-barrett dismissed porcuquine’s stale review via 540d07a January 11, 2024 20:35

gabriel-barrett force-pushed the new-env branch from 55128a3 to 540d07a Compare January 11, 2024 20:35

porcuquine previously approved these changes Jan 11, 2024

View reviewed changes

gabriel-barrett dismissed porcuquine’s stale review via 719ce89 January 12, 2024 00:14

gabriel-barrett added 11 commits January 11, 2024 21:16

letrec works differently

45145aa

letrec will now add thunk expressions which adds themselves to the environment before evaluating

environment tag added

1875061

Nums now take arbitrary raw pointers

7ea7f49

This is so that we can delay hashing, much like `Ptr`s delay hashing

added PushBinding/PopBinding operations

a38310a

eval uses optimized environment

0344809

updated tests to new environment

f631178

reverted letrec change

340a3db

Now `letrec` works as before, but updated to the new, changed, environment. The reason for reverting is that the iteration count increased in a bunch of tests. For thunks to work properly, we will need the memoset implementation of eval

test iterations updated

9a8e1d3

fixed repl and fib benchmark

dae5e05

fixed zstore

825e1e3

fixed demo, except for protocol.lurk

7c86bca

gabriel-barrett added 3 commits January 11, 2024 21:16

added empty-env operation and fixed protocol.lurk

0d05938

fixed cli test

e5779dc

slightly more efficient lookup

55b5baa

gabriel-barrett force-pushed the new-env branch from 719ce89 to 55b5baa Compare January 12, 2024 00:16

porcuquine approved these changes Jan 12, 2024

View reviewed changes

gabriel-barrett added this pull request to the merge queue Jan 12, 2024

github-actions bot pushed a commit that referenced this pull request Jan 12, 2024

[automated] GPU Benchmark from PR #1035

91fe658

Merged via the queue into lurk-lab:main with commit fa6d5c0 Jan 12, 2024
12 checks passed

gabriel-barrett deleted the new-env branch January 12, 2024 00:58

arthurpaulino mentioned this pull request Feb 6, 2024

Broken protocol environments #1098

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Environment optimization #1035

Environment optimization #1035

gabriel-barrett commented Jan 9, 2024

porcuquine Jan 10, 2024

gabriel-barrett Jan 10, 2024

porcuquine commented Jan 10, 2024

porcuquine commented Jan 10, 2024

gabriel-barrett commented Jan 10, 2024

github-actions bot commented Jan 10, 2024

porcuquine left a comment

Environment optimization #1035

Environment optimization #1035

Conversation

gabriel-barrett commented Jan 9, 2024

porcuquine Jan 10, 2024

Choose a reason for hiding this comment

gabriel-barrett Jan 10, 2024

Choose a reason for hiding this comment

porcuquine commented Jan 10, 2024

porcuquine commented Jan 10, 2024

gabriel-barrett commented Jan 10, 2024

github-actions bot commented Jan 10, 2024

Benchmark for 73badc6

porcuquine left a comment

Choose a reason for hiding this comment

Benchmark for `73badc6`