Add statistical test for regex-guided generation #77

dpsimpson · 2024-10-17T13:39:04Z

I added a simple test that checks that average length of a regex-guided generation on a simple grammar with a simple language model and a fairly simple regex described in #73 is correct. For simplicity, this has a fixed seed, however there is enough information in the issue to turn it into a stochastic test is someone has the stomach for that.

It would be possible to make this test run faster with fewer Monte Carlo samples. I have verified it on much larger sample sizes and it's consistently within MC-standard error of the correct solution. I have also varied parameters in the language model.

I had to kludge together a generate function so as to not depend on outlines, but I think I'm using the guides properly.

… simple regex. It has a frozen seed to avoid flakiness, but there is information in there to make it a proper probabilistic test. It's also possible to reduce the run length by taking fewer samples.

brandonwillard

We usually use subdirectories in a way that mirrors package sub-modules, so, to match that, let's go with something like tests/test_[statistics|sampling|...].py.

Other than that an similarly small things, this tests looks good to add!

tests/statistical/test_generate.py

… the computation of the exact distributions easier. Tests of the variance and a Kolmogorov-Smirnov test for the distribution has been added. Asserts have been computed for n=250. If we want the test to run faster, we can reduce this number. All tests have been run with n=1000000 to verify the monte carlo error goes to zero. All tests have been run with multiple seeds.

tests/fsm/test_statistical.py

brandonwillard

It looks like we could generalize some of this code and move it outside of the test function. That would make it easier to separate the Markov and non-Markov tests so that they could be run separately/in parallel (e.g. locally you can use pytest-xdist and fly through the tests without changes to the code).

This isn't necessary, though, so no worries.

Other than that, we can squash and merge after the above two edits.

dpsimpson · 2024-10-21T08:47:13Z

I agree this can be generalized. Will save that work for when the next version is added though.

dpsimpson · 2024-10-21T09:49:00Z

@brandonwillard Foolishly made the changes myself rather than accepting yours, so I can't merge. Sorry.

dpsimpson added 5 commits October 16, 2024 17:51

done for the day. need to work out the logic properly

e2b8831

This commit adds a statistical test to check the expected length of a…

47f5215

… simple regex. It has a frozen seed to avoid flakiness, but there is information in there to make it a proper probabilistic test. It's also possible to reduce the run length by taking fewer samples.

Merge branch 'main' into add_statistical_test

5735bda

Black formatted

169c4bb

fixed pre-commit. sorry

636a607

brandonwillard requested changes Oct 17, 2024

View reviewed changes

tests/statistical/test_generate.py Outdated Show resolved Hide resolved

tests/statistical/test_generate.py Outdated Show resolved Hide resolved

tests/statistical/test_generate.py Outdated Show resolved Hide resolved

tests/statistical/test_generate.py Outdated Show resolved Hide resolved

dpsimpson added 2 commits October 18, 2024 11:55

Gotta add the files

9a873eb

brandonwillard reviewed Oct 18, 2024

View reviewed changes

tests/fsm/test_statistical.py Outdated Show resolved Hide resolved

tests/fsm/test_statistical.py Outdated Show resolved Hide resolved

brandonwillard reviewed Oct 18, 2024

View reviewed changes

brandonwillard added the enhancement New feature or request label Oct 18, 2024

brandonwillard linked an issue Oct 18, 2024 that may be closed by this pull request

Add statistical sampling tests #73

Closed

removed unnecessary code as per review

e4ba24e

dpsimpson enabled auto-merge (squash) October 21, 2024 09:44

dpsimpson disabled auto-merge October 21, 2024 09:44

brandonwillard merged commit 9db9927 into main Oct 21, 2024
8 checks passed

brandonwillard deleted the add_statistical_test branch October 21, 2024 16:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add statistical test for regex-guided generation #77

Add statistical test for regex-guided generation #77

dpsimpson commented Oct 17, 2024

brandonwillard left a comment

brandonwillard left a comment •

edited

Loading

dpsimpson commented Oct 21, 2024

dpsimpson commented Oct 21, 2024

Add statistical test for regex-guided generation #77

Add statistical test for regex-guided generation #77

Conversation

dpsimpson commented Oct 17, 2024

brandonwillard left a comment

Choose a reason for hiding this comment

brandonwillard left a comment • edited Loading

Choose a reason for hiding this comment

dpsimpson commented Oct 21, 2024

dpsimpson commented Oct 21, 2024

brandonwillard left a comment •

edited

Loading