Fast, High-Fidelity LLM Decoding with Regex Constraints

This repository includes resources meant to complement this blog post entitled Fast, High-Fidelity LLM Decoding with Regex Constraints:

A technical appendix formalizing and proving the algorithms introduced in the blog post;
A Python notebook with implementations of DirectMerge and CartesianMerge, along with additional empirical results.

Please note that the code in the Python notebook was tested only with the tokenizer of Mistral-7B and takes into account some of its specificities, in particular the fact that a blank space is prepended to the strings to tokenize. The code should be adapted to work with other tokenizers.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
implementation_and_experiments.ipynb		implementation_and_experiments.ipynb
technical_appendix.pdf		technical_appendix.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fast, High-Fidelity LLM Decoding with Regex Constraints

About

Languages

License

vivien000/regex-constrained-decoding

Folders and files

Latest commit

History

Repository files navigation

Fast, High-Fidelity LLM Decoding with Regex Constraints

About

Resources

License

Stars

Watchers

Forks

Languages