Create Rust-only index construction function #9

brandonwillard · 2024-08-21T15:53:09Z

We need a pure Rust function—call it build_regex_index—that only takes a regex string and a vocabulary or tokenizer object and (possibly) returns a HashMap version of the current Python index.

This is a step towards #1.

There are few things that need to be converted in order to do this:

Determine the exact form of the return value.
What works best for returning a Python object and future Rust-only use (i.e. Rust at run-time)?
Determine if a vocabulary or tokenizer is better as input.
Do we need to allow both?
We eventually want tighter integration with tokenizers; should we start there? Also, could we get a Rust-based tokenizer from a Python-created transformers tokenizer object (that uses tokenizers under the hood, of course)?
Handle regex-to-FSM conversion in Rust #10
Start by finding a suitable Rust crate for this functionality (e.g. regex-automata).
Convert reduced_vocabulary.

The text was updated successfully, but these errors were encountered:

brandonwillard added enhancement New feature or request help wanted Extra attention is needed labels Aug 21, 2024

brandonwillard assigned kc611 Aug 21, 2024

brandonwillard mentioned this issue Aug 21, 2024

Separate Python bindings code from Rust-only code #11

Closed

5 tasks

brandonwillard self-assigned this Aug 21, 2024

brandonwillard added the TGI Related to the integration with `text-generation-inference` label Aug 21, 2024

kc611 mentioned this issue Aug 26, 2024

Rust-only index construction function #16

Closed

brandonwillard mentioned this issue Aug 26, 2024

Make PyO3 bindings an optional feature #14

Merged

1 task

torymur linked a pull request Dec 20, 2024 that will close this issue

Build Index from regex #125

Draft

12 tasks

torymur removed the help wanted Extra attention is needed label Dec 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create Rust-only index construction function #9

Create Rust-only index construction function #9

brandonwillard commented Aug 21, 2024 •

edited

Loading

Create Rust-only index construction function #9

Create Rust-only index construction function #9

Comments

brandonwillard commented Aug 21, 2024 • edited Loading

brandonwillard commented Aug 21, 2024 •

edited

Loading