You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi! I have a technical question to better understand how Guidance works. Let's imagine that we are interested in generating a string satisfying a certain constraint. Let S be the set of strings satisfying this constraint.
Are the token sequences potentially generated by Guidance exactly:
those that can be decoded as a string of S (ie. the preimage of S through the tokenizer's decoding function);
those are the image of a string of S through the tokenization
(ie. the image of S through the tokenizer's encoding function) ?
For example, if a regex is "^[a-z]{10}$", can Guidance return:
any token sequence whose total number of letters is 10;
only those that actually correspond to the tokenization of an a-z string of length 10?
I understand the answer is 1) but I would appreciate if this can be confirmed.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hi! I have a technical question to better understand how Guidance works. Let's imagine that we are interested in generating a string satisfying a certain constraint. Let S be the set of strings satisfying this constraint.
Are the token sequences potentially generated by Guidance exactly:
(ie. the image of S through the tokenizer's encoding function) ?
For example, if a regex is "^[a-z]{10}$", can Guidance return:
I understand the answer is 1) but I would appreciate if this can be confirmed.
Beta Was this translation helpful? Give feedback.
All reactions