Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
benchmark: add guided decoding script
Signed-off-by: Aaron Pham <[email protected]> chore: add warmup args Signed-off-by: Aaron Pham <[email protected]> chore: run format accordingly Signed-off-by: Aaron Pham <[email protected]> chore: add @mgoin's suggestion Signed-off-by: Aaron Pham <[email protected]> chore: run format Signed-off-by: Aaron Pham <[email protected]> Add xgrammar similiar dataset Signed-off-by: Chendi Xue <[email protected]> Add grammar, regex, choice, json - json support using file path Signed-off-by: Chendi Xue <[email protected]> Asycn engine save results Signed-off-by: Chendi Xue <[email protected]> Add latency for first and Next token - async engine Signed-off-by: Chendi Xue <[email protected]> Add correctness check Signed-off-by: Chendi Xue <[email protected]> Add mix request using guided_decoding_ratio Signed-off-by: Chendi Xue <[email protected]> only check with format Signed-off-by: Chendi Xue <[email protected]> Fix correctness check error when no json detected Signed-off-by: Chendi Xue <[email protected]>
- Loading branch information