Fix inference tests in CI #1225

goliaro · 2023-11-05T04:00:40Z

Description of changes:

PR #1219 changed the prompt used in the CI tests, resulting in the generated text starting at the very first line of the output. The bash files checking the alignment between FlexFlow and Huggingface, however, still assume that the generated text starts at the second line of the output, so the first line of output is not checked. This PR fixes this bug.

In addition, this PR fixes several latent issues (which we had postponed for a long time) related to the llama tokenizer. And we go back to using the original JackFram/llama-160m (instead of JackFram/llama-160m-base) after aligning the configs with the official LLAMA model (see here)

Related Issues:

Linked Issues:

Issue #

Issues closed by this PR:

Closes #

This change is

goliaro added 4 commits November 5, 2023 03:56

updated diffs in tests

86a63e8

manually add BOS token in LLAMA

ba6bd47

shellcheck

eae5aea

fix

fdf58ed

goliaro enabled auto-merge (squash) November 5, 2023 04:10

goliaro added 3 commits November 5, 2023 20:01

align tokenizer with llama2

d2e9acf

update cmake

38564bf

fix

9615be2

goliaro merged commit b0fe522 into inference Nov 6, 2023
45 checks passed

goliaro deleted the fix_ci branch November 6, 2023 02:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix inference tests in CI #1225

Fix inference tests in CI #1225

goliaro commented Nov 5, 2023 •

edited

Loading

Fix inference tests in CI #1225

Fix inference tests in CI #1225

Conversation

goliaro commented Nov 5, 2023 • edited Loading

goliaro commented Nov 5, 2023 •

edited

Loading