-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add reference code for mixtral-8x7b
in axs
#49
Comments
1. Download dataset recipe
|
2. Download checkpoint model recipe
|
3. Run the recipe for copying tokerizer files:
In checkpoint model dicrectory:
|
4. Run Accuracy(short run, without downloaded dataset, model by
|
5. Run Accuracy(full run, without downloaded dataset, model by
|
6. Short runsAfter update in MLCommons inference repo(mlcommons/inference#1754)
Accuracy script:
|
7. Performance run, Offline(short run)
user.conf:
|
Set
Also if
Then
|
If set
Then
So we can't short run for Offline, Performance. |
8. Accuracy run for Server scenario
We need to fix this issue in https://github.com/mlcommons/inference/blob/master/language/mixtral-8x7b/main.py#L105
then
|
Accuracy:
|
9. Performance run for Server scenario- short run
|
|
If set in mlperf.conf
|
Changed in mlperf.conf Now в mlperf.conf: Then
But
|
If set:
|
mlperf.conf copied to created entry.
mlperf.conf
user.conf:
|
Useful updates:
|
Recipe for downloading checkpoint model with patch.
|
Added "patch", "abs_patch_path". |
|
Then
|
We have this issue:
Also we have the same issue when we run it locally
But it is installed
Solution: |
All python packages for accuracy calculation |
According to last updates in mlcommons/inference#1782 (comment) don't need path for downloading checkpoint model.
So remove patch supporing:
|
Last version of useful commands:
|
All commits in |
Add reference code for
mixtral-8x7b
(https://github.com/mlcommons/inference/tree/master/language/mixtral-8x7b) inaxs
.To do the following steps:
use the following branch
mixtral-dev
inaxs2mlperf
.The text was updated successfully, but these errors were encountered: