-
Notifications
You must be signed in to change notification settings - Fork 310
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #945 from AI-Hypercomputer:msingh-trillium
PiperOrigin-RevId: 682461471
- Loading branch information
Showing
3 changed files
with
119 additions
and
66 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,18 @@ | ||
# The format of this config file is 'key = value'. | ||
# The key has the format 'model.scenario.key'. Value is mostly int64_t. | ||
# Model maybe '*' as wildcard. In that case the value applies to all models. | ||
# All times are in milli seconds | ||
|
||
# Set performance_sample_count for each model. | ||
llama2-70b.*.performance_sample_count_override = 24576 | ||
*.Offline.min_duration = 600000 | ||
|
||
|
||
# In Offline scenario, we always have one query. But LoadGen maps this to | ||
# min_sample_count internally in Offline scenario. If the dataset size is larger | ||
# than 24576 we limit the min_query_count to 24576 and otherwise we use | ||
# the dataset size as the limit | ||
llama2-70b.Offline.min_query_count = 24576 | ||
|
||
# These fields should be defined and overridden by user.conf. | ||
*.Offline.target_qps = 5.0 |