Validate context length for generation and validation #33

sfc-gh-jhilgart · 2024-05-08T20:25:14Z

No description provided.

sfc-gh-nsehrawat · 2024-05-08T20:40:45Z

semantic_model_generator/validate/context_length.py

@@ -0,0 +1,11 @@
+_MODEL_CONTEXT_LENGTH = 7000  # We use 7k so that we can reserve 1k for response tokens.


Shouldn't we account for instruction tokens in the prompt?

This is pretty loose as is. Instruction tokens, even for llama3, are only ~20 tokens.

I can add an additional buffer here though!

semantic_model_generator/validate/context_length.py

Validate context length for generation and validation

4e81047

sfc-gh-jhilgart requested review from sfc-gh-nsehrawat, sfc-gh-rehuang, sfc-gh-nlimtiaco and sfc-gh-dasilva as code owners May 8, 2024 20:25

fix test

9cb9419

sfc-gh-nsehrawat reviewed May 8, 2024

View reviewed changes

sfc-gh-jhilgart added 2 commits May 8, 2024 17:24

instr token buffer

9c60980

lint

07684bd

sfc-gh-dasilva approved these changes May 9, 2024

View reviewed changes

semantic_model_generator/validate/context_length.py Outdated Show resolved Hide resolved

sfc-gh-jhilgart added 2 commits May 8, 2024 17:33

feedback

360b249

black

20f73e9

sfc-gh-jhilgart merged commit 940de24 into main May 9, 2024
3 checks passed

Provide feedback

		@@ -0,0 +1,11 @@
		_MODEL_CONTEXT_LENGTH = 7000 # We use 7k so that we can reserve 1k for response tokens.