Replies: 1 comment
-
Hi, Yeah you can get the raw probabilities in the sample method. Currently it does argmax to pick the top value based on the temperature but it should be simple to make this method pluggable (PRs welcome!) |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi all,
I've read this interesting article that shows how to use Keras and the Gemma model to peek under the hood and seen the next token probabilities.
I'm curious if 1) it's possible to do something like that when running Gemma with Jlama, and 2) if yes, how can you drive Jlama to do that?
Guillaume
Beta Was this translation helpful? Give feedback.
All reactions