-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to get hidden_state value? #13
Comments
The size of the hidden_states matrix is |
Thanks for reply! But I have another question. As far as I know, huggningface's trainer has a fixed seed of 42(I even fixed the seeds with code separately). But in this project, the loss value changes every time I run it, so the metric result changes every time. Could you tell me why? Same code, difference loss & lr results checkpoint-8856/trainer_state.json
case 2
case 3
|
Thanks for great paper!
I am curious how to get hidden_state value?
and why it has more than double dim size up? I thought it would be (b,50, 768), not a (b,140,768)
Could you explain why is it?
Thanks!!!
gpt2.py
87lineshttps://github.com/RitaRamo/smallcap/blob/513f4f795950328129014eb37f011d686ab6ed24/src/gpt2.py#L87C13-L87C13
The text was updated successfully, but these errors were encountered: