You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello author, after delving into your implementation code, I found that the quantize method in GEMM_W4A4 does not align with what is presented in the paper. I used smoothed(x) @ lora_down and the un-smoothed version x @ lora_down, and the results differ from qact.lora_act. Could you please explain this?
Thank you.
The text was updated successfully, but these errors were encountered:
Hello author, after delving into your implementation code, I found that the
quantize
method inGEMM_W4A4
does not align with what is presented in the paper. I usedsmoothed(x) @ lora_down
and the un-smoothed versionx @ lora_down
, and the results differ fromqact.lora_act
. Could you please explain this?Thank you.
The text was updated successfully, but these errors were encountered: