[Feature Request]. Support Reward Model #634

liziniu · 2024-10-20T02:13:48Z

Hi,
May I know whether the framework can support to quantize reward models like Qwen/Qwen2.5-Math-RM-72B and Llama-3.1-Nemotron-70B-Reward-HF? These models are usually used at inference stage but requires extensive GPU memory. Thus quantization is valuable.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request]. Support Reward Model #634

[Feature Request]. Support Reward Model #634

liziniu commented Oct 20, 2024

[Feature Request]. Support Reward Model #634

[Feature Request]. Support Reward Model #634

Comments

liziniu commented Oct 20, 2024