Length Penalty only affects beam search #1143

jacob-mink-1996 · 2024-11-04T21:27:46Z

In the docs, it specifies that length_penalty is only for beam search - that means that in multinomial sampling, length_penalty does not change the generation.

openvino.genai/src/cpp/include/openvino/genai/generation_config.hpp

Line 51 in 7954d82

    
            * @param length_penalty exponential penalty to the length that is used with beam-based generation. It is applied as an exponent to

It would be useful if length_penalty could be used in combination with max_length or max_new_tokens in order to scale the importance of the eos token during multinomial sampling, then it would help to get prompt results that end more naturally as the max_length limit approaches.

Wovchena · 2024-11-05T10:02:00Z

@sbalandi, is length_penalty applied to greedy and multinational after the implementation switched to the Sampler? If so, please update doc strings for C++ and Python.

sbalandi · 2024-11-05T12:45:54Z

no, length_penalty is supported for beam search only now

ilya-lavrenov assigned sbalandi Nov 5, 2024

sbalandi mentioned this issue Nov 14, 2024

Move beam search in case of chat scenario to sampler.cpp #1215

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Length Penalty only affects beam search #1143

Length Penalty only affects beam search #1143

jacob-mink-1996 commented Nov 4, 2024

Wovchena commented Nov 5, 2024

sbalandi commented Nov 5, 2024

Length Penalty only affects beam search #1143

Length Penalty only affects beam search #1143

Comments

jacob-mink-1996 commented Nov 4, 2024

Wovchena commented Nov 5, 2024

sbalandi commented Nov 5, 2024