Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about length bias mitigation in pairwise evaluation #20

Open
liamjxu opened this issue Nov 13, 2024 · 0 comments
Open

Question about length bias mitigation in pairwise evaluation #20

liamjxu opened this issue Nov 13, 2024 · 0 comments

Comments

@liamjxu
Copy link

liamjxu commented Nov 13, 2024

Hi, thanks for open-sourcing the code, it looks great.

I have a question about the length bias mitigation part in the WB reward metric.

It seems that the code is shortening both reference output and model output to a fixed word count.

I'm curious about how to implement

converting outcomes of “slightly better/worse” to “tie” if the winner’s response exceeds the loser’s by more than K characters.

as mentioned in the paper. Or am I missing something here?

Thank you in advance!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant