From e62edcec0508e975448959262fed7e4222b72e61 Mon Sep 17 00:00:00 2001 From: Tim Li Date: Fri, 19 Apr 2024 15:24:39 -0700 Subject: [PATCH] small fixes --- blog/2024-04-19-arena-hard.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/blog/2024-04-19-arena-hard.md b/blog/2024-04-19-arena-hard.md index 1391b945..ec8d3ac6 100644 --- a/blog/2024-04-19-arena-hard.md +++ b/blog/2024-04-19-arena-hard.md @@ -95,7 +95,7 @@ li::before { -

Figure 1: Comparison between MT-bench and Arena Hard v0.1. The latter offers significantly better separability between models and tighter confidence intervals. Note: We do not include GPT-4-Turbo in the plot due to potential self-bias. GPT-4-0314 has no variance in Arena-hard-v0.1 because it’s used as the anchor model.

+

Figure 1: Comparison between MT-bench and Arena Hard v0.1. The latter offers significantly better separability between models and tighter confidence intervals. Note: We do not include GPT-4-Turbo in the plot due to potential bias towards itself. Also, GPT-4-0314 has no variance in Arena-hard-v0.1 because it's used as the anchor model.

Links: - Evaluate your model on Arena-Hard-v0.1: [Link](https://github.com/lm-sys/arena-hard) @@ -810,7 +810,7 @@ We hope to study deeper into the above limitations and biases in the later techn ## Acknowledgment -We thank Matei Zaharia, Yann Dubois, Anastasios Angelopoulos, Joey Gonzalez, Lianmin Zheng, Lewis Tunstall, Nathan Lambert, Xuechen Li, Naman Jain, Ying Sheng, Maarten Grootendorst for their valuable feedback. We thank Microsoft [AFMR](https://www.microsoft.com/en-us/research/collaboration/accelerating-foundation-models-research/) for Azure OpenAI credits support. We also thank Together.ai & Anyscale for open model endpoint support. +We thank Matei Zaharia, Yann Dubois, Anastasios Angelopoulos, Lianmin Zheng, Lewis Tunstall, Nathan Lambert, Xuechen Li, Naman Jain, Ying Sheng, Maarten Grootendorst for their valuable feedback. We thank Microsoft [AFMR](https://www.microsoft.com/en-us/research/collaboration/accelerating-foundation-models-research/) for Azure OpenAI credits support. We also thank Together.ai & Anyscale for open model endpoint support. ## Citation ```