Skip to content

Commit

Permalink
Merge pull request #42 from andy-yang-1/decontaminator-typo
Browse files Browse the repository at this point in the history
add chatbot arena
  • Loading branch information
infwinston authored Nov 14, 2023
2 parents 03c5664 + 46c0538 commit 17ab36b
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion blog/2023-11-14-llm-decontaminator.md
Original file line number Diff line number Diff line change
Expand Up @@ -108,7 +108,7 @@ The following command builds a top-k similar database based on sentence bert and
## **Conclusion**

In this blog, we show that contamination is still poorly understood. With our proposed decontamination method, we reveal significant previously unknown test overlap in real-world datasets. We encourage the community to rethink benchmark and contamination in LLM context, and adopt stronger decontamination tools when evaluating LLMs on public benchmarks.
We call for the community to actively develop fresh one-time exams to accurately evaluate LLMs.
Moreover, we call for the community to actively develop fresh one-time exams to accurately evaluate LLMs. Learn more about our ongoing effort on live LLM eval at [Chatbot Arena](https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard)!


## **Acknowledgment**
Expand Down

0 comments on commit 17ab36b

Please sign in to comment.