Popular repositories Loading
-
HarmBench
HarmBench PublicForked from centerforaisafety/HarmBench
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Jupyter Notebook
-
llm-past-tense
llm-past-tense PublicForked from tml-epfl/llm-past-tense
Does Refusal Training in LLMs Generalize to the Past Tense? [NeurIPS 2024 Safe Generative AI Workshop (Oral)]
Python
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.