Change the repository type filter
All
Repositories list
10 repositories
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
stanford_alpaca
PublicCode and documentation to train Stanford's Alpaca models, and generate the data.- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
opinions_qa
Publicgpt_paper_assistant
Publicfast_imagenet_ffcv
Publicmlm_inductive_bias
Public