Skip to content

Actions: tatsu-lab/alpaca_eval

Format leaderboard

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
123 workflow runs
123 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add TOA to AlpacaEval (#428)
Format leaderboard #161: Commit 2898342 pushed by YannDubs
December 27, 2024 20:20 2m 42s main
December 27, 2024 20:20 2m 42s
Add FuseChat-3.0 models to AlpacaEval (#426)
Format leaderboard #160: Commit 8bb6e57 pushed by YannDubs
December 27, 2024 20:16 4m 37s main
December 27, 2024 20:16 4m 37s
Add Llama-3-Instruct-8B-RainbowPO to AlpacaEval (#416)
Format leaderboard #159: Commit 6976988 pushed by YannDubs
November 11, 2024 07:16 2m 8s main
November 11, 2024 07:16 2m 8s
Add NullModel to AlpacaEval (#414)
Format leaderboard #158: Commit 3c6ae8f pushed by YannDubs
October 23, 2024 17:00 2m 36s main
October 23, 2024 17:00 2m 36s
Add GPO-Llama-3-8B-Instruct-GPM-2B and SPPO-Llama-3-8B-Instruct-GPM-2…
Format leaderboard #157: Commit 9d8e91d pushed by YannDubs
October 19, 2024 21:48 4m 22s main
October 19, 2024 21:48 4m 22s
add Self-taught-llama3.1-70B-dpo as a evaluator (#412)
Format leaderboard #156: Commit d96bcbd pushed by YannDubs
September 26, 2024 15:37 2m 47s main
September 26, 2024 15:37 2m 47s
Fix the float number & Add SelfMoA_gemma-2-9b-it-SimPO, SelfMoA_gemm…
Format leaderboard #155: Commit b759c8d pushed by YannDubs
September 25, 2024 21:13 2m 58s main
September 25, 2024 21:13 2m 58s
Add Llama-3-8B-Instruct-SkillMix to AlpacaEval (#405)
Format leaderboard #154: Commit c4d4b01 pushed by YannDubs
September 15, 2024 17:35 4m 29s main
September 15, 2024 17:35 4m 29s
Add REBEL-Llama-3-8B-Instruct-Armo to AlpacaEval (#403)
Format leaderboard #153: Commit 54a52ec pushed by YannDubs
August 29, 2024 23:49 2m 22s main
August 29, 2024 23:49 2m 22s
Add Shopee-SlimMoA-v1 to AlpacaEval (#398)
Format leaderboard #152: Commit 4da8a95 pushed by YannDubs
August 26, 2024 21:38 2m 20s main
August 26, 2024 21:38 2m 20s
Add evaluator weighted_alpaca_eval_gpt-4o-mini-2024-07-18 (#401)
Format leaderboard #151: Commit 9136c7f pushed by YannDubs
August 26, 2024 21:30 2m 28s main
August 26, 2024 21:30 2m 28s
Added Llama3-PBM-Nova-70B model (#395)
Format leaderboard #150: Commit fe99c17 pushed by YannDubs
August 24, 2024 00:45 2m 19s main
August 24, 2024 00:45 2m 19s
Add blendaxai-gm-l6-vo31 to AlpacaEval (#399)
Format leaderboard #149: Commit 76bfa78 pushed by YannDubs
August 23, 2024 22:06 2m 35s main
August 23, 2024 22:06 2m 35s
[ENH] add mistral v0.3, Qwen2 70b, gtp4 mini (#393)
Format leaderboard #148: Commit 4e3b281 pushed by YannDubs
August 17, 2024 23:22 2m 6s main
August 17, 2024 23:22 2m 6s
Format leaderboard
Format leaderboard #147: Manually run by YannDubs
August 17, 2024 00:19 2m 8s main
August 17, 2024 00:19 2m 8s
Add blendaxai-gm-l3-v35 to AlpacaEval (#389)
Format leaderboard #146: Commit 9c46d20 pushed by YannDubs
August 16, 2024 23:45 2m 16s main
August 16, 2024 23:45 2m 16s
Change the name of the Infinity-Instruct-7M-0729-Models to Infinity-I…
Format leaderboard #145: Commit 5021996 pushed by YannDubs
August 13, 2024 04:24 2m 11s main
August 13, 2024 04:24 2m 11s
Format leaderboard
Format leaderboard #144: Manually run by YannDubs
August 12, 2024 23:00 2m 10s main
August 12, 2024 23:00 2m 10s
Add gemma-2-9b-it-WPO-HB to AlpacaEval (#384)
Format leaderboard #143: Commit 9062094 pushed by YannDubs
August 9, 2024 08:48 2m 12s main
August 9, 2024 08:48 2m 12s
Add Infinity-Instruct-7M-0729-Llama3_1-70B, Infinity-Instruct-7M-0729…
Format leaderboard #142: Commit f49b647 pushed by YannDubs
August 8, 2024 17:53 2m 24s main
August 8, 2024 17:53 2m 24s
[ENH] add llama 3.1 (#378)
Format leaderboard #141: Commit 42189a2 pushed by YannDubs
July 26, 2024 01:06 2m 16s main
July 26, 2024 01:06 2m 16s
Add Llama-3-Instruct-8B-WPO-HB-v2 to AlpacaEval (#377)
Format leaderboard #140: Commit 394f340 pushed by YannDubs
July 25, 2024 23:31 2m 6s main
July 25, 2024 23:31 2m 6s
[ENH] add the code to comptue instructio (#371)
Format leaderboard #139: Commit 1c22f12 pushed by YannDubs
July 18, 2024 16:24 2m 12s main
July 18, 2024 16:24 2m 12s
[ENH] add model rank to website
Format leaderboard #138: Commit 4232055 pushed by YannDubs
July 18, 2024 11:21 2m 15s main
July 18, 2024 11:21 2m 15s
Add gemma-2-9b-it-SimPO and gemma-2-9b-it-DPO to AlpacaEval (#368)
Format leaderboard #137: Commit 783c4b5 pushed by YannDubs
July 18, 2024 11:15 1m 57s main
July 18, 2024 11:15 1m 57s