Skip to content

Actions: tatsu-lab/alpaca_eval

test format leaderboard

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
249 workflow runs
249 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

"Add Mistral-7B+RAHF-DUAL+LoRA to AlpacaEval"
test format leaderboard #175: Pull request #307 synchronize by LiuAmber
May 11, 2024 07:24 2m 2s LiuAmber:main
May 11, 2024 07:24 2m 2s
"Add Mistral-7B+RAHF-DUAL+LoRA to AlpacaEval"
test format leaderboard #174: Pull request #307 synchronize by LiuAmber
May 11, 2024 07:14 2m 35s LiuAmber:main
May 11, 2024 07:14 2m 35s
"Add Mistral-7B+RAHF-DUAL+LoRA to AlpacaEval"
test format leaderboard #173: Pull request #307 synchronize by LiuAmber
May 11, 2024 05:29 2m 6s LiuAmber:main
May 11, 2024 05:29 2m 6s
"Add Mistral-7B+RAHF-DUAL+LoRA to AlpacaEval"
test format leaderboard #172: Pull request #307 opened by LiuAmber
May 11, 2024 05:26 2m 37s LiuAmber:main
May 11, 2024 05:26 2m 37s
Add <Mistral-7B+RAHF-DUAL+LoRA> to AlpacaEval
test format leaderboard #171: Pull request #306 opened by LiuAmber
May 11, 2024 05:13 2m 20s LiuAmber:main
May 11, 2024 05:13 2m 20s
Add Yi-Large Preview to AlpacaEval
test format leaderboard #170: Pull request #304 opened by HyperdriveHustle
May 8, 2024 07:44 2m 2s HyperdriveHustle:main
May 8, 2024 07:44 2m 2s
Add ExPO results to AlpacaEval
test format leaderboard #169: Pull request #299 opened by chujiezheng
May 5, 2024 06:04 2m 0s chujiezheng:main
May 5, 2024 06:04 2m 0s
Add SPPO-Mistral7B-PairRM to AlpacaEval
test format leaderboard #168: Pull request #298 opened by Edward-Sun
May 4, 2024 03:52 2m 1s Edward-Sun:SPPO
May 4, 2024 03:52 2m 1s
Add Storm-7B to AlpacaEval
test format leaderboard #167: Pull request #294 synchronize by yifan123
May 2, 2024 11:47 1m 59s yifan123:main
May 2, 2024 11:47 1m 59s
Add Storm-7B to AlpacaEval
test format leaderboard #166: Pull request #294 synchronize by yifan123
May 2, 2024 11:31 2m 12s yifan123:main
May 2, 2024 11:31 2m 12s
Add Storm-7B to AlpacaEval
test format leaderboard #165: Pull request #294 synchronize by yifan123
May 2, 2024 03:51 2m 9s yifan123:main
May 2, 2024 03:51 2m 9s
Add Storm-7B to AlpacaEval
test format leaderboard #164: Pull request #294 synchronize by yifan123
May 2, 2024 03:47 2m 6s yifan123:main
May 2, 2024 03:47 2m 6s
Add Storm-7B to AlpacaEval
test format leaderboard #163: Pull request #294 opened by yifan123
April 28, 2024 08:01 2m 7s yifan123:main
April 28, 2024 08:01 2m 7s
[ENH] verifying all the qwens
test format leaderboard #162: Pull request #292 synchronize by YannDubs
April 27, 2024 00:22 1m 56s yann/qwen
April 27, 2024 00:22 1m 56s
[ENH] verifying all the qwens
test format leaderboard #161: Pull request #292 synchronize by YannDubs
April 27, 2024 00:09 2m 17s yann/qwen
April 27, 2024 00:09 2m 17s
[ENH] verifying all the qwens
test format leaderboard #160: Pull request #292 synchronize by YannDubs
April 26, 2024 23:54 2m 9s yann/qwen
April 26, 2024 23:54 2m 9s
[ENH] verifying all the qwens
test format leaderboard #159: Pull request #292 synchronize by YannDubs
April 26, 2024 23:47 2m 3s yann/qwen
April 26, 2024 23:47 2m 3s
add Qwen1.5-110B-Chat self-report results
test format leaderboard #158: Pull request #291 synchronize by Lukeming-tsinghua
April 26, 2024 04:34 2m 1s Lukeming-tsinghua:main
April 26, 2024 04:34 2m 1s
Add Ghost 7B Alpha to AlpacaEval
test format leaderboard #157: Pull request #288 opened by lh0x00
April 22, 2024 11:13 2m 10s lh0x00:main
April 22, 2024 11:13 2m 10s
Add the evaluation result for our latest model
test format leaderboard #156: Pull request #286 synchronize by hendrydong
April 21, 2024 00:26 2m 23s hendrydong:main
April 21, 2024 00:26 2m 23s
Add the evaluation result for our latest model
test format leaderboard #155: Pull request #286 opened by hendrydong
April 20, 2024 14:06 2m 20s hendrydong:main
April 20, 2024 14:06 2m 20s
[ENH] llama3
test format leaderboard #154: Pull request #285 synchronize by YannDubs
April 19, 2024 04:10 2m 1s yann/llama3
April 19, 2024 04:10 2m 1s
[ENH] llama3
test format leaderboard #153: Pull request #285 opened by YannDubs
April 19, 2024 04:05 2m 13s yann/llama3
April 19, 2024 04:05 2m 13s
[BUG] revert to GPT4 preview 1106
test format leaderboard #152: Pull request #283 opened by YannDubs
April 18, 2024 07:26 2m 10s yann/revert_preview
April 18, 2024 07:26 2m 10s
Add Nanbeige-Plus-Chat-v0.1 to AlpacaEval
test format leaderboard #151: Pull request #279 opened by yuani114
April 16, 2024 06:58 2m 15s main
April 16, 2024 06:58 2m 15s