Optimize attention kernel v2 1.0, use Gemm replace GemmStridedBatch #2662
This run and associated checks have been archived and are scheduled for deletion.
Learn more about checks retention
gpu-ci.yml
on: pull_request
GPU CI Concierge
17s
Check Python Interface
23m 32s
Single Machine, Multiple GPUs Tests
0s
Annotations
1 error
Inference Tests
Process completed with exit code 1.
|
Artifacts
Produced during runtime
Name | Size | |
---|---|---|
output
Expired
|
4.87 KB |
|