Skip to content

Optimize attention kernel v2 1.0, use Gemm replace GemmStridedBatch #2662

Optimize attention kernel v2 1.0, use Gemm replace GemmStridedBatch

Optimize attention kernel v2 1.0, use Gemm replace GemmStridedBatch #2662

Triggered via pull request October 12, 2023 17:25
Status Failure
Total duration 2h 1m 53s
Artifacts 1
This run and associated checks have been archived and are scheduled for deletion. Learn more about checks retention

gpu-ci.yml

on: pull_request
GPU CI Concierge
17s
GPU CI Concierge
Inference Tests
1h 33m
Inference Tests
Check Python Interface
23m 32s
Check Python Interface
Single Machine, Multiple GPUs Tests
0s
Single Machine, Multiple GPUs Tests
Fit to window
Zoom out
Zoom in

Annotations

1 error
Inference Tests
Process completed with exit code 1.

Artifacts

Produced during runtime
Name Size
output Expired
4.87 KB