Skip to content

Commit

Permalink
add pragma header for fp16
Browse files Browse the repository at this point in the history
  • Loading branch information
clamchowder committed Oct 5, 2023
1 parent 4afdd78 commit 7230935
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions GpuMemLatency/instruction_rate_fp16_kernel.cl
Original file line number Diff line number Diff line change
@@ -1,3 +1,4 @@
#pragma OPENCL EXTENSION cl_khr_fp16 : enable
#define rate_local_mem_test_size 256
__kernel void fp16_add_rate_test(__global half8 *A, int count, __global half8 *ret) {
int tid = get_local_id(0);
Expand Down

0 comments on commit 7230935

Please sign in to comment.