Add CPU kernel builders #25

maxhgerlach · 2021-02-18T16:52:17Z

Currently, it is not possible to add NVTX tracing to ops that may be executed on CPU rather than GPU. In that case one would run into exceptions like
tensorflow.python.framework.errors_impl.InvalidArgumentError: Cannot assign a device for operation .../NvtxStart: Could not satisfy explicit device specification '/device:CPU:0' because no supported kernel for CPU devices is available.

This can be fixed in a straight-forward manner by registering CPU kernels for NvtxStart and NvtxEnd. As far as I can tell, NVTX tracing should work fine for non-CUDA code, so this feels generally useful to me.

add cpu kernel builders

6317e25

maxhgerlach mentioned this pull request Feb 18, 2021

Register Bypass CPU OPs #11

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CPU kernel builders #25

Add CPU kernel builders #25

maxhgerlach commented Feb 18, 2021

Add CPU kernel builders #25

Are you sure you want to change the base?

Add CPU kernel builders #25

Conversation

maxhgerlach commented Feb 18, 2021