Is there any way to get an offline asm dump for OpenCL kernels?
We have a convolution shader generator that we use to generate for everything from iOS Metal to Android OpenCL.
The kernels work with reasonable perf on NVidia, AMD, Intel, Apple and Adreno…