We are running a survey to help us improve the experience for all of our members. If you see the survey appear, please take the time to tell us about your experience if you can.
Is there any way to get an offline asm dump for OpenCL kernels?
We have a convolution shader generator that we use to generate for everything from iOS Metal to Android OpenCL.
The kernels work with reasonable perf on NVidia, AMD, Intel, Apple and Adreno, but something weird happens on Mali gfx. Performas absolutely terrible. Trying to figure out what.
The offline compiler shows no reg spills, but it doesn't give a lot more info. Was hoping to see an asm dump or something generated.
Just looking for any feedback. Would be appreciated.
Sample low perf on Mali-T830 MP2: : https://play.google.com/store/apps/details?id=com.advancedkernels.betect
--Hashem