Offline asm dump

Is there any way to get an offline asm dump for OpenCL kernels? 

We have a convolution shader generator that we use to generate for everything from iOS Metal to Android OpenCL.

The kernels work with reasonable perf on NVidia, AMD, Intel, Apple and Adreno, but something weird happens on Mali gfx. Performas absolutely terrible. Trying to figure out what.

The offline compiler shows no reg spills, but it doesn't give a lot more info. Was hoping to see an asm dump or something generated.

Just looking for any feedback. Would be appreciated.

Sample low perf on Mali-T830 MP2: :