1、I found a strange problem, I tested the following two kernels, the first kernel shows in picture one is shorter than the second kernel shows in picture two.Test platform is Mali -T864.GlobalWorkSize=10000000(10M),The first takes 15ms and the second takes 20ms.
2、I use mali_offline_compiler to profile them,the two are same shows in pic 3,how to get Instructions Emmited and Path Cycles？Why Instructions Emmited is twice than Longest Path Cycles ?And in my opinion, the L/S operation should be 3 times,Why four times here?
Please repost to the correct forum (linked below). This forum relates to C and C++ compilers, and so it is unlikely to be tracked by users with experience of the Mali tools.https://community.arm.com/developer/tools-software/graphics/f/discussions
Thanks for your reply,I've asked the question again.
So please mark this one as resolved:
View all questions in Arm Compilers forum