Hi, I'm optimizing my compute shader by streamline. I have some question about the counters:
1. My GPU is mali-G78 mp24, it has 24 cores, so the GPU Cycles and Mali Core Cycles is the sum of all 24 cores ???
2. The gpu cycles is 400k, the exectuion core active cycles is 288k, does it means the exectuion core has (400 - 288) 112k cycle idle ??? What's the exectuion core waiting for in 112k cycles (memory fetch or some other things??)
3. If the execution core is waiting for memory data return, does the Load/Store Cycles contain the waiting cycle ??? Does the execution core active cycle contains the waiting cycle ???
Thank you!
Hi, thank you for reply.
Does the Load/Store Cycles also include the waiting memory cycles ??
Peter Harris said:Yes, execution core active will increment any cycle that a thread is live in the core, even if that thread is blocked waiting on memory.
Annie w said:Does the Load/Store Cycles also include the waiting memory cycles ??
No. The load/store cycles are the actual data cache access cycles.