To get ALU/invocation:ALU Count: CVT + SFU + FMA instructions * 11 (11 cores)
invocations: $MaliCoreNonFragmentWarps * 16
Is the above calculation Right?