Hi Forum,
As per perf counter guide, counter $MaliCoreInstructionsNarrowInstructions increments for every 8-bit or 16-bit instruction execution. To understand counter further, when tried with simple medium precision test observed part of 16-bit instructions are reported in $MaliALUInstructionsFMAInstructions as well. is it expected?
the following test executed on G720 and collected data using streamline version 8.9.0
precision mediump float;
in vec4 in1, in2;
out vec4 col;
main( ) { col = in1 * in2; }
the total ALUs are 4. Narrow should be included to get 4, formula: ((FMA+Narrow)*4)/Warps
when tried the same test in high precision, data is as expected
Thanks,
Venkatesh.
Never checked specifically for the trancendental operations, but my understanding is that it should be all 16-bit ops.