Please note: We are aware of an issue affecting replies on the Arm Community forums, which may not be loading as expected.
We apologize for any inconvenience and appreciate your patience while we investigate and work to resolve the issue.
Thank you for your understanding.
Hi Forum,
As per perf counter guide, counter $MaliCoreInstructionsNarrowInstructions increments for every 8-bit or 16-bit instruction execution. To understand counter further, when tried with simple medium precision test observed part of 16-bit instructions are reported in $MaliALUInstructionsFMAInstructions as well. is it expected?
the following test executed on G720 and collected data using streamline version 8.9.0
precision mediump float;
in vec4 in1, in2;
out vec4 col;
main( ) { col = in1 * in2; }
the total ALUs are 4. Narrow should be included to get 4, formula: ((FMA+Narrow)*4)/Warps
when tried the same test in high precision, data is as expected
Thanks,
Venkatesh.
Thank you for the immediate response. one follow-up question. does 16-bit math operations (ex: sqrt, log, floor, abs, etc..) are also reported as vec2 SIMD? or only 16-bit ALUs are packed into vec2 SIMD?
Never checked specifically for the trancendental operations, but my understanding is that it should be all 16-bit ops.