Hi team
Basically, I am inferencing CNN model using ARMNN framework and captured profiling on streamline.
In below example running model for 10 iterations on HW with armcore(clock frequency 1.8GHz)
The runtime showing for inferencing the model in second say for example in 2nd frame it 4.02ms.
But the corresponding cycles in cortexA78AE is 712 kilo cycles and it will not match with runtime of 4ms.