Real view profiler doesnot provide cycle accurate simulation.
Cortex R4 has inbuilt PMU(Performance Measuring Unit). You can make use of these PMU and can measure the cycles accurately. You can get sample code if you have DS-5 installer. It comes with example(Optimization3) and you can re-use the same code for measuring the cycles consumed by a sub routine in the software itself.