ARM provides cycle accurate simulation using ARM Cycle Models.
Cycles models gather all of the PMU events in a non-intrusive way for doing performance optimization.
You may also want to have a look at: ARM Cycle Models IP Home Page
Please take a look at community articles on the topic such as: System Performance Analysis and the ARM Performance Monitor Unit (PMU)