ETM implementations typically only provide instruction trace - they do not usually provide a means to see data accesses at all, and if they do they only show external memory address accesses not the register accesses. What you are asking for is not useful for normal software development, so I'm not sure who would have a solution for it hardware (waste of gates, and very power / bandwidth hungry).I think you are stuck with an invasive methodology implemented in software for this ...