Assuming you can run Linux on the platform you are using, I'd recommend using LMBench - it can provide a nice set of data for latency and bandwidth for the various levels in the memory system.