I am looking at the performance of OpenMP program, specifically cache and memory performance. I have found guidelines while back ago how to analyze performance with Vtune that mentioned which counters to watch out for. However now cannot seem to find the manual.
If you know which manual I have in question or if you know the counters/events, please let me know. Also if you have other techniques for analyzing multithreaded memory performance, please share if you can
Thanks
Here is an article discussion this topic.
The most common counters to examine are L2 cache misses and branch prediction misses.
Note that, in VS2010, you can use the concurrency visualizer in the new profiling tools to directly see this. It does a great job of helping you analyze this information, including directly showing you how your code lays out, showing you misses, blocks, and many other useful tools for debugging and profiling concurrent apps.