• Profiling OpenMP with Arm MAP 5.0
    A whirlwind tour of Arm MAP's new OpenMP profiling capabilities We're going to see what Arm MAP 5.0 can do by profiling three versions of a simple PI calculator program with some added I/O for good...
  • Debugging and Profiling HPC Applications while Working Remotely
    The ongoing impact of the COVID-19 pandemic means that more and more scientific research is being conducted by teams working remotely. While remote access to compute resources is nothing new, visual...
  • Profiling and Tuning Linpack: A Step-by-Step Guide
    This year we're proud to be sponsoring the Student Cluster Competition at SC15. One of the key codes teams will have to optimize for their systems is the classic Linpack benchmark. I decided to have a...
  • Profiling Python and compiled code with Arm Forge – and a performance surprise
    If you are developing HPC applications, there is a good chance that you have been in contact with Python these days. Whether you use Python to orchestrate large workflows, to quickly put together small...
  • CUDA Debugger and Profiler - Advanced Debugging and Performance Optimization Tools for CUDA and OpenACC
    Debugging and Optimizing CUDA and OpenACC Arm Forge is a development tool suite for developing, debugging and optimizing CUDA and OpenACC codes - from GeForce to Tesla and the Kepler K80. Forge includes...