Arm Community
Site
Search
User
Site
Search
User
Arm Community blogs
Servers and Cloud Computing blog
Blogs
Mentions
Sub-Groups
Tags
Jump...
Cancel
Servers and Cloud Computing blog
Tags
Subscribe by email
More
Cancel
By date
By view count
By comment count
Descending
Ascending
Detecting I/O contention in HPC code using Arm Forge Pro GPFS metrics
Chris January
I/O contention is a frustrating problem to solve. An application run may be taking longer than expected, but how do you know if it’s due to I/O contention?
July 26, 2017
Arm HPC Case Study: University of Cambridge
Darren Cepulis
The University of Cambridge examines the readiness and potential of Arm-based platforms for High Performance Computing.
July 13, 2017
ISC17: Arm HPC Community Rising
Darren Cepulis
With deployments of Arm-based servers for HPC now on the near horizon, ISC17 in Frankfurt was a prime opportunity for partners and end-users to begin talking more openly about their plans and intentions…
June 28, 2017
Advancing scientific codes with Arm Fortran Compiler
Ashok Bhat
Alongside the open source compilers for porting scientific applications to Armv8-A, for the first time sits a new commercial solution from Arm, promising regular and targeted updates made specifically…
June 13, 2017
“Going ARM”: ISC (International Supercomputing Conference) High Performance - 2017 Workshop
Berenice Mann
Join us at ISC 2017 in Frankfurt, Germany as ARM hosts its “Going ARM” Workshop on Thursday, June 22 from 09:00-13:00. This workshop is all about sharing experiences and knowledge. Attendees will gain…
May 12, 2017
Arm acquires Allinea: The exciting road ahead
David Lecomber
It’s with great excitement that we’re announcing that Allinea is now a part of Arm.
December 16, 2016
Deep Learning Episode 4: Supercomputer vs Pong II
Mark O'Connor
It looks like DeepMind-level performance is within our grasp! I took a quick look to see how our 90-core cloud run looks under a profiler
October 20, 2016
Deep Learning Episode 3: Supercomputer vs Pong
Mark O'Connor
This spring my attention was caught by another interesting result: using a new approach they were able to beat Atari Pong in just 2 hours of training time on a single node by parallelizing the work into…
October 19, 2016
Technology Update: Scalable Vector Extension (SVE) for Armv8-A
Nigel Stephens
I had the opportunity to present the latest update to our Armv8-A architecture, known as the Scalable Vector Extension or SVE. In this post I run through the latest update.
August 22, 2016
Deep Learning Episode 2: Scaling TensorFlow over multiple EC2 GPU nodes
Mark O'Connor
Not everybody has access to an Infiniband-backed supercomputer, but anyone can launch a fleet of EC2 GPU instances for a few dollars per hour. Can we get similar speedups in the cloud? I decided to find…
August 3, 2016
Deep Learning Episode 1: Optimizing DeepMind's A3C on Torch
Mark O'Connor
In February, a new paper from Google's DeepMind team appeared on arxiv. This one was interesting – they showed dramatically improved performance and training time of their Atari-playing Deep Q-Learning…
June 3, 2016
Writing a MAP Custom Metric: PAPI IPC
Mark O'Connor
Arm MAP isn't just a lightweight profiler to help you optimize your code. It also lets you add your own metrics with just a couple of lines of code.
April 21, 2016
Tips for Debugging Fortran
David Lecomber
We look at debugging tips for Fortran and F90 developers to show why and how to use a debugger for some typical bugs.
March 5, 2016
Advanced Memory Debugger and Memory Leak Detection for C++, C and F90 Applications
Mark O'Connor
The memory debugger in Arm DDT assists in fixing a number of common memory usage errors with C, C++ and Fortran codes on Linux.
February 11, 2016
Fixing Dangling Pointers
Mark O'Connor
Dangling pointers are pointers whose memory has been freed but which have not been set to null (or 0x0). This allows a particularly tricky class of bug to arise...
December 22, 2015
Profiling and Tuning Linpack: A Step-by-Step Guide
Mark O'Connor
I expected a very straightforward tuning process – after all, this code has been a staple of the HPC community for decades – but as it turns out I was in for a surprise!
October 21, 2015
Optimizing Discovar - Part 2: Running in the cloud on Amazon EC2
Mark O'Connor
In Part 1 I ran Discovar, a life sciences genome assembly code, on one of our internal systems and optimized it to run the benchmark code 7% faster. Of course, physical hardware often performs very differently…
August 18, 2015
How to debug and profile those mixed Python and Fortran codes
David Lecomber
Python is pretty commonplace in scientific computing these days. It is easy to code and powerful - but numerical computation is not a strength that Python has.
February 11, 2015
Profiling OpenMP with Arm MAP 5.0
Mark O'Connor
We're going to see what Arm MAP 5.0 can do by profiling three versions of a simple PI calculator program with some added I/O for good fun
February 3, 2015
CUDA Debugger and Profiler - Advanced Debugging and Performance Optimization Tools for CUDA and OpenACC
Mark O'Connor
Arm Forge is a development tool suite for developing, debugging and optimizing CUDA and OpenACC codes - from GeForce to Tesla and the Kepler K80.
November 25, 2014
Boosting OpenFOAM behavior with Arm Performance Reports
Florent Lebeau
OpenFOAM, developed by ESI-OpenCFD is one of the most popular tools for developing CFD (Computational Fluid Dynamics) applications, along with ANSYS Fluent or CD-Adapco Star-CCM+.
November 3, 2014
<
>