Arm Community
Site
Search
User
Site
Search
User
Groups
Education Hub
Distinguished Ambassadors
Open Source Software and Platforms
Research Collaboration and Enablement
Forums
AI and ML forum
Architectures and Processors forum
Arm Development Platforms forum
Arm Development Studio forum
Arm Virtual Hardware forum
Automotive forum
Compilers and Libraries forum
Graphics, Gaming, and VR forum
High Performance Computing (HPC) forum
Infrastructure Solutions forum
Internet of Things (IoT) forum
Keil forum
Morello forum
Operating Systems forum
SoC Design and Simulation forum
SystemReady Forum
Blogs
AI and ML blog
Announcements
Architectures and Processors blog
Automotive blog
Graphics, Gaming, and VR blog
High Performance Computing (HPC) blog
Infrastructure Solutions blog
Internet of Things (IoT) blog
Operating Systems blog
SoC Design and Simulation blog
Tools, Software and IDEs blog
Support
Arm Support Services
Documentation
Downloads
Training
Arm Approved program
Arm Design Reviews
Community Help
More
Cancel
Arm Community blogs
High Performance Computing (HPC) blog
Blogs
Mentions
Sub-Groups
Tags
Jump...
Cancel
High Performance Computing (HPC) blog
Tags
Subscribe by email
More
Cancel
By date
By view count
By comment count
Descending
Ascending
Advancing scientific codes with Arm Fortran Compiler
Ashok Bhat
Alongside the open source compilers for porting scientific applications to Armv8-A, for the first time sits a new commercial solution from Arm, promising regular and targeted updates made specifically…
June 13, 2017
“Going ARM”: ISC (International Supercomputing Conference) High Performance - 2017 Workshop
Berenice Mann
Join us at ISC 2017 in Frankfurt, Germany as ARM hosts its “Going ARM” Workshop on Thursday, June 22 from 09:00-13:00. This workshop is all about sharing experiences and knowledge. Attendees will gain…
May 12, 2017
Arm acquires Allinea: The exciting road ahead
David Lecomber
It’s with great excitement that we’re announcing that Allinea is now a part of Arm.
December 16, 2016
Deep Learning Episode 4: Supercomputer vs Pong II
Mark O'Connor
It looks like DeepMind-level performance is within our grasp! I took a quick look to see how our 90-core cloud run looks under a profiler
October 20, 2016
Deep Learning Episode 3: Supercomputer vs Pong
Mark O'Connor
This spring my attention was caught by another interesting result: using a new approach they were able to beat Atari Pong in just 2 hours of training time on a single node by parallelizing the work into…
October 19, 2016
Technology Update: Scalable Vector Extension (SVE) for Armv8-A
Nigel Stephens
I had the opportunity to present the latest update to our Armv8-A architecture, known as the Scalable Vector Extension or SVE. In this post I run through the latest update.
August 22, 2016
Deep Learning Episode 2: Scaling TensorFlow over multiple EC2 GPU nodes
Mark O'Connor
Not everybody has access to an Infiniband-backed supercomputer, but anyone can launch a fleet of EC2 GPU instances for a few dollars per hour. Can we get similar speedups in the cloud? I decided to find…
August 3, 2016
Deep Learning Episode 1: Optimizing DeepMind's A3C on Torch
Mark O'Connor
In February, a new paper from Google's DeepMind team appeared on arxiv. This one was interesting – they showed dramatically improved performance and training time of their Atari-playing Deep Q-Learning…
June 3, 2016
Writing a MAP Custom Metric: PAPI IPC
Mark O'Connor
Arm MAP isn't just a lightweight profiler to help you optimize your code. It also lets you add your own metrics with just a couple of lines of code.
April 21, 2016
Tips for Debugging Fortran
David Lecomber
We look at debugging tips for Fortran and F90 developers to show why and how to use a debugger for some typical bugs.
March 5, 2016
Advanced Memory Debugger and Memory Leak Detection for C++, C and F90 Applications
Mark O'Connor
The memory debugger in Arm DDT assists in fixing a number of common memory usage errors with C, C++ and Fortran codes on Linux.
February 11, 2016
Fixing Dangling Pointers
Mark O'Connor
Dangling pointers are pointers whose memory has been freed but which have not been set to null (or 0x0). This allows a particularly tricky class of bug to arise...
December 22, 2015
Profiling and Tuning Linpack: A Step-by-Step Guide
Mark O'Connor
I expected a very straightforward tuning process – after all, this code has been a staple of the HPC community for decades – but as it turns out I was in for a surprise!
October 21, 2015
Optimizing Discovar - Part 2: Running in the cloud on Amazon EC2
Mark O'Connor
In Part 1 I ran Discovar, a life sciences genome assembly code, on one of our internal systems and optimized it to run the benchmark code 7% faster. Of course, physical hardware often performs very differently…
August 18, 2015
How to debug and profile those mixed Python and Fortran codes
David Lecomber
Python is pretty commonplace in scientific computing these days. It is easy to code and powerful - but numerical computation is not a strength that Python has.
February 11, 2015
Profiling OpenMP with Arm MAP 5.0
Mark O'Connor
We're going to see what Arm MAP 5.0 can do by profiling three versions of a simple PI calculator program with some added I/O for good fun
February 3, 2015
CUDA Debugger and Profiler - Advanced Debugging and Performance Optimization Tools for CUDA and OpenACC
Mark O'Connor
Arm Forge is a development tool suite for developing, debugging and optimizing CUDA and OpenACC codes - from GeForce to Tesla and the Kepler K80.
November 25, 2014
Boosting OpenFOAM behavior with Arm Performance Reports
Florent Lebeau
OpenFOAM, developed by ESI-OpenCFD is one of the most popular tools for developing CFD (Computational Fluid Dynamics) applications, along with ANSYS Fluent or CD-Adapco Star-CCM+.
November 3, 2014
<
>