• Game cost budgeting, Valhall shader profiling, and more with Mobile Studio 2020.2

    Peter Harris
    Peter Harris

    We have recently released Arm Mobile Studio 2020.2, the latest version of our performance analysis tool suite for Android applications. This release includes a number of new features, as well as some major quality-of-life improvements to make using the…

    • 2 months ago
    • Graphics and Gaming
    • Graphics and Gaming blog
  • GPU Benchmark Tools Recommendation

    Vikash
    Vikash

    Hi There,

    Could you please suggest some recommended GPU benchmark tools for OpenGL ES 3.2 and Vulkan 1.1 for Linux and Android based devices?

    I would like to check those tools on https://developer.arm.com/ip-products/graphics-and-multimedia/mali-gpus…

    • Answered
    • 5 months ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • Unleashing Developer Potential Through Total Compute

    Ian Smythe
    Ian Smythe

    In my first blog about Total Compute, I discussed how this new approach would accelerate compute performance. This is being achieved through a solutions-based approach across the entire SoC focusing on digital immersion use-cases and experiences on mobile…

    • 5 months ago
    • Innovation
    • Innovation blog
  • Maximizing the System Efficiency of Augmented Reality Devices

    Chris Szabo
    Chris Szabo

    Future augmented reality (AR) devices execute workloads that create immersive experiences - a synergy between real and virtual worlds. Just like our current mobile devices, it is not enough to perform just a single task efficiently, so a collection of…

    • 6 months ago
    • Innovation
    • Innovation blog
  • Top-Down microarchitectural analysis

    YHuerta
    YHuerta

    Hi,

    I was wondering if there are any documentations on how to analyze the ARM pipeline. I have access to thunderx2 nodes, and i'd like to make bottleneck analysis like can be done on intel chips.  i can get the formulas to get compute the different…

    • 6 months ago
    • High Performance Computing
    • HPC forum
  • Performance Advisor crash

    WildMaN
    WildMaN

    Can't compile the Performance Advisor report with the following error. Any tips appreciated!

    !ENTRY org.eclipse.osgi 4 0 2020-04-09 00:37:49.570
    !MESSAGE Application error
    !STACK 1
    java.lang.AssertionError
    at com.arm.meerkat.application.capturedata.Frame…

    • Answered
    • 7 months ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • Automated Performance Advice for Android Games

    Peter Harris
    Peter Harris

    Mobile gaming is advancing rapidly in terms of sophistication, with increasing numbers of successful high-fidelity 3D games exploiting the performance capabilities of the latest smartphones. As the complexity of content rises, the traditional approach…

    • 8 months ago
    • Graphics and Gaming
    • Graphics and Gaming blog
  • Streamline中基于统计的性能分析功能介绍

    章政
    章政

    原文参考

    ARM性能分析器(Streamline) 7.0版开始支持基于SPE(统计性能分析扩展)的性能分析。SPE是Armv8.2-A架构中的一个可选扩展特性,它可以以很低的代价对处理器的流水线进行采样。Streamline可以从Arm Development Studio和Arm Mobile Stuido获取。

    为什么需要SPE?

    Streamline前序版本只能通过硬件计数器和软中断中采样PC信息获取CPU的信息。硬件计数器只记录触发事件加总数,不可能知道具体哪条指令触发了事件,它只能大致提供应用程序哪块代码触发了这个计数器增加…

    • 10 months ago
    • 中文社区
    • 中文社区博客
  • [ArmV8] [Cortex-A53] [PMU] PM_CCNTR to measure cpuload

    Amr Fawzy
    Amr Fawzy

    Dear Experts

    I am working on a target that contains quad A53 cores operating at 1GHz. The operating system idle loop contains WFI inline assembly instruction. I know that the Core Clock halts during the WFI instruction which can be seen on the PM_CCNTR…

    • 10 months ago
    • Processors
    • Cortex-A / A-Profile forum
  • Introduction to Statistical Profiling Support in Streamline

    Ben Gainey
    Ben Gainey

    Starting with version 7.0, Arm Streamline Performance Analyzer (Streamline) supports profiling using the Statistical Profiling Extension (SPE). SPE is an optional extension to the Armv8.2-A architecture that allows low probe effect hardware sampling of…

    • 10 months ago
    • Software Tools
    • Tools, Software and IDEs blog
  • Accelerate your shaders with Mali Offline Compiler 7.0

    Peter Harris
    Peter Harris

    Shader programs for OpenGL ES and Vulkan are one of the most important inputs an application provides to render a scene because they define the processing operations executed by the GPU shader core hardware. They are also one of the hardest aspects of…

    • over 1 year ago
    • Graphics and Gaming
    • Graphics and Gaming blog
  • Arm Allinea Studio 19.2: building on Libraries and Arm Compiler for Linux performance

    Patrick Wohlschlegel
    Patrick Wohlschlegel

    Arm Allinea Studio 19.2 is now available. This new major release includes valuable updates to the Arm Performance Libraries (Arm PL) and the Arm Compiler for Linux. This new version includes our first attempt at the Arm Opt Report and the introduction…

    • over 1 year ago
    • High Performance Computing
    • HPC blog
  • Arm Forge 19.1: Introducing "Forge Ultimate" edition and region profiling capabilities

    Patrick Wohlschlegel
    Patrick Wohlschlegel

    Arm Forge 19.1 is now available. This new major version includes the launch of a new Arm Forge Ultimate edition and the introduction of "region profiling", leveraging LLNL's work on Caliper.

    Introduction of Arm Forge Ultimate

    By popular…

    • over 1 year ago
    • High Performance Computing
    • HPC blog
  • Count Main TLB miss

    scribnote5
    scribnote5

    Hello, experts:

    My platform is a Cortex-A9 MPCore CPU, Sabre Lite(i.mx6).

    I tried to count TLB miss so I implemented PMUEVENT to check micro TLB miss.

    But PMUEVENT doesn't support the main TLB miss event.

    In cortex-a9 RFP, "If there is a miss in main…

    • Answered
    • over 1 year ago
    • Processors
    • Cortex-A / A-Profile forum
  • How could I find the critical functions when dual core cpu is fully loading?

    Myles
    Myles

    It can see almost 100% CPU0 usage during the timeline 2.34s ~ 2.42s(in figure 1). How can we know which functions are executed in CPU0/ CPU1 @timeline 2.376s(in figure 2)?

    The figure 3 is show the functions sheet @timeline 2.376s, but no more information…

    • Answered
    • over 1 year ago
    • Software Tools
    • Arm Development Studio forum
  • How to analyze critical function(in kernel module) that caused CPU-bound task by streamline?

    Myles
    Myles

    We are going to analyze the CPU heavy task(in kernel module), but the streamling tool could not show the detail.

    The problem is that the critical functions(caused CPU-bound task) executed in kernel module, and the streamline tool just showed the unknown…

    • Answered
    • over 1 year ago
    • Software Tools
    • Arm Development Studio forum
  • Accelerating Mali GPU analysis using Arm Mobile Studio

    Peter Harris
    Peter Harris

    Japanese version - 日本語版 
    Korean version - 한국어판

    The Streamline performance analyzer is a sample-based profiler which can present detailed performance information about the Arm CPU and Mali GPU present in a device. Recent versions of Streamline have included…

    • over 1 year ago
    • Graphics and Gaming
    • Graphics and Gaming blog
  • System Performance Analysis and the Arm Performance Monitor Unit (PMU)

    Jason Andrews
    Jason Andrews

    Carbon cycle accurate models of Arm CPUs enable system performance analysis by providing access to the Performance Monitor Unit (PMU). Carbon models instrument the PMU registers and record PMU events into the The specified item was not found. System…

    • pmu.tar.gz
    • over 5 years ago
    • System
    • SoC Design blog
  • Mali Midgard Family Performance Counters

    Peter Harris
    Peter Harris

    Analysis and optimization of graphics and compute content running on a GPU is an important task when trying to build a top quality system integration, or a compelling high performance application. For developers working with the public APIs, such as…

    • over 4 years ago
    • Graphics and Gaming
    • Graphics and Gaming blog
  • Using the ARM Performance Monitor Unit (PMU) Linux Driver

    Jason Andrews
    Jason Andrews

    The Linux kernel provides an ARM PMU driver for counting events such as cycles, instructions, and cache metrics. My previous article covered how to access data from the PMU automatically within SoC Designer by enabling hardware profiling events. It also…

    • over 5 years ago
    • System
    • Embedded blog
  • Mali Bifrost Family Performance Counters

    Peter Harris
    Peter Harris

    Analysis and optimization of graphics and compute content running on a GPU is an important task when trying to build a top quality system integration, or a compelling high performance application. For developers working with the public APIs, such as OpenGL…

    • over 4 years ago
    • Graphics and Gaming
    • Graphics and Gaming blog
  • The Mali GPU: An Abstract Machine, Part 2 - Tile-based Rendering

    Peter Harris
    Peter Harris

    Chinese version of this blog - thanks to vincent for the translation!

    In my previous blog I started defining an abstract machine which can be used to describe the application-visible behaviors of the Mali GPU and driver software. The purpose of this…

    • over 6 years ago
    • Graphics and Gaming
    • Graphics and Gaming blog
  • The Mali GPU: An Abstract Machine, Part 3 - The Midgard Shader Core

    Peter Harris
    Peter Harris

    EDIT: Updated March 2015 to include more information on the GPU memory system to help developers optimizing compute shaders.

    In the first two blogs of this series I introduced the frame-level pipelining [The Mali GPU: An Abstract Machine, Part 1 - Frame…

    • over 6 years ago
    • Graphics and Gaming
    • Graphics and Gaming blog
  • The Mali GPU: An Abstract Machine, Part 1 - Frame Pipelining

    Peter Harris
    Peter Harris

    Chinese version of this blog

    Optimization of graphics workloads is often essential to many modern mobile applications, as almost all rendering is now handled directly or indirectly by an OpenGL ES based rendering back-end. One of my colleagues, Michael…

    • over 6 years ago
    • Graphics and Gaming
    • Graphics and Gaming blog
  • The Mali GPU: An Abstract Machine, Part 4 - The Bifrost Shader Core

    Peter Harris
    Peter Harris

    We have recently announced the first GPU in the Mali Bifrost architecture family, the Mali-G71. While the overall rendering model it implements is similar to previous Mali GPUs – the Bifrost family is still a deeply pipelined tile-based renderer…

    • over 4 years ago
    • Graphics and Gaming
    • Graphics and Gaming blog
>