Arm Community
Site
Search
User
Site
Search
User
Arm Developer
Documentation
Learning Paths
On-Demand Videos
Groups
Arm Ambassadors
Education Hub
Open Source Software and Platforms
Research Collaboration and Enablement
Forums
AI forum
Architectures and Processors forum
Arm Development Platforms forum
Arm Development Studio forum
Automotive forum
Compilers and Libraries forum
Embedded and Microcontrollers forum
Internet of Things (IoT) forum
Keil forum
Laptops and Desktops forum
Mobile, Graphics, and Gaming forum
Morello forum
Operating Systems forum
Servers and Cloud Computing forum
SoC Design and Simulation forum
SystemReady Forum
Blogs
AI blog
Announcements
Architectures and Processors blog
Automotive blog
Embedded and Microcontrollers blog
Internet of Things (IoT) blog
Laptops and Desktops blog
Mobile, Graphics, and Gaming blog
Operating Systems blog
Servers and Cloud Computing blog
SoC Design and Simulation blog
Tools, Software and IDEs blog
Support
Arm Support Services
Documentation
Downloads
Training
Arm Approved program
Arm Design Reviews
Community Help
More
Cancel
Arm Community blogs
Architectures and Processors blog
Blogs
Mentions
Sub-Groups
Tags
Jump...
Cancel
Architectures and Processors blog
Tags
Subscribe by email
More
Cancel
By date
By view count
By comment count
Descending
Ascending
Caches and Self-Modifying Code: Working with Threads
Jacob Bramley
How to synchronize JIT-compiled instructions across threads.
January 21, 2025
Caches and Self-Modifying Code: Implementing `__clear_cache`
Jacob Bramley
How to implement `__clear_cache` using assembly.
January 20, 2025
The when, why and how of waiting and backoff in multi-threaded applications on Arm
Ola Liljedahl
Read about the different user space delays and wait implementations for the Armv8+ architecture and best practices for the purpose of improving throughput and fair access to shared resources.
December 13, 2024
Using SVE in C#
Alan Hayward
.NET 9 introduces SVE support on Arm, allowing users to write simplified vectorised code. This blog post gives examples in C# and compares it to C++.
November 20, 2024
Part 3: Enabling PAC and BTI on AArch64 for Linux
Bill Roberts
Supporting C++ style exceptions and DWARF for Pointer Authentication Codes (PAC) signed pointers.
November 20, 2024
Part 2: Enabling PAC and BTI on AArch64 for Linux
Bill Roberts
Utilizing Pointer Authentication Codes (PAC) and Branch Target Instructions (BTI) together and optimizations in instruction counts.
November 19, 2024
Enabling PAC and BTI on AArch64 for Linux
Bill Roberts
This post will show how Linux developers can enable Pointer Authentication Codes (PAC) and Branch Target Instructions (BTI) them in C/C++ code bases.
November 18, 2024
Meeting the demand for software interoperability on Arm-based hardware
Dong Wei
How vendors can reduce cost and adapt to longer product lifecycles with an updated, compliance-based Arm SystemReady programme.
November 1, 2024
Arm A-Profile Architecture Developments 2024
Martin Weidmann
Arm's 2024 v9.6-A extensions introduce significant updates to enhance computing performance, efficiency, and security for today’s AI enabled software.
October 1, 2024
Accelerate multi-token search in strings with SVE2 SVMATCH instruction
Yibo Cai
The SVMATCH instruction in Arm SVE2 accelerates multi-token string matches, boosting performance in tasks like JSON decoding.
September 25, 2024
Accelerating video decode and image processing with Armv9 CPUs and SVE2
Poulomi Dasgupta
This blog post explores three video and image use cases demonstrating the proven impact of the Armv9 CPU architectural features.
September 23, 2024
Part 3: Matrix-matrix multiplication. Neon, SVE, and SME compared
Khalid Saadi
This blog post describes how to implement the same matrix-matrix multiplication algorithm using three different Arm technologies: Neon, SVE, and SME.
August 6, 2024
Part 2: Arm Scalable Matrix Extension (SME) Instructions
Zenon Xiu (修志龙)
This blog is the second half of a two-part blog for SME Instructions. See link to Part 1 in the note at the top of this blog post.
June 24, 2024
Part 1: Arm Scalable Matrix Extension (SME) Introduction
Zenon Xiu (修志龙)
This blog series provides an introduction to the Arm Scalable Matrix Extension (SME) including SVE and SVE2.
May 23, 2024
MPAM-Style cache partitioning with ATP-Engine and gem5
Hristo Belchev
Upstream gem5 and ATP-Engine MPAM-style cache partitioning are discussed, with experiments for the feature being proposed and analyzed.
April 24, 2024
Optimizing your programs for Arm platforms
Tamar Christina
This blog covers techniques and tips that are useful to create better performing programs through compilers whether you are creating Android, Desktop or Server applications.
April 24, 2024
Deep dive into the PMU value of L2D_CACHE_WR on the Neoverse N2 server
Ker Liu
In-depth analysis of what the PMU of L2D_CACHE_WR counts on the Neoverse N2 server.
April 15, 2024
Arm SPE: SoC Telemetry & Performance Analysis using Statistical Profiling Extension
Brian Jeff
We refer to the SPE performance methodology whitepaper published by Arm for details on the content of this blog.
December 8, 2023
Implementing the WebAssembly bitmask operations on the 64-bit Arm architecture
Anton Kirilov
We discuss some of the challenges that we face when we are trying to implement the WebAssembly SIMD bitmask operations on the 64-bit Arm architecture.
December 6, 2023
Profile firmware with Performance Monitor Unit (PMU) in Armv8-A CPU
Jiaming Guo
This blog introduces how to profile firmware with Performance Monitor Unit (PMU) in Armv8-A CPU. It provides a reference implementation of the PMU library.
November 8, 2023
Views on Relaxed Atomics in C++ from Arm’s technical leadership team
Richard Grisenthwaite
Views from the Arm technical leadership team that it believes that today, the cost implications of tightening C++ relaxed atomics outweigh its benefit.
November 3, 2023
>