Home
TECHNICAL PAPERS

Effects Of Hardware Prefetchers For Scientific Application Kernels Running on High-End Processors

popularity

A new technical paper titled “Memory Prefetching Evaluation of Scientific Applications on A Modern HPC Arm-based Processor” was published by researchers at Jülich Supercomputing Centre and KTH Royal Institute of Technology.

Abstract
“Memory prefetching is a well-known technique for mitigating the negative impact of memory access latencies on memory bandwidth. This problem has become more pressing as improvements in memory bandwidth have not kept pace with increases in computational power. While much existing work has been devoted to finding appropriate prefetching techniques for specific workloads, few provide insight into the behavior of scientific applications to better understand the impact of prefetchers. This paper investigates the impact of hardware prefetchers on the latest Arm-based high-end processor architectures. In this work, we investigate memory access patterns by analyzing locality properties and visualizing delta and repetitive address patterns. A deeper understanding of memory access patterns allows the use of the appropriate prefetcher and reaching a better correlation between access pattern properties and prefetcher performance. This can guide future co-design efforts. We evaluated traditional and innovative prefetchers using a gem5-based model of Arm Neoverse V1 cores. The model features a 16-core architecture, using Amazon’s Graviton 3 processor as a hardware reference, but substituting DDR5 by high bandwidth memory (HBM2). We performed a detailed prefetching evaluation focusing on stencil, sparse matrix-vector multiplication, and Breadth-First Search kernels. These kernels represent a broad range of the applications running on today’s High-Performance Computing (HPC) systems, which are sensitive to memory performance.”

Find the technical paper here. May 2025.

N. Ho, C. Falquez, A. Portero, E. Suarez and D. Pleiter, “Memory Prefetching Evaluation of Scientific Applications on A Modern HPC Arm-based Processor,” in IEEE Access, doi: 10.1109/ACCESS.2025.3569533.



Leave a Reply


(Note: This name will be displayed publicly)