Beyond The Demo: Deploying And Evaluating Open-Source AI Workloads

By Odin Shen - 11 Jun, 2026 - Comments: 0

As more open-source AI models move closer to real-world adoption, developers are changing how they evaluate edge deployment. The question is no longer simply whether a model can run, but whether it can be deployed reproducibly on a concrete platform, observed in practice, and turned into meaningful deployment decisions based on actual technical evidence. For developers, the CIX Armv9 platfor... » read more

Introducing “The Architecture Speaks”

By Jade Alglave - 14 May, 2026 - Comments: 0

What are specifications used for? How do you use them? Are they intelligible? These questions are at the heart of the project that produces a new tool called "The Architecture Speaks". This is an experimental chatbot tool built on generative AI that aims to provide quick answers to complex questions about the Arm architecture. It also provides links to the Arm Architecture Reference Manual. Th... » read more

Rethinking Robotics Reinforcement Learning: A Practical Humanoid Training Workflow

By Odin Shen - 09 Apr, 2026 - Comments: 0

Reinforcement learning (RL) for robotics is often associated with large GPU clusters, distributed infrastructure, and x86-based development environments. Training a humanoid robot with high-fidelity simulation is a resource-intensive workflow that runs in the data center. What if that workflow could run on a single workstation? In this blog post, we explore a complete robotics pipeline bu... » read more

Rethinking Voice AI At The Edge: A Practical Offline Pipeline

By Odin Shen - 12 Mar, 2026 - Comments: 0

Cloud-based AI dominates the headlines, but responsive and private interaction lies at the edge. This blog post shows how to build a fully offline, real-time voice assistant using the Arm-based NVIDIA DGX Spark platform. The system integrates open-source components such as faster-whisper and vLLM. It delivers low-latency, human-like dialogue without sending data outside the local environment. ... » read more

Scaling llama.cpp On Neoverse N2: Solving Cross-NUMA Performance Issues

By Bolt Liu - 12 Feb, 2026 - Comments: 0

This blog post explains the cross-NUMA memory access issue that occurs when you run llama.cpp in Neoverse. It also introduces a proof-of-concept patch that addresses this issue and can provide up to a 55% performance increase for text generation when you run the llama3_Q4_0 model on the ZhuFeng Neoverse system. Cross-NUMA memory access problem In llama.cpp, performance drops when the number o... » read more

Smarter Write Barriers For Arm64 In .NET CoreCLR

By Alan Hayward - 15 Jan, 2026 - Comments: 0

Last year, I explored how you can use the Arm Scalable Vector Extension (SVE) in .NET to unlock SIMD performance at scale. This year, my focus has shifted to something less visible but just as fundamental to runtime performance. Write barriers in the CoreCLR garbage collector (GC). Write barriers are not a feature most .NET developers ever think about. They do not change how you write C# cod... » read more

Rethinking The Role Of CPUs In AI: A Practical RAG Implementation

By Odin Shen - 11 Dec, 2025 - Comments: 0

In many enterprise environments, engineers and technical staff need to find information quickly. They search internal documents such as hardware specifications, project manuals, and technical notes. These materials are often scattered, making traditional search inefficient. These documents are often confidential or proprietary. This constraint prevents these documents from being processed by... » read more

Future Architecture Technologies: POE2 And vMTE

By Martin Weidmann - 13 Nov, 2025 - Comments: 0

Future Architecture Technologies are features being developed for currently unreleased versions of the Arm architecture. Arm provides the ecosystem with relevant information and specifications in advance, ensuring software support for when new technologies are realized in hardware. This blog introduces two future technologies: Permission Overlay Extension version 2 (POE2), and Virtual T... » read more

Integrated Modular Firmware Solutions: A Vital Component Of Custom Silicon Chiplet Architecture Designs

By Marc Meunier - 16 Oct, 2025 - Comments: 0

By Marc Meunier and Srini Narayana The shift from monolithic SoC designs to chiplet-based architecture isn’t just a packaging innovation. It’s a fundamental rethinking of how custom silicon is designed, manufactured, and deployed. This transition is driven by the growing impracticality of scaling large monolithic dies at advanced nodes. As die sizes increase, so do the costs, yield ri... » read more

How Neural Super Sampling Works: Architecture, Training, And Inference

By Liam O'Neil - 11 Sep, 2025 - Comments: 0

This blog post is the second in our Neural Super Sampling (NSS) series. The post explores why we introduced NSS and explains its architecture, training, and inference components. In August 2025, we announced Arm neural technology that will ship in Arm GPUs in 2026. The first use case of the technology is Neural Super Sampling (NSS). NSS is a next-generation, AI-powered upscaling solution. ... » read more

← Older posts

category: At The Core

category: Auto, Security & Edge AI

Beyond The Demo: Deploying And Evaluating Open-Source AI Workloads

Introducing “The Architecture Speaks”

Rethinking Robotics Reinforcement Learning: A Practical Humanoid Training Workflow

Rethinking Voice AI At The Edge: A Practical Offline Pipeline

Scaling llama.cpp On Neoverse N2: Solving Cross-NUMA Performance Issues

Smarter Write Barriers For Arm64 In .NET CoreCLR

Rethinking The Role Of CPUs In AI: A Practical RAG Implementation

Future Architecture Technologies: POE2 And vMTE

Integrated Modular Firmware Solutions: A Vital Component Of Custom Silicon Chiplet Architecture Designs

How Neural Super Sampling Works: Architecture, Training, And Inference

Trending Articles

The Sub-2nm Paradox

Chip Industry Week In Review

Chip Industry Week In Review

Toward Agentic Verification

Swapping Out Chiplets: I/Os Vs. Compute

Knowledge Centers
Entities, people and technologies explored

Related Articles

CPO Is Extending The Limits Of What’s Possible In AI Data Centers

Flash Getting Stacked High-Bandwidth Version

Can Edge AI Keep Up?

Chiplets Need A New Workflow

HBM4E Raises The Bar For AI Memory Bandwidth

Scale Up, Scale Out Get a New Partner

AI Power on the Edge

Gates Add Functionality, But Wires Create Problems

Sponsors

Recent Comments

About

Navigation

Connect With Us

category: At The Core

category: Auto, Security & Edge AI

Beyond The Demo: Deploying And Evaluating Open-Source AI Workloads

Introducing “The Architecture Speaks”

Rethinking Robotics Reinforcement Learning: A Practical Humanoid Training Workflow

Rethinking Voice AI At The Edge: A Practical Offline Pipeline

Scaling llama.cpp On Neoverse N2: Solving Cross-NUMA Performance Issues

Smarter Write Barriers For Arm64 In .NET CoreCLR

Rethinking The Role Of CPUs In AI: A Practical RAG Implementation

Future Architecture Technologies: POE2 And vMTE

Integrated Modular Firmware Solutions: A Vital Component Of Custom Silicon Chiplet Architecture Designs

How Neural Super Sampling Works: Architecture, Training, And Inference

Trending Articles

The Sub-2nm Paradox

Chip Industry Week In Review

Chip Industry Week In Review

Toward Agentic Verification

Swapping Out Chiplets: I/Os Vs. Compute

Knowledge Centers Entities, people and technologies explored

Related Articles

CPO Is Extending The Limits Of What’s Possible In AI Data Centers

Flash Getting Stacked High-Bandwidth Version

Can Edge AI Keep Up?

Chiplets Need A New Workflow

HBM4E Raises The Bar For AI Memory Bandwidth

Scale Up, Scale Out Get a New Partner

AI Power on the Edge

Gates Add Functionality, But Wires Create Problems

Sponsors

Newsletter Signup

Popular Tags

Recent Comments

About

Navigation

Connect With Us

Knowledge Centers
Entities, people and technologies explored