NPU Acceleration For Multimodal LLMs


Transformer-based models have rapidly spread from text to speech, vision, and other modalities. This has created challenges for the development of Neural Processing Units (NPUs). NPUs must now efficiently support the computation of weights and propagation of activations through a series of attention blocks. Increasingly, NPUs must be able to process models with multiple input modalities with ac... » read more

Dual-Grid Interpolation: For Improved Accuracy Of Overset Grid Systems


Computational fluid dynamics (CFD) has become an integral part of engineering decision-making, providing a deeper understanding of how fluids behave in various scenarios, from the high skies in aerospace all the way to the fast-paced realm of automotive engineering. The task of accurately simulating fluid dynamics, particularly when faced with complex shapes or moving parts, demands innovative ... » read more

Ghostbusting With Simulation: Solving Engineering Challenges In Automotive Radar Development


Probably the biggest trick to the adoption of full autonomy in the automotive space is learning how to safely achieve a level of perception that matches that of a human driver. Carmakers are rising to the challenge with a combination of advanced camera, radar, and lidar sensing technologies, machine learning, and artificial intelligence that makes self-driving possible. This includes the adva... » read more

The Future Of AI For Games


Earlier this month, I had the pleasure of attending the inaugural AI and Games Conference at Goldsmiths in London, for which Arm was an associate sponsor. Hosted by Dr. Tommy Thompson, and borrowing its name from his AI and Games YouTube channel, the day really delivered on the promise of bringing experts and enthusiasts (and subscribers) together for interesting talks on the intersecti... » read more

Small Language Models: A Solution To Language Model Deployment At The Edge?


While Large Language Models (LLMs) like GPT-3 and GPT-4 have quickly become synonymous with AI, LLM mass deployments in both training and inference applications have, to date, been predominately cloud-based. This is primarily due to the sheer size of the models; the resulting processing and memory requirements often overwhelm the capabilities of edge-based systems. While the efficiency of Exped... » read more

To (B)atch Or Not To (B)atch?


When evaluating benchmark results for AI/ML processing solutions, it is very helpful to remember Shakespeare’s Hamlet, and the famous line: “To be, or not to be.” Except in this case the “B” stands for Batched. Batch size matters There are two different ways in which a machine learning inference workload can be used in a system. A particular ML graph can be used one time, preced... » read more

Extending The DDR5 Roadmap With MRDIMM


Given the voracious memory bandwidth and capacity demands of Gen AI and other advanced workloads, we’ve seen a rapid progression through the generations of DDR5 memory. Multiplexed Registered DIMMs (MRDIMMs) offer a new memory module architecture capable of extending the DDR5 roadmap and expanding the capabilities of server main memory. MRDIMM reuses the lion’s share of existing DDR5 infras... » read more

Harnessing Computational Storage For Faster Data Processing


By Ujjwal Negi and Prashant Dixit In the evolving landscape of data storage, computational storage devices (CSDs) are revolutionizing how we process and store data. By embedding processing capabilities within storage units, these devices enable in-situ data manipulation, minimizing data movement between storage and CPUs and dramatically improving performance and efficiency. This paradigm shi... » read more

Simulating Multiple DSPs As Multiple x86 Processes


An increasing number of embedded designs are multi-core systems. At the pre-silicon stage, customers use a simulation platform for architectural exploration and software development. Architects want to quantify the impact of the number of cores, local memory size, system memory latency, and interconnect bandwidth. Software teams wish to have a practical development platform that is not excrucia... » read more

Building Safe And Secure Software With Rust On Arm


The Rust Programming Language has gained the attention of government security agencies, and even the White House, due to its unique blend of safety, performance and productivity. Rust is designed to remove common programming burdens and handle issues like use-after-free errors at compile time. Remarkably, it achieves this without using a garbage collector, generating machine code that rivals th... » read more

← Older posts Newer posts →