How Neural Super Sampling Works: Architecture, Training, And Inference


This blog post is the second in our Neural Super Sampling (NSS) series. The post explores why we introduced NSS and explains its architecture, training, and inference components. In August 2025, we announced Arm neural technology that will ship in Arm GPUs in 2026. The first use case of the technology is Neural Super Sampling (NSS). NSS is a next-generation, AI-powered upscaling solution. ... » read more

How Fast A GPU Do You Need For Your User Interface?


Graphical user interfaces (GUIs) are common across all walks of life, from your smartphones to your TVs and even in your cars. Over the past decade, their complexity has evolved, moving from a simple background with basic icons into beautiful device differentiators, with 3D elements and micro-interactions that enhance the experience: shifting the perception when a phone tilts, or providing an... » read more

Cloud vs. Edge Gaming: Performance Gap Is Shrinking


Chip designers and gaming companies are scrambling to figure out whether the gaming market will tilt toward the cloud, the edge, or some combination of both. Multi-gigabit internet allows more people to play high-end games in the cloud, but edge-based gaming consoles and devices remain well-rooted, more secure, and private. Which one wins? So far, there are more questions than answers. Handh... » read more

Complex Mix Of Processors At The Edge


With AI changing so fast, it’s a juggle for companies to ensure they can deliver the best performance now while also future-proofing for unknown AI models or a completely different approach to training and inference that may emerge. There are a slew of options for high-end and budget phones, hyperscalers, and low-cost, low-power edge devices, and while GPUs keep making headlines, many designe... » read more

Start Experimenting With Neural Super Sampling For Mobile Graphics


Mobile game developers around the world face increasing pressure to meet user expectations for sharper visuals, smoother gameplay, and longer battery life. Balancing these goals on constrained mobile devices often means making trade-offs. Traditional upscaling methods offer limited flexibility. Real-time AI rendering remains complex, power-hungry, or hardware dependent. Neural Super Sampling... » read more

Efficient Failure-Detection Methods for GPU Control-Logic (Hitachi, Osaka Univ., Kyoto Univ.)


A new technical paper titled "A Hardware-Aware Failure-Detection Method for GPU Control-Logic" was published by researchers at Hitachi, Ltd., Osaka University, and Kyoto University. Excerpt "Various failure detection methods have been proposed for SDCs caused by faults in data units such as registers. However, effective methods for detecting SDCs resulting from faults in control logic, such... » read more

Rowhammer Attack On NVIDIA GPUs With GDDR6 DRAM (University of Toronto)


A new technical paper titled "GPUHammer: Rowhammer Attacks on GPU Memories are Practical" was published by researchers at University of Toronto. Abstract: "Rowhammer is a read disturbance vulnerability in modern DRAM that causes bit-flips, compromising security and reliability. While extensively studied on Intel and AMD CPUs with DDR and LPDDR memories, its impact on GPUs using GDDR memorie... » read more

NVIDIA GPU Confidential Computing: Threat Model And Security Insights (IBM Research, Ohio State)


A new technical paper titled "NVIDIA GPU Confidential Computing Demystified" was published by IBM Research and Ohio State University. Abstract "GPU Confidential Computing (GPU-CC) was introduced as part of the NVIDIA Hopper Architecture, extending the trust boundary beyond traditional CPU-based confidential computing. This innovation enables GPUs to securely process AI workloads, providing ... » read more

A New Era Of Edge AI: E-Series GPU IP


Welcome to the Age of Parallel Compute The age of parallel compute has arrived. The emergence of artificial intelligence is having profound implications on hardware development in every single market. Data centres are being built with massive compute resources to handle the demands of training and connected inference. At the edge, embedded hardware system designers are dealing with the chal... » read more

Offline RL Framework That Dynamically Controls The GPU Clock And Server Fan Speed To Optimize Power Consumption And Computation Time (KAIST)


A new technical paper titled "Power Consumption Optimization of GPU Server With Offline Reinforcement Learning" was published by researchers at Korea Advanced Institute of Science and Technology (KAIST) and KT Research and Development Center. "Optimizing GPU server power consumption is complex due to the interdependence of various components. Conventional methods often involve trade-offs: in... » read more

← Older posts Newer posts →