Comprehensive Performance Study of Zero-Knowledge Proofs on GPUs (Univ. of Michigan)

By Technical Paper Link - 14 Oct, 2025 - Comments: 0

A new technical paper titled "ZKProphet: Understanding Performance of Zero-Knowledge Proofs on GPUs" was published by researchers at University of Michigan. Abstract "Zero-Knowledge Proofs (ZKP) are protocols which construct cryptographic proofs to demonstrate knowledge of a secret input in a computation without revealing any information about the secret. ZKPs enable novel applications in p... » read more

GPU Driver Update Adds Support For Additional Vulkan And OpenCL Extensions

By Patrik Masson - 09 Oct, 2025 - Comments: 0

Here are some of the highlights of what has been updated in the latest Imagination GPU Linux and Android Driver Development Kits: Leveraging cooperative matrix in Vulkan To help accelerate graphics post-processing, neural shaders, physics simulations, and machine learning inference on the GPU, DDK 25.2 implements support for VK_KHR_cooperative_matrix. This extension provides Vulkan developers... » read more

The Rise Of AI Co-Processors

By Ed Sperling - 29 Sep, 2025 - Comments: 0

Figuring out the best kinds of processors to use for different AI workloads is a challenge. AI algorithms are undergoing rapid and frequent changes, and the workloads tied to them can vary by data type, by user, and sometimes because of software/firmware updates. On top of that, AI computations tend to require much higher utilization rates than traditional computing, and that will only become m... » read more

Analog IMC Attention Mechanism For Fast And Energy-Efficient LLMs (FZJ, RWTH Aachen)

By Technical Paper Link - 15 Sep, 2025 - Comments: 0

A new technical paper titled "Analog in-memory computing attention mechanism for fast and energy-efficient large language models" was published by researchers at Forschungszentrum Jülich and RWTH Aachen. Abstract "Transformer networks, driven by self-attention, are central to large language models. In generative transformers, self-attention uses cache memory to store token projec... » read more

Optimizing LLM Training Under GPU Memory Constraints (Argonne, RIT)

By Technical Paper Link - 09 Sep, 2025 - Comments: 0

A new technical paper titled "MLP-Offload: Multi-Level, Multi-Path Offloading for LLM Pre-training to Break the GPU Memory Wall" was published by researchers at Argonne National Laboratory and Rochester Institute of Technology. Abstract "Training LLMs larger than the aggregated memory of multiple GPUs is increasingly necessary due to the faster growth of LLM sizes compared to GPU memory. To... » read more

What Do LLMs Want from Hardware

By Geoff Tate - 08 Sep, 2025 - Comments: 0

Figure 1: Noam Shazeer, Google Gemini vice president, presented this in his Hot Chips 2025 talk. Noam Shazeer is Google’s vice president of engineering for Gemini, their LLM competitor to ChatGPT. He talked recently at Hot Chips: “Predictions for the Next Phase of AI." He has worked on LLMs for a decade since inventing the transformer model in 2017. As his slide says, LLMs can take adv... » read more

Power Stabilization To Allow Continued Scaling Of AI Training Workloads (Microsoft, OpenAI, NVIDIA)

By Technical Paper Link - 28 Aug, 2025 - Comments: 0

A new technical paper titled "Power Stabilization for AI Training Datacenters" was published by researchers at Microsoft, OpenAI, and NVIDIA. Abstract "Large Artificial Intelligence (AI) training workloads spanning several tens of thousands of GPUs present unique power management challenges. These arise due to the high variability in power consumption during the training. Given the synchron... » read more

LLM Inference: Core Bottlenecks Imposed By Memory, Compute Capacity, Synchronization Overheads (NVIDIA)

By Technical Paper Link - 01 Aug, 2025 - Comments: 0

A new technical paper titled "Efficient LLM Inference: Bandwidth, Compute, Synchronization, and Capacity are all you need" was published by NVIDIA. Abstract "This paper presents a limit study of transformer-based large language model (LLM) inference, focusing on the fundamental performance bottlenecks imposed by memory bandwidth, memory capacity, and synchronization overhead in distributed ... » read more

GPU Acceleration Of Rigorous Lithography Simulations

By Wolfgang Demmerle - 24 Jul, 2025 - Comments: 0

Producing modern semiconductor devices is an immensely challenging process. Successful execution entails advanced process nodes, novel device architectures, new materials, and many fabrication steps. One especially challenging area is lithography, in which light is sent through a photomask, passes through a projection system of lenses and mirrors, and strikes the substrate to create the device ... » read more

Data Center CPU Dominance Is Shifting To AMD And Arm

By Geoff Tate - 14 Jul, 2025 - Comments: 0

Fig. 1: Created by ChatGPT from a text prompt. The data center processor market has seen two major tectonic shifts in the last decade. It used to be that all data center compute was x86, and well more than 90% of that was Intel. GPUs first appeared in the data center in 2016 (Pascal GPU). Now, the majority of computation is done on GPUs. AMD is looking to pass Intel in x86 share, and... » read more

← Older posts Newer posts →

tag: GPUs

Comprehensive Performance Study of Zero-Knowledge Proofs on GPUs (Univ. of Michigan)

GPU Driver Update Adds Support For Additional Vulkan And OpenCL Extensions

The Rise Of AI Co-Processors

Analog IMC Attention Mechanism For Fast And Energy-Efficient LLMs (FZJ, RWTH Aachen)

Optimizing LLM Training Under GPU Memory Constraints (Argonne, RIT)

What Do LLMs Want from Hardware

Power Stabilization To Allow Continued Scaling Of AI Training Workloads (Microsoft, OpenAI, NVIDIA)

LLM Inference: Core Bottlenecks Imposed By Memory, Compute Capacity, Synchronization Overheads (NVIDIA)

GPU Acceleration Of Rigorous Lithography Simulations

Data Center CPU Dominance Is Shifting To AMD And Arm

Trending Articles

Chip Industry Week In Review

Chip Industry Week In Review

Executive Outlook: Agentic AI’s Impact On Chip Design

I/O Design Challenges Grow In AI Data Centers And HPC Clusters

Chip Industry Week In Review

Knowledge Centers
Entities, people and technologies explored

Related Articles

Startup Funding: Q1 2026

All AI Data Center Interconnects Will Be Optical Within 5 Years

The Sub-2nm Paradox

When Semiconductor Materials Misbehave

TSMC Tech Symposium 2026, By The Numbers

Silicon Photonics Lights The Way To More Efficient Data Centers

Memory Wall Gets Higher

TSV Complexity Leads To Manufacturing Bottleneck

Sponsors

Recent Comments

About

Navigation

Connect With Us

tag: GPUs

Comprehensive Performance Study of Zero-Knowledge Proofs on GPUs (Univ. of Michigan)

GPU Driver Update Adds Support For Additional Vulkan And OpenCL Extensions

The Rise Of AI Co-Processors

Analog IMC Attention Mechanism For Fast And Energy-Efficient LLMs (FZJ, RWTH Aachen)

Optimizing LLM Training Under GPU Memory Constraints (Argonne, RIT)

What Do LLMs Want from Hardware

Power Stabilization To Allow Continued Scaling Of AI Training Workloads (Microsoft, OpenAI, NVIDIA)

LLM Inference: Core Bottlenecks Imposed By Memory, Compute Capacity, Synchronization Overheads (NVIDIA)

GPU Acceleration Of Rigorous Lithography Simulations

Data Center CPU Dominance Is Shifting To AMD And Arm

Trending Articles

Chip Industry Week In Review

Chip Industry Week In Review

Executive Outlook: Agentic AI’s Impact On Chip Design

I/O Design Challenges Grow In AI Data Centers And HPC Clusters

Chip Industry Week In Review

Knowledge Centers Entities, people and technologies explored

Related Articles

Startup Funding: Q1 2026

All AI Data Center Interconnects Will Be Optical Within 5 Years

The Sub-2nm Paradox

When Semiconductor Materials Misbehave

TSMC Tech Symposium 2026, By The Numbers

Silicon Photonics Lights The Way To More Efficient Data Centers

Memory Wall Gets Higher

TSV Complexity Leads To Manufacturing Bottleneck

Sponsors

Newsletter Signup

Popular Tags

Recent Comments

About

Navigation

Connect With Us

Knowledge Centers
Entities, people and technologies explored