Stacking Persistent Embedded Memories Based On Oxide Transistors Upon GPGPU Platforms (Georgia Tech)

By Technical Paper Link - 01 Jul, 2025 - Comments: 0

A new technical paper titled "CMOS+X: Stacking Persistent Embedded Memories based on Oxide Transistors upon GPGPU Platforms" was published by Georgia Tech. Abstract "In contemporary general-purpose graphics processing units (GPGPUs), the continued increase in raw arithmetic throughput is constrained by the capabilities of the register file (single-cycle) and last-level cache (high bandwidth... » read more

Review Paper: Wafer-Scale Accelerators Versus GPUs (UC Riverside)

By Technical Paper Link - 18 Jun, 2025 - Comments: 0

A new technical paper titled "Performance, efficiency, and cost analysis of wafer-scale AI accelerators vs. single-chip GPUs" was published by researchers at UC Riverside. "This review compares wafer-scale AI accelerators and single-chip GPUs, examining performance, energy efficiency, and cost in high-performance AI applications. It highlights enabling technologies like TSMC’s chip-on-wafe... » read more

Arithmetic Intensity In Decoding: A Hardware-Efficient Perspective (Princeton University)

By Technical Paper Link - 03 Jun, 2025 - Comments: 0

A new technical paper titled "Hardware-Efficient Attention for Fast Decoding" was published by researchers at Princeton University. Abstract "LLM decoding is bottlenecked for large batches and long contexts by loading the key-value (KV) cache from high-bandwidth memory, which inflates per-token latency, while the sequential nature of decoding limits parallelism. We analyze the interplay amo... » read more

Embarrassingly Parallel Problems: Definitions, Challenges And Solutions

By Ed Plowman - 01 May, 2025 - Comments: 0

One of the reasons GPUs are regularly discussed in the same breath as AI is that AI shares the same fundamental class of problems as 3D graphics. They are both embarrassingly parallel. Embarrassingly parallel problems refer to computational tasks that: Exhibit independence: Subtasks do not rely on intermediate results from other tasks. Require minimal interaction: Parallel task... » read more

TCAD For GPUs And GPUs For TCAD

By Shela Aboud - 17 Apr, 2025 - Comments: 0

It is well known that many steps in chip development become exponentially harder as feature sizes shrink and instance counts balloon. Billions of transistors are now commonplace, and wafer-scale devices with trillions are on the horizon. Such massive chips put pressure on every electronic design automation (EDA) tool in the development flow, from front-end architectural modeling to signoff and ... » read more

Memory Wall Problem Grows With LLMs

By Katherine Derbyshire - 20 Feb, 2025 - Comments: 0

The growing imbalance between the amount of data that needs to be processed to train large language models (LLMs) and the inability to move that data back and forth fast enough between memories and processors has set off a massive global search for a better and more energy- and cost-efficient solution. Much of this is evident in the numbers. The GPU market is forecast to reach $190 billion in ... » read more

Striking A Balance On Efficiency, Performance, And Cost

By Ed Sperling - 11 Sep, 2024 - Comments: 0

Experts at the Table: Semiconductor Engineering sat down to discuss power-related issues such as voltage droop, application-specific processing elements, the impact of physical effects in advanced packaging, and the benefits of backside power delivery, with Hans Yeager, senior principal engineer, architecture, at Tenstorrent; Joe Davis, senior director for Calibre interfaces and EM/IR product m... » read more

Managing kW Power Budgets

By Ed Sperling - 17 Jul, 2024 - Comments: 0

Experts at the Table: Semiconductor Engineering sat down to discuss increasing power demands and how to address it with Hans Yeager, senior principal engineer, architecture, at Tenstorrent; Joe Davis, senior director for Calibre interfaces and EM/IR product management at Siemens EDA; Mo Faisal, CEO of Movellus; Trey Roessig, CTO and senior vice president of engineering at Empower Semiconductor.... » read more

Opportunities Grow For GPU Acceleration

By Gregory Haley - 05 Jun, 2024 - Comments: 0

Experts at the Table: Semiconductor Engineering sat down to discuss the impact of GPU acceleration on mask design and production and other process technologies, with Aki Fujimura, CEO of D2S; Youping Zhang, head of ASML Brion; Yalin Xiong, senior vice president and general manager of the BBP and reticle products division at KLA; and Kostas Adam, vice president of engineering at Synopsys. W... » read more

A HW-Aware Scalable Exact-Attention Execution Mechanism For GPUs (Microsoft)

By Technical Paper Link - 21 May, 2024 - Comments: 0

A technical paper titled “Lean Attention: Hardware-Aware Scalable Attention Mechanism for the Decode-Phase of Transformers” was published by researchers at Microsoft. Abstract: "Transformer-based models have emerged as one of the most widely used architectures for natural language processing, natural language generation, and image generation. The size of the state-of-the-art models has in... » read more

← Older posts

Knowledge Centers
Entities, people and technologies explored

Startup Funding: Q1 2025

AI chips and data center communications see big funding; 75 startups raise $2 billion.

by Jesse Allen

Advanced Packaging Fundamentals for Semiconductor Engineers

New SE eBook examines the next phase of semiconductor design, testing, and manufacturing.

by Bryon Moyer

Chip Industry Week in Review

AI export rule to be scrapped; SEMI, EU request; Cadence, Nvidia supercomputer; AI co-processor; Imagination's new GPU; semi sales up; imec, TNO photonics lab; NSF key to national security; flexible packaging control system; SiConic test engineering; USB 4 support; SiC JFETS; magnetic behavior in hematite.

by The SE Staff

tag: GPUs

Stacking Persistent Embedded Memories Based On Oxide Transistors Upon GPGPU Platforms (Georgia Tech)

Review Paper: Wafer-Scale Accelerators Versus GPUs (UC Riverside)

Arithmetic Intensity In Decoding: A Hardware-Efficient Perspective (Princeton University)

Embarrassingly Parallel Problems: Definitions, Challenges And Solutions

TCAD For GPUs And GPUs For TCAD

Memory Wall Problem Grows With LLMs

Striking A Balance On Efficiency, Performance, And Cost

Managing kW Power Budgets

Opportunities Grow For GPU Acceleration

A HW-Aware Scalable Exact-Attention Execution Mechanism For GPUs (Microsoft)

Trending Articles

RISC-V’s Increasing Influence

Chip Industry Week in Review

Co-Packaged Optics Reaches Power Efficiency Tipping Point

Chip Industry Week in Review

TSMC: King Of Data Center AI

Knowledge Centers
Entities, people and technologies explored

Related Articles

Startup Funding: Q1 2025

Advanced Packaging Fundamentals for Semiconductor Engineers

Chip Industry Week in Review

Chip Industry Week in Review

RISC-V’s Increasing Influence

Chip Industry Week in Review

What Exactly Are Chiplets And Heterogeneous Integration?

Big Changes Ahead For Interposers And Substrates

Sponsors

Recent Comments

About

Navigation

Connect With Us

tag: GPUs

Stacking Persistent Embedded Memories Based On Oxide Transistors Upon GPGPU Platforms (Georgia Tech)

Review Paper: Wafer-Scale Accelerators Versus GPUs (UC Riverside)

Arithmetic Intensity In Decoding: A Hardware-Efficient Perspective (Princeton University)

Embarrassingly Parallel Problems: Definitions, Challenges And Solutions

TCAD For GPUs And GPUs For TCAD

Memory Wall Problem Grows With LLMs

Striking A Balance On Efficiency, Performance, And Cost

Managing kW Power Budgets

Opportunities Grow For GPU Acceleration

A HW-Aware Scalable Exact-Attention Execution Mechanism For GPUs (Microsoft)

Trending Articles

RISC-V’s Increasing Influence

Chip Industry Week in Review

Co-Packaged Optics Reaches Power Efficiency Tipping Point

Chip Industry Week in Review

TSMC: King Of Data Center AI

Knowledge Centers Entities, people and technologies explored

Related Articles

Startup Funding: Q1 2025

Advanced Packaging Fundamentals for Semiconductor Engineers

Chip Industry Week in Review

Chip Industry Week in Review

RISC-V’s Increasing Influence

Chip Industry Week in Review

What Exactly Are Chiplets And Heterogeneous Integration?

Big Changes Ahead For Interposers And Substrates

Sponsors

Newsletter Signup

Popular Tags

Recent Comments

About

Navigation

Connect With Us

Knowledge Centers
Entities, people and technologies explored