RPU: A Chiplet-Based Architecture To Address The Challenges of the Modern Memory Wall (Harvard University)

By Technical Paper Link - 04 Mar, 2026 - Comments: 0

Researchers from Harvard University have released “RPU -- A Reasoning Processing Unit”. Abstract “Large language model (LLM) inference performance is increasingly bottlenecked by the memory wall. While GPUs continue to scale raw compute throughput, they struggle to deliver scalable performance for memory bandwidth bound workloads. This challenge is amplified by emerging reasonin... » read more

Knowledge Centers
Entities, people and technologies explored

Advanced Packaging Limits Come Into Focus

Mechanical and process control limits are now shaping what can be manufactured at scale.

by Gregory Haley

Startup Funding: Q1 2026

Massive rounds for AI, EDA, and manufacturing; 80 startups raise $8.4B.

by Jesse Allen

All AI Data Center Interconnects Will Be Optical Within 5 Years

InP and SiPho join CMOS as critical technologies. Lasers, CPO and OCS will be everywhere (indium phosphide, silicon photonics, co-packaged optics, optical circuit switch).

by Geoff Tate

Making Hybrid Bonding Better

Why this technology is so essential for multi-die assemblies, and how it can be improved.

by Laura Peters

When Semiconductor Materials Misbehave

The gap between lab performance and fab reality is growing wider as packages grow more complex.

by Gregory Haley

The Sub-2nm Paradox

Reducing variation in manufacturing, monitoring behavior over time, and targeting specific workloads can have a big impact on power, performance, and area/cost.

by Ed Sperling

TSMC Tech Symposium 2026, By The Numbers

Foundry rolls out aggressive new roadmap, focusing on area, power, and latency.

by Barry Pangrle

CPO Is Extending The Limits Of What’s Possible In AI Data Centers

Co-packaged optics technology will have a big impact on system power and the cost of data movement.

by Ann Mutschler

tag: AI hardware acceleration

RPU: A Chiplet-Based Architecture To Address The Challenges of the Modern Memory Wall (Harvard University)

Trending Articles

The Sub-2nm Paradox

Chip Industry Week In Review

Chip Industry Week In Review

Toward Agentic Verification

Swapping Out Chiplets: I/Os Vs. Compute

Knowledge Centers
Entities, people and technologies explored

Related Articles

Advanced Packaging Limits Come Into Focus

Startup Funding: Q1 2026

All AI Data Center Interconnects Will Be Optical Within 5 Years

Making Hybrid Bonding Better

When Semiconductor Materials Misbehave

The Sub-2nm Paradox

TSMC Tech Symposium 2026, By The Numbers

CPO Is Extending The Limits Of What’s Possible In AI Data Centers

Sponsors

Recent Comments

About

Navigation

Connect With Us

tag: AI hardware acceleration

RPU: A Chiplet-Based Architecture To Address The Challenges of the Modern Memory Wall (Harvard University)

Trending Articles

The Sub-2nm Paradox

Chip Industry Week In Review

Chip Industry Week In Review

Toward Agentic Verification

Swapping Out Chiplets: I/Os Vs. Compute

Knowledge Centers Entities, people and technologies explored

Related Articles

Advanced Packaging Limits Come Into Focus

Startup Funding: Q1 2026

All AI Data Center Interconnects Will Be Optical Within 5 Years

Making Hybrid Bonding Better

When Semiconductor Materials Misbehave

The Sub-2nm Paradox

TSMC Tech Symposium 2026, By The Numbers

CPO Is Extending The Limits Of What’s Possible In AI Data Centers

Sponsors

Newsletter Signup

Popular Tags

Recent Comments

About

Navigation

Connect With Us

Knowledge Centers
Entities, people and technologies explored