Moving AI Workloads To The Edge

By Ann Mutschler - 06 Nov, 2025 - Comments: 0

Experts At The Table: Semiconductor Engineering gathered a group of experts to discuss how some AI workloads are better suited for on-device processing to achieve consistent performance, avoid network connectivity issues, reduce cloud computing costs, and ensure privacy. The panel included Frank Ferro, group director in the Silicon Solutions Group at Cadence; Eduardo Montanez, vice president an... » read more

Ebook: The Impact of AI On Data Center Design

By Cadence - 29 Oct, 2025 - Comments: 0

AI is reshaping the data center industry. Rising power demands, advanced cooling needs, and digital twin technology are redefining how facilities are designed and operated. Download our free ebook on AI-optimized data centers to learn: How AI workloads are driving massive increases in power and cooling requirements Why liquid cooling is becoming essential for AI infrastructure ... » read more

Enabling The 448G Era: System Architecture And Standards For Next-Gen AI Networks

By Synopsys - 15 Oct, 2025 - Comments: 0

As Artificial Intelligence (AI) and Machine Learning (ML) workloads continue to reshape data center infrastructure, the need for higher bandwidth and lower latency has accelerated the need for a next-generation Ethernet. This white paper examines the industry’s shift toward 448G signaling—driven by scale-up and scale-out AI cluster demands—and outlines the evolving system architecture... » read more

Speeding Time To Market With A Future-Proof Fabric

By Baya Systems - 24 Sep, 2025 - Comments: 0

This whitepaper covers how Tenstorrent is elevating their AI fabric to new heights of performance, efficiency, and productivity through a collaboration with Baya Systems. Tenstorrent’s in-house fabric has set a new standard for efficiency and performance in AI compute in their current generation products and is proactively addressing the needs of the next generation. By combining Tenstorrent�... » read more

Balancing Workloads In AI Processor Designs

By Ann Mutschler - 11 Sep, 2025 - Comments: 0

A growing number of AI processors are being designed around specific workloads rather than standardized benchmarks, optimizing performance and power efficiency, but often with enough flexibility to adapt to future changes. While the fundamentals of matrix multiplication and software optimization still apply, those alone are no longer sufficient. Designs need to address specific data types, w... » read more

The Criticality of Performance per Watt Optimization for AI Chip Development

By Synopsys - 03 Sep, 2025 - Comments: 0

Chip developers are seeing an urgent rise in demand for compute processing capability driven by AI workloads. This increase in compute requirements drives a corresponding increase in the demand for power consumption. For example, a ChatGPT query requires nearly 10 times as much power, on average, as a Google search. Power has traditionally been treated as a secondary constraint, with perform... » read more

Power Stabilization To Allow Continued Scaling Of AI Training Workloads (Microsoft, OpenAI, NVIDIA)

By Technical Paper Link - 28 Aug, 2025 - Comments: 0

A new technical paper titled "Power Stabilization for AI Training Datacenters" was published by researchers at Microsoft, OpenAI, and NVIDIA. Abstract "Large Artificial Intelligence (AI) training workloads spanning several tens of thousands of GPUs present unique power management challenges. These arise due to the high variability in power consumption during the training. Given the synchron... » read more

Thermally-Aware, Multi-Objective Scheduling Framework for DL Workloads on Heterogeneous Multi-Chiplet PIM Architectures (UW–Madison, Washington State)

By Technical Paper Link - 19 Aug, 2025 - Comments: 0

A new technical paper titled "THERMOS: Thermally-Aware Multi-Objective Scheduling of AI Workloads on Heterogeneous Multi-Chiplet PIM Architectures" was published by researchers at the University of Wisconsin–Madison and Washington State University. Abstract "Chiplet-based integration enables large-scale systems that combine diverse technologies, enabling higher yield, lower costs, and sca... » read more

Report: The AI Efficiency Boom

By Arm - 16 Jul, 2025 - Comments: 0

Artificial Intelligence (AI) is undergoing a fundamental transformation. While early AI models were large, compute-heavy, and dependent on cloud processing, a new wave of efficiency-driven innovations is moving AI inference—the generation of model results—to the edge. Smaller models, improved memory and compute performance, and the need for privacy, low latency, and energy efficiency are dr... » read more

Scaling GenAI Training And Inference Chips With Runtime Monitoring

By proteanTecs - 10 Jul, 2025 - Comments: 0

GenAI’s rapid growth is pushing the limits of semiconductor technology, demanding breakthroughs in performance, power efficiency, and reliability. Training and inference workloads for models like GPT-4 and GPT-5 require massive computational resources, leading to skyrocketing costs, energy consumption, and hardware failures. Traditional optimization methods, such as static guard bands and per... » read more

← Older posts Newer posts →

tag: AI workloads

Moving AI Workloads To The Edge

Ebook: The Impact of AI On Data Center Design

Enabling The 448G Era: System Architecture And Standards For Next-Gen AI Networks

Speeding Time To Market With A Future-Proof Fabric

Balancing Workloads In AI Processor Designs

The Criticality of Performance per Watt Optimization for AI Chip Development

Power Stabilization To Allow Continued Scaling Of AI Training Workloads (Microsoft, OpenAI, NVIDIA)

Thermally-Aware, Multi-Objective Scheduling Framework for DL Workloads on Heterogeneous Multi-Chiplet PIM Architectures (UW–Madison, Washington State)

Report: The AI Efficiency Boom

Scaling GenAI Training And Inference Chips With Runtime Monitoring

Trending Articles

Chip Industry Week In Review

Chip Industry Week In Review

Executive Outlook: Agentic AI’s Impact On Chip Design

I/O Design Challenges Grow In AI Data Centers And HPC Clusters

Data Center AI Growth Faces Challenging Bottlenecks

Knowledge Centers
Entities, people and technologies explored

Related Articles

Startup Funding: Q1 2026

All AI Data Center Interconnects Will Be Optical Within 5 Years

The Sub-2nm Paradox

When Semiconductor Materials Misbehave

TSMC Tech Symposium 2026, By The Numbers

Silicon Photonics Lights The Way To More Efficient Data Centers

Memory Wall Gets Higher

TSV Complexity Leads To Manufacturing Bottleneck

Sponsors

Recent Comments

About

Navigation

Connect With Us

tag: AI workloads

Moving AI Workloads To The Edge

Ebook: The Impact of AI On Data Center Design

Enabling The 448G Era: System Architecture And Standards For Next-Gen AI Networks

Speeding Time To Market With A Future-Proof Fabric

Balancing Workloads In AI Processor Designs

The Criticality of Performance per Watt Optimization for AI Chip Development

Power Stabilization To Allow Continued Scaling Of AI Training Workloads (Microsoft, OpenAI, NVIDIA)

Thermally-Aware, Multi-Objective Scheduling Framework for DL Workloads on Heterogeneous Multi-Chiplet PIM Architectures (UW–Madison, Washington State)

Report: The AI Efficiency Boom

Scaling GenAI Training And Inference Chips With Runtime Monitoring

Trending Articles

Chip Industry Week In Review

Chip Industry Week In Review

Executive Outlook: Agentic AI’s Impact On Chip Design

I/O Design Challenges Grow In AI Data Centers And HPC Clusters

Data Center AI Growth Faces Challenging Bottlenecks

Knowledge Centers Entities, people and technologies explored

Related Articles

Startup Funding: Q1 2026

All AI Data Center Interconnects Will Be Optical Within 5 Years

The Sub-2nm Paradox

When Semiconductor Materials Misbehave

TSMC Tech Symposium 2026, By The Numbers

Silicon Photonics Lights The Way To More Efficient Data Centers

Memory Wall Gets Higher

TSV Complexity Leads To Manufacturing Bottleneck

Sponsors

Newsletter Signup

Popular Tags

Recent Comments

About

Navigation

Connect With Us

Knowledge Centers
Entities, people and technologies explored