Moving AI Workloads To The Edge


Experts At The Table: Semiconductor Engineering gathered a group of experts to discuss how some AI workloads are better suited for on-device processing to achieve consistent performance, avoid network connectivity issues, reduce cloud computing costs, and ensure privacy. The panel included Frank Ferro, group director in the Silicon Solutions Group at Cadence; Eduardo Montanez, vice president an... » read more

Ebook: The Impact of AI On Data Center Design


AI is reshaping the data center industry. Rising power demands, advanced cooling needs, and digital twin technology are redefining how facilities are designed and operated. Download our free ebook on AI-optimized data centers to learn: How AI workloads are driving massive increases in power and cooling requirements Why liquid cooling is becoming essential for AI infrastructure ... » read more

Enabling The 448G Era: System Architecture And Standards For Next-Gen AI Networks


As Artificial Intelligence (AI) and Machine Learning (ML) workloads continue to reshape data center infrastructure, the need for higher bandwidth and lower latency has accelerated the need for a next-generation Ethernet. This white paper examines the industry’s shift toward 448G signaling—driven by scale-up and scale-out AI cluster demands—and outlines the evolving system architecture... » read more

Speeding Time To Market With A Future-Proof Fabric


This whitepaper covers how Tenstorrent is elevating their AI fabric to new heights of performance, efficiency, and productivity through a collaboration with Baya Systems. Tenstorrent’s in-house fabric has set a new standard for efficiency and performance in AI compute in their current generation products and is proactively addressing the needs of the next generation. By combining Tenstorrent�... » read more

Balancing Workloads In AI Processor Designs


A growing number of AI processors are being designed around specific workloads rather than standardized benchmarks, optimizing performance and power efficiency, but often with enough flexibility to adapt to future changes. While the fundamentals of matrix multiplication and software optimization still apply, those alone are no longer sufficient. Designs need to address specific data types, w... » read more

The Criticality of Performance per Watt Optimization for AI Chip Development


Chip developers are seeing an urgent rise in demand for compute processing capability driven by AI workloads. This increase in compute requirements drives a corresponding increase in the demand for power consumption. For example, a ChatGPT query requires nearly 10 times as much power, on average, as a Google search. Power has traditionally been treated as a secondary constraint, with perform... » read more

Power Stabilization To Allow Continued Scaling Of AI Training Workloads (Microsoft, OpenAI, NVIDIA)


A new technical paper titled "Power Stabilization for AI Training Datacenters" was published by researchers at Microsoft, OpenAI, and NVIDIA. Abstract "Large Artificial Intelligence (AI) training workloads spanning several tens of thousands of GPUs present unique power management challenges. These arise due to the high variability in power consumption during the training. Given the synchron... » read more

Thermally-Aware, Multi-Objective Scheduling Framework for DL Workloads on Heterogeneous Multi-Chiplet PIM Architectures (UW–Madison, Washington State)


A new technical paper titled "THERMOS: Thermally-Aware Multi-Objective Scheduling of AI Workloads on Heterogeneous Multi-Chiplet PIM Architectures" was published by researchers at the University of Wisconsin–Madison and Washington State University. Abstract "Chiplet-based integration enables large-scale systems that combine diverse technologies, enabling higher yield, lower costs, and sca... » read more

Report: The AI Efficiency Boom


Artificial Intelligence (AI) is undergoing a fundamental transformation. While early AI models were large, compute-heavy, and dependent on cloud processing, a new wave of efficiency-driven innovations is moving AI inference—the generation of model results—to the edge. Smaller models, improved memory and compute performance, and the need for privacy, low latency, and energy efficiency are dr... » read more

Scaling GenAI Training And Inference Chips With Runtime Monitoring


GenAI’s rapid growth is pushing the limits of semiconductor technology, demanding breakthroughs in performance, power efficiency, and reliability. Training and inference workloads for models like GPT-4 and GPT-5 require massive computational resources, leading to skyrocketing costs, energy consumption, and hardware failures. Traditional optimization methods, such as static guard bands and per... » read more

← Older posts Newer posts →