GPU Or ASIC For LLM Scale-Up?

By Geoff Tate - 17 Mar, 2025 - Comments: 1

The CEOs of OpenAI, Anthropic, and xAI share a strikingly similar vision — AI's progress is exponential, it will change humanity, and its impact will be greater than most people expect. This is more than just speculation. The market for AI, and its value, are real today: A human developer with GitHub CoPilot codes 55% faster with AI. GPT-4 scores 88th percentile on the LSAT vs. 50t... » read more

RISC-V High Performance Multicore and GPU SoC Platform For Safety Critical System

By Technical Paper Link - 03 Mar, 2025 - Comments: 0

A new technical paper titled "A RISC-V Multicore and GPU SoC Platform with a Qualifiable Software Stack for Safety Critical Systems" published by researchers at Universitat Politecnica de Catalunya and Barcelona Supercomputing Center. Abstract "In the context of the Horizon Europe project, METASAT, a hardware platform was developed as a prototype of future space systems. The platform is bas... » read more

Speeding Up Computational Lithography With The Power And Parallelism Of GPUs

By Thuc Dam - 20 Feb, 2025 - Comments: 0

There are so many challenges in producing modern semiconductor devices that it’s amazing for the industry to pull it off at all. From the underlying physics to fabrication processes to the development flow, there is no shortage of tough issues to address. Some of the biggest arise in lithography for deep submicron chips. A recent post outlined the major trends in lithography and summarized a ... » read more

HW-Aligned Sparse Attention Architecture For Efficient Long-Context Modeling (DeepSeek et al.)

By Technical Paper Link - 18 Feb, 2025 - Comments: 0

A new technical paper titled "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention" was published by DeepSeek, Peking University and University of Washington. Abstract "Long-context modeling is crucial for next-generation language models, yet the high computational cost of standard attention mechanisms poses significant computational challenges. Sparse attention... » read more

Uncore Frequency Scaling For Energy Optimization In Heterogeneous Systems (UIC, Argonne)

By Technical Paper Link - 12 Feb, 2025 - Comments: 0

A new technical paper titled "Exploring Uncore Frequency Scaling for Heterogeneous Computing" was published by researchers at University of Illinois Chicago and Argonne National Laboratory. Abstract "High-performance computing (HPC) systems are essential for scientific discovery and engineering innovation. However, their growing power demands pose significant challenges, particularly as sys... » read more

Transforming Industrial IoT With Edge AI And AR

By Eleanor Brash - 06 Feb, 2025 - Comments: 0

The Internet of Things (IoT) has evolved significantly from its early days of centralized cloud processing. Initially, IoT applications relied heavily on cloud-based data processing, where data from various devices was collected, processed, and analyzed in the cloud before insights were sent back to the devices. While effective, this approach has limitations, particularly in environments requir... » read more

The Use Of GPU Compute In Automotive

By Eleanor Brash - 09 Jan, 2025 - Comments: 0

The pace of innovation in automotive is accelerating. Electrification, advanced driver assistance systems (ADAS) and vehicle connectivity are revolutionizing the in-car experience, which is now largely determined by the capabilities of the car’s software and electronic hardware. When a vehicle can receive software upgrades while it is on the road, the electronic control units (ECUs) that a... » read more

GPUs: Bandit Based Framework To Dynamically Reduce Energy Consumption

By Technical Paper Link - 17 Oct, 2024 - Comments: 0

A new technical paper titled "Online Energy Optimization in GPUs: A Multi-Armed Bandit Approach" was published by researchers at Illinois Institute of Technology, Argonne National Lab and Emory University. Abstract "Energy consumption has become a critical design metric and a limiting factor in the development of future computing architectures, from small wearable devices to large-scale lea... » read more

New AI Processors Architectures Balance Speed With Efficiency

By Ed Sperling - 04 Sep, 2024 - Comments: 1

Leading AI systems designs are migrating away from building the fastest AI processor possible, adopting a more balanced approach that involves highly specialized, heterogeneous compute elements, faster data movement, and significantly lower power. Part of this shift revolves around the adoption of chiplets in 2.5D/3.5D packages, which enable greater customization for different workloads and ... » read more

GPU Microarchitecture Integrating Dedicated Matrix Units At The Cluster Level (UC Berkeley)

By Technical Paper Link - 27 Aug, 2024 - Comments: 0

A new technical paper titled "Virgo: Cluster-level Matrix Unit Integration in GPUs for Scalability and Energy Efficiency" was published by UC Berkeley. Abstract "Modern GPUs incorporate specialized matrix units such as Tensor Cores to accelerate GEMM operations central to deep learning workloads. However, existing matrix unit designs are tightly coupled to the SIMT core, limiting the size a... » read more

← Older posts Newer posts →

Knowledge Centers
Entities, people and technologies explored

Startup Funding: Q1 2025

AI chips and data center communications see big funding; 75 startups raise $2 billion.

by Jesse Allen

Advanced Packaging Fundamentals for Semiconductor Engineers

New SE eBook examines the next phase of semiconductor design, testing, and manufacturing.

by Bryon Moyer

Chip Industry Week in Review

AI export rule to be scrapped; SEMI, EU request; Cadence, Nvidia supercomputer; AI co-processor; Imagination's new GPU; semi sales up; imec, TNO photonics lab; NSF key to national security; flexible packaging control system; SiConic test engineering; USB 4 support; SiC JFETS; magnetic behavior in hematite.

by The SE Staff

tag: GPU

GPU Or ASIC For LLM Scale-Up?

RISC-V High Performance Multicore and GPU SoC Platform For Safety Critical System

Speeding Up Computational Lithography With The Power And Parallelism Of GPUs

HW-Aligned Sparse Attention Architecture For Efficient Long-Context Modeling (DeepSeek et al.)

Uncore Frequency Scaling For Energy Optimization In Heterogeneous Systems (UIC, Argonne)

Transforming Industrial IoT With Edge AI And AR

The Use Of GPU Compute In Automotive

GPUs: Bandit Based Framework To Dynamically Reduce Energy Consumption

New AI Processors Architectures Balance Speed With Efficiency

GPU Microarchitecture Integrating Dedicated Matrix Units At The Cluster Level (UC Berkeley)

Trending Articles

RISC-V’s Increasing Influence

Chip Industry Week in Review

Co-Packaged Optics Reaches Power Efficiency Tipping Point

Chip Industry Week in Review

TSMC: King Of Data Center AI

Knowledge Centers
Entities, people and technologies explored

Related Articles

Startup Funding: Q1 2025

Advanced Packaging Fundamentals for Semiconductor Engineers

Chip Industry Week in Review

Chip Industry Week in Review

RISC-V’s Increasing Influence

Chip Industry Week in Review

What Exactly Are Chiplets And Heterogeneous Integration?

Big Changes Ahead For Interposers And Substrates

Sponsors

Recent Comments

About

Navigation

Connect With Us

tag: GPU

GPU Or ASIC For LLM Scale-Up?

RISC-V High Performance Multicore and GPU SoC Platform For Safety Critical System

Speeding Up Computational Lithography With The Power And Parallelism Of GPUs

HW-Aligned Sparse Attention Architecture For Efficient Long-Context Modeling (DeepSeek et al.)

Uncore Frequency Scaling For Energy Optimization In Heterogeneous Systems (UIC, Argonne)

Transforming Industrial IoT With Edge AI And AR

The Use Of GPU Compute In Automotive

GPUs: Bandit Based Framework To Dynamically Reduce Energy Consumption

New AI Processors Architectures Balance Speed With Efficiency

GPU Microarchitecture Integrating Dedicated Matrix Units At The Cluster Level (UC Berkeley)

Trending Articles

RISC-V’s Increasing Influence

Chip Industry Week in Review

Co-Packaged Optics Reaches Power Efficiency Tipping Point

Chip Industry Week in Review

TSMC: King Of Data Center AI

Knowledge Centers Entities, people and technologies explored

Related Articles

Startup Funding: Q1 2025

Advanced Packaging Fundamentals for Semiconductor Engineers

Chip Industry Week in Review

Chip Industry Week in Review

RISC-V’s Increasing Influence

Chip Industry Week in Review

What Exactly Are Chiplets And Heterogeneous Integration?

Big Changes Ahead For Interposers And Substrates

Sponsors

Newsletter Signup

Popular Tags

Recent Comments

About

Navigation

Connect With Us

Knowledge Centers
Entities, people and technologies explored