Inference Framework For Deployment Challenges of Large Generative Models On GPUs (Google)

By Technical Paper Link - 03 May, 2025 - Comments: 0

A new technical paper titled "Scaling On-Device GPU Inference for Large Generative Models" was published by researchers at Google and Meta Platforms. Abstract "Driven by the advancements in generative AI, large machine learning models have revolutionized domains such as image processing, audio synthesis, and speech recognition. While server-based deployments remain the locus of peak perform... » read more

Bandwidth Utilization Side-Channel On ML Inference Accelerators

By Arm - 10 Nov, 2021 - Comments: 0

Abstract—Accelerators used for machine learning (ML) inference provide great performance benefits over CPUs. Securing confidential model in inference against off-chip side-channel attacks is critical in harnessing the performance advantage in practice. Data and memory address encryption has been recently proposed to defend against off-chip attacks. In this paper, we demonstrate that bandwidth... » read more

Hardware Security For AI Accelerators

By Rambus - 03 Jun, 2020 - Comments: 0

Dedicated accelerator hardware for artificial intelligence and machine learning (AI/ML) algorithms are increasingly prevalent in data centers and endpoint devices. These accelerators handle valuable data and models, and face a growing threat landscape putting AI/ML assets at risk. Using fundamental cryptographic security techniques performed by a hardware root of trust can safeguard these as... » read more

AI Chip Architectures Race To The Edge

By Kevin Fogarty - 28 Nov, 2018 - Comments: 0

As machine-learning apps start showing up in endpoint devices and along the network edge of the IoT, the accelerators that make AI possible may look more like FPGA and SoC modules than current data-center-bound chips from Intel or Nvidia. Artificial intelligence and machine learning need powerful chips for computing answers (inference) from large data sets (training). Most AI chips—both tr... » read more

Knowledge Centers
Entities, people and technologies explored

EUV’s Future Looks Even Brighter

Demand for AI chips is growing exponentially, but costs and complexity limit the technology to a handful of companies. That could soon change.

by Gregory Haley

Speeding Up Computational Lithography With The Power And Parallelism Of GPUs

A new lithography library brings mask optimization operations to GPUs.

by Thuc Dam

Startup Funding: Q1 2025

AI chips and data center communications see big funding; 75 startups raise $2 billion.

by Jesse Allen

Advanced Packaging Fundamentals for Semiconductor Engineers

New SE eBook examines the next phase of semiconductor design, testing, and manufacturing.

by Bryon Moyer

Chip Industry Week in Review

AI export rule to be scrapped; SEMI, EU request; Cadence, Nvidia supercomputer; AI co-processor; Imagination's new GPU; semi sales up; imec, TNO photonics lab; NSF key to national security; flexible packaging control system; SiConic test engineering; USB 4 support; SiC JFETS; magnetic behavior in hematite.

by The SE Staff

tag: ML accelerators

Inference Framework For Deployment Challenges of Large Generative Models On GPUs (Google)

Bandwidth Utilization Side-Channel On ML Inference Accelerators

Hardware Security For AI Accelerators

AI Chip Architectures Race To The Edge

Trending Articles

Chip Industry Week in Review

Chip Industry Week in Review

Co-Packaged Optics Reaches Power Efficiency Tipping Point

RISC-V’s Increasing Influence

Chip Industry Week in Review

Knowledge Centers
Entities, people and technologies explored

Related Articles

EUV’s Future Looks Even Brighter

Speeding Up Computational Lithography With The Power And Parallelism Of GPUs

Startup Funding: Q1 2025

Advanced Packaging Fundamentals for Semiconductor Engineers

Chip Industry Week in Review

Linear Pluggable Optics Save Energy In Data Centers

Chip Industry Week in Review

Interconnects Approach Tipping Point

Sponsors

Recent Comments

About

Navigation

Connect With Us

tag: ML accelerators

Inference Framework For Deployment Challenges of Large Generative Models On GPUs (Google)

Bandwidth Utilization Side-Channel On ML Inference Accelerators

Hardware Security For AI Accelerators

AI Chip Architectures Race To The Edge

Trending Articles

Chip Industry Week in Review

Chip Industry Week in Review

Co-Packaged Optics Reaches Power Efficiency Tipping Point

RISC-V’s Increasing Influence

Chip Industry Week in Review

Knowledge Centers Entities, people and technologies explored

Related Articles

EUV’s Future Looks Even Brighter

Speeding Up Computational Lithography With The Power And Parallelism Of GPUs

Startup Funding: Q1 2025

Advanced Packaging Fundamentals for Semiconductor Engineers

Chip Industry Week in Review

Linear Pluggable Optics Save Energy In Data Centers

Chip Industry Week in Review

Interconnects Approach Tipping Point

Sponsors

Newsletter Signup

Popular Tags

Recent Comments

About

Navigation

Connect With Us

Knowledge Centers
Entities, people and technologies explored