GDDR6 Memory Enables High-Performance AI/ML Inference

By Frank Ferro - 10 Nov, 2022 - Comments: 0

A rapid rise in the size and sophistication of inference models has necessitated increasingly powerful hardware deployed at the network edge and in endpoint devices. To keep these inference processors and accelerators fed with data requires a state-of-the-art memory that delivers extremely high bandwidth. This blog will explore how GDDR6 supports the memory and performance requirements of artif... » read more

Getting Better Edge Performance & Efficiency From Acceleration-Aware ML Model Design

By Vlad Bronstein - 04 Nov, 2021 - Comments: 0

The advent of machine learning techniques has benefited greatly from the use of acceleration technology such as GPUs, TPUs and FPGAs. Indeed, without the use of acceleration technology, it’s likely that machine learning would have remained in the province of academia and not had the impact that it is having in our world today. Clearly, machine learning has become an important tool for solving... » read more

Tradeoffs Between Edge Vs. Cloud

By Brian Bailey - 09 Sep, 2021 - Comments: 0

Increasing amounts of processing are being done on the edge, but how the balance will change between what's computed in the cloud versus the edge remains unclear. The answer may depend as much on the value of data and other commercial reasons as on technical limitations. The pendulum has been swinging between doing all processing in the cloud to doing increasing amounts of processing at the ... » read more

GDDR6 Memory On The Leading Edge

By Frank Ferro - 12 Aug, 2021 - Comments: 0

With the accelerating growth in data traffic, it is unsurprising that the number of hyperscale data centers keeps rocketing skyward. According to analysts at the Synergy Research Group, in nine months (Q2’20 to Q1’21), 84 new hyperscale data centers came online bringing the total worldwide to 625. Hyperscaler capex set a record $150B over the last four quarters eclipsing the $121B spent in ... » read more

RaPiD: AI Accelerator for Ultra-low Precision Training and Inference

By Technical Paper Link - 02 Jul, 2021 - Comments: 0

Abstract—"The growing prevalence and computational demands of Artificial Intelligence (AI) workloads has led to widespread use of hardware accelerators in their execution. Scaling the performance of AI accelerators across generations is pivotal to their success in commercial deployments. The intrinsic error-resilient nature of AI workloads present a unique opportunity for performance/energy i... » read more

Challenges Of Edge AI Inference

By Vinay Mehta - 01 Jul, 2021 - Comments: 0

Bringing convolutional neural networks (CNNs) to your industry—whether it be medical imaging, robotics, or some other vision application entirely—has the potential to enable new functionalities and reduce the compute requirements for existing workloads. This is because a single CNN can replace more computationally expensive image processing, denoising, and object detection algorithms. Howev... » read more

Challenges In Developing A New Inferencing Chip

By Ed Sperling - 01 Jul, 2021 - Comments: 0

Cheng Wang, co-founder and senior vice president of software and engineering at Flex Logix, sat down with Semiconductor Engineering to explain the process of bringing an inferencing accelerator chip to market, from bring-up, programming and partitioning to tradeoffs involving speed and customization. SE: Edge inferencing chips are just starting to come to market. What challenges di... » read more

Architectural Considerations For AI

By Brian Bailey - 24 Jun, 2021 - Comments: 2

Custom chips, labeled as artificial intelligence (AI) or machine learning (ML), are appearing on a weekly basis, each claiming to be 10X faster than existing devices or consume 1/10 the power. Whether that is enough to dethrone existing architectures, such as GPUs and FPGAs, or whether they will survive alongside those architectures isn't clear yet. The problem, or the opportunity, is that t... » read more

Why Reconfigurability Is Essential For AI Edge Inference Throughput

By Vinay Mehta - 06 May, 2021 - Comments: 0

For a neural network to run at its fastest, the underlying hardware must run efficiently on all layers. Through the inference of any CNN—whether it be based on an architecture such as YOLO, ResNet, or Inception—the workload regularly shifts from being bottlenecked by memory to being bottlenecked by compute resources. You can think of each convolutional layer as its own mini-workload, and so... » read more

Applications, Challenges For Using AI In Fabs

By Mark LaPedus - 14 Apr, 2021 - Comments: 0

Experts at the Table: Semiconductor Engineering sat down to discuss chip scaling, transistors, new architectures, and packaging with Jerry Chen, head of global business development for manufacturing & industrials at Nvidia; David Fried, vice president of computational products at Lam Research; Mark Shirey, vice president of marketing and applications at KLA; and Aki Fujimura, CEO of D2S. Wh... » read more

← Older posts Newer posts →

tag: inference

GDDR6 Memory Enables High-Performance AI/ML Inference

Getting Better Edge Performance & Efficiency From Acceleration-Aware ML Model Design

Tradeoffs Between Edge Vs. Cloud

GDDR6 Memory On The Leading Edge

RaPiD: AI Accelerator for Ultra-low Precision Training and Inference

Challenges Of Edge AI Inference

Challenges In Developing A New Inferencing Chip

Architectural Considerations For AI

Why Reconfigurability Is Essential For AI Edge Inference Throughput

Applications, Challenges For Using AI In Fabs

Trending Articles

RISC-V’s Increasing Influence

Chip Industry Week in Review

Power Delivery Challenges For AI Chips

TSMC: King Of Data Center AI

Challenges In Using Sub-7nm ICs In Automotive

Knowledge Centers
Entities, people and technologies explored

Related Articles

RISC-V’s Increasing Influence

3D-IC For The Masses

Chiplets Add New Power Issues

Development Flows For Chiplets

New Data Center Protocols Tackle AI

Chiplet Tradeoffs And Limitations

Implementing AI Activation Functions

Die-to-die Interconnect Standards In Flux

Sponsors

Recent Comments

About

Navigation

Connect With Us

tag: inference

GDDR6 Memory Enables High-Performance AI/ML Inference

Getting Better Edge Performance & Efficiency From Acceleration-Aware ML Model Design

Tradeoffs Between Edge Vs. Cloud

GDDR6 Memory On The Leading Edge

RaPiD: AI Accelerator for Ultra-low Precision Training and Inference

Challenges Of Edge AI Inference

Challenges In Developing A New Inferencing Chip

Architectural Considerations For AI

Why Reconfigurability Is Essential For AI Edge Inference Throughput

Applications, Challenges For Using AI In Fabs

Trending Articles

RISC-V’s Increasing Influence

Chip Industry Week in Review

Power Delivery Challenges For AI Chips

TSMC: King Of Data Center AI

Challenges In Using Sub-7nm ICs In Automotive

Knowledge Centers Entities, people and technologies explored

Related Articles

RISC-V’s Increasing Influence

3D-IC For The Masses

Chiplets Add New Power Issues

Development Flows For Chiplets

New Data Center Protocols Tackle AI

Chiplet Tradeoffs And Limitations

Implementing AI Activation Functions

Die-to-die Interconnect Standards In Flux

Sponsors

Newsletter Signup

Popular Tags

Recent Comments

About

Navigation

Connect With Us

Knowledge Centers
Entities, people and technologies explored