Challenges Of Edge AI Inference

By Vinay Mehta - 01 Jul, 2021 - Comments: 0

Bringing convolutional neural networks (CNNs) to your industry—whether it be medical imaging, robotics, or some other vision application entirely—has the potential to enable new functionalities and reduce the compute requirements for existing workloads. This is because a single CNN can replace more computationally expensive image processing, denoising, and object detection algorithms. Howev... » read more

Integrating Embedded FPGA Made Easy

By Andy Jaros - 03 Jun, 2021 - Comments: 0

Chip designers have been integrating hard and soft IPs for decades – some being easy to integrate and others much more difficult. But what about eFPGA? It’s a relatively new IP on the IP landscape and according to data from Gartner, the market share of semiconductors with eFPGA is expected to approach $10B in 2023 with greater than 50% compounded annual growth. So, this raises the question ... » read more

Why Reconfigurability Is Essential For AI Edge Inference Throughput

By Vinay Mehta - 06 May, 2021 - Comments: 0

For a neural network to run at its fastest, the underlying hardware must run efficiently on all layers. Through the inference of any CNN—whether it be based on an architecture such as YOLO, ResNet, or Inception—the workload regularly shifts from being bottlenecked by memory to being bottlenecked by compute resources. You can think of each convolutional layer as its own mini-workload, and so... » read more

Maximizing Edge AI Performance

By Vinay Mehta - 01 Apr, 2021 - Comments: 0

Inference of convolutional neural network models is algorithmically straightforward, but to get the fastest performance for your application there are a few pitfalls to keep in mind when deploying. A number of factors make efficient inference difficult, which we will first step through before diving into specific solutions to address and resolve each. By the end of this article, you will be arm... » read more

The Best AI Edge Inference Benchmark

By Dana McCarty - 04 Mar, 2021 - Comments: 0

When evaluating the performance of an AI accelerator, there’s a range of methodologies available to you. In this article, we’ll discuss some of the different ways to structure your benchmark research before moving forward with an evaluation that directly runs your own model. Just like when buying a car, research will only get you so far before you need to get behind the wheel and give your ... » read more

Edge Inference Applications And Market Segmentation

By Geoff Tate - 03 Dec, 2020 - Comments: 0

Until recently, most AI was in data centers/cloud and most of that was training. Things are changing quickly. Projections are AI sales will grow rapidly to tens of billions of dollars by the mid 2020s, with most of the growth in edge AI inference. Data center/cloud vs. edge inference: What’s the difference? The data center/cloud is where inference started on Xeons. To gain efficiency, much ... » read more

ResNet-50 Does Not Predict Inference Throughput For MegaPixel Neural Network Models

By Geoff Tate - 05 Nov, 2020 - Comments: 1

Customers are considering applications for AI inference and want to evaluate multiple inference accelerators. As we discussed last month, TOPS do NOT correlate with inference throughput and you should use real neural network models to benchmark accelerators. So is ResNet-50 a good benchmark for evaluating relative performance of inference accelerators? If your application is going to p... » read more

One More Time: TOPS Do Not Predict Inference Throughput

By Geoff Tate - 08 Oct, 2020 - Comments: 0

Many times you’ll hear vendors talking about how many TOPS their chip has and imply that more TOPS means better inference performance. If you use TOPS to pick your AI inference chip, you will likely not be happy with what you get. Recently, Vivienne Sze, a professor at MIT, gave an excellent talk entitled “How to Evaluate Efficient Deep Neural Network Approaches.” Slides are also av... » read more

Apples, Oranges & The Optimal AI Inference Accelerator

By Geoff Tate - 03 Sep, 2020 - Comments: 0

There are a wide range of AI inference accelerators available and a wide range of applications for them. No AI inference accelerator will be optimal for every application. For example, a data center class accelerator almost certainly will be too big, burn too much power, and cost too much for most edge applications. And an accelerator optimal for key word recognition won’t have the capabil... » read more

Performance Metrics For Convolutional Neural Network Accelerators

By Vinay Mehta - 06 Aug, 2020 - Comments: 0

Across the industry, there are few benchmarks that customers and potential end users can employ to evaluate an inference acceleration solution end-to-end. Early on in this space, the performance of an accelerator was measured as a single number: TOPs. However, the limitations of using a single number has been covered in detail in the past by previous blogs. Nevertheless, if the method of cal... » read more

← Older posts Newer posts →

Knowledge Centers
Entities, people and technologies explored

Startup Funding: Q1 2025

AI chips and data center communications see big funding; 75 startups raise $2 billion.

by Jesse Allen

Advanced Packaging Fundamentals for Semiconductor Engineers

New SE eBook examines the next phase of semiconductor design, testing, and manufacturing.

by Bryon Moyer

Chip Industry Week in Review

AI export rule to be scrapped; SEMI, EU request; Cadence, Nvidia supercomputer; AI co-processor; Imagination's new GPU; semi sales up; imec, TNO photonics lab; NSF key to national security; flexible packaging control system; SiConic test engineering; USB 4 support; SiC JFETS; magnetic behavior in hematite.

by The SE Staff

category: Flexible Chips

category: IoT, Security & Automotive

Challenges Of Edge AI Inference

Integrating Embedded FPGA Made Easy

Why Reconfigurability Is Essential For AI Edge Inference Throughput

Maximizing Edge AI Performance

The Best AI Edge Inference Benchmark

Edge Inference Applications And Market Segmentation

ResNet-50 Does Not Predict Inference Throughput For MegaPixel Neural Network Models

One More Time: TOPS Do Not Predict Inference Throughput

Apples, Oranges & The Optimal AI Inference Accelerator

Performance Metrics For Convolutional Neural Network Accelerators

Trending Articles

RISC-V’s Increasing Influence

Chip Industry Week in Review

Chip Industry Week in Review

Power Delivery Challenges For AI Chips

TSMC: King Of Data Center AI

Knowledge Centers
Entities, people and technologies explored

Related Articles

Startup Funding: Q1 2025

Advanced Packaging Fundamentals for Semiconductor Engineers

Chip Industry Week in Review

Chip Industry Week in Review

RISC-V’s Increasing Influence

Chip Industry Week in Review

What Exactly Are Chiplets And Heterogeneous Integration?

Big Changes Ahead For Interposers And Substrates

Sponsors

Recent Comments

About

Navigation

Connect With Us

category: Flexible Chips

category: IoT, Security & Automotive

Challenges Of Edge AI Inference

Integrating Embedded FPGA Made Easy

Why Reconfigurability Is Essential For AI Edge Inference Throughput

Maximizing Edge AI Performance

The Best AI Edge Inference Benchmark

Edge Inference Applications And Market Segmentation

ResNet-50 Does Not Predict Inference Throughput For MegaPixel Neural Network Models

One More Time: TOPS Do Not Predict Inference Throughput

Apples, Oranges & The Optimal AI Inference Accelerator

Performance Metrics For Convolutional Neural Network Accelerators

Trending Articles

RISC-V’s Increasing Influence

Chip Industry Week in Review

Chip Industry Week in Review

Power Delivery Challenges For AI Chips

TSMC: King Of Data Center AI

Knowledge Centers Entities, people and technologies explored

Related Articles

Startup Funding: Q1 2025

Advanced Packaging Fundamentals for Semiconductor Engineers

Chip Industry Week in Review

Chip Industry Week in Review

RISC-V’s Increasing Influence

Chip Industry Week in Review

What Exactly Are Chiplets And Heterogeneous Integration?

Big Changes Ahead For Interposers And Substrates

Sponsors

Newsletter Signup

Popular Tags

Recent Comments

About

Navigation

Connect With Us

Knowledge Centers
Entities, people and technologies explored