eFPGAs Offer Practical Solution For Embedded Vision Applications

Processing data from multiple cameras in real time is a challenge for surround-view systems.

December 6th, 2018 - By: Alok Sanghavi

Video applications, such as surveillance, object detection and motion analysis, rely on 360° embedded vision and high-resolution fish-eye cameras lenses with a wide-angle field of view (FOV). These systems have up to six real-time camera streams processing together frame by frame. Each frame is corrected for distortion and other image artifacts, adjusted for exposure and white balance, then stitched together dynamically into a single 360° panoramic view. Output at 4K 60 fps is projected on a spherical coordinate space.

Storing and accessing multiple-camera input data to and from external memory in real time and then processing it as a single frame is a vexing snarl for surround-view camera systems. The hardware needs to operate within one frame of latency between incoming raw sensor data from the input cameras and the stitched output video.

High-performance computing platforms are moving toward the use of FPGAs combined with CPUs to provide specialized hardware acceleration for real-time image processing tasks. In this configuration, CPUs focus on complex algorithms where they quickly switch threads and contexts, while repetitive tasks are relegated to an FPGA to function as a configurable hardware accelerator/coprocessor/offload engine.

Using FPGAs and CPUs as discrete devices increases overall efficiency to systems because the technologies fit together. For example, because images obtained from fish-eye lenses suffer from distortion, the stitching operation involving multiple cameras is a compute-intensive, per-pixel task and requires significant real-time image processing and a highly parallelized architecture. This application outstrips the ability of FPGAs to perform this role, primarily due to delays in moving data on and off chip, impacting the overall latency, throughput and performance of the system.

In a recent development, embedded vision designers are looking to eFPGAs as a practical solution. An eFPGA IP can be embedded along with a CPU in an SoC because its fabric offers unique advantages, including higher performance when compared to a standalone FPGA plus CPU solution.

Low latency is important for complex real-time processing of images –– when correcting fish-eye distortion, for example. An eFPGA is directly connected with no I/O buffers to the ASIC through a wide parallel interface for higher throughput with latency counted in single-digit clock cycles.

Another advantage of eFPGAs is that they can be sized to meet the specific application. For instance, Speedcore eFPGA IP users specify their logic, memory and DSP resource needs and the IP is configured to meet their individual requirements. Look-up-tables (LUTs), RAM blocks and DSP64 blocks can be assembled like building blocks to create an optimal programmable fabric for any given application.

Additionally, users can define their own custom functions to be included in the eFPGA fabric. They are integrated into the logic fabric alongside traditional building blocks, increasing the capability of the eFPGA by adding functions optimized to decrease area and/or increase performance of targeted applications, especially for embedded vision and image processing algorithms.

Implementing “you only look once” (YOLO), a state-of-the-art, real-time object detection algorithm using neural networks that increases performance over earlier methods is an example of how custom blocks enable high-performance image processing. This algorithm relies on a large number of matrix multipliers. When implemented in an FPGA, these matrix multipliers are built using DSP and RAM blocks.

A problem arises in the mismatch between the optimal configuration of DSP and RAM blocks needed by YOLO versus what is found in a typical FPGA fabric. Perhaps an FPGA fabric offers DSP blocks with 18 × 27 multiplication/accumulation and 32 × 128 RAMs. The optimal solution would be a fabric with 16 × 8 DSP blocks with 48 × 1024 RAMs. By creating custom blocks that implement the optimal DSP and RAM block configurations, the Speedcore fabric uses 40% less die area to implement the same functionality as well as achieving a higher level of system performance.

Embedding FPGA fabrics in SoCs provides two additional system-level benefits:

Lower power –– Programmable I/O circuitry accounts for half of the total power consumption of standalone FPGAs. An eFPGA has direct wire connections to other blocks within the host SoC, eliminating the need for large programmable I/O buffers altogether.
Lower system cost –– The die size of an eFPGA is smaller than an equivalent standalone FPGA. The eFPGA can be sized for the specific target function. Moreover, functions in standalone FPGAs like programmable I/O buffers and interface logic are not needed in the eFPGA.

Ultra-low latency and real-time processing is driving the need for efficient implementation of 360° view vision-based systems. eFPGAs with custom blocks working with a CPU in the same host SoC are well suited to implement dedicated functionality such as object detection and image recognition, warping and distortion correction and stitching together final images.

Alok Sanghavi

(all posts)
Alok Sanghavi is a senior marketing manager at Achronix.

Knowledge Centers
Entities, people and technologies explored

Shift Left Is The Tip Of The Iceberg

A transformative change is underway for semiconductor design and EDA. New languages, models, and abstractions will need to be created.

by Brian Bailey

Partitioning In The Chiplet Era

Understanding how chiplets interact under different workloads is critical to ensuring signal integrity and optimal performance in heterogeneous designs.

by Ann Mutschler

NAND Flash Targets 1,000 Layers

New techniques go beyond improved deposition and etching, but challenges stack up, too.

by Bryon Moyer

3.5D: The Great Compromise

Pros and cons of a middle-ground chiplet assembly that combines 2.5D and 3D-IC.

by Ed Sperling

AI’s Role In Chip Design Widens, Drawing In New Startups

Focus is on letting engineers do much more with the same or fewer resources — and less drudgery.

by Karen Heyman

What Comes After HBM For Chiplets

The standard for high-bandwidth memory limits design freedom at many levels, but that is required for interoperability. What freedoms can be taken from other functions to make chiplets possible?

by Brian Bailey

Memory Fundamentals For Engineers

eBook: Nearly everything you need to know about memory, including detailed explanations of the different types of memory; how and where these are used today; what's changing, which memories are successful and which ones might be in the future; and the limitations of each memory type.

by The SE Staff

Why Small Fab And Assembly Houses Are Thriving

Megafabs churning out the most advanced processors are not the only game in town.

by Bryon Moyer

eFPGAs Offer Practical Solution For Embedded Vision Applications

Alok Sanghavi

Leave a Reply Cancel reply

Technical Papers

Knowledge Centers
Entities, people and technologies explored

Related Articles

Shift Left Is The Tip Of The Iceberg

Partitioning In The Chiplet Era

NAND Flash Targets 1,000 Layers

3.5D: The Great Compromise

AI’s Role In Chip Design Widens, Drawing In New Startups

What Comes After HBM For Chiplets

Memory Fundamentals For Engineers

Why Small Fab And Assembly Houses Are Thriving

Sponsors

Recent Comments

About

Navigation

Connect With Us

eFPGAs Offer Practical Solution For Embedded Vision Applications

Alok Sanghavi

Leave a Reply Cancel reply

Technical Papers

Knowledge Centers Entities, people and technologies explored

Related Articles

Shift Left Is The Tip Of The Iceberg

Partitioning In The Chiplet Era

NAND Flash Targets 1,000 Layers

3.5D: The Great Compromise

AI’s Role In Chip Design Widens, Drawing In New Startups

What Comes After HBM For Chiplets

Memory Fundamentals For Engineers

Why Small Fab And Assembly Houses Are Thriving

Sponsors

Newsletter Signup

Popular Tags

Recent Comments

About

Navigation

Connect With Us

Knowledge Centers
Entities, people and technologies explored