HW and SW Architecture Approaches For Running AI Models

By Karen Heyman - 31 Oct, 2024 - Comments: 0

How best to run AI inference models is a current topic of much debate as a wide breadth of systems companies look to add AI to a variety of systems, spurring both hardware innovation and the need to revamp models. Hardware developers are making progress with AI accelerators and SoCs. But on the model side, questions abound about whether the answer might come from revisiting older, less compl... » read more

Fundamental Issues In Computer Vision Still Unresolved

By Karen Heyman - 02 May, 2024 - Comments: 1

Given computer vision’s place as the cornerstone of an increasing number of applications from ADAS to medical diagnosis and robotics, it is critical that its weak points be mitigated, such as the ability to identify corner cases or if algorithms are trained on shallow datasets. While well-known bloopers are often the result of human decisions, there are also fundamental technical issues that ... » read more

FP8: Cross-Industry Hardware Specification For AI Training And Inference (Arm, Intel, Nvidia)

By Technical Paper Link - 16 Sep, 2022 - Comments: 0

Arm, Intel, and Nvidia proposed a specification for an 8-bit floating point (FP8) format that could provide a common interchangeable format that works for both AI training and inference and allow AI models to operate and perform consistently across hardware platforms. Find the technical paper titled " FP8 Formats For Deep Learning" here. Published Sept 2022. Abstract: "FP8 is a natural p... » read more

There’s More To Machine Learning Than CNNs

By Bryon Moyer - 09 Jun, 2021 - Comments: 1

Neural networks – and convolutional neural networks (CNNs) in particular – have received an abundance of attention over the last few years, but they're not the only useful machine-learning structures. There are numerous other ways for machines to learn how to solve problems, and there is room for alternative machine-learning structures. “Neural networks can do all this really comple... » read more

Making Sense Of New Edge-Inference Architectures

By Bryon Moyer - 04 Mar, 2021 - Comments: 0

New edge-inference machine-learning architectures have been arriving at an astounding rate over the last year. Making sense of them all is a challenge. To begin with, not all ML architectures are alike. One of the complicating factors in understanding the different machine-learning architectures is the nomenclature used to describe them. You’ll see terms like “sea-of-MACs,” “systolic... » read more

Edge-Inference Architectures Proliferate

By Bryon Moyer - 04 Feb, 2021 - Comments: 0

First part of two parts. The second part will dive into basic architectural characteristics. The last year has seen a vast array of announcements of new machine-learning (ML) architectures for edge inference. Unburdened by the need to support training, but tasked with low latency, the devices exhibit extremely varied approaches to ML inference. “Architecture is changing both in the comp... » read more

The MCU Dilemma

By Brian Bailey - 13 Feb, 2020 - Comments: 2

The humble microcontroller is getting squeezed on all sides. While most of the semiconductor industry has been able to take advantage of Moore's Law, the MCU market has faltered because flash memory does not scale beyond 40nm. At the same time, new capabilities such as voice activation and richer sensor networks are requiring inference engines to be integrated for some markets. In others, re... » read more

IP Management And Development At 5/3nm

By Ann Mutschler - 24 Oct, 2019 - Comments: 1

The growing complexity of moving to new process nodes is making it much more difficult to create, manage and re-use IP. There are more rules, more data to manage, and more potential interactions as density increases, both in planar implementations and in advanced packaging. And the problems only get worse as designs move to 5nm and 3nm, and as more heterogeneous components such as accelerato... » read more

Edge Inferencing Challenges

By Ed Sperling - 10 Jan, 2019 - Comments: 0

Geoff Tate, CEO of Flex Logix, talks about balancing different variables to improve performance and reduce power at the lowest cost possible in order to do inferencing in edge devices. https://youtu.be/1BTxwew--5U » read more

Making AI Run Faster

By Ed Sperling - 11 Oct, 2018 - Comments: 0

The semiconductor industry has woken up to the fact that heterogeneous computing is the way forward and that inferencing will require more than a GPU or a CPU. The numbers being bandied about by the 30 or so companies working on this problem are 100X improvements in performance. But how to get there isn't so simple. It requires four major changes, as well as some other architectural shifts. ... » read more

← Older posts

Knowledge Centers
Entities, people and technologies explored

Startup Funding: Q1 2025

AI chips and data center communications see big funding; 75 startups raise $2 billion.

by Jesse Allen

Advanced Packaging Fundamentals for Semiconductor Engineers

New SE eBook examines the next phase of semiconductor design, testing, and manufacturing.

by Bryon Moyer

Chip Industry Week in Review

AI export rule to be scrapped; SEMI, EU request; Cadence, Nvidia supercomputer; AI co-processor; Imagination's new GPU; semi sales up; imec, TNO photonics lab; NSF key to national security; flexible packaging control system; SiConic test engineering; USB 4 support; SiC JFETS; magnetic behavior in hematite.

by The SE Staff

tag: RNNs

HW and SW Architecture Approaches For Running AI Models

Fundamental Issues In Computer Vision Still Unresolved

FP8: Cross-Industry Hardware Specification For AI Training And Inference (Arm, Intel, Nvidia)

There’s More To Machine Learning Than CNNs

Making Sense Of New Edge-Inference Architectures

Edge-Inference Architectures Proliferate

The MCU Dilemma

IP Management And Development At 5/3nm

Edge Inferencing Challenges

Making AI Run Faster

Trending Articles

Power Delivery Challenges For AI Chips

TSMC: King Of Data Center AI

Novel Assembly Approaches For 3D Device Stacks

Chip Industry Week in Review

EDA’s Top Execs Map Out An AI-Driven Future

Knowledge Centers
Entities, people and technologies explored

Related Articles

Startup Funding: Q1 2025

Advanced Packaging Fundamentals for Semiconductor Engineers

Chip Industry Week in Review

Chip Industry Week in Review

RISC-V’s Increasing Influence

Chip Industry Week in Review

Big Changes Ahead For Interposers And Substrates

What Exactly Are Chiplets And Heterogeneous Integration?

Sponsors

Recent Comments

About

Navigation

Connect With Us

tag: RNNs

HW and SW Architecture Approaches For Running AI Models

Fundamental Issues In Computer Vision Still Unresolved

FP8: Cross-Industry Hardware Specification For AI Training And Inference (Arm, Intel, Nvidia)

There’s More To Machine Learning Than CNNs

Making Sense Of New Edge-Inference Architectures

Edge-Inference Architectures Proliferate

The MCU Dilemma

IP Management And Development At 5/3nm

Edge Inferencing Challenges

Making AI Run Faster

Trending Articles

Power Delivery Challenges For AI Chips

TSMC: King Of Data Center AI

Novel Assembly Approaches For 3D Device Stacks

Chip Industry Week in Review

EDA’s Top Execs Map Out An AI-Driven Future

Knowledge Centers Entities, people and technologies explored

Related Articles

Startup Funding: Q1 2025

Advanced Packaging Fundamentals for Semiconductor Engineers

Chip Industry Week in Review

Chip Industry Week in Review

RISC-V’s Increasing Influence

Chip Industry Week in Review

Big Changes Ahead For Interposers And Substrates

What Exactly Are Chiplets And Heterogeneous Integration?

Sponsors

Newsletter Signup

Popular Tags

Recent Comments

About

Navigation

Connect With Us

Knowledge Centers
Entities, people and technologies explored