Packing Neural Networks Into End-User Client Devices

How number representation shrinks the footprint.


Most of today’s neural networks can only run on high-performance servers. There’s a big push to change this and simplify network processing to the point where the algorithms can run on end-user client devices. One approach is to eliminate complexity by replacing floating-point representation with fixed-point representation. We take a different approach, and recommend a mix of the two, so as to reduce memory and power requirements while retaining accuracy.

To read more, click here.

Leave a Reply

(Note: This name will be displayed publicly)