Application-Optimized Processors

How to improve performance and decrease power in neural networks.

popularity

Executing a neural network on top of an NPU requires an understanding of application requirements, such as latency and throughput, as well as the potential partitioning challenges. Sharad Chole, chief scientist and co-founder of Expedera, talks about fine-grained dependencies, why processing packets out of order can help optimize performance and power, and when to use voltage and frequency scaling versus clock gating.



Leave a Reply


(Note: This name will be displayed publicly)