Many times you’ll hear vendors talking about how many TOPS their chip has and imply that more TOPS means better inference performance.
If you use TOPS to pick your AI inference chip, you will likely not be happy with what you get.
Recently, Vivienne Sze, a professor at MIT, gave an excellent talk entitled “How to Evaluate Efficient Deep Neural Network Approaches.” Slides are also av...
» read more