in

(video) Co-Design of Deep Neural Nets and Neural Net Accelerators for Embedded Vision Applications

Prof. Kurt Keutzer’s presentation at the 2018 CVPR Workshop on Efficient Deep Learning for Computer Vision

1. Co-design of DNNs and accelerators requires detailed layer-by-layer analysis of contemporary DNNs
2. Key to efficient DNN computation is data reuse
3. Different CNN/DNN layers have different patterns of data reuse
4. Different NN accelerator architectures favor different types of reuse (outputs vs weights)
5. We present a NN accelerator design, the Squeezelerator, that supports Weight-Stationary (WS) and Output-Stationary (OS) dataflow and matches the data reuse in the accelerator to the characteristics of the layer
6. Careful attention to layer-by-layer speed/energy of mobile DNNs resulted in up to 6x speedup over other accelerator architectures

Report

What do you think?

486 Points
Upvote Downvote

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

GIPHY App Key not set. Please check settings

(video) L’AUTONOM TRACT AT135 est là !

(video) Meet the Test Specialists