(video) Co-Design of Deep Neural Nets and Neural Net Accelerators for Embedded Vision Applications

Prof. Kurt Keutzer’s presentation at the 2018 CVPR Workshop on Efficient Deep Learning for Computer Vision

1. Co-design of DNNs and accelerators requires detailed layer-by-layer analysis of contemporary DNNs
2. Key to efficient DNN computation is data reuse
3. Different CNN/DNN layers have different patterns of data reuse
4. Different NN accelerator architectures favor different types of reuse (outputs vs weights)
5. We present a NN accelerator design, the Squeezelerator, that supports Weight-Stationary (WS) and Output-Stationary (OS) dataflow and matches the data reuse in the accelerator to the characteristics of the layer
6. Careful attention to layer-by-layer speed/energy of mobile DNNs resulted in up to 6x speedup over other accelerator architectures

(video) Co-Design of Deep Neural Nets and Neural Net Accelerators for Embedded Vision Applications

(video) DeepScale object detection

(video) Kurt Keutzer, Co-Founder of DeepScale. How Autonomous Driving Challenges Computer Vision Research

(video) Forrest Iandola, CEO of DeepScale. Tips and Tricks for Developing Smaller Neural Nets

(video) Developing Enabling Technologies for Automated Driving

AV Talent War Heats Up as Physical AI Firms Drive Salaries Higher – News and Statistics

Nissan to Streamline Global Car Lineup, Expand AI Driving Technology Integration

Software-Defined Vehicle Market to Reach US$ 726.0 Billion

Amazon’s Zoox to begin robotaxi testing in Austin

BMW launches pilot project for in-plant driverless mobility

Here’s How Much A Waymo Ride Will Cost You Compared To Uber And Lyft