Curbing the roofline: A scalable and flexible architecture for CNNs on FPGA

MELONI, PAOLO;DERIU, GIANFRANCO;RAFFO, LUIGI;
2016-01-01

Abstract

Convolutional Neural Networks (CNNs) have reached out-standing results in several complex visual recognition tasks, such as classification and scene parsing. CNNs are com-posed of multiple filtering layers that perform 2D convolu-Tions over input images. The intrinsic parallelism in such a computation kernel makes it suitable to be effectively ac-celerated on parallel hardware. In this paper we propose a highly flexible and scalable architectural template for accel-eration of CNNs on FPGA devices, based on the cooperation between a set of software cores and a parallel convolution engine that communicate via a tightly coupled L1 shared scratchpad. Our accelerator structure, tested on a Xilinx Zynq XC-Z7045 device, delivers peak performance up to 80 GMAC/s, corresponding to 100 MMAC/s for each DSP slice in the programmable fabric. Thanks to the flexible archi-Tecture, convolution operations can be scheduled in order to reduce input/output bandwidth down to 8 bytes per cy-cle without degrading the performance of the accelerator in most of the meaningful use-cases.
2016
Inglese
2016 ACM International Conference on Computing Frontiers - Proceedings
9781450341288
9781450341288
Association for Computing Machinery, Inc
376
383
8
ACM International Conference on Computing Frontiers, CF 2016
Esperti anonimi
2016
ita
internazionale
scientifica
Accelerator; Convolutional Neural Network; FPGA; Software
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
Meloni, Paolo; Deriu, Gianfranco; Conti, Francesco; Loi, Igor; Raffo, Luigi; Benini, Luca
273
6
4.1 Contributo in Atti di convegno
reserved
info:eu-repo/semantics/conferencePaper
Files in This Item:
File Size Format  
p376-meloni.pdf

Solo gestori archivio

Type: versione editoriale
Size 182.35 kB
Format Adobe PDF
182.35 kB Adobe PDF & nbsp; View / Open   Request a copy

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Questionnaire and social

Share on:
Impostazioni cookie