TY - GEN
T1 - Medusa
T2 - 28th International Conference on Field-Programmable Logic and Applications, FPL 2018
AU - Shen, Yongming
AU - Ji, Tianchu
AU - Ferdman, Michael
AU - Milder, Peter
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/11/9
Y1 - 2018/11/9
N2 - To cope with the increasing demand and computational intensity of deep neural networks (DNNs), industry and academia have turned to accelerator technologies. In particular, FPGAs have been shown to provide a good balance between performance and energy efficiency for accelerating DNNs. While significant research has focused on how to build efficient layer processors, the computational building blocks of DNN accelerators, relatively little attention has been paid to the on-chip interconnects that sit between the layer processors and the FPGA's DRAM controller. We observe a disparity between DNN accelerator interfaces, which tend to comprise many narrow ports, and FPGA DRAM controller interfaces, which tend to be wide buses. This mismatch causes traditional interconnects to consume significant FPGA resources. To address this problem, we designed Medusa: An optimized FPGA memory interconnect which transposes data in the interconnect fabric, tailoring the interconnect to the needs of DNN layer processors. Compared to a traditional FPGA interconnect, our design can reduce LUT and FF use by 4.7x and 6.0x, and improves frequency by 1.8x.
AB - To cope with the increasing demand and computational intensity of deep neural networks (DNNs), industry and academia have turned to accelerator technologies. In particular, FPGAs have been shown to provide a good balance between performance and energy efficiency for accelerating DNNs. While significant research has focused on how to build efficient layer processors, the computational building blocks of DNN accelerators, relatively little attention has been paid to the on-chip interconnects that sit between the layer processors and the FPGA's DRAM controller. We observe a disparity between DNN accelerator interfaces, which tend to comprise many narrow ports, and FPGA DRAM controller interfaces, which tend to be wide buses. This mismatch causes traditional interconnects to consume significant FPGA resources. To address this problem, we designed Medusa: An optimized FPGA memory interconnect which transposes data in the interconnect fabric, tailoring the interconnect to the needs of DNN layer processors. Compared to a traditional FPGA interconnect, our design can reduce LUT and FF use by 4.7x and 6.0x, and improves frequency by 1.8x.
KW - Accelerator
KW - Deep Neural Network
KW - FPGA
KW - Memory Interconnect
UR - https://www.scopus.com/pages/publications/85060284486
U2 - 10.1109/FPL.2018.00026
DO - 10.1109/FPL.2018.00026
M3 - Conference contribution
AN - SCOPUS:85060284486
T3 - Proceedings - 2018 International Conference on Field-Programmable Logic and Applications, FPL 2018
SP - 101
EP - 105
BT - Proceedings - 2018 International Conference on Field-Programmable Logic and Applications, FPL 2018
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 26 August 2018 through 30 August 2018
ER -