Open Access System for Information Sharing

Login Library

 

Conference
Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Maximizing System Performance by Balancing Computation Loads in LSTM Accelerators

Title
Maximizing System Performance by Balancing Computation Loads in LSTM Accelerators
Authors
PARK, JUN KIKUNG, JAEHAYI, WOOSEOKKIM, JAE JOON
Date Issued
2018-03-20
Publisher
ACM/IEEE
Abstract
The LSTM is a popular neural network model for modeling or analyzing the time-varying data. The main operation of LSTM is a matrix-vector multiplication and it becomes sparse (spMxV) due to the widely-accepted weight pruning in deep learning. This paper presents a new sparse matrix format, named CBSR, to maximize the inference speed of the LSTM accelerator. In the CBSR format, the speed-up is achieved by balancing out the computation loads over PEs. Along with the new format, we present a simple network transformation to completely remove the hardware overhead incurred when using the CBSR format. Also, the detailed analysis on the impact of network size or the number of PEs is performed which lacks in the prior work. The simulation results show 16∼38% improvement in the system performance compared to the well-known CSC/CSR format. The power analysis is also performed in 65nm CMOS technology to show 9∼22% energy savings.
URI
https://oasis.postech.ac.kr/handle/2014.oak/41592
Article Type
Conference
Citation
Design, Automation and Test in Europe (DATE) 2018, 2018-03-20
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher

김재준KIM, JAE JOON
Dept. Convergence IT Engineering
Read more

Views & Downloads

Browse