Open Access System for Information Sharing

Login Library

 

Conference
Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads
Full metadata record
Files in This Item:
There are no files associated with this item.
DC FieldValueLanguage
dc.contributor.authorSong, Haena-
dc.contributor.authorYoon, Jongho-
dc.contributor.authorKim, Dohun-
dc.contributor.authorKwon, Eunji-
dc.contributor.authorOh, Tae-Hyun-
dc.contributor.authorKang, Seokhyeong-
dc.date.accessioned2024-03-06T08:10:55Z-
dc.date.available2024-03-06T08:10:55Z-
dc.date.created2024-02-22-
dc.date.issued2023-04-17-
dc.identifier.urihttps://oasis.postech.ac.kr/handle/2014.oak/122626-
dc.description.abstractNumerous network compression methods have been proposed to deploy deep neural networks in a resource-constrained embedded system. Among them, block-circulant matrix (BCM) compression is one of the promising hardware-friendly methods for both acceleration and compression. However, it has several limitations; (i) limited representation due to the structural characteristic of circulant matrix, (ii) limitation of the compression parameter, (iii) need to specialize the dataflow for BCM-compressed network accelerators. In this paper, rank-enhanced and highly-pruned block-circulant matrices compression (RP-BCM) framework is proposed to overcome these limitations. RP-BCM comprises two stages: Hadamard-BCM and BCM-wise pruning. Moreover, a dedicated skip scheme is introduced to processing element design for exploiting high-parallelism with BCM-wise sparsity. Furthermore, we propose specialized dataflow for a BCM-compressed network on a resource-constrained FPGA. As a result, the proposed method achieves parameter reduction and FLOPs reduction for ResNet-50 in ImageNet by 92.4% and 77.3%, respectively. Moreover, the proposed hardware design achieves 3.1× improvement in energy efficiency on the Xilinx PYNQ-Z2 FPGA board for ResNet-18 on ImageNet compared to the GPU.-
dc.languageEnglish-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.relation.isPartOf2023 Design, Automation and Test in Europe Conference and Exhibition, DATE 2023-
dc.relation.isPartOfProceedings -Design, Automation and Test in Europe, DATE-
dc.titleFPGA-Based Accelerator for Rank-Enhanced and Highly-Pruned Block-Circulant Neural Networks-
dc.typeConference-
dc.type.rimsCONF-
dc.identifier.bibliographicCitation2023 Design, Automation and Test in Europe Conference and Exhibition, DATE 2023-
dc.citation.conferenceDate2023-04-17-
dc.citation.conferencePlaceBE-
dc.citation.title2023 Design, Automation and Test in Europe Conference and Exhibition, DATE 2023-
dc.contributor.affiliatedAuthorOh, Tae-Hyun-
dc.contributor.affiliatedAuthorKang, Seokhyeong-
dc.description.journalClass1-
dc.description.journalClass1-

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Views & Downloads

Browse