Open Access System for Information Sharing

Department of Electrical Engineering (전자전기공학과) 2. Conference Papers

Conference

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Full metadata record

Files in This Item:: There are no files associated with this item.

DC Field	Value	Language
dc.contributor.author	Kwon, Eunji	-
dc.contributor.author	Song, Haena	-
dc.contributor.author	Park, Jihye	-
dc.contributor.author	Kang, Seokhyeong	-
dc.date.accessioned	2024-03-06T08:10:43Z	-
dc.date.available	2024-03-06T08:10:43Z	-
dc.date.created	2024-02-22	-
dc.date.issued	2023-04-17	-
dc.identifier.uri	https://oasis.postech.ac.kr/handle/2014.oak/122623	-
dc.description.abstract	It is difficult to employ transformer models for computer vision in mobile devices due to their memory- and computation-intensive properties. Accordingly, there is ongoing research on various methods for compressing transformer models, such as pruning. However, general computing platforms such as central processing units (CPUs) and graphics processing units (GPUs) are not energy-efficient to accelerate the pruned model due to their structured sparsity. This paper proposes a low-power accelerator for transformers with various sizes of structured sparsity induced by pruning with different granularity. In this study, we can accelerate a transformer that has been pruned in a head-wise, line-wise, or block-wise manner. We developed a head scheduling algorithm to support head-wise skip operations and resolve the processing engine (PE) load imbalance problem caused by different number of operations in one head. Moreover, we implemented a sparse general matrix-to-matrix multiplication (sparse GEMM) module that supports line-wise and block-wise skipping. As a result, when compared with a mobile GPU and mobile CPU respectively, our proposed accelerator achieved 6.1× and 13.6× improvements in energy efficiency for the detection transformer (DETR) model and achieved approximately 2.6× and 7.9× improvements in the energy efficiency on average for the vision transformer (ViT) models.	-
dc.language	English	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.relation.isPartOf	2023 Design, Automation and Test in Europe Conference and Exhibition, DATE 2023	-
dc.relation.isPartOf	Proceedings -Design, Automation and Test in Europe, DATE	-
dc.title	Mobile Accelerator Exploiting Sparsity of Multi-Heads, Lines, and Blocks in Transformers in Computer Vision	-
dc.type	Conference	-
dc.type.rims	CONF	-
dc.identifier.bibliographicCitation	2023 Design, Automation and Test in Europe Conference and Exhibition, DATE 2023	-
dc.citation.conferenceDate	2023-04-17	-
dc.citation.conferencePlace	BE	-
dc.citation.title	2023 Design, Automation and Test in Europe Conference and Exhibition, DATE 2023	-
dc.contributor.affiliatedAuthor	Kang, Seokhyeong	-
dc.description.journalClass	1	-
dc.description.journalClass	1	-

Show simple item record

qr_code

트윗하기

Communities & Collection

Department of Electrical Engineering (전자전기공학과)

Open Access System for Information Sharing

Communities & Collection

Views & Downloads

Browse