Open Access System for Information Sharing

Department of Electrical Engineering (전자전기공학과) 1. Journal Papers

Article

Cited 12 time in webofscience

Cited 17 time in scopus

Metadata Downloads

Full metadata record

Files in This Item:: There are no files associated with this item.

DC Field	Value	Language
dc.contributor.author	BAEK, JONGCHAN	-
dc.contributor.author	JEON, HAYEONG	-
dc.contributor.author	PARK, JONGYEOK	-
dc.contributor.author	LEE, HAKJUN	-
dc.contributor.author	HAN, SOOHEE	-
dc.date.accessioned	2021-06-01T01:53:08Z	-
dc.date.available	2021-06-01T01:53:08Z	-
dc.date.created	2021-03-08	-
dc.date.issued	2021-09	-
dc.identifier.issn	0278-0046	-
dc.identifier.uri	https://oasis.postech.ac.kr/handle/2014.oak/105104	-
dc.description.abstract	Recent advancements in deep reinforcement learning for real control tasks have received interest from many researchers and field engineers in a variety of industrial areas. However, in most cases, optimal policies obtained by deep reinforcement learning are difficult to implement on cost-effective and lightweight platforms such as mobile devices. This can be attributed to their computational complexity and excessive memory usage. For this reason, this study proposes an off-policy deep reinforcement learning algorithm called the sparse variational deterministic policy gradient (SVDPG). SVDPG provides highly efficient policy network compression under the standard reinforcement learning framework. The proposed SVDPG integrates Bayesian pruning, which is known as a state-of-the-art neural network compression technique, with the policy update in an actor-critic architecture for reinforcement learning. It is demonstrated that SVDPG achieves a high compression rate of policy networks for continuous control benchmark tasks while preserving a competitive performance. The superiority of SVDPG in low-computing power devices is proven by comparing the level of compression in terms of the memory requirements and computation time on a commercial microcontroller unit. Finally, it is confirmed that the proposed SVDPG is also reliable in real-world scenarios since it can be applied to the swing-up control of an inverted pendulum system.	-
dc.language	English	-
dc.publisher	Institute of Electrical and Electronics Engineers	-
dc.relation.isPartOf	IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS	-
dc.subject	Benchmarking	-
dc.subject	Cost effectiveness	-
dc.subject	Deep learning	-
dc.subject	Real time control	-
dc.subject	Reinforcement learning	-
dc.subject	Actor-critic architectures	-
dc.subject	Competitive performance	-
dc.subject	Continuous control	-
dc.subject	High compressions	-
dc.subject	Inverted pendulum system	-
dc.subject	Memory requirements	-
dc.subject	Microcontroller unit	-
dc.subject	Real-world scenario	-
dc.subject	Learning algorithms	-
dc.title	Sparse Variational Deterministic Policy Gradient for Continuous Real Time Control	-
dc.type	Article	-
dc.identifier.doi	10.1109/TIE.2020.3021607	-
dc.type.rims	ART	-
dc.identifier.bibliographicCitation	IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, v.68, no.10, pp.9800 - 9810	-
dc.identifier.wosid	000670541800070	-
dc.citation.endPage	9810	-
dc.citation.number	10	-
dc.citation.startPage	9800	-
dc.citation.title	IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS	-
dc.citation.volume	68	-
dc.contributor.affiliatedAuthor	BAEK, JONGCHAN	-
dc.contributor.affiliatedAuthor	JEON, HAYEONG	-
dc.contributor.affiliatedAuthor	PARK, JONGYEOK	-
dc.contributor.affiliatedAuthor	LEE, HAKJUN	-
dc.contributor.affiliatedAuthor	HAN, SOOHEE	-
dc.identifier.scopusid	2-s2.0-85112513702	-
dc.description.journalClass	1	-
dc.description.journalClass	1	-
dc.description.isOpenAccess	N	-
dc.type.docType	Article	-
dc.subject.keywordAuthor	Bayes methods	-
dc.subject.keywordAuthor	Machine learning	-
dc.subject.keywordAuthor	Neural networks	-
dc.subject.keywordAuthor	Learning (artificial intelligence)	-
dc.subject.keywordAuthor	Computational modeling	-
dc.subject.keywordAuthor	Standards	-
dc.subject.keywordAuthor	Optimization	-
dc.subject.keywordAuthor	Bayesian compression	-
dc.subject.keywordAuthor	deep reinforcement learning	-
dc.subject.keywordAuthor	inverted pendulum system	-
dc.subject.keywordAuthor	sparse Bayesian deep learning	-
dc.relation.journalWebOfScienceCategory	Automation & Control Systems	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalWebOfScienceCategory	Instruments & Instrumentation	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Automation & Control Systems	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Instruments & Instrumentation	-

Show simple item record

qr_code

트윗하기

Communities & Collection

Department of Electrical Engineering (전자전기공학과)

Related Researcher

Researcher

한수희HAN, SOOHEE: Dept of Electrical Enginrg

Read more

Open Access System for Information Sharing

Communities & Collection

Related Researcher

Views & Downloads

Browse