Open Access System for Information Sharing

Login Library

 

Thesis
Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads
Full metadata record
Files in This Item:
There are no files associated with this item.
DC FieldValueLanguage
dc.contributor.author전현민-
dc.date.accessioned2022-10-31T16:32:37Z-
dc.date.available2022-10-31T16:32:37Z-
dc.date.issued2021-
dc.identifier.otherOAK-2015-09654-
dc.identifier.urihttp://postech.dcollection.net/common/orgView/200000506809ko_KR
dc.identifier.urihttps://oasis.postech.ac.kr/handle/2014.oak/114201-
dc.descriptionMaster-
dc.description.abstractIn this thesis, we propose Dialogue System with Optimizing a Recurrent Action Policy using Efficient Context (DORA), a multi-domain task-oriented dialogue system that uses supervised learning (SL) with subsequently applied reinforcement learning (RL) for optimization by using a recurrent dialogue policy. This dialogue policy recurrently generates explicit system actions as a both word-level and high-level policy. The system actions are interpretable and controllable; therefore, we propose approaches to the system actions by using them for rewards and by controlling them. As a result, DORA is clearly optimized during both SL and RL steps by using the explicit system action policy that considers an efficient input context instead of the entire dialogue history. In the experiments, DORA achieved state-of-the-art success rate with improvement by 6.6 points on MultiWOZ 2.0 and by 10.9 points on MultiWOZ 2.1.-
dc.languageeng-
dc.publisher포항공과대학교-
dc.titleRecurrent Action Policy Optimization for Multi-domain Task-oriented Dialogue System-
dc.title.alternative다중 도메인 목적 지향 대화 시스템을 위한 순환 행동 정책 최적화-
dc.typeThesis-
dc.contributor.college컴퓨터공학과-
dc.date.degree2021- 8-

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Views & Downloads

Browse