Open Access System for Information Sharing

Login Library

 

Conference
Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Exploring the Viability of Synthetic Audio Data for Audio-Based Dialogue State Tracking

Title
Exploring the Viability of Synthetic Audio Data for Audio-Based Dialogue State Tracking
Authors
LEE, GARY GEUNBAELee, JihyunJeon, YejinLee, WonjunKim, Yunsu
Date Issued
2023-12-16
Publisher
Institute of Electrical and Electronics Engineers Inc.
Abstract
Dialogue state tracking plays a crucial role in extracting information in task-oriented dialogue systems. However, preceding research are limited to textual modalities, primarily due to the shortage of authentic human audio datasets. We address this by investigating synthetic audio data for audio-based DST. To this end, we develop cascading and end-to-end models, train them with our synthetic audio dataset, and test them on actual human speech data. To facilitate evaluation tailored to audio modalities, we introduce a novel PhonemeF1 to capture pronunciation similarity. Experimental results showed that models trained solely on synthetic datasets can generalize their performance to human voice data. By eliminating the dependency on human speech data collection, these insights pave the way for significant practical advancements in audio-based DST. Data and code are available at https://github.com/JihyunLee1/E2E-DST.
URI
https://oasis.postech.ac.kr/handle/2014.oak/121219
Article Type
Conference
Citation
2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023, 2023-12-16
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Views & Downloads

Browse