Open Access System for Information Sharing

Login Library

 

Conference
Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Score-balanced Loss for Multi-aspect Pronunciation Assessment

Title
Score-balanced Loss for Multi-aspect Pronunciation Assessment
Authors
LEE, GARY GEUNBAEDo, HeejinKim, Yunsu
Date Issued
2023-08-20
Publisher
International Speech Communication Association
Abstract
With rapid technological growth, automatic pronunciation assessment has transitioned toward systems that evaluate pronunciation in various aspects, such as fluency and stress. However, despite the highly imbalanced score labels within each aspect, existing studies have rarely tackled the data imbalance problem. In this paper, we suggest a novel loss function, score-balanced loss, to address the problem caused by uneven data, such as bias toward the majority scores. As a re-weighting approach, we assign higher costs when the predicted score is of the minority class, thus, guiding the model to gain positive feedback for sparse score prediction. Specifically, we design two weighting factors by leveraging the concept of an effective number of samples and using the ranks of scores. We evaluate our method on the speechocean762 dataset, which has noticeably imbalanced scores for several aspects. Improved results particularly on such uneven aspects prove the effectiveness of our method.
URI
https://oasis.postech.ac.kr/handle/2014.oak/121435
Article Type
Conference
Citation
24th International Speech Communication Association, Interspeech 2023, page. 4998 - 5002, 2023-08-20
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Views & Downloads

Browse