Open Access System for Information Sharing

Login Library

 

Conference
Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads
Full metadata record
Files in This Item:
There are no files associated with this item.
DC FieldValueLanguage
dc.contributor.authorLEE, GARY GEUNBAE-
dc.contributor.authorLee, Wonjun-
dc.contributor.authorKim, Yunsu-
dc.date.accessioned2024-03-06T00:57:36Z-
dc.date.available2024-03-06T00:57:36Z-
dc.date.created2024-02-20-
dc.date.issued2023-12-16-
dc.identifier.urihttps://oasis.postech.ac.kr/handle/2014.oak/121218-
dc.description.abstractThis research optimizes two-pass cross-lingual transfer learning in low-resource languages by enhancing phoneme recognition and phoneme-to-grapheme translation models. Our approach optimizes these two stages to improve speech recognition across languages. We optimize phoneme vocabulary coverage by merging phonemes based on shared articulatory characteristics, thus improving recognition accuracy. Additionally, we introduce a global phoneme noise generator for realistic ASR noise during phoneme-to-grapheme training to reduce error propagation. Experiments on the CommonVoice 12.0 dataset show significant reductions in Word Error Rate (WER) for low-resource languages, highlighting the effectiveness of our approach. This research contributes to the advancements of two-pass ASR systems in low-resource languages, offering the potential for improved cross-lingual transfer learning.-
dc.languageEnglish-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.relation.isPartOf2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023-
dc.relation.isPartOf2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023-
dc.titleOptimizing Two-Pass Cross-Lingual Transfer Learning: Phoneme Recognition And Phoneme To Grapheme Translation-
dc.typeConference-
dc.type.rimsCONF-
dc.identifier.bibliographicCitation2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023-
dc.citation.conferenceDate2023-12-16-
dc.citation.conferencePlaceCH-
dc.citation.title2023 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023-
dc.contributor.affiliatedAuthorLEE, GARY GEUNBAE-
dc.description.journalClass1-
dc.description.journalClass1-

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Views & Downloads

Browse