Open Access System for Information Sharing

Login Library

 

Article
Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Pruning-based unsupervised segmentation for Korean SCIE SCOPUS

Title
Pruning-based unsupervised segmentation for Korean
Authors
Kang, ISNa, SHLee, JH
Date Issued
2006-10
Publisher
IEICE-INST ELECTRONICS INFORMATION CO
Abstract
Compound noun segmentation is a key component for Korean language processing. Supervised approaches require some types of human intervention such as maintaining lexicons, manually segmenting the corpora, or devising heuristic rules. Thus, they suffer from the unknown word problem, and cannot distinguish domain-oriented or corpus-directed segmentation results from the others. These problems can be overcome by unsupervised approaches that employ segmentation clues obtained purely from a raw corpus. However, most unsupervised approaches require tuning of empirical parameters or learning of the statistical dictionary. To develop a tuning-less, learning-free unsupervised segmentation algorithm, this study proposes a pruning-based unsupervised technique that eliminates unhelpful segmentation candidates. In addition, unlike previous unsupervised methods that have relied on purely character-based segmentation clues, this study utilizes word-based segmentation clues. Experimental evaluations show that the pruning scheme is very effective to unsupervised segmentation of Korean compound nouns, and the use of word-based prior knowledge enables better segmentation accuracy. This study also shows that the proposed algorithm performs competitively with or better than other unsupervised methods.
URI
https://oasis.postech.ac.kr/handle/2014.oak/10385
DOI
10.1093/ietisy/e89-d.10.2670
ISSN
0916-8532
Article Type
Article
Citation
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, vol. E89-D, no. 10, page. 2670 - 2677, 2006-10
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher

이종혁LEE, JONG HYEOK
Grad. School of AI
Read more

Views & Downloads

Browse