Open Access System for Information Sharing

Login Library

 

Thesis
Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

언어모델과 재순위화를 이용한 형태소 기반 한국어 품사 태거

Title
언어모델과 재순위화를 이용한 형태소 기반 한국어 품사 태거
Authors
이의현
Date Issued
2017
Publisher
포항공과대학교
Abstract
In Korean Part-of-Speech (POS) tagging task, morpheme-based tagging is linguistically standard approach compared to character-based tagging, because character itself means nothing. Moreover, despite of its high speed and easy customizing, morpheme-based approach is not general in recent Korean POS tagging, because of its complex architecture and poor precision. Because segmentation of morpheme is also determined in decoding step, it is hard to use broad context features such as higher n-gram feature. This thesis applied re-ranking framework into POS tagging system to utilize global feature. The baseline POS tagger generates K-best segmented and tagged candidates and a higher n-gram language model re-scores them. By using re-ranking with language model, it improved word precision by 1.2\% points compared to baseline tagger.
URI
http://postech.dcollection.net/jsp/common/DcLoOrgPer.jsp?sItemId=000002378462
https://oasis.postech.ac.kr/handle/2014.oak/93555
Article Type
Thesis
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Views & Downloads

Browse