Open Access System for Information Sharing

Login Library

 

Article
Cited 6 time in webofscience Cited 8 time in scopus
Metadata Downloads

Entity Translation Mining from Comparable Corpora: Combining Graph Mapping with Corpus Latent Features SCIE SCOPUS

Title
Entity Translation Mining from Comparable Corpora: Combining Graph Mapping with Corpus Latent Features
Authors
Kim, JHwang, SWJiang, LSong, YIZhou, M
Date Issued
2013-08
Publisher
IEEE COMPUTER SOC
Abstract
This paper addresses the problem of mining named entity translations from comparable corpora, specifically, mining English and Chinese named entity translation. We first observe that existing approaches use one or more of the following named entity similarity metrics: entity, entity context, and relationship. Motivated by this observation, we propose a new holistic approach by 1) combining all similarity types used and 2) additionally considering relationship context similarity between pairs of named entities, a missing quadrant in the taxonomy of similarity metrics. We abstract the named entity translation problem as the matching of two named entity graphs extracted from the comparable corpora. Specifically, named entity graphs are first constructed from comparable corpora to extract relationship between named entities. Entity similarity and entity context similarity are then calculated from every pair of bilingual named entities. A reinforcing method is utilized to reflect relationship similarity and relationship context similarity between named entities. We also discover "latent" features lost in the graph extraction process and integrate this into our framework. According to our experimental results, our holistic graph-based approach and its enhancement using corpus latent features are highly effective and our framework significantly outperforms previous approaches.
Keywords
Data mining; text mining
URI
https://oasis.postech.ac.kr/handle/2014.oak/14877
DOI
10.1109/TKDE.2012.117
ISSN
1041-4347
Article Type
Article
Citation
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, vol. 25, no. 8, page. 1787 - 1800, 2013-08
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher

황승원HWANG, SEUNG WON
Dept of Computer Science & Enginrg
Read more

Views & Downloads

Browse