Open Access System for Information Sharing

Login Library

 

Article
Cited 3 time in webofscience Cited 3 time in scopus
Metadata Downloads

Clustering Noise-Included Data by Controlling Decision Errors SCIE SCOPUS

Title
Clustering Noise-Included Data by Controlling Decision Errors
Authors
Park, Hae-SangJeonghwa LeeJun, CH
Date Issued
2014-05
Publisher
Springer
Abstract
Cluster analysis is an unsupervised learning technique for partitioning objects into several clusters. Assuming that noisy objects are included, we propose a soft clustering method which assigns objects that are significantly different from noise into one of the specified number of clusters by controlling decision errors through multiple testing. The parameters of the Gaussian mixture model are estimated from the EM algorithm. Using the estimated probability density function, we formulated a multiple hypothesis testing for the clustering problem, and the positive false discovery rate (pFDR) is calculated as our decision error. The proposed procedure classifies objects into significant data or noise simultaneously according to the specified target pFDR level. When applied to real and artificial data sets, it was able to control the target pFDR reasonably well, offering a satisfactory clustering performance.
URI
https://oasis.postech.ac.kr/handle/2014.oak/13787
DOI
10.1007/S10479-012-1238-7
ISSN
0254-5330
Article Type
Article
Citation
Annals of Operations Research, vol. 216, no. 1, page. 129 - 144, 2014-05
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher

전치혁JUN, CHI HYUCK
Dept of Industrial & Management Enginrg
Read more

Views & Downloads

Browse