Open Access System for Information Sharing

ETC 1. Journal Papers

Article

Cited 263 time in webofscience

Cited 331 time in scopus

Metadata Downloads

Full metadata record

Files in This Item:: There are no files associated with this item.

DC Field	Value	Language
dc.contributor.author	Lee, CK	-
dc.contributor.author	Lee, GG	-
dc.date.accessioned	2016-04-01T02:04:43Z	-
dc.date.available	2016-04-01T02:04:43Z	-
dc.date.created	2009-08-21	-
dc.date.issued	2006-01	-
dc.identifier.issn	0306-4573	-
dc.identifier.other	2005-OAK-0000005443	-
dc.identifier.uri	https://oasis.postech.ac.kr/handle/2014.oak/24365	-
dc.description.abstract	Most previous works of feature selection emphasized only the reduction of high dimensionality of the feature space. But in cases where many features are highly redundant with each other, we must utilize other means, for example, more complex dependence models such as Bayesian network classifiers. In this paper, we introduce a new information gain and divergence-based feature selection method for statistical machine learning-based text categorization without relying on more complex dependence models. Our feature selection method strives to reduce redundancy between features while maintaining information gain in selecting appropriate features for text categorization. Empirical results are given on a number of dataset, showing that our feature selection method is more effective than Koller and Sahami's method [Koller, D., & Sahami, M. (1996). Toward optimal feature selection. In Proceedings of ICML-96, 13th international conference on machine learning], which is one of greedy feature selection methods, and conventional information gain which is commonly used in feature selection for text categorization. Moreover, our feature selection method sometimes produces more improvements of conventional machine learning algorithms over support vector machines which are known to give the best classification accuracy. (c) 2004 Elsevier Ltd. All rights reserved.	-
dc.description.statementofresponsibility	X	-
dc.language	English	-
dc.publisher	PERGAMON-ELSEVIER SCIENCE LTD	-
dc.relation.isPartOf	INFORMATION PROCESSING & MANAGEMENT (postech rank 1)	-
dc.subject	text categorization	-
dc.subject	feature selection	-
dc.subject	information gain and divergence-based feature selection	-
dc.title	Information gain and divergence-based feature selection for machine learning-based text categorization	-
dc.type	Article	-
dc.contributor.college	컴퓨터공학과	-
dc.identifier.doi	10.1016/j.ipm.2004.08.006	-
dc.author.google	Lee, CK	-
dc.author.google	Lee, GG	-
dc.relation.volume	42	-
dc.relation.issue	1	-
dc.relation.startpage	155	-
dc.relation.lastpage	165	-
dc.contributor.id	10103841	-
dc.relation.journal	INFORMATION PROCESSING & MANAGEMENT (postech rank 1)	-
dc.relation.index	SCI급, SCOPUS 등재논문	-
dc.relation.sci	SCIE	-
dc.collections.name	Journal Papers	-
dc.type.rims	ART	-
dc.identifier.bibliographicCitation	INFORMATION PROCESSING & MANAGEMENT (postech rank 1), v.42, no.1, pp.155 - 165	-
dc.identifier.wosid	000232355300010	-
dc.date.tcdate	2019-01-01	-
dc.citation.endPage	165	-
dc.citation.number	1	-
dc.citation.startPage	155	-
dc.citation.title	INFORMATION PROCESSING & MANAGEMENT (postech rank 1)	-
dc.citation.volume	42	-
dc.contributor.affiliatedAuthor	Lee, GG	-
dc.identifier.scopusid	2-s2.0-23744432473	-
dc.description.journalClass	1	-
dc.description.journalClass	1	-
dc.description.wostc	125	-
dc.type.docType	Article	-
dc.subject.keywordAuthor	text categorization	-
dc.subject.keywordAuthor	feature selection	-
dc.subject.keywordAuthor	information gain and divergence-based feature selection	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.relation.journalWebOfScienceCategory	Information Science & Library Science	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Information Science & Library Science	-

Show simple item record

qr_code

트윗하기

Communities & Collection

Related Researcher

Researcher

이근배LEE, GARY GEUNBAE: Grad. School of AI

Read more

Open Access System for Information Sharing

Communities & Collection

Related Researcher

Views & Downloads

Browse