Open Access System for Information Sharing

Login Library

Department of Computer Science & Engineering (컴퓨터공학과) 3. Theses_Ph.D.

Thesis

Cited 0 time in webofscience

webofscience

Cited 0 time in scopus

scopus

Metadata Downloads

Full metadata record

Files in This Item:: There are no files associated with this item.

DC Field	Value	Language
dc.contributor.author	이예하	en_US
dc.date.accessioned	2014-12-01T11:48:00Z	-
dc.date.available	2014-12-01T11:48:00Z	-
dc.date.issued	2012	en_US
dc.identifier.other	OAK-2014-00989	en_US
dc.identifier.uri	http://postech.dcollection.net/jsp/common/DcLoOrgPer.jsp?sItemId=000001218603	en_US
dc.identifier.uri	https://oasis.postech.ac.kr/handle/2014.oak/1491	-
dc.description	Doctor	en_US
dc.description.abstract	Since the advent of the Internet, it has become one of the most important channels for communicating information among users including individuals and news organizations.Many news organizations have started to distribute news stories on the Internet, and a large number of news stories are published by various news channels, on a daily basis.This makes it difficult to keep track of important news stories.As a result, users' need to identify top news stories has increased, and news story search has played an increasingly important role in users' Internet activity.The objective of this dissertation is to identify important news stories for a given date, using the blogosphere.Blogs consists of blog posts that are user-generated contents, and reflects diverse the opinion of users about news stories.Therefore, a news story that attracts much attention in the blogosphere is likely to be important.In this dissertation, we define the popularity of a news story as the amount of attention it receives from users within the blogosphere.We first evaluate the popularity of a news story in terms of content similarity between the story and blog posts published on a given date.For this purpose, we propose several approaches to estimate language models for each of the story and the blog posts.We also generate a temporal profile of a news story by analyzing the distribution of the number of blog posts relevant to the story over several days, and evaluate the popularity of the story based on the temporal profile.The experimental results on the TREC 2009 and 2010 Blog Track show that our approach is effective in identifying the important news stories.In particular, the proposed approach achieved the state-of-the-art performance.Furthermore, we propose a simple but effective approach to deal with the noisy information of blog posts.In general, blog posts include several types of noisy information including blog templates, advertisements and navigation panels.This noisy information is not user-generated contents, and has a bad influence on our system for identifying important news stories.The motivation for our approach is that most of the noisy contents do not change across several consecutive posts within the same blog.To eliminate the noisy information, we compare two consecutive posts belonging to the same blog.Then, we consider common parts of the two posts as the noisy contents, and remove them.Experimental results from the TREC blog track are remarkable, showing that the retrieval system using the proposed method results in an important performance improvement of about 10% MAP (Mean Average Precision) increase over that of the baseline system.	en_US
dc.language	eng	en_US
dc.publisher	포항공과대학교	en_US
dc.rights	BY_NC_ND	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/2.0/kr	en_US
dc.title	News Story Ranking Using Blogosphere	en_US
dc.type	Thesis	en_US
dc.contributor.college	일반대학원 컴퓨터공학과	en_US
dc.date.degree	2012- 2	en_US
dc.type.docType	Thesis	-

Show simple item record

qr_code

트윗하기

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Communities & Collection

Department of Computer Science & Engineering (컴퓨터공학과)

Views & Downloads

OAK

개인정보처리방침 Personal Information Protection Policy

library@postech.ac.kr Tel: 054-279-2548

Copyrights © by 2017 Pohang University of Science ad Technology All right reserved.

Browse

Login Library Help