Open Access System for Information Sharing

Login Library

 

Article
Cited 1 time in webofscience Cited 0 time in scopus
Metadata Downloads
Full metadata record
Files in This Item:
There are no files associated with this item.
DC FieldValueLanguage
dc.contributor.authorHAN, WOOK SHIN-
dc.contributor.authorKwak, Wooseong-
dc.contributor.authorYu, Hwanjo-
dc.date.accessioned2018-09-03T00:52:19Z-
dc.date.available2018-09-03T00:52:19Z-
dc.date.created2018-08-30-
dc.date.issued2010-03-
dc.identifier.issn1084-4627-
dc.identifier.urihttps://oasis.postech.ac.kr/handle/2014.oak/92224-
dc.description.abstractCommercial tuple extraction systems have enjoyed some success to extract tuples by regarding HTML pages as tree structures and exploiting XPath queries to find attributes of tuples in the HTML pages. However, such systems would be vulnerable to small changes on the web pages. In this paper, we propose a robust tuple extraction system which utilizes spatial relationships among elements rather than the XPath queries of the elements. Our system regards elements in the rendered page as spatial objects in the 2-D space and executes spatial joins to extract target elements. Since humans also identify an element in a web page by its relative spatial location, our system extracting elements by their spatial relationships could possibly be as robust as manual extraction and is far more robust than existing tuple extraction systems.-
dc.languageEnglish-
dc.publisherIEEE-
dc.relation.isPartOfProceedings - International Conference on Data Engineering-
dc.titleOn Supporting Effective Web Extraction-
dc.typeArticle-
dc.identifier.doi10.1109/ICDE.2010.5447932-
dc.type.rimsART-
dc.identifier.bibliographicCitationProceedings - International Conference on Data Engineering, pp.773 - 775-
dc.identifier.wosid000286933100082-
dc.citation.endPage775-
dc.citation.startPage773-
dc.citation.titleProceedings - International Conference on Data Engineering-
dc.contributor.affiliatedAuthorHAN, WOOK SHIN-
dc.description.journalClass1-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.type.docTypeProceedings Paper-
dc.relation.journalWebOfScienceCategoryComputer Science, Theory & Methods-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.description.journalRegisteredClassscie-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher

한욱신HAN, WOOK SHIN
Grad. School of AI
Read more

Views & Downloads

Browse