Open Access System for Information Sharing

Login Library

 

Conference
Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Guaranteeing the O(AGM/OUT) Runtime for Uniform Sampling and Size Estimation over Joins

Title
Guaranteeing the O(AGM/OUT) Runtime for Uniform Sampling and Size Estimation over Joins
Authors
HAN, WOOK SHINKIM, KYOUNG MIN하재현Fletcher, George
Date Issued
2023-06-19
Publisher
ACM
Abstract
We propose a new method for estimating the number of answers OUT of a small join query 𝑄 in a large database 𝐷, and for uniform sampling over joins. Our method is the first to satisfy all the following statements. • Support arbitrary 𝑄, which can be either acyclic or cyclic, and contain binary and non-binary relations. • Guarantee an arbitrary small error with a high probability always in 𝑂˜(𝐴𝐺𝑀/OUT) time, where 𝐴𝐺𝑀 is the AGM bound (an upper bound of OUT), and 𝑂˜ hides the polylogarithmic factor of input size. We also explain previous join size estimators in a unified framework. All methods including ours rely on certain indexes on relations in 𝐷, which take linear time to build offline. Additionally, we extend our method using generalized hypertree decompositions (GHDs) to achieve a lower complexity than 𝑂˜ (𝐴𝐺𝑀/OUT) when OUT is small, and present optimization techniques for improving estimation efficiency and accuracy
URI
https://oasis.postech.ac.kr/handle/2014.oak/123207
Article Type
Conference
Citation
42nd Symposium on Principles of Database Systems, page. 113 - 125, 2023-06-19
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Views & Downloads

Browse