TY - GEN
T1 - Answering why-not spatial keyword top-k queries via keyword adaption
AU - Chen, Lei
AU - Xu, Jianliang
AU - Lin, Xin
AU - Jensen, Christian S.
AU - Hu, Haibo
N1 - This work is supported by HK-RGC grants 12201615, 12202414, and 12200114. Xin Lin is the corresponding author and is supported by Shanghai Pujiang Program.
PY - 2016/5/16
Y1 - 2016/5/16
N2 - Web objects, often associated with descriptive text documents, are increasingly being geo-tagged. A spatial keyword top-k query retrieves the best k such objects according to a scoring function that considers both spatial distance and textual similarity. However, it is in some cases difficult for users to identify the exact keywords that describe their query intent. After a user issues an initial query and gets back the result, the user may find that some expected objects are missing and may wonder why. Answering the resulting why-not questions can aid users in retrieving better results. However, no existing techniques are able to answer why-not questions by adapting the query keywords. We propose techniques capable of adapting an initial set of query keywords so that expected, but missing, objects enter the result along with other relevant objects. We develop a basic algorithm with a set of optimizations that sequentially examines a sequence of candidate keyword sets. In addition, we present an index-based bound-and-prune algorithm that is able to determine the best sample out of a set of candidates in just one pass of index traversal, thus speeding up the query processing. We also extend the proposed algorithms to handle multiple missing objects. Extensive experimental results offer insight into the efficiency of the proposed techniques in terms of running time and I/O cost.
AB - Web objects, often associated with descriptive text documents, are increasingly being geo-tagged. A spatial keyword top-k query retrieves the best k such objects according to a scoring function that considers both spatial distance and textual similarity. However, it is in some cases difficult for users to identify the exact keywords that describe their query intent. After a user issues an initial query and gets back the result, the user may find that some expected objects are missing and may wonder why. Answering the resulting why-not questions can aid users in retrieving better results. However, no existing techniques are able to answer why-not questions by adapting the query keywords. We propose techniques capable of adapting an initial set of query keywords so that expected, but missing, objects enter the result along with other relevant objects. We develop a basic algorithm with a set of optimizations that sequentially examines a sequence of candidate keyword sets. In addition, we present an index-based bound-and-prune algorithm that is able to determine the best sample out of a set of candidates in just one pass of index traversal, thus speeding up the query processing. We also extend the proposed algorithms to handle multiple missing objects. Extensive experimental results offer insight into the efficiency of the proposed techniques in terms of running time and I/O cost.
UR - http://www.scopus.com/inward/record.url?scp=84980337101&partnerID=8YFLogxK
U2 - 10.1109/ICDE.2016.7498282
DO - 10.1109/ICDE.2016.7498282
M3 - Conference proceeding
AN - SCOPUS:84980337101
T3 - Proceedings of IEEE International Conference on Data Engineering, ICDE
SP - 697
EP - 708
BT - Proceedings of IEEE 32nd International Conference on Data Engineering, ICDE 2016
PB - IEEE
T2 - 32nd IEEE International Conference on Data Engineering, ICDE 2016
Y2 - 16 May 2016 through 20 May 2016
ER -