Published June 11, 2018
| Version v1
Journal article
Open
TOP-K DOMINATING QUERY PROCESSING OVER DISTRIBUTED DATA STREAMS
Authors/Creators
Description
Data stream has been widely used in lots of modern applications such as Social networks and the Internet of things. Aiming at the problem of Top-k dominating query in distributed data stream, a distributed Top-k query algorithm based on Spark Streaming framework is proposed. Based on partitioning, double pruning techniques are implemented on the data. Local and global pruning can significantly reduce the number of candidate sets, reduce the computational overhead and space costs, and improve the query efficiency. Experimental results show that the algorithm has good performance and scalability.
Files
Files
(421.8 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:f180b00b299457c806139dd2e7facabf
|
421.8 kB | Download |