An efficient theta-join query processing in distributed environment

W Liu, Z Li - Journal of Parallel and Distributed Computing, 2018 - Elsevier
Theta-join query is very useful in many data analysis tasks, but it is not efficiently processed
in distributed environment, especially in large scale data. Although there is much progress in
dealing theta-join with MapReduce paradigm, the methods are either complex which require
fundamental changes to MapReduce framework or only consider the overheads of load
balance in the network, when data scale is large, they will make much computation cost and
induce OOM (Out of Memory) errors. In this work, we propose a filter method for theta-join on …
Showing the best result for this search. See all results