Robust and Efficient Large-large Table Outer Joins on Distributed Infrastructures

From International Center for Computational Logic

Toggle side column

Robust and Efficient Large-large Table Outer Joins on Distributed Infrastructures

Long ChengLong Cheng,  Spyros KotoulasSpyros Kotoulas,  Tomas E. WardTomas E. Ward,  Georgios TheodoropoulosGeorgios Theodoropoulos
Robust and Efficient Large-large Table Outer Joins on Distributed Infrastructures


Long Cheng, Spyros Kotoulas, Tomas E. Ward, Georgios Theodoropoulos
Robust and Efficient Large-large Table Outer Joins on Distributed Infrastructures
Proc. 20th International European Conference on Parallel Processing (Euro-Par'14), 258-269, August 2014. Springer
  • KurzfassungAbstract
    Outer joins are ubiquitous in many workloads but are sensitive to load-balancing problems. Current approaches mitigate such problems caused by data skew by using (partial) replication. However, contemporary replication-based approaches (1) introduce overhead, since they usually result in redundant data movement, (2) are sensitive to parameter tuning and value of data skew and (3) typically require that one side is small. In this paper, we propose a novel parallel algorithm, Redistribution and Efficient Query with Counters (REQC), aimed at robustness in terms of size of join sides, variation in skew and parameter tuning. Experimental results demonstrate that our algorithm is faster, more robust and less demanding in terms of network bandwidth, compared to the state-of-the-art.
  • Weitere Informationen unter:Further Information: Link
  • Forschungsgruppe:Research Group: Wissensbasierte SystemeKnowledge-Based Systems
The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-09873-9_22.
@inproceedings{CKWT2014,
  author    = {Long Cheng and Spyros Kotoulas and Tomas E. Ward and Georgios
               Theodoropoulos},
  title     = {Robust and Efficient Large-large Table Outer Joins on Distributed
               Infrastructures},
  booktitle = {Proc. 20th International European Conference on Parallel
               Processing (Euro-Par'14)},
  publisher = {Springer},
  year      = {2014},
  month     = {August},
  pages     = {258-269},
  doi       = {10.1007/978-3-319-09873-9_22}
}