Comparative Study Parallel Join Algorithms for MapReduce environment.
There are the following techniques that are used to analyze massive amounts of data: MapReduce paradigm, parallel DBMSs, column-wise store, and various combinations of these approaches. We focus in a MapReduce environment. Unfortunately, join algorithms is not directly supported in MapReduce. The aim of this work is to generalize and compare existing equi-join algorithms with some optimization techniques.
Proceedings of the Institute for System Programming, vol. 23, 2012, pp. 285-306.
ISSN 2220-6426 (Online), ISSN 2079-8156 (Print).
DOI: 10.15514/ISPRAS-2012-23-17Full text of the paper in pdf Back to the contents of the volume