Proceedings of ISP RAS

Comparative Study Parallel Join Algorithms for MapReduce environment.

A. Pigul.


There are the following techniques that are used to analyze massive amounts of data: MapReduce paradigm, parallel DBMSs, column-wise store, and various combinations of these approaches. We focus in a MapReduce environment. Unfortunately, join algorithms is not directly supported in MapReduce. The aim of this work is to generalize and compare existing equi-join algorithms with some optimization techniques.


parallel join algorithms, MapReduce, optimization


Proceedings of the Institute for System Programming, vol. 23, 2012, pp. 285-306.

ISSN 2220-6426 (Online), ISSN 2079-8156 (Print).

DOI: 10.15514/ISPRAS-2012-23-17

Full text of the paper in pdf Back to the contents of the volume