Proceedings of ISP RAS


Comparative Study Parallel Join Algorithms for MapReduce environment.

A. Pigul.

Abstract

There are the following techniques that are used to analyze massive amounts of data: MapReduce paradigm, parallel DBMSs, column-wise store, and various combinations of these approaches. We focus in a MapReduce environment. Unfortunately, join algorithms is not directly supported in MapReduce. The aim of this work is to generalize and compare existing equi-join algorithms with some optimization techniques.

Keywords

parallel join algorithms, MapReduce, optimization

Edition

Proceedings of the Institute for System Programming, vol. 23, 2012, pp. 285-306.

ISSN 2220-6426 (Online), ISSN 2079-8156 (Print).

DOI: 10.15514/ISPRAS-2012-23-17

Full text of the paper in pdf Back to the contents of the volume