Proceedings of ISP RAS


Enabling Data Driven Projects for a Modern Enterprise

A.R. Topchyan (YSU, Yerevan, Armenia)

Abstract

With the growing volume and demand for data a major concern for an Organization trying to implement Data Driven projects, is not only how to technically collect, cleanse, integrate, access, but even more so, how and why to use it. There is a lack of unification on a logical and technical level between Data Scientists, IT departments and Business departments, as it is very unclear where the data comes from, what it looks like,  what it contains and how to process it in the context of existing systems. So in this paper we present a platform for data exploration and processing, which enables Data-Driven projects, that does not require a complete organizational revamp, but  provides a workflow and technical basis for such projects.

Keywords

data-driven projects, crisp, Hadoop, data vault, sandbox, mesos, kafka

Edition

Proceedings of the Institute for System Programming, vol. 28, issue 3, 2016, pp. 209-230

ISSN 2220-6426 (Online), ISSN 2079-8156 (Print).

DOI: 10.15514/ISPRAS-2016-28(3)-13

Full text of the paper in pdf Back to the contents of the volume