epiC: an Extensible and Scalable System for Processing Big DataepiC

2015-06-02 Vistors:217

 

作者: Dawei Jiang; Gang Chen;Beng Chin Ooi.
来源:IEEE VLDB     出版年: SEP. 2014

This paper presents epiC, an extensible system to tackle the Big Data’s data variety challenge. epiC introduces a general Actor-like concurrent programming model, independent of the data processing models, for specifying parallel omputations. Users process multi-structured datasets with appropriate epiC extensions, the implementation of a data processing model best suited for the data type and auxiliary code for mapping that data processing model into epiC’s concurrent programming model. Like Hadoop, programs written in this way can be automatically parallelized and the run- time system takes care of fault tolerance and inter-machine communications. We present the design and implementation of epiC’s concurrent programming model. We also present two customized data processing model, an optimized MapReduce extension and a relational model, on top of epiC. Experiments demonstrate the effectiveness and efficiency of our proposed epiC. This paper (supervisor: Prof. Chen Gang) obtained the Best Paper Award of IEEE VLDB 2014, which is the first time for Chinese scholars.

Contact US
Email : fit@zju.edu.cn
Tel:86-571-87951772
FAX : 0086-571-87951077
Addr:No. 38 ZheDa Road