Does RDMA-based Enhanced Hadoop MapReduce Need a New Performance Model? Md. Wasi-ur-Rahman, Xiaoyi Lu, Nusrat S. Islam, and Dhabaleswar K. (DK) Panda The Ohio State University, {rahmanmd,luxi,islamn,panda}@cse.ohio-state.edu

Recent studies [17, 12] show that leveraging benefits of high performance interconnects like InfiniBand, MapReduce performance in terms of job execution time can be greatly enhanced by using additional features like in-memory merge, pipelined merge and reduce, and prefetching and caching of map outputs. In this paper, we validate that it is time to have a new performance model for the RDMA-based design of MapReduce over high performance interconnects. Our initial results derived from the proposed analytical model matches the experimental results within a 3-5% range.

1 Motivation Authors in [17, 12] present enhanced designs and algorithms for the RDMA-based MapReduce framework. With these design changes, MapReduce job execution can be greatly accelerated by leveraging the benefits of high-performance interconnects. The high performance design of Hadoop (Hadoop-RDMA) [3] also shows significant performance benefits achievable through RDMA-capable interconnects using enhanced designs of various components (HDFS [6], MapReduce [12], RPC [9]) inside Hadoop. On the other hand, much performance modeling research [4, 8, 2, 1, 13, 5, 7, 10, 11] has been carried out to deeply analyze the default MapReduce framework. But, because of the inherent architectural changes, these models are not appropriate for performance prediction of RDMA-based enhanced MapReduce. For example, Table 1 captures the performance evaluation for the Sort benchmark using default Hadoop [16] and enhanced MapReduce with RDMA [12] and compares these with the performance model in [4]. This clearly illustrates the necessity of a new model for the enhanced design of MapReduce. c 2013 by the Association for Computing Machinery, Inc. Copyright (ACM). Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author. SoCC’13, 1–3 Oct. 2013, Santa Clara, California, USA. ACM 978-1-4503-2428-1. http://dx.doi.org/10.1145/2523616.2525953

Benchmark Sort (20GB)

Hadoop 707 sec

Model [4] 691.78 sec

RDMA [12] 324 sec

Table 1: Comparison using Sort

2 Our Approach For the RDMA-based enhanced design of MapReduce, all of the new features are added inside the ReduceTask. Thus, to predict the performance correctly for this design, we approach to model the performance of the ReduceTask from scratch. In the default MapReduce framework, execution time for a single ReduceTask, tRT is calculated from the execution times of different phases in the ReduceTask. tRT = tshu f f le + tmerge + treduce (1) For the RDMA-based design, on the other hand, tRT , will not be as simple as the default one. Because of the fully overlapping feature among these three phases, tRT can be rewritten as: tRT = max(tshu f f le ,tmerge ) + α ∗ treduce (2) α represents the fraction of the total data that resides in memory yet to be reduced, while both shuffle and merge phases have completed their execution. Also, because of the architectural changes in the enhanced design, all of the parameters tshu f f le , tmerge , and treduce need to be re-modeled to incorporate all of the new design enhancements.

3 Contribution 1,200 Job Executiion Time (sec)

Abstract

1,000

Experimental Model

800 600 400 200 0

8

16

32

64

128

Cluster Size

Figure 1: Model validation in Stampede Cluster

We validate our model for enhanced MapReduce using terasort [15] on Stampede [14]. We vary the cluster size from 8 to 128, while increasing the data size exponentially from 40 GB to 640 GB. As shown in Figure 1, we observe that the model successfully validates the experimental results with a difference of 3-5% range.

References [1] T. Condie, N. Conway, P. Alvaro, J. M. Hellerstein, K. Elmeleegy, and R. Sears. MapReduce Online. In Proceedings of the 7th USENIX conference on Networked systems design and implementation, NSDI’10, pages 21–21, Berkeley, CA, USA, 2010. USENIX Association. [2] J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. Commun. ACM, 51(1):107–113, Jan. 2008. [3] Hadoop-RDMA: High-Performance Design of Hadoop over RDMA-enabled Interconnects. http: //hadoop-rdma.cse.ohio-state.edu/. [4] H. Herodotou. Hadoop Performance Models. Technical Report CS-2011-05, Computer Science Department, Duke University. [5] Y. Huai, R. Lee, S. Zhang, C. H. Xia, and X. Zhang. DOT: A Matrix Model for Analyzing, Optimizing and Deploying Software for Big Data Analytics in Distributed Systems. In Proceedings of the 2nd ACM Symposium on Cloud Computing, SOCC ’11, pages 4:1–4:14, New York, NY, USA, 2011. ACM. [6] N. S. Islam, M. W. Rahman, J. Jose, R. Rajachandrasekar, H. Wang, H. Subramoni, C. Murthy, and D. K. Panda. High Performance RDMA-based Design of HDFS over InfiniBand. In The International Conference for High Performance Computing, Networking, Storage and Analysis (SC), November 2012. [7] H. Karloff, S. Suri, and S. Vassilvitskii. A Model of Computation for MapReduce. In Proceedings of the Twenty-First Annual ACM-SIAM Symposium on Discrete Algorithms, SODA ’10, pages 938– 948, Philadelphia, PA, USA, 2010. Society for Industrial and Applied Mathematics. [8] X. Lin, Z. Meng, C. Xu, and M. Wang. A Practical Performance Model for Hadoop MapReduce. In Cluster Computing Workshops (CLUSTER WORKSHOPS), 2012 IEEE International Conference on, pages 231–239, 2012. [9] X. Lu, N. S. Islam, M. W. Rahman, J. Jose, H. Subramoni, H. Wang, and D. K. Panda. HighPerformance Design of Hadoop RPC with RDMA over InfiniBand. In IEEE 42nd International Conference on Parallel Processing (ICPP), 2013. [10] K. Morton, M. Balazinska, and D. Grossman. ParaTimer: A Progress Indicator for MapReduce

DAGs. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of data, SIGMOD ’10, pages 507–518, New York, NY, USA, 2010. ACM. [11] Mumak: Map-Reduce Simulator. https://issues. apache.org/jira/browse/MAPREDUCE-728. [12] M. W. Rahman, N. S. Islam, X. Lu, J. Jose, H. Subramoni, H. Wang, and D. K. Panda. HighPerformance RDMA-based Design of Hadoop MapReduce over InfiniBand. In International Workshop on High Performance Data Intensive Computing(HPDIC), in conjunction with IEEE International Parallel and Distributed Processing Symposium (IPDPS), May 2013. [13] N. Rapolu, K. Kambatla, S. Jagannathan, and A. Grama. TransMR: Data-centric Programming beyond Data Parallelism. In Proceedings of the 3rd USENIX conference on Hot topics in cloud computing, HotCloud’11, pages 19–19, Berkeley, CA, USA, 2011. USENIX Association. [14] Stampede at Texas Advanced Computing Center. http://www.tacc.utexas.edu/resources/hpc/ stampede. [15] TeraSort. http://hadoop.apache.org/docs/r0. 20.0/api/org/apache/hadoop/examples/terasort/ TeraSort.html. [16] The Apache Software Foundation. The Apache Hadoop Project. http://hadoop.apache.org/. [17] Y. Wang, X. Que, W. Yu, D. Goldenberg, and D. Sehgal. Hadoop Acceleration through Network Levitated Merge. In Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, SC ’11, 2011.

Does RDMA-based Enhanced Hadoop MapReduce ...

The Ohio State University, {rahmanmd,luxi,islamn,panda}@cse.ohio-state.edu. Abstract. Recent studies [17, 12] show that leveraging bene- fits of high performance interconnects like InfiniBand,. MapReduce performance in terms of job execution time can be greatly enhanced by using additional features like in-memory ...

50KB Sizes 5 Downloads 163 Views

Recommend Documents

Intro Hadoop and MapReduce Certificate.pdf
There was a problem loading more pages. Intro Hadoop and MapReduce Certificate.pdf. Intro Hadoop and MapReduce Certificate.pdf. Open. Extract. Open with.

Writing An Hadoop MapReduce Program In Python - Michael G.pdf ...
Writing An Hadoop MapReduce Program In Python - Michael G.pdf. Writing An Hadoop MapReduce Program In Python - Michael G.pdf. Open. Extract.

SIGMETRICS Tutorial: MapReduce
Jun 19, 2009 - A programming model for large-scale distributed data ..... Could be hard to debug in .... Reading from local disk is much faster and cheaper.

Cloud MapReduce: a MapReduce Implementation on ...
a large-scale system design and implementation if we build on top of it. Unfortunately .... The theorem states that, of the three properties of shared-data systems ...

Cloud MapReduce: a MapReduce Implementation on ...
The theorem states that, of the three properties of shared-data systems – data ...... then copies over the results to the hard disks on the destination node when ...

MapReduce/Bigtable for Distributed Optimization
our global model parameters, we use a distributed data-store known as Bigtable ... allows backup workers to duplicate effort without producing erroneous output.

Incoop: MapReduce for Incremental Computations
Max Planck Institute for Software Systems (MPI-SWS) and ... The second approach would be to develop systems that ... plexity and development effort low. ...... Acar, G. E. Blelloch, and R. Harper. Adaptive functional programming. ACM Trans.

Does Z′ equal 1 or 2? Enhanced powder NMR ...
Jun 2, 2017 - ... University Library on 11/07/2017 16:03:31. View Article Online · View Journal | View Issue ...... 2007, 9, 360. 50 R. K. Harris, S. A. Joyce, C. J. ...

Understanding MapReduce-based Next-Generation ... - GitHub
solutions leveraging algorithmic advances, tools and services, and .... Figure 1: PMR architecture and the workflow for a MapReduce task: The compute and data units are basic blocks of scheduling in Pilot abstractions ... network resources.

Distributed Programming with MapReduce
Jun 4, 2009 - a programming system for large-scale data processing ... save word_count to persistent storage … → will take .... locality. ○ backup tasks ...

Distributed Computing: MapReduce and Beyond!
14 Jan 2008 - 7. Example Distributed System: Google File System. • GFS is a distributed file system written at Google for Google's needs. (lots of data, lots of cheap computers, need for speed). • We use it to store the data from our web crawl, b

Combining MapReduce and Virtualization on ... - Semantic Scholar
Feb 4, 2009 - Keywords-Cloud computing; virtualization; mapreduce; bioinformatics. .... National Center for Biotechnology Information. The parallelization ...

Does demolitionorrefurbishmentofoldandinefficienthomeshelpto ...
... 36 (2008) 4487–4501 4489. Page 3 of 15. Does demolitionorrefurbishmentofoldandinefficienthom ... reaseourenvironmental,socialandeconomicviability.pdf.

MapReduce: Simplified Data Processing on Large Clusters - flowdas
mentation of the MapReduce interface tailored towards ... Reverse Web-Link Graph: The map function outputs. (target, source) pairs for each link to a target. URL found in a page named ..... GFS to open the set of 1000 input files and to get the.

MapReduce: Simplified Data Processing on ... - Research at Google
For example, during one MapReduce operation, network maintenance on a running ..... struction of highly-available networked services. Like. MapReduce ...

Privacy-enhanced display device
Jan 15, 2009 - 5,463,428 A 10/1995 Ligtg? et al'. Darrell L. Lelgh ... 5,963,371 A 10/1999 Needham et al. .... This is a particular problem for laptop computers.

Hydrogen Enhanced Combustion
Page 2 ...... [12]Emissions reductions using hydrogen from plasmatron fuel converters, L. Bromberg, D.R. Cohn, K. Hadidi, J.B. Heywood and. A. Rabinovich, MIT.

Data-Intensive Text Processing with MapReduce
Apr 11, 2010 - 3.1.1 Combiners and In-Mapper Combining. 41 ..... erational, the LHC will produce roughly 15 petabytes of data a year.3 ...... ware development process, the same is also true of algorithms: increasing the degree .... Finally, the compu

Online Load Balancing for MapReduce with Skewed ...
strategy is a constrained version of online minimum makespan and, in the ... server clusters, offering a highly flexible, scalable, and fault tolerant solution for ...

A Scalable MapReduce Framework for All-Pair ... - Research at Google
stage computes the similarity exactly for all candidate pairs. The V-SMART-Join ... 1. INTRODUCTION. The recent proliferation of social networks, mobile appli- ...... [12] eHarmony Dating Site. http://www.eharmony.com. [13] T. Elsayed, J. Lin, ...

(MVT) Enhanced Learning
macro-level representations (e.g., graph of current vs time) (e.g., Activity 1 in ... Students' moment-by-moment actions and input as required by the software can be ..... naïve scientism believing their model represents real phenomena that .... Cre

(MVT) Enhanced Learning
Designing and Assessing Modeling and Visualization Technologies ... The new reform agenda in US education (benchmarks for Scientific Literacy, 1993; National Standards .... education system, that impact student learning significantly.