in cooperation with

Communication Networks

IBPM: An Open-Source-Based Framework for InfiniBand Performance Monitoring Michael Hoefling1, Michael Menth1, Christian Kniep2, Marcus Camen2

Background: InfiniBand (IB)     

State-of-the art communication technology for interconnection in high-performance computing data centers Point-to-point bidirectional links High throughput (40 Gbit/s with QDR) Low latency Dynamic on-line network reconfiguration

Rate Measurement in IB Networks

IBPM: Demo Overview ibsim-Based Network Simulation  ibsim simulates an IB network  Simple topology changes possible (GUI)  ibsim limitations  No performance simulation possible  No data rate changes possible Real IB Network  Physical network  Allows performance measurements  GUI controlled traffic scenarios

Idea  Extract raw network information from IB network  Analyze output  Derive statistics about performance of the network Topology Extraction  Subnet discovery using ibnetdiscover  Produces human readable file of network topology  Process output to produce graphical representation of the network Remote Counter Readout  Each port has its own set of performance counters  Counters measure, e.g., transferred data, congestion, errors, link states changes

IBPM: Open-Source-Based InfiniBand Performance Monitoring Features  Automatic topology extraction and visualization  Visualization of traffic locality  Visualization of link utilization  Visualization of congestion  Visualization of port performance history Architecture

IBPM: Demo Scenarios Scenario 1: Topology Changes  Node and/or switch becomes unavailable  Connectivity state is represented in the topology map Scenario 2: Port Performance and Link Utilization  Nodes communicate with each other  Port performance accessible through simple point-and-click interface on a node or switch  Link utilization is visualized through utilization-based-coloring of the links in the performance map Scenario 3: Traffic Locality  Nodes use pre-defined traffic patterns  Traffic locality is visualized through locality-based-coloring of the switches in the locality map Michael Hoefling, Michael Menth, Christian Kniep, and Marcus Camen: "IBPM: An Open-Source-Based Framework for InfiniBand Performance Monitoring", in Proceedings of the 16th GI/ITG Conference on Measurement, Modeling, and Evaluation of Computer and Communication Systems (MMB) and Dependability and Fault Tolerance (DFT), March 2012, Kaiserslautern, Germany

1 2

These authors are with the University of Tuebingen, Tuebingen, Germany. These authors are with science+computing ag, Tuebingen, Germany.

University of Tuebingen · Sand 13 · 72076 Tübingen Phone: +49-7071-29-70507 [email protected] http://kn.inf.uni-tuebingen.de/staff/hoefling

IBPM: An Open-Source-Based Framework for InfiniBand ... - GitHub

Evaluation of Computer and Communication Systems (MMB) and. Dependability and ... 2 These authors are with science+computing ag, Tuebingen, Germany.

443KB Sizes 13 Downloads 343 Views

Recommend Documents

Communication Networks IBPM - GitHub
Evaluation of Computer and Communication Systems (MMB) and. Dependability and Fault ... 2 These authors are with science+computing ag, Tuebingen, Germany. ▻ State-of-the art communication technology for interconnection in high-performance ... Extra

An Integrated Security Framework For GOSS Power Grid ... - GitHub
Sep 24, 2014 - potential network failures (N-1) ... in one of these roles in order to ... Users can't find out about data/services they don't have access for ...

The Coco Framework - GitHub
Aug 10, 2017 - failure. In a consortium of banks, members could be large, global, systemically important financial institutions (GSIFIs). ... End users, such as a bank's customers, do not have an identity in the Coco network and cannot transact .....

Generic Load Regulation Framework for Erlang - GitHub
Erlang'10, September 30, 2010, Baltimore, Maryland, USA. Copyright c 2010 ACM ...... rate on a budget dual-core laptop was 500 requests/s. Using parallel.

Open Modeling Framework - GitHub
Prepared for the U.S. Department of Energy, Office of Electricity Delivery and Energy Reliability, under Contract ... (ORNL), and the National Renewable Energy.

Introduction to Framework One - GitHub
Introduction to Framework One [email protected] ... Event Management, Logging, Caching, . ... Extend framework.cfc in your Application.cfc. 3. Done. (or in the ... All controllers are passed the argument rc containing the request.context, and all v

An Architectural Framework for Interactive Music Systems
Software Architecture, Interactive Systems, Music soft- ... synthesis of data media of different nature. ... forms (e.g. Max/MSP [19] and Pure Data [24]), and oth-.

AN EVIDENCE FRAMEWORK FOR BAYESIAN ...
generalization, and achieve desirable recognition performance for unknown test speech. Under this framework, we develop an EM iterative procedure to ...

Floatworld : A Simple Artificial Life Framework for Simulated ... - GitHub
which virtual “creatures” compete for space and energy. We will ... the ability of evolution by natural selection to drive the increase in fitness of ..... of energies ϵ.

A Framework for Flexible and Scalable Replica-Exchange on ... - GitHub
a type of application with multiple scales of communication. ... Chemistry and Chemical Biology, Rutgers University, Piscataway,. NJ 08854. †Electrical .... ity built on the BigJob/SAGA distributed computing envi- ronment ... Fortunately, great pro

A textual graph-based modeling framework for education ... - GitHub
Item 1 - 6 - Proc-2013-1-3-15.pdf (A visual DPF tool implemented by our students) ... Elements (in the signature) having the same name are considered ... {LOCAL PATH ON YOUR COMPUTER}/no.hib.dpf.text.updatesite/target/site. 8. Open the ...

OpenCUDA+MPI - A Framework for Heterogeneous GP-GPU ... - GitHub
Kenny Ballou, Boise State University Department of Computer Science ... computing limit scientists and researchers in various ways. The goal of.

OpenCUDA+MPI - A Framework for Heterogeneous GP ... - GitHub
CUDA. Compute Unified Device Architecture. Established interface with (nVidia) GPU's. pyCUDA. Deferred CUDA kernel compilation. Familiarity. Ballou. OpenCUDA+MPI ...

Open Data publishing method framework - GitHub
Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. Build. Assess. Iterate. Certification. Publish!

MPI over InfiniBand: Early Experiences
Aug 18, 2003 - design the next generation clusters and servers with high .... Virtual Data Lanes (VLs) in additon to a Dedicated Management Lane (VL15). 5 ...

Develop for an international audience - GitHub
About me. • Developer and QA at www.transifex.com ... A Django based startup. • It's like a Github ... msginit -i app.pot -o locale/en/LC_MESSAGES/en.po -l en.

Reimagining IT for an omnichannel world - GitHub
Figure 1: IT budgets are growing rapidly, and most of the expansion is in new projects. 25%. CAGR. Ratio. 2.4x. Indexed IT hours by project type. Note: Run category includes IT services, maintenance, admin, PMO admin and training time; grow category

An Extended Framework of STRONG for Simulation ...
Feb 29, 2012 - Indeed, STRONG is an automated framework with provable .... Construct a local model rk(x) around the center point xk. Step 2. .... We call the sample size required for each iteration a sample size schedule, which refers to a.

Instrumentino: An open-source modular Python framework for ...
Official Full-Text Paper (PDF): Instrumentino: An open-source modular ... 1. Introduction. In the process of scientific research, many laboratories around ..... [18] N. Barroca, et al., Wireless sensor networks for temperature and .... The communicat

An Event-based Framework for Characterizing the ...
for evolving networks, based on our framework. Categories and Subject Descriptors: H.2.8 Database. Management: Database Applications - Data Mining.

Zemberek, an open source NLP framework for Turkic ...
source, platform independent NLP framework not only for Turkish but also all Turkic languages. In. This paper .... There is no agreed standard naming scheme defined for. Turkish yet. Most suffixes contain a production ..... environment: AMD Athlon 64

An Empirical Framework for Automatically Selecting the Best Bayesian ...
Keywords: Bayesian networks; Data mining; Classifi- cation; Search ... In deciding which classifier will work best for a given dataset there .... The software used to ...

Reference Framework for Handling Concept Drift: An ...
In predictive analytics, machine learning and data mining the phenomenon ...... [13] A. Bifet, R. Gavalda, Learning from time-changing data with adaptive.

An island-model framework for evolving neuro ...
showing that the aforementioned framework can parallelise the controller design .... will outline ongoing work into the application of migration to more complex ..... http://www.cnes.fr/web/5719-msl-09-at-a-glance.php. [28] D. P. Miller and T. L. ...