Lustre Monitoring Tool Version 3 Jim Garlick [email protected]

Livermore Computing Lawrence Livermore National Laboratory LLNL-PRES-459655-DRAFT

NERSC, Oct. 28, 2010 – p.1/11

Background Lustre provides some very nice metrics in /proc, but no mechanism to aggregate metrics across a file system. LMT version 1 was a python application for visualizing Lustre metrics, mainly used in-house at LLNL. LMT version 2 rewrite in C/Java added MySQL for capturing historical data, plotting capability in the GUI client, and ltop text utility. MySQL history is cool! [Uselton 2009 CUG] ltop in particular has become a useful sys admin tool.

NERSC, Oct. 28, 2010 – p.2/11

LMT Overview The Lustre Monitoring tool uses cerebro and MySQL for data collection and storage. Data can be mined directly from MySQL or visualized with LMT clients (ltop, lwatch). Cerebro multicast

Desktop or ??

MySQL API

Mgmt Node

MDS

OSS

LNET router

MySQL

lmt-gui

lmt-server

lmt-server-agents

LMT Version 3 uses the same architecture as Version 2.

NERSC, Oct. 28, 2010 – p.3/11

LMT Version 2 Problems Based on LLNL experience and lmt-discuss mailing list: Lustre config must be expressed in an odd language, then pre-loaded into MySQL. Nothing functions until both MySQL and cerebro are up. Poor error handling and logging make debug difficult. There are two overlapping config files in odd locations. The cerebro module code is prototype quality and brittle. frustration for new users and maintainer!

NERSC, Oct. 28, 2010 – p.4/11

Improved in Version 3 Lustre config is automatically determined on the fly. ltop now functions as soon as cerebro is up. MySQL is actually optional now.

Error handling and logging are rewritten/improved. Cerebro module code has been refactored/rewritten. There is a single new config file: /etc/lmt/lmt.conf More data is collected/shown in ltop [demo later].

NERSC, Oct. 28, 2010 – p.5/11

Unchanged in Version 3 The architecture is the same (except ltop). The database schema is unchanged. The lwatch/lstat java clients are unchanged (moved to separate lmt-gui package). Cron aggregation scripts that convert high → low-res MySQL sample data still exist (kludge!).

NERSC, Oct. 28, 2010 – p.6/11

LMT Version 3 Setup 1. Install packages. 2. Configure cerebro and restart cerebrod on LMT and Lustre servers. (Test with ltop.) 3. Run mysql secure installation or equiv, then /usr/share/lmt/mkusers.sql 4. Set up /etc/lmt/lmt.conf. 5. Create databases for each file system to be monitored: lmtinit -a fsname. (Test with lwatch.) 6. Add cron job for aggregation scripts.

NERSC, Oct. 28, 2010 – p.7/11

LUA-based lmt.conf lmt_cbr_debug = 0 lmt_proto_debug = 0 lmt_db_debug = 0 lmt_db_host = nil lmt_db_port = 0 lmt_db_rouser = "lwatchclient" lmt_db_ropasswd = nil lmt_db_rwuser = "lwatchadmin" f = io.open("/etc/lmt/rwpasswd") if (f) then lmt_db_rwpasswd = f:read("*all") f:close() else lmt_db_rwpasswd = nil end

NERSC, Oct. 28, 2010 – p.8/11

Version 3 Metric Protocol Changes mds.v2 → mdt.v1: MDS + multiple MDT data in one metric. Dropped 60 seldom-used mdops (81 − 60 = 21). oss.v1 + ost.v1 → ost.v2: OSS + multiple OST data in one metric Added IOPS, lock count, lock grant/cancel rate, (re-)connects, recovery state. osc.v1: OST state from MDS pov, definitive OST list. router.v1: No change. New data is only displayed in ltop, not stored in MySQL. NERSC, Oct. 28, 2010 – p.9/11

Future Work LMT schema needs a revision to accommodate new data. lwatch should display new data.

Add support for Lustre 2.x ioctl interface. Add support for Lustre 2.x ZFS servers. New ltop screens for routers, MDS’s, etc. ltop should also support direct /proc, MySQL.

Use lnet/ptlrpc to gather data, tighter integration with Lustre. What other Lustre metrics to be monitored/visualized?

NERSC, Oct. 28, 2010 – p.10/11

LMT Support and Downloads Google code project: http://code.google.com/p/lmt. Email support list: http://groups.google.com/lmt-discuss.

NERSC, Oct. 28, 2010 – p.11/11

Lustre Monitoring Tool Version 3 - GitHub

LMT version 1 was a python application for visualizing ... The Lustre Monitoring tool uses cerebro and MySQL for data collection ... MySQL API ... (Test with ltop.).

66KB Sizes 48 Downloads 297 Views

Recommend Documents

Open Vehicle Monitoring System - GitHub
Aug 14, 2013 - 10. CONFIGURE THE GPRS DATA CONNECTION (NEEDED FOR ...... Using the OVMS smartphone App (Android or Apple iOS), set Feature ...

Open Vehicle Monitoring System - GitHub
Feb 5, 2017 - GITHUB. 10. COMPILE AND FLASH YOUR FIRST FIRMWARE. 10. CHIPS USED .... If your laptop already has a RS232 port, then you can ... download your own forked repository from github to your local computer. Detailed ...

kirafatyangra - a tool to recommend insecticides - GitHub
Department of Computer Science and Information Technology. DWIT College. In partial fulfillment of the requirements for the Bachelor's Degree in ... Page 2 ...

A Multifunctional RFID/NFC Tool - GitHub
send unexpected data → buffer overflow, … ▫ Power-switch: effective privacy protection/ ... Record and analyze all communication. ▫ Distinguish normal behavior ...

Haxe 3 Manual - GitHub
of programs in Haxe. Each Haxe class has an explicit name, an implied path and zero or more class fields. Here we will focus on the general structure of classes and their relations, while leaving the details of class fields for Class Fields (Chapter

Lab 3: Structure - GitHub
Structure Harvester is very easy to use, and is all web-based! You simply upload your zip file and then click “Harvest!” It may take a few minutes to run.

symbiotic 3 - GitHub
Marek Chalupa, Martin Jonáš, Jiri Slaby,. Jan Strejcek, and Martina Vitovská. Masaryk University, Brno. Page 2. Symbiotic workflow. SOURCES. LLVM.

EU \3 - GitHub
l)The switch has been open for a long time when at time t = 0, the switch is closed. What is. 11(0), the magnitude of the current through the resistor R1 just after ...

Chapter 3 - GitHub
N(0, 1). The CLT tells us about the shape of the “piling”, when appropriately normalized. Evaluation. Once I choose some way to “learn” a statistical model, I need to decide if I'm doing a good job. How do I decide if I'm doing anything good?

Lab 3 Example - GitHub
Download “polygonum.stru”'. • Look at “polygonum.stru” using a text editor. – Column 1 refers to individual ID (516 total individuals). – Column 2 refers to ...

Project 3 - GitHub
Discuss the following: 1. Plot the residual vs. number of iteration for each method. Use different relaxation factors for PSOR and LSOR. 2. What relaxation factor ...

USB Drawn By: Title: Version - GitHub
C9. C10. IN. GND. OUT. EN BP. U$11. C11. C12. U$10. VDDIO. 1. BYP. 2. SCL. 4. GND. 5. SDA. 6. SA0. 7. INT2. 9. GND. 10. INT1. 11. GND. 12. VDD. 14. C13.

My title My subtitle Version 0.1 - GitHub
This is bold text. This is bold text. This is italic text ... It converts "HTML", but keep intact partial entries like “xxxHTMLyyy” and so on. 1. Footnote can have markup.

Unicode block preview (XeTeX version 0.99999) - GitHub
1. Basic Latin (U+0-U+7F). 0. 1. 2. 3. 4. 5. 6. 7. 8. 9. A. B. C. D. E. F. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 1A. 1B. 10. 1D. 1E. 1F. U+0. U+20. ; ? N. G. ) U+40 ...... FS Met @ 圈 u B x 23 留 s 8.马圈. 5C H W8 @ 留. 9召spe图. SAN 多图.

Tuscarora Developers Manual Version 2.0 - GitHub
Mar 3, 2015 - Patterns or Apps might query this service to get a the nodes global name and use it as the identifies in its protocols and algorithms. List of Key Interfaces. List of APIs: • Implemented by platform Shim Layer provider. – PatternBas

Specification on Image Data File Version - GitHub
5.4.10 ShootingRecord heap ... the JFIF file format[1], as described below), sample software shall be provided openly to player vendors. ... In the separate "Decisions Concerning Extension" section, we define how various companies.

Queens Community District 3 - GitHub
This metric from the Mayor's Office for Economic Opportunity accounts for NYC's high cost of housing, as well as other costs of living and anti-poverty benefits. Land Use Category. % Lot. Area. # Lots. Click here for a more detailed land use map of Q

Manhattan Community District 3 - GitHub
E 14 St. EastRiver. MN 6. MN 2. MN 1. Manhattan Community District 3. Neighborhoods1: Chinatown, East Village, Lower East Side, NoHo, Two Bridges. LAND USE MAP. 164,407. 163,277. -1% ... ACCESS TO PARKS7 of residents live within.

Subway Surfers Hack Tool Ultimate Version 424
Do you love to play games in your android or iOS device? ... game, channel for Free Game Generator Codes Subway Surfers Hack Mod Ios game today, Code.

Operating Systems Homework #3 - GitHub
May 14, 2015 - (Integer) Number indicating scheduling algorithm. 27 int policy;. 28. // Enum values for policy. 29 const short FCFS=0, SJF=1, PRIOR=2, RR=3;.

Brooklyn Community District 3 - GitHub
Transportation/Utility. Public/Institutional. Open Space. Parking. Vacant. Other. 6,911. 6,373. 194. 1,457. 245. 199. 32. 368. 39. 284. 822. 71. U tica. A v. Broadway ... Bedford-Stuyvesant, Stuyvesant Heights, Tompkins Park North. Top 3 pressing iss

Bronx Community District 3 - GitHub
for Public Use Microdata Areas (PUMAs). PUMAs are geographic approximations of community districts. BX 3 shares PUMA 3705 with BX 6, and the ACS population estimate cannot be reliably disaggregated. 5NYC Dept of City Planning Facilites Database (2017