Sysman: A Virtual File System for Cluster System Management

Sysman is a Linux system management infrastructure for HPC clusters similar to the /proc file system and provides a familiar yet powerful interface for server and storage system management. In the Sysman virtual file system each managed entity (e.g., power on/off button of a server), is represented by a file. Reading from Sysman files obtains active information from devices being managed by Sysman (e.g., power state of storage systems, BladeCenter chassis temperature, CPU type and boot order information of servers). Writing to Sysman files initiates tasks such as turning on/off server blades, discovering new devices, and changing the boot order of a blade. The combination of the file access semantics and existing Unix utilities such as grep and find that operate on multiple files allow the creation of very short but powerful system management procedures for large HPC clusters. Sysman is an extendable framework and has a simple interface through which new system management procedures can be easily added.

By: Mohammad Banikazemi; David Daly; Bulent Abali

Published in: RC24273 in 2007

LIMITED DISTRIBUTION NOTICE:

This Research Report is available. This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g., payment of royalties). I have read and understand this notice and am a member of the scientific community outside or inside of IBM seeking a single copy only.

RC24273.pdf

Questions about this service can be mailed to reports@us.ibm.com .