Reliability of Data Storage Systems under Network Rebuild Bandwidth Constraints

To improve the reliability of data storage systems, certain data placement schemes spread replicas across several nodes. This enables parallelizing the rebuild process which in turn results in reducing the rebuild times. However, the underlying assumption is that the parallel rebuild process is facilitated by sufficient availability of network bandwidth to transfer data across nodes. In a large-scale data storage system where the network bandwidth for rebuild is constrained, such placement schemes will not be as effective. In this paper, it is shown through analysis and simulation how the spread of replicas across nodes affects system reliability under a system network bandwidth constraint. Efficient placement schemes that can achieve high reliability in the presence of bandwidth constraints are proposed. Furthermore, in a dynamically changing storage system, in which the number of nodes and the network rebuild bandwidth can change over time, the data placement can be accordingly adapted to maintain the highest level of reliability.

An updated version of this report has been published in: Proc. 2012 IEEE 20th Int'l Symp. on Modelling, Analysis, and Simulation of Computer and Communication Systems "MASCOTS," Washington, DC (IEEE, August 2012) 189-197.

By: V. Venkatesan, I. Iliadis, R. Haas

Published in: RZ3821 in 2012

LIMITED DISTRIBUTION NOTICE:

This Research Report is available. This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g., payment of royalties). I have read and understand this notice and am a member of the scientific community outside or inside of IBM seeking a single copy only.

rz3821.pdf

Questions about this service can be mailed to reports@us.ibm.com .