Understanding Data Survivability in Archival Storage Systems

Appeared in Proceedings of the 5th Annual International Systems and Storage Conference (SYSTOR 2012).

Abstract

Preserving data for a long period of time in the face of faults, large and small, is crucial for designing reliable archival storage systems. However, the survivability of data is different from the reliability of storage because typically, data are stored in more than one storage at a given moment. Previous studies of reliability ignore the former. We present a framework for relating data survivability and storage reliability, and use the framework to gauge the impact of rare but large-scale events on data survivability. We also present a method to track all copies of data and the condition of all the online and offline media, devices and systems on which they are stored uninterruptedly over the whole lifetime of the data. With this method, the survivability of the data can be closely monitored, and potential dangers can be handled in a timely manner. A better understanding of data survivability can be used in reducing unnecessary data replicas, thus reducing the cost.

Publication date:
June 2012

Authors:
Yan Li
Darrell D. E. Long
Ethan L. Miller

Projects:
Archival Storage
Reliable Storage

Available for download:

Full text:
Download as PDF

Bibtex entry

@inproceedings{li12-systor,
  author       = {Yan Li and Darrell D. E. Long and Ethan L. Miller},
  title        = {Understanding Data Survivability in Archival Storage Systems},
  booktitle = {Proceedings of the 5th Annual International Systems and Storage
Conference (SYSTOR 2012)},
  month        = jun,
  year         = {2012},
}
Last modified 13 Dec 2013