No upcoming events at this time.
Center for Research in Storage Systems
Deduplicating in-line data on primary storage is hampered by the disk bottleneck problem, an issue which results from the need to keep an index mapping portions of data to hash values in memory in order to detect duplicate data without paying the performance penalty of disk paging. The index size is proportional to the volume of unique data, so placing the entire index into RAM is not cost effective with a deduplication ratio below 45%.
HANDS is a framework that dynamically pre-fetches fingerprints from disk into memory cache according to working sets statistically derived from access patterns. We use a simple neighborhood grouping as our statistical technique to demonstrate the effectiveness of our approach. HANDS is modular and requires only spatio-temporal data, making it suitable for a wide range of storage systems without the need to modify host file systems. HANDS reduces the amount of in-memory index storage required by up to 99% while still achieving between 30% and 90% of the deduplication a full memory-resident index provides, making primary deduplication cost effective in workloads with deduplication rates as low as 8%.
The Center for Research in Storage Systems (CRSS) is a partnership between academia and industry to explore and develop new technologies and techniques to improve the manageability, scalability, security, reliability, longevity, and performance of storage systems. CRSS researchers conduct high-quality, industrially-relevant fundamental research, often in collaboration with CRSS industrial members. CRSS also facilitates collaboration in research and education, and provides pathways to simplify direct transfer of university developed ideas, research results, and technology to its industrial sponsors, helping them to improve their competitive posture in the global marketplace. CRSS also focuses on providing talented graduate and undergraduate students the technical background and industry interaction they will use as they go on to become the next generation of scientists and engineers.
CRSS conducts research in a wide range of storage-related fields, including archival storage, scalable distributed indexing and non-hierarchical file systems, large-scale distributed storage systems, file systems for next-generation storage devices, and data deduplication. We also have particular focus in cross-cutting issues such as security and reliability in file and storage systems. While CRSS was established in 2013, it inherits a long history of advanced storage research from the Storage Systems Research Center in the Jack Baskin School of Engineering. We encourage you to learn about SSRC projects that predate the formation of CRSS.
Please see the CRSS fact sheet for more information on the Center and its organization. You're also encouraged to browse the Web site for more information about research at CRSS. Please note that some parts of the site are restricted to CRSS sponsors.
If you're interested in becoming an industrial member of CRSS, please contact our Executive Director, Dr. Andy Hospodor or one of the CRSS faculty, Prof. Ethan Miller and Prof. Darrell Long. Prospective students interesting in working in the CRSS can contact one of the faculty, and are encouraged to communicate with current students to learn about their experiences working as part of CRSS.
Center for Research in Storage Systems
Engineering 2 Building CRSS Labs:
Last modified 18 Nov 2014
|© 2014 CRSS & UC Santa Cruz||Home | Research | People | Publications | Seminars | Sponsors|
|Site powered by Django|
|This material is based on work supported in part by the National Science Foundation under grant IIP-1266400. Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.|