The Zebra striped network file system
- 1 December 1993
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGOPS Operating Systems Review
- Vol. 27 (5), 29-43
- https://doi.org/10.1145/173668.168622
Abstract
Zebra is a network file system that increases throughput by striping file data across multiple servers. Rather than striping each file separately, Zebra forms all the new data from each client into a single stream, which it then stripes using an approach similar to a log-structured file system. This provides high performance for writes of small files as well as for reads and writes of large files. Zebra also writes parity information in each stripe in the style of RAID disk arrays; this increases storage costs slightly but allows the system to continue operation even while a single storage server is unavailable. A prototype implementation of Zebra, built in the Sprite operating system, provides 4--5 times the throughput of the standard Sprite file system or NFS for large files and a 20%-3x improvement for writing small files.Keywords
This publication has 19 references indexed in Scilit:
- Bridge: a high performance file system for parallel processorsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- The TickerTAIP parallel RAID architecturePublished by Association for Computing Machinery (ACM) ,1993
- Non-volatile memory for fast, reliable file systemsPublished by Association for Computing Machinery (ACM) ,1992
- Replication in the harp file systemPublished by Association for Computing Machinery (ACM) ,1991
- Measurements of a distributed file systemACM SIGOPS Operating Systems Review, 1991
- Maximizing performance in a striped disk arrayPublished by Association for Computing Machinery (ACM) ,1990
- Scale and performance in a distributed file systemACM Transactions on Computer Systems, 1988
- Reimplementing the Cedar file system using logging and group commitPublished by Association for Computing Machinery (ACM) ,1987
- GrapevineCommunications of the ACM, 1982
- Concurrency Control in Distributed Database SystemsACM Computing Surveys, 1981