The Burrows-Wheeler Transform for Block Sorting Text Compression: Principles and Improvements

1 January 1996

journal article
research article
Published by Oxford University Press (OUP) in The Computer Journal

Vol. 39 (9), 731-740
https://doi.org/10.1093/comjnl/39.9.731

Abstract

A recent development in text compression is a ‘block sorting’ algorithm which permutes the input text according to a special sort procedure and then processes the permuted text with Move-To-Front (MTF) and a final statistical compressor. The technique combines good speed with excellent compression performance. This paper investigates the fundamental operation of the algorithm and presents some improvements based on that analysis. Although block sorting is clearly related to previous compression techniques, it appears that it is best described by techniques derived from work by Shannon on the prediction and entropy of English text. A simple model is developed which relates the compression to the proportion of zeros after the MTF stage.

Keywords

BURROWS WHEELER TRANSFORM

Cited by 82 articles