Compression of a Set of Strings

Abstract
Dictionary is used by many compression methods. Some of them put the dictionary into the compressed message. In such cases the improvements in dictionary compression can improve the performance of the compression methods. We have focused on compression of dictionaries of words or syllables. The dictionary compression is often based on tree representation of the dictionary. We suppose that proper encoding of the tree can save a lot of space. We have therefore focused on minimizing the stored information corresponding to a tree node. We store only the number of children encoded by Elias gamma code, the difference of encoded character from the left sibling encoded by Elias delta code, and the information whether given node represents a dictionary item stored in a single bit. The distance from left brother is skipped for the root and the information whether the node represents a dictionary item is skipped by the leaves.

This publication has 2 references indexed in Scilit: