Murdoch University Research Repository

Welcome to the Murdoch University Research Repository

The Murdoch University Research Repository is an open access digital collection of research
created by Murdoch University staff, researchers and postgraduate students.

Learn more

An approach to phrase selection for offline data compression

Turpin, A. and Smyth, W.F. (2002) An approach to phrase selection for offline data compression. In: 25th Australasian Computer Science Conference, Jan/Feb 2002, Monash University, Melbourne



Recently several offline data compression schemes have been published that expend large amounts of computing resources when encoding a file, but decode the file quickly. These compressors work by identifying phrases in the input data, and storing the data as a series of pointer to these phrases. This paper explores the application of an algorithm for computing all repeating substrings within a string for phrase selection in an offline data compressor. Using our approach, we obtain compression similar to that of the best known offline compressors on genetic data, but poor results on general text. It seems, however, that an alternate approach based on selecting repeating substrings is feasible.

Item Type: Conference Paper
Item Control Page Item Control Page


Downloads per month over past year