Data Compression (2000)
By Ginger Rosenkrans
Introduction
Compression
techniques reduce the amount of memory needed for storing still and video images
and the bandwidth for transporting them (Dipert, 1998; Saha, 2000; Wu &
Irwin, 1998). Even the smallest or simplest picture on a Web page is compressed
before it is placed on the page, and then it is decompressed by the user’s
computer. Compression is a process where algorithms reduce the number of bytes
needed to encode data (England & Finney, 1999; Tannenbaum, 1998). Video in
multimedia is digital; therefore, it has to be compressed due to the high data
rate used by raw video, storage devices that are too slow, and networks that
have too narrow bandwidth to rapidly transmit the data that is needed to deliver
the multimedia presentation in real time (England & Finney, 1999; Tannenbaum,
1998). There are at least two kinds of compression algorithms: lossy and
lossless (Tannenbaum, 1998; Web Reference, 2000). Additionally, video
compression is commonly known as bit rate reduction. JPEG and MPEG are two
standards applied to video compression, and GIF is an image format widely used
on the Web (Dipert, 1998; England & Finney, 1999; Olsen, 1997; Web
Reference, 2000; Wu & Irwin, 1998).
Lossy
and Lossless Compression
Lossy compression
creates smaller files by discarding some information about the original image (Tannenbaum,
1998; Web Reference, 2000). Lossy algorithms remove details and color
alterations that are deemed too small for the brain to detect. They do not
significantly reduce the effect of the presentation; therefore, some distortion
is acceptable because it may be imperceptible or may not distract the brain (Tannenbaum,
1998). Additionally, lossy compression algorithms are not perfectly reversible.
When a compressed file is decompressed, the data is not bit-by-bit identical
with the original data. It does not significantly modify the perception of the
data from what was originally intended. Motion and still image files can be
compressed using lossy algorithms (Tannenbaum, 1998; Web Reference, 2000).
A lossless
compression algorithm does not discard any information about the original file (Saha,
2000; Tannenbaum, 1998; Web Reference, 2000). The data retrieved is identical to
the original data. Saha (2000) asserts that in lossless compression, the
reconstructed image after compression is "numerically identical to the
original image on a pixel-by-pixel basis." Lossless compression is done by
a recording of the original data using an efficient code that takes less
storage. One type of lossless compression algorithm is run-length coding, which
"replaces the long sequence of identical big patterns with one instance of
the pattern plus a number specifying how many times the pattern is to be
repeated" (Tannenbaum, 1998, p. 335). If a multimedia program needs to be
compressed for transmission or storage, a lossless algorithm is used.
JPEG,
GIF, and MPEG
JPEG, GIF, and MPEG
are common compression algorithms (Dipert, 1998; England & Finney, 1999;
Tannenbaum, 1998; Web Reference, 2000; Wu & Irwin, 1998). JPEG, which is the
common name for the raster format defined by the Joint Photographic Experts
Groups, is one of the best methods to compress photographic images (Coffee,
1997; Dipert, 1998; Saha, 2000; Siegel, 1997). GIF, which is the acronym for
Graphics Interchange Format, is one of the oldest graphic formats on the Web.
MPEG, which is the name for Motion Picture Experts Group, is a standard for
distributed system video (Olsen, 1997; Tannenbaum, 1998).
Lossy JPEG which is
a lossy compression algorithm, is designed for use with continuous tone images
like photographs or natural artwork (Dibert, 1998; Olsen, 1997; Saha, 2000;
Siegel, 1997; Web Reference, 2000; Wu & Irwin, 1998). JPEG separates the
brightness information from the color hues to reduce file size and converts the
spatial image representation into a frequency map (Siegel, 1997). The greater
the compression, the lossier the file will be. Siegel (1997) recommends starting
with the lowest-quality JPEG setting, discarding the most information, and
reducing file size as much as possible.
The GIF format uses
a lossless compression algorithm (Siegel, 1997; Web Reference, 2000). The GIF
format uses a variant of the Lempel-Ziv-Welch (LZW) compression (Siegel, 1997;
Web Reference, 2000). LZW, a lossless compression algorithm, is a
repeated-string compressor and uses a translation table to represent linear
sequences of data in the uncompressed input stream (Siegel, 1997; Web Reference,
2000). When a sequence is first encountered, a code is added to the translation
table and subsequent matching sequences are represented by the code. Generally,
GIF compression is used for anything that is not photographic (e.g., line art,
type). Additionally, the average compression ration for GIF images is 4:1. GIFs
can support up to 8 bits per pixel, which is a maximum of 256 colors (Web
Reference, 2000).
MPEG compression
includes the compression of moving picture images and the accompanying audio
(England & Finney, 1999; Olsen, 1997; Tannenbaum, 1998). MPEG compression
for audio is accomplished by a subband coding process, and compression for
images is accomplished by encoding only the changes in images that occur from
frame to frame (Coffee, 1997; Olsen, 1997; Tannenbaum, 1998)
References
Coffee, P. (1997,
December 1). How to make a multimedia look good. PC Week, 14 (50),68.
Dipert, B.(1998,
June 18). Compression puts images on a diet. EDN, 43 (13), 71.
England, E., &
Finney, A. (1999). Managing Multimedia (2nd ed.). Harlow,
England:Addison-Wesley Longman Limited.
Melamed, B. (1997).
Modeling compressed full-motion video. Proceedings of the 1997 Winter
Simulation Conference.
Olsen, G. (1997). Multimedia
Design. Cincinnati, OH: F&W Publications.
Saha, S. (2000,
Spring). Image compression—from DCT to wavelets: a review. Crossroads,
6 (3), 12-21.
Siegel, D. (1997). Creating
Killer Web Sites. Indianapolis, IN: Hayden Books.
Tannenbaum, R. S.
(1998). Theoretical Foundations of Multimedia. New York, NY: W.H. Freeman
and Company.
Web Reference.
(2000, February 24). Compression: Optimizing Web Graphics. Available: http://www.webreference.com/dev/graphics/compress.htm
Accessed February 25, 2000.
Wu, C. & Irwin
J. D. (1998). Emerging Multimedia Computer Communication Technologies,
Upper Saddle River, NJ: Prentice Hall, Inc.
|