Appendix I:  Horizontal Differencing Predictor


This appendix, written after the release of Revision 5.0 of the TIFF specification, is
still in draft form.  Please send any comments to the Aldus Developers Desk.


Revision 5.0 of the TIFF specification defined a new tag called _Predictor_ that
describes techniques that may be used in conjuction with TIFF compression schemes.  We
now define a Predictor that greatly improves compression ratios for some images.

The horizontal differencing predictor is assigned the tag value Predictor = 2:

Predictor
Tag	= 317 (13D)
Type	= SHORT
N	= 1

A predictor a mathematical operator that is applied to the image data before the
encoding scheme is applied.  Currently (as of revision 5.0) this tag is used only with
LZW (Compression=5) encoding, since LZW is probably the only TIFF encoding scheme that
benefits significantly from a predictor step.  See Appendix F.

1 = No prediction scheme used before coding.
2 = Horizontal differencing. See Appendix I.

Default is 1.


The algorithm

The idea is to make use of the fact that many continuous tone images rarely vary much
in pixel value from one pixel to the next.  In such images, if we replace the pixel
values by differences between consecutive pixels, many of the differences should be 0,
plus or minus 1, and so on.  This reduces the apparent information content, and thus
allows LZW to encode the data more compactly.

Assuming 8-bit grayscale pixels for the moment, a basic C implementation might look
something like this:

	char	image[ ][ ];
	int	row, col;

	/* take horizontal differences:
	 */
	for (row = 0; row < nrows; row++)
		for (col = ncols - 1; col >= 1; col--)
			image[row][col] -= image[row][col-1];

If we don_t have 8-bit samples, we need to work a little harder, so that we can make
better use of the architecture of most CPUs.  Suppose we have 4-bit samples, packed two
to a byte, in normal TIFF uncompressed (i.e., Compression=1) fashion.  In order to find
differences, we want to first expand each 4-bit sample into an 8-bit byte, so that we
have one sample per byte, low-order justified.  We then perform the above horizontal
differencing.  Once the differencing has been completed, we then repack the 4-bit
differences two to a byte, in normal TIFF uncompressed fashion.

If the samples are greater than 8 bits deep, expanding the samples into 16-bit words
instead of 8-bit bytes seems like the best way to perform the subtraction on most
computers.

Note that we have not lost any data up to this point, nor will we lose any data later
on.  It might at first seem that our differencing might turn 8-bit samples into 9-bit
differences, 4-bit samples into 5-bit differences, and so on.  But it turns out that we
can completely ignore the _overflow_ bits caused by subtracting a larger number from a
smaller number and still reverse the process without error.  Normal twos complement
arithmetic does just what we want.  Try an example by hand if you need more convincing.

Up to this point we have implicitly assumed that we are compressing bilevel or
grayscale images.  An additional consideration arises in the case of color images.

If PlanarConfiguration is 2, there is no problem.  Differencing proceeds the same way
as it would for grayscale data.

If PlanarConfiguration is 1, however, things get a little trickier.  If we didnt do
anything special, we would be subtracting red sample values from green sample values,
green sample values from blue sample values, and blue sample values from red sample
values, which would not give the LZW coding stage much redundancy to work with.  So we
will do our horizontal differences with an offset of SamplesPerPixel (3, in the RGB
case).  In other words, we will subtract red from red, green from green, and blue from
blue.  The LZW coding stage is identical to the SamplesPerPixel=1 case.  We require
that BitsPerSample be the same for all 3 samples.


Results and guidelines

LZW without differencing works well for 1-bit images, 4-bit grayscale images, and
synthetic color images.  But natural 24-bit color images and some 8-bit grayscale
images do much better with differencing.  For example, our 24-bit natural test images
hardly compressed at all using _plain_ LZW: the average compression ratio was 1.04 to
1.  The average compression ratio with horizontal differencing was 1.40 to 1.  (A
compression ratio of 1.40 to 1 means that if the uncompressed image is 1.40MB in size,
the compressed version is 1MB in size.)

Although the combination of LZW coding with horizontal differencing does not result in
any loss of data, it may be worthwhile in some situations to give up some information
by removing as much noise as possible from the image data before doing the
differencing, especially with 8-bit samples.  The simplest way to get rid of noise is
to mask off one or two low-order bits of each 8-bit sample.  On our 24-bit test images,
LZW with horizontal differencing yielded an average compression ratio of 1.4 to 1. 
When the low-order bit was masked from each sample, the compression ratio climbed to
1.8 to 1; the compression ratio was 2.4 to 1 when masking two bits, and 3.4 to 1 when
masking three bits.  Of course, the more you mask, the more you risk losing useful
information along with the noise.  We encourage you to experiment to find the best
compromise for your device.  For some applications it may be useful to let the user
make the final decision.

Interestingly, most of our RGB images compressed slightly better using
PlanarConfiguration=1.  One might think that compressing the red, green, and blue
difference planes separately (PlanarConfiguration=2) might give better compression
results than mixing the differences together before compressing
(PlanarConfiguration=1), but this does not appear to be the case.

Incidentally, we tried taking both horizontal and vertical differences, but the extra
complexity of two-dimensional differencing did not appear to pay off for most of our
test images.  About one third of the images compressed slightly better with
two-dimensional differencing, about one third compressed slightly worse, and the rest
were about the same.