Gary Lucas created SANSELAN-80:
----------------------------------

             Summary: Incorrect code for tiled TIFF files applyPredictor method 
                 Key: SANSELAN-80
                 URL: https://issues.apache.org/jira/browse/SANSELAN-80
             Project: Commons Sanselan
          Issue Type: Bug
          Components: Format: TIFF
            Reporter: Gary Lucas
            Priority: Minor


I believe that the DataReaderTiled class used for reading tiled TIFF files 
invokes the applyPredictor method with incorrect arguments and will not be able 
to properly decode TIFF files that use predictors.  The bug was found during a 
code inspection. Unfortunately, I do not have any samples of data in this 
format (there are none in the Apache Imaging test files) and cannot verify that 
this is the case.

Some Background
TIFF files are often used to store images in technical applications where data 
must be faithfully preserved, so lossy compression methods like JPEG are 
inappropriate and non-lossy method like LZW must be used. However, continuous 
tone images like satellite images or photographs often do not compress well 
since there is little apparent redundancy in the data. To improve the 
redundancy of the data, TIFF uses a simple predictor.  The first pixel (gray 
tone or RGB value) in a tile is stored as a literal value.  All subsequent 
pixels are stored as differences.  To see how this works, imagine a monochrome 
picture where the gray tones gradually fade from white to black at a steady 
rate. Although no particular data value is ever repeated (so there is little 
apparent redundancy in the source data) the delta values remain constant (so a 
set of delta values will compress very well). When transformed in this matter, 
certain images show substantial improvements in compression ratio.

The Probem
The DataReaderTiff class uses a method called applyPredictor that takes an 
argument telling it whether the sample passed in is the first value, and should 
be treated as a literal, or whether it is a subsequent value and should be 
treated as a delta.   Unfortunately, the parameter it uses is the x coordinate 
of the pixel to be decoded.  While this approach works for TIFF strip files 
(where the first pixel always has a coordinate of zero), it does not work for 
tiles where the first pixel in the tile could fall anywhere in the image. 
  
The Fix
While we could simply fix the argument passed into the predictor, there is a 
better solution. The predictor performs an if/then operation on the input 
parameter to find out if it is the first sample in the tile. Once it unpacks a 
sample, it retains it as the "last" value so that it may be added to the next 
delta value.  Why not simply get rid of the if/then operation and just ensure 
that the last value gets zeroed out before beginning the processing of a strip 
or tile.  This would save an if/then operation and fix the bug.



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to