Re: [libjpeg-turbo-users] jpegtran - flipping problem

'DRC' via libjpeg-turbo User Discussion/Support Tue, 22 Oct 2024 11:35:13 -0700

New users are moderated by default (unfortunately necessary because ofspambots), so that's why your initial message did not appear to gothrough. (Presumably that's why you made a duplicate post.)

jpegtran is working as designed. It's just that the transform you arerequesting is "imperfect."

To explain what an imperfect transform is, I first need to roughlyexplain the stages of JPEG compression:

1. Color Conversion: The packed source pixels are typically convertedfrom RGB to YCbCr, which allows the "luminance" (brightness) to beseparated from the "chrominance" (color.) The Y, Cb, and Cr componentsare organized into component planes.

2. Chrominance Subsampling (AKA Downsampling): The chrominance (Cb, Cr)components are optionally "subsampled", which typically involvesdiscarding every other component in either the horizontal or verticaldirection or both. (The human eye is more sensitive to spatial changesin brightness than spatial changes in color, so with a photograph orother image content that has gradual variations in color, you candiscard 1/2 or even 3/4 of the color data without much if any perceptualquality loss.) If necessary, the chrominance component planes arepadded to the nearest multiple of 8 components in both directions, andthe luminance component plane is padded to the nearest multiple of 8 *the horizontal or vertical subsampling factor (for instance, 8x8 in thecase of no subsampling, 16x8 in the case of 2x1 chrominance subsampling,and 16x16 in the case of 2x2 chrominance subsampling.) The padded areaat the right of the plane is filled with the right-most component, andthe padded area at the bottom of the plane is filled with thebottom-most component.

3. Forward DCT: Each 8x8 block of each component plane is processed withthe discrete cosine transform (DCT) algorithm to produce an 8x8 block ofDCT coefficients. Within an 8x8 DCT coefficient block, thelowest-frequency coefficient (the "DC" coefficient) from thecorresponding 8x8 component block is stored at the upper left, and thefrequencies increase in a zigzag pattern toward the highest-frequencycoefficient at the lower right. The minimum set of DCT coefficientsthat includes an 8x8 coefficient block from all components is called an"iMCU" (interleaved minimum coded unit.) For example, an iMCU in animage that uses 2x2 (AKA "4:2:0") subsampling contains four 8x8coefficient blocks from the luminance plane and one 8x8 coefficientblock from each chrominance plane. If the image width or height isn'tevenly divisible by the iMCU width or height, then the iMCUs at theright or bottom of the image are "partial" iMCUs, i.e. they don'tcorrespond to full 8x8 blocks of components in the source image.

4. Quantization: The highest-frequency coefficients from each 8x8coefficient block are removed, depending on the JPEG quality level. (Quality 100 removes no coefficients. Quality 0 removes all but the DCcoefficient.) This and chrominance subsampling are the two "lossy"parts of JPEG encoding. (If you create a JPEG image with Quality 100and no subsampling, then the only loss comes from round-off error.) Theidea is that, with a photograph or other image content that has gradualvariations in color, you can discard the highest frequencies withoutmuch if any perceptual quality loss. (MP3 compression is based on thesame assumption with respect to audio.)

5. Entropy Encoding: The purpose of all of the aforementionedreorganization and conversion of data is to create long runs of zeroesthat can compress well with a lossless codec, so the last step islosslessly compressing the quantized DCT coefficients using eitherHuffman coding or arithmetic coding.

jpegtran works by performing entropy decoding on a JPEG image,rearranging the DCT coefficients, then performing entropy re-encoding onthe rearranged DCT coefficients to produce a transformed JPEG image. However, because the DCT coefficients are organized into 8x8 blocksbased on frequency, there are limits to how they can be rearranged. Ifyou have a full iMCU, then you can always transform the correspondingpixels/components by changing the order of the DCT coefficients. However, you can't always do that with a partial iMCU, because thecomponents at the right or bottom of the corresponding component blockare just padding. In general, if the image width or height isn't evenlydivisible by the iMCU width or height, then some transform operationsare "imperfect", i.e. they cannot transform the partial iMCUs in the image:

- Horizontal flipping, transverse transposition, 180-degree rotation,and 270-degree rotation are imperfect if the image contains any partialiMCUs along its right edge.

- Vertical flipping, transverse transposition, 90-degree rotation, and180-degree rotation are imperfect if the image contains any partialiMCUs along its bottom edge.


- Regular (non-transverse) transposition is always perfect.

To put this another way, if a coefficient block comes from a componentblock that was padded on the right side, then you can't transform theblock in such a way that the right side would become the left or topside. If a coefficient block comes from a component block that waspadded on the bottom side, then you can't transform it in such a waythat the bottom side would become the left or top side.

The test image is 236x236 and uses 4:2:0 (2x2) subsampling, so the iMCUsize is 16x16. Since 236 isn't evenly divisible by 16, both the rightand bottom edges contain partial iMCUs. By default, jpegtran leavespartial iMCUs in place. You can optionally discard partial iMCUs withthe -trim option, but that's probably not what you want either.Unfortunately, there is no other solution except to decompress the imageand transform it in the spatial domain (which would incur generationalloss if you recompressed it into a JPEG image.) Reorganizing DCTcoefficients in order to losslessly transform a JPEG image in thefrequency domain is a neat trick that Tom Lane came up with, but perabove, it is limited by the structure of the JPEG format.

BTW, all of this is documented in the jpegtran man page, as well as inusage.txt and wizard.txt. Those are the official and vetted sources ofdocumentation regarding this. My summary above is quick & dirty andbased on my own memory. Thus, it may contain errors and should not beconsidered official or canonical documentation.


DRC

On 10/21/24 5:47 AM, SFA wrote:

Dear libjpeg-turbo team,
we are experiencing an issue when vertically flipping a JPEG imageusing jpegtran. Please find attached a ZIP file containing the inputand output images.
The command used was: jpegtran -flip vertical imgflipv.jpeg > flipped.jpeg
It appears there may be a bug in the library causing the incorrectoutput. We hope this issue will be resolved in future versions ofjpegtran.
Any potential solutions or workarounds would be greatly appreciated.Please feel free to ask for further information.
Sincerely,
SFA
--
You received this message because you are subscribed to the GoogleGroups "libjpeg-turbo User Discussion/Support" group.To unsubscribe from this group and stop receiving emails from it, sendan email to libjpeg-turbo-users+unsubscr...@googlegroups.com.To view this discussion on the web visithttps://groups.google.com/d/msgid/libjpeg-turbo-users/25cbb128-38da-45e5-b45d-c488b45f8f4fn%40googlegroups.com<https://groups.google.com/d/msgid/libjpeg-turbo-users/25cbb128-38da-45e5-b45d-c488b45f8f4fn%40googlegroups.com?utm_medium=email&utm_source=footer>.


--
You received this message because you are subscribed to the Google Groups 
"libjpeg-turbo User Discussion/Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to libjpeg-turbo-users+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/libjpeg-turbo-users/0d1d23cb-3a36-40d4-9f4d-8f18f7b0705b%40virtualgl.org.

Re: [libjpeg-turbo-users] jpegtran - flipping problem

Reply via email to