Re: [Pixman] [cairo] Planar YUV support

Bill Spitzak Fri, 04 Mar 2011 13:48:11 -0800

Soeren Sandmann wrote:

Siarhei Siamashka <[email protected]> writes:

The pipeline as it is now:

   1 convert image sample to a8r8g8b8
   2 extend sample grid in all directions according to repeat
   3 interpolate between sample according to filter
   4 transform
   5 resample
   6 combine
   7 store

What is the difference between "3 interpolate between sample according
to filter" and "5 resample"?


The output of stage 3 is an image that is defined on all of the real
plane. There are no pixels any more, so there is no question about what
stage 4, "transform", means. Stage 5 converts back to pixels by point
sampling.

This is a very poor description of what should be happening. You cannotdo Stage 5 as a point sample. This is what the bilinear interpolation isdoing, and everybody should have realized by now that the output imageis no good for scales less than .5.

It is MUCH better to combine steps 3,4,5 together. The goal is toproduce a pixel in the output coordinate system. This is done by makingup a filter that will vary depending on the transform and the outputpixel, applying it to the source image, and the result is the outputpixel. It is absolutely impossible to do a "sample" step last that doesnot take into account the transform.

For affine transforms an output pixel maps to a parallelogram on theinput image. This parallelogram can be much bigger or much smaller thana single pixel. The parallelogram has 6 degres of freedom. It is obviousthat two numbers cannot describe the parallelogram, and therefore youcannot use point sampling (no matter how fancy your bilinearinterpolation is) to produce the output image.

To add support for potentially subsampled YUV, some additional stages
have to be inserted before the first:

  -2 interpolate subsampled components of YUV to get the same
     resolution as the Y plane

  -1 if the format is planar, stitch together components to form YUV
     pixels

   0 convert to sRGB

Stage -2 is important because the filter used in that interpolation
should probably be user-specifiable eventually, which has the
implication that whatever simple support is added first, it needs to be
clear what filter precisely is being used.

No I do not think the filter for UV should be "user specified". You areadding meaningless complexity to the API and actually *preventing* theinterpolation from being improved.

It is quite possible to merge step -2 with the transform. Theparallelogram I described above would be 1/2 as big for the UV planes.Also it may be shifted 1/2 pixel between U and V (because most producerssubsample the UV by averaging different pairs of pixels).

It does mean you cannot do "extend" of "black" as an earlier step.However I very strongly believe that the current cairo behavior is notwanted by anybody and is inefficient on modern hardware. See below aboutthis.

Stage 0 is a color space conversion and need to eventually be
configurable too, which means it has to be specified which matrix is
being used.

I believe some assumptions can be made about the color samples so thatstage 0 can be moved to a later point.

All color spaces of interest have orthogonal channels which can befiltered independently. Thus the filtering can be done before conversion.

If a channel is non-linear, it technically will effect the filtering.See below for comments on why I think this may not be necessary. Even ifit is necessary, all interesting non-linear channels are so close to apower of 2 that a single alternate filter that squares the input image,applies the same filter, and does the square root, will produce ananswer that is accurate to 5 bits for the worst case of a white pixelnext to black, and well over 12 bits for most photographic images.

Note that if some day we add compositing in linear RGB, the alternative
process breaks down because the initial interpolation will be taking
place in non-linear color space, whereas with intermediates in linear
RGB, you'd want to do the second interpolation (but not the first) in
linear light.

I do not think there is a requirement that the transform filtering bedone in linear space. It could be useful, but it will not completelybreak doing the rest of the composite in linear RGB.

The reason is that for low-contrast images the gamma curve between twoadjacent pixels is extremely close to a straight line and thus theresult is almost identical.


There are problems with doing transforms in linear space:

For very large scales of high contrast images users are unhappy withtrue linear filtering and prefer the gamma filtering. The reason is thatonce the pixels become visible it becomes a perceptual rather thanphysical appearance and the image just looks "wrong". This will mostlyeffect "magnifier" applications for enlarging already-rendered text.

Linear filtering can also have very nasty side effects if the images arepremultiplied. The premultiplied pixels have been stored at much lowerresolution, in effect, and linearization can produce very bright colorsthat will produce artifacts when blended with neighboring pixels.

If you do linear filtering it may only want to be done for scales lessthan 1. Also there is no need to do it on color spaces where highcontrast is already poorly supported, so there is no need to do it tothe UV channels.

There is also a question of what to do with YUV images with a
non-premultiplied alpha channel. Interpolating the samples of such an
image direclty is definitely wrong, but it may be that simply
premultiplying first will work.

Filtering non-premulitplied data is a problem with all data formats, notjust YUV!

The problem is that where the alpha is zero the color is often black. Afilter that covers this area will bleed black into the object, makingthe resulting image as though the object turns slightly darker at theedges. The only way to get "correct" results is to ignore thecontribution of alpha zero pixels to the filter for the color channels.Depending on the source you may have to ignore tiny alphas as well (someprograms produce this with black due to internal filtering).

You do have to watch out for "premultiplied" YUV where the UV channelsgo towards what is really the maximum negative value, rather than theneutral value, as the alpha goes to zero. You can easily correct theseby adding (255-alpha)/2 to the UV channels.

The two-interpolation pipeline has the practical benefit that chroma
reconstruction can be done in the fetchers, at least as long as the
chroma filter is fixed, where as the one-step process means the general
code for bilinear filtering would have to sample each component
individually, then filter, and then do a color conversion. It would no
longer be able to simply ask the underlying system to fetch an RGB
pixel.

Here is the steps as I see it, with the parts that I believe CANNOT beseparated are made a single step:


  1. Widen to 8 bit components
  2. Extend sample grid but use "repeat" for "black outside"
  3. Transform/filter to 1 sample per output pixel
  4. Convert to interlaced
  5. Convert to sRGB
  6. Do "black outside" by multiplying by an antialiased quad
  7. Composite into output buffer

_______________________________________________
Pixman mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/pixman

Re: [Pixman] [cairo] Planar YUV support

Reply via email to