Re: RAW v JPEG

Cory Papenfuss Wed, 15 Jun 2005 04:09:04 -0700

All a TIFF file is is a standardized format for storing bitmappedimages, as well as whatever tags you like. The thing with RAW is that it'snot a color model, it's a monochrome image. It's understood that themonochrome image should be interpretted according to a RGBG Bayer pattern.
I think you can argue that this does indeed make it a colour model, or rather- the data is just data of course, the "RGBG Bayer pattern" interpretationwould be the colour model. The actual data in a normal RGB TIFF file is afterall also indistinguishable from monochrome image data...

I suppose one could call it a color model. The color models I wasthinking of is more like Lab, YUV, xyY, that sort of stuff (icc profiles,color management, etc). The "RGB" in the Bayer data is only monochromeand doesn't have any meaning until one defines exactly what "red is red","green is green," and "blue is blue." Those are all relative, dependingon the dyes of the sensor. Some cameras even use CYM dyes, IIRC. None ofthat info is in the RAW TIFF... it's just monochrome.

... but one can call it a color model if you want... it's gotassumed RGB values.

Probably...

I've looked at the files a fair bit with linux libtiff utilities.The only significant tweaking they've done is pack 12 bits into 12 bits...the tiff format does not allow 12 bit data. The other stuff (exif info,jpeg thumbnails, lens info, etc) is all stored as TAGS and DIRECTORY infowithin the TIFF file structure.

I've also been thinking that they should not really call it RAW, by the way.A "raw" image file is traditionally a file containing the pixel data andnothing else.

... true, but then it would confuse the issue as compared toeveryone else. Since it's TIFF, it does have RAW pixel data in theopen... just has some fuzz of other data on either side of it... :)

Somewhat smaller, but not much I wouldn't think. The standard TIFFsupports only a few types of compression, most of which aren't all thatgreat. It's similar to taking a document, copying it three times, makingsmall changes in each one, and then compressing the three. Chances are it'sgoing to be a fair bit bigger than the original compressed.
I must admit I haven't tried this, but isn't some of the "extra" dataintroduced by interpolation duplicate date that might easily be reduced insize by traditional compression algorithms.

Yes, it's duplicated, but I'm not sure how intelligent thecompression algorithms in TIFF are. I know two of them are simple RLE andLZW (zip-style) encodings. I doubt either of them check forcompressibility along the minor axis (e.g. vertically if image is writtenin horizonal stripes). A photo will have spacial compressibility in thatdirection as well. I also doubt either of them try to be clever betweenmultiple image planes (R, G, B)... you end up with three separate,independently compressed images.

I've played with gzip (zip-style) and bzip2 on my RAW files to seeif I can have them take up less space for archival to DVD. They reallydon't compress very well with that. I think I got a minimum of about 80%of the original size... even when the compression parameters were cranked.Basically, one needs to be more clever in compressing images than a simplezip or it doesn't help much. Zip doesn't know it's an image or anythingspecial... just tries to compress the binary data. Similarly for music.Zipping a WAV file doesn't get it very small, but using something likeFLAC (fully-lossless audio compression) or MonkeyAudio or whatever does apretty good job.

The way it's typically done in-camera (8-bit, white-balanced,gamma-corrected), it does have almost no advantages over minimallycompressed JPEGs, and the disadvantage of being much larger...
It has the advantage that you know you have lossless compression. Even"minimal" JPEG is lossy, isn't it? Unless you use JPEG-2000, which alsoallows lossless compression, as far as I understand. And has several otherfeatures making it a generally much more usable image format, but the articleof course dismisses it without any discussion...

Lossless *compression*, yes... but you've already thrown away1/3 of the data that was captured by the sensor, and posterized the colorsto only 1/16 of their original accuracy. Add to that the fact that you'vetaken this image data which has only 2/3 of the sensor data, andinterpolated (duplicated) it three times (R, G, B instead of the BayerRGBG). It's just not an efficient means of storage, and it *is* lossy(unless it's 16-bit linear, which I doubt any cameras do). It may nothave JPEG artifacts, but those are fairly minimal unless repeated multipletimes or have the compression level turned way up.


-Cory

--

*************************************************************************
* Cory Papenfuss                                                        *
* Electrical Engineering candidate Ph.D. graduate student               *
* Virginia Polytechnic Institute and State University                   *
*************************************************************************

Re: RAW v JPEG

Reply via email to