Re: Initial feedback for std.experimental.image

via Digitalmars-d Wed, 08 Jul 2015 11:11:24 -0700

On Monday, 6 July 2015 at 13:48:53 UTC, Rikki Cattermole wrote:


Please destroy!


You asked for it! :)

As a reference to a library that is used to handle images on aprofessional level (VFX industry), I'd encourage you to look atthe feature set and interfaces of OpenImageIO. Sure, it's a biglibrary and some of it is definitely out of scope for what youtry to accomplish (image tile caching and texture sampling,obviously).

Yet, there are some features I specifically want to mention hereto challenge the scope of your design:

- arbitrary channel layouts in images: this is a big one. Youmention 3D engines as a targeted use case in the specification.3D rendering is one of the worst offenders when it comes to crazychannel layouts in textures (which are obviously stored as imagefiles). If you have a data texture that requires 2 channels (e.g.uv offsets for texture lookups in shaders or some crazy datatables), its memory layout should also only ever have twochannels. Don't expand it to RGB transparently or anything elsebraindead. Don't change the data type of the pixel values wildlywithout being asked to do so. The developer most likely haschosen a 16 bit signed integer per channel (or whatever else) fora good reason. Some high end file formats like OpenEXR even allowusers to store completely arbitrary channels as well, often witha different per-channel data format (leading to layouts likeRGBAZ with an additional mask channel on top). But support forthat really bloats image library interfaces. I'd stick with asane variant of the uncompressed texture formats that the OpenGLspecification lists as the target set of supported in-memoryimage formats. That mostly matches current GPU hardware supportand probably will for some time to come.

- padding and memory alignment: depending on the platform, imageformat and task at hand you may want the in-memory layout of yourimage to be padded in various ways. For example, you would wantyour scanlines and pixel values aligned to certain offsets tomake use of SIMD instructions which often carry alignmentrestrictions with them. This is one of the reasons why RGB imagesare sometimes expanded to have a dummy channel between thetriplets. Also, aligning the start of each scanline may beimportant, which introduces a "pitch" between them that isgreater than just the storage size of each scanline by itself.Again, this may help speeding up image processing.

- subimages: this one may seem obscure, but it happens in anumber common of file formats (gif, mng, DDS, probably TIFF andothers). Subimages can be - for instance - individual animationframes or precomputed mipmaps. This means that they may havemetadata attached to them (e.g. framerate or delay to next frame)or they may come in totally different dimensions (mipmap levels).

- window regions: now this not quite your average image formatfeature, but relevant for some use cases. The gist of it is thatthe image file may define a coordinate system for a whole imageframe but only contain actual data within certain regions that donot cover the whole frame. These regions may even extend beyondthe defined image frame (used e.g. for VFX image postprocessingto have properly defined pixel values to filter into the visiblepart of the final frame). Again, the OpenEXR documentationexplains this feature nicely. Again, I think this likely is outof scope for this library.


My first point also leads me to this criticism:

- I do not see a way to discover the actual data format of a PNGfile through your loader. Is it 8 bit palette-based, 8 bit perpixel or 16 bits per pixel? Especially the latter should not betransparently converted to 8 bits per pixel if encounteredbecause it is a lossy transformation. As I see it right now youhave to know the pixel format up front to instantiate the loader.I consider that bad design. You can only have true knowledge ofthe file contents after the image header were parsed. The same isgenerally true of most actually useful image formats out there.

- Could support for image data alignment be added by defining anew ImageStorage subclass? The actual in-memory data is notexposed to direct access, is it? Access to the raw image datawould be preferable for those cases where you know exactly whatyou are doing. Going through per-pixel access functions for largeimage regions is going to be dreadfully slow in comparison towhat can be achieved with proper processing/filtering code.

- Also, uploading textures to the GPU requires passing raw memoryblocks and a format description of sorts to the 3D API. Beingrequired to slowly copy the image data in question into atemporary buffer for this process is not an adequate solution.


Let me know what you think!

Re: Initial feedback for std.experimental.image

Reply via email to