Re: Commercial video processing app in D (experience report)

thedeemon via Digitalmars-d-announce Thu, 28 Apr 2016 01:25:50 -0700

On Thursday, 28 April 2016 at 06:22:18 UTC, Relja Ljubobratovicwrote:

Can you share with us some of your experience working on imageand video processing modules in the app, such as are filtershere:
http://www.infognition.com/VideoEnhancer/filters.html
If I may ask, was that part implemented in D, C++, or was some3rd party library used?


Thanks!

The filters listed there are third-party plugins originallycreated for VirtualDub ( http://virtualdub.org/ ) by differentpeople, in C++. We made just 2-3 of them, like motion-basedtemporal denoiser (Film Dirt Cleaner) and Intelligent Brightnessfilter for automatic brightness/contrast correction. Our mostinteresting and distinctive piece of tech is our Super Resolutionengine for video upsizing and it's not in that list, it'sbuilt-in in the app (and also available separately as plugins forsome other hosts). All this image processing stuff is written inC++ and works directly with raw image bytes, no special librariesinvolved. When video processing starts our filters usually launcha bunch of worker threads and these threads work in parallel eachon its part of video frame (divided into horizontal stripesusually). Inside they often work block-wise and we have a bunchof template classes for different blocks (RGB or monochrome)parameterized by pixel data type and often block size, so thesize is often is known at compile-time and compiler can unrollthe loops properly. When doing motion search we're using ourvector class parameterized by precision, so we have vectors ofdifferent precision (low-res pixel, high-res pixel, half-pixel,quarter-pixel etc.) and type system makes sure I don't add or mixvectors of different precision and don't pass ahalf-pixel-precise vector to a block reading routine that expectsquarter-pixel precise coordinates. Where it makes sense andpossible we use SIMD classes like F32vec4 and/or SIMD intrinsicsfor pixel operations.

Video Enhancer allows chaining several VD filters and our SRrescaler instances to a pipeline and it's also parallelized, sowhen first filter finishes with frame X it can immediately startworking on frame X+1 while the next filter is still working onframe X. Previously it was organized as a chain of DirectShowfilters with a special Parallelizer filter inserted between videoprocessing ones, this Parallelizer had some frame queue insideand separated receiving and sending threads, allowing theconnected filters to work in parallel. In version 2 it'strickier, since we need to be able to seek to different positionsin the video and some filters may request a few frames before andafter the current, so sequential pipeline doesn't sufficeanymore, now we build a virtual chain inside one big DirectShowfilter, and each node in that chain has its worker thread andthey do message passing to communicate. After all, we now have abig DirectShow filter in 11K lines of C++ that does both SuperResolution resizing and invoking VirtualDub plugins (imitatingVirtualDub for them) and doing colorspace conversions wherenecessary and organizing them all into a pipeline that ispull-based inside but behaves as push-based DirectShow filteroutside.

So the D part is using COM to build and run a DirectShow graphwith all the readers, splitters, codecs and of course our bigvideo processing DirectShow filter, it talks to it via COM andsome callbacks but doesn't do much with video frames apart fromcopying.

Btw, if you're interested in an image processing app in pure D,I've got one too:

http://www.infognition.com/blogsort/
(sources: https://bitbucket.org/infognition/bsort )

Re: Commercial video processing app in D (experience report)

Reply via email to