[Sursound] blindly identifying matrix encodings

Sampo Syreeni Tue, 06 Mar 2012 16:40:46 -0800

On 2012-03-06, Fons Adriaensen wrote:

I'd agree with what Sampo wrote in a previous post: it should bepossible to find out the encoding by analysing the recordings. This isjust a bit of 'applied cryptology', and since everything is assumed tobe linear in this case, rather easy cryptology.

So what are the algorithms -- preferably simple -- which allow us to dothis? As a preservation freak, I'd like to get at some actual,programmatic, blind solutions, here, so that simply digitizing whateverpeople have and feeding it through a program (which was not toodifficult to develop) will with high accuracy both identify+tag theencoding, and in some cases also decode it, and definitely transcode itlosslessly, into a preservable format.

Personally, I would approach this sort of thing as a statisticalclassification problem. There's neat, powerful math to be used there,ready-made code, and it meshes well with digital signal processing evenin its simplest forms (audio signals tend to be Gaussian thanks to thecentral limit theorem, and so on).

The simplest approach, I think, would be to build a) a model of ageneric encoder which includes all of the formats we care about, b) acollection of source material, preferably including at least one pair ofpre-encoding and post-encoding signals per format, and c) some machinerywhich plots out how the signals behave e.g. on the Scheiber sphere.After all, if the different formats distinguish themselves, long term,in a single analysis like that on the Sphere, there's no need to doanything more sophisticated: we just aggregate all of the loci which thesignals reach and see which complete encoding locus best fits theresulting trajectory the best.

The code to do that involves two Hilbert transformers, minor complexmodulo and binning arithmetic, and a big enough three dimensional(hash?) array. (Combined channel amplitude can be normalized away.)

If that doesn't work, then we might have to build a source model and dosome cross-covariance analysis against it. (Long term averaging of therecording side background noise alone ought to bring up the locus.)

Having a source model ought to let us identify the center front and anydiffuse channels, as in Dolby MP, while L/R separation would probably bean issue unless the orientation was given as metadata. Up-down, andvertical vs. horizontal, ought to be easy enough with just about anymaterial. So in theory, arbitrary channel orders (with more than twochannels as well) ought to be doable to a degree, then.

Next in line, t'd seem that at least separation of diffuse sound fromdirect sources and other DirAC-like processing could help track theprominent sources, and so to see which kinds of loci they follow.Averaging over time, they again ought to plot out the average encoudinglocus, and combined with some kind of a (possibly adaptive and frequencysensitive) source model, they could help us finally tell even theevilest, most actively encoded pieces apart.

And if nothing else really works, perhaps we need to build an actualencoder for each format we want to discern, some backwards projectingcode through them, and just run them all at the same time to see whichone produces the best simulacrum of what we actually have, starting fromthe best commonal, adaptive, source model? That's already evil, but itdoes work: I've once used that to blindly identify languages behindunknown character encodings, for example. (The basic framework is simpleas hell, but then your Bayesian arithmetic easily bumps into precisiontrouble. If we have to go this far, we'll have to recruit a numericalanalyst as well.)


Fons, does this seem like a research outline, already? :)
--
Sampo Syreeni, aka decoy - [email protected], http://decoy.iki.fi/front
+358-50-5756111, 025E D175 ABE5 027C 9494 EEB0 E090 8BA9 0509 85C2
_______________________________________________
Sursound mailing list
[email protected]
https://mail.music.vt.edu/mailman/listinfo/sursound

[Sursound] blindly identifying matrix encodings

Reply via email to