Re: [Sursound] UHJ disc discovery?

Sampo Syreeni Mon, 04 Nov 2013 19:22:19 -0800

On 2013-11-05, Fons Adriaensen wrote:

I have often wondered if there was a way to confirm matrix encodingby using a scope to display phase differences, but have found thatnot to be the case
It should be possible to do that, but the process will be a bit morecomplicated.


This has also been talked about before.

Calculate a series of FFTs on overlapping windows, over the entirelenght of the recording, and look for components that have equalmagnitude in L and R.

On my part, I'd rather do a continuous (perhaps FFT-enabled, perhapsfrequency segregated) wideband Hilbert transform, and in way or anothertry to bin+aggregate the phase relationships between the two carrierchannels, so as to build up an histogram of where the phaserelationships landed over time. I believe, over time, that would be theeasiest and most adaptable way of gathering the basic data, to then besubjected to some prima facie Bayesian reasoning and/or algorithmsderived from machine learning from-sample.

I suspect support vector machines derived fromVapnik-Cervonekis-complexity-limited low order Chebyshev-polynomialsmight be useful in the latter, too: their inverse problem is the mostwell behaved I know with regard to all kinds of noise and nonlinearity,in these kinds of problems, and at the same time, if you can somehowmarry them to an efficient preprocessing and prebinning stage, maybe youcould even do realtime, adaptive recognition of the original analogencoding.

But obviously, since I've never actually coded anything like this out orseen the intermediate statistics, this must remain just a hint tosomeone more willing than me to "actually talk the talk".

If they were panned to center using a normal stereo pan pot theyshould be exactly in phase.

That, and the fact that in analogue material you can't really rely uponthe stuff even staying within the same encoding regime, is why I abovementioned time spans. And why I believe this sort of blind decoding, iftried, should be able to do mixed decoding and/or seamless transitionsfrom the start. Because, once you put in even a statistically optimalrecognizer, it'll be just half sure, half of the time, and will betelling you lots of unexpected things in the middle of thathalf-and-half.

Thus, the worst problem might not be to detect what you want or do notwant to see at all, and then just decode what is there. The real problemmight be to deal with continually and variously detecting something inbetween, and how to decode that, then, without sounding hideous/sillypretty much all of the time.

If the signal is UHJ encoded AMB there will be a phase difference ofaround 35 degrees.

In fact, under some rather mundane assumptions, at least two channel UHJcan even be automatically and reliably detected to the level ofresolving L from R. The same isn't true for e.g. Dolby MR, where theleft-right difference doesn't exist at a level deeper than simple 180degree phase reversal.

If you find that consistently on all center front panned componentsthat would be strong evidence of UHJ encoding.

So, exactly that. At the same time, the accumulation over the Scheibersphere of phase-amplitude points, combined with a couple of ratherminimalistic a priories on what sound fields ought to look like in reallife, ought to be able to statistically differentiate between prettymuch all of the extant, untagged, analogue systems. MP/SQ/QS/UHJ/RM, allthat typical stuff I at least know about, and with a couple of tricksover time, probably even the various extant versions of each of thosesystems. In fact, using algorithms derived from the audio encoding litof the past ten years or so of mobile phone codec algorithms, you shouldeven be able to efficiently (i.e. in real time) compensate for staticdelay differences between the channels, slowly varying same, quite a lotof backround noise, and even certain kinds of speaker-microphone-likerapidly varying angular distortion. In some regards even Doppler, evenif it's particularly nasty, as a non-shift-invariant phenomenon.


So, exactly that, and even more.

Looking at the complete signal it's probably impossible to decide ifany phase difference is significant or not, you need the 'logicdecoding' first.

The way I see it myself, as an amateur (and perhaps soon freelanceprofessional) audio DSP guy, as well as an amateur economist, is thatyou shouldn't look at the instantaneous phase differentials as such.Instead, you should recognize that there is a whole spectrum ofdifferent timespans between none at all and hours, relevant to thesolution, all of them interworking the whole time.

Granted, I might be making this a bit more difficult from the start thanit has to be. But still, think about a typical audio feed from yourcurrent Western television set. It does have commercial breaks in it,no, which might contain totally fucked up audio wrt the audio you had inthe movie they just interrupted. No? So in principle, do you not have toswitch fluently and rather often between encodings, not only in kind,but in time as well? I think you have to be able to do that.

Also, you have to be able to do that even in mixture, because in mostwork I've heard of, "the stuff" was *not* mixed any single system. As Irecall, even such iconic things as the Star Wars soundtrack (almost)always contained ad hoc elements which didn't utilize the underlyingDolby encoding, but were placed as such onto the raw channels. If Iremember right, you'd have to be able to decode even time-variantmixtures, from the start, in order to be useful in the real world. And,you in fact *can* do that, at least in theory, but what I'm then sayingis that the methods you'd use to do that can't stay within the typicalsingle-rate LTI framework; the'll have to be multistage and multirate,often nonlinear ones.

(And actually I might have to ask the group a couple of salientquestions here, in a short time, because it just might be I have toutilize some ambisonic methods just the way I've described, on shortorder. ;) )

--
Sampo Syreeni, aka decoy - [email protected], http://decoy.iki.fi/front
+358-40-3255353, 025E D175 ABE5 027C 9494 EEB0 E090 8BA9 0509 85C2
_______________________________________________
Sursound mailing list
[email protected]
https://mail.music.vt.edu/mailman/listinfo/sursound

Re: [Sursound] UHJ disc discovery?

Reply via email to