Re: [Sursound] the recent 2-channel 3D sound formats and their viability for actual 360 degree sound

Stefan Schreiber Mon, 18 Jul 2011 11:34:16 -0700

Jörn Nettingsmeier wrote:

On 07/18/2011 06:18 PM, Stefan Schreiber wrote:
Which means that they are probably using HRTF techniques. Because HRTF
is an individual parameter, they would have to use some form of
"standard" HRTF, as long as they don't perform individual measurements.
For me, the interviewer didn't ask the right questions.
quite obviously, the interviewer either doesn't have much insight intosurround sound psychoacoustics as a whole, or he's deliberatelyplaying dumb for the (dubious) benefit of his readers.

Jörn, yes, but I tried to distinguish between the interviewer and thetechnique which is actually reviewed. ..

And again, that’s not just amplitude.
master of suspense. to the uninitiated, this wording implies highmagic. to the slightly more initiated, the word "phase" begins to glowin deep blue letters on the wall, and we have read so many amazingthings in our hifi magazines about phase, and our friends in the pubdon't understand it.



Right you are ;-) , even completey right, but see my first commentary above.

So we’re taking advantage of
what we learned there to create this feeling that things are being
projected into space in the D axis, the depth axis.
<sound of coffee being expelled through the nose>

the what?
so this is 4d spacetime, right? x, y, z, and d :) now this funnydrone noise, is that minkowski spinning in his grave?



Careful, here I differ!

In a parametric approach, d makes a lot of sense. It is not clear fromthe interview < how > the distance cues are reproduced, agreed.

Music representation according to this approach is clearlyfive-dimensional (x,y,z, d and t!), so they call this "multidimensialaudio"/MDA... O:-) :-)

This < might > be something new, and indeed difficult to obtain with 5.1
or (classical) Ambisonics. (If at all.)
ambisonics is about recreating a sound field (for many listeners).head-tracked binaural (whether fed over loudspeakers or headphones) isa single-listener thing.any cues that will work without head tracking for more than a singleperson with known orientation in the room can be tacked to ambisonicsjust as well.

Ambisonics 1st order doesn't reproduce close distance. And maybe it isjust for one or two listeners. We have to be fair...

However, X-talk cancelling techniques would require close speakers.
i'm not sure about this. from what i've heard, rwth aachen are runninga CAVE with head tracking and binaural feeds delivered by a cube ofspeakers (as that is the only layout that wouldn't interfere too muchwith their screen configuration). no idea how exactly they do it, butthere should be some papers out there. iirc they can even accomodatemore than one listener. haven't heard it, though.

Heinrich Hertz Institut (Berlin) does reproduction of 3D video withoutglasses, while they are tracking observer positions.Even the XBox might track players, so what? (Kinect, distance cuesquite directly via IR camera, if I remember well.)

What I heard that day at SRS was a witch’s brew of breakthrough audio
technologies, a combination of new psychoacoustic depth-rendering
techniques applied through the filter of a game-changing approach to
mixing movie soundtracks that SRS calls Multi-dimensional Audio, or
MDA. Together, they form the basis of CircleCinema 3D, a feature that
will begin appearing in flat-panel HDTVs and soundbars from SRS
licensees in 2012, and perhaps later, in A/V receivers.


this is gibberish.

Look, he is just a journalist, not a sursound-trained suroundscientist... 8-)One technique journalist I know has told me that he plans to visit SRSwhen he is next time in LA, which will be soonly. The interview shouldinclude better question, he already knows...

But the coding of depth cues seems to be something new, and if this
works, it is really impressing.
actually, i don't see that happening for more than one person, withouthead tracking.

Very unclear, indeed. Somebody has to review the approach from a moretechnical point of view!

P.S.: The next surround system has to be independent of speaker
configurations, and to include the 3D/"sphere" aspect. If you can
reproduce distance cues, even better.
distance cues are mostly gimmickry in my opinion. you can fakedistance in a number of ways, but most are really dependent on thespectrum and envelope of the program material. most aspects ofdistance encoding are also orthogonal to most surround techniques,which means they can be added at will, today. they don't evennecessitate a fancy new name.



Ok. So just < do > this in a commecial system?!

But again, if they design some parametric or "audio object" basedsystem, it is natural to add some distance parameter. (In 3D video, theparallel approach would be "2D and depth". It is pretty natural andefficient, although there are some limits in accuiracy.)

you could just say "i'm doing crosstalk-cancelled binaual delivery viaspeakers using near-field hrtfs as described by menzies and others",or you could say "i'm using vector-base amplitude panning of anechoicaudio objects as introduced by pulkki, combined with room synthesisbased on well-known algorithms a, b, and c, some lowpass to mimic airabsorption and adaptive resampling delay to obtain doppler shifts".

Absolutely, but this an area where they just might stop talking. Ialready have suspected that they are using VBAP and X-talk cancelledbinaural representation (not on this list). Your analysis seems to bevery sophisticated, and probably pretty close to the real thing. Bravo!

But even if they use ingredients which are all known in the scientificcommunity, they are trying to define a new standard, or at least a newcommercial system which is based on all this science. It is hard todesign anything commercial when the science behind is not understood. Inthis sense, I don't have any problem with the SRS appoach. (It is stillhard enough to get a system work...)

Now, compare this to "our" different European WFS attempts, and you willsee what I mean. (This is very interesting and scientifically probablymore advanced, but commecially, this is just going nowhere. A new cinemaaudio standard would intend to introduce at least < some > meaningful 3Dclues. I hope that I don't have some 97 new enemies from the WFScommunity... :-[ )

of course you could also say "we are harnessing ultrasound-triggeredectoplasm for real 4-d sound projectiong using our proprietaryone-more-dimension-than-your-mum technology". yawn.

Disagree, and strongly! They are demonstrating their technology, this isnot about vaporware.

it's so friggin' hard to make the walls of the listening roomdisappear (with _any_ surround technique) that i don't see how themajority of consumers would ever respond to distance cues properly,with the exception of some bumblebee-in-your-ear tricks or deptheffects mediated by visuals. the former are often limited to veryspecific content, and for the latter, if you have visuals, then likeit or not, mono is totally adequate and the brain will do the rest(exaggerated, but only very slightly).

And maybe they want to have some "proximity effects" in cinema/filmaudio? (Some can be done now, I know.)

My opinion: It is the right time to introduce some improved surroundsystem into the market, at least in the cinema area there seems to bereal demand.Scientifically, we should have gathered enough knowledge to be able todo so, by now.



Surround is not just about Ambisonics and maybe WFS, yet again.

Even if SRS is using technology which is an old hat and based on somevery old math conceived by Huygens or even earlier :-D , well, at leastthey do something.



Best,

Stefan
_______________________________________________
Sursound mailing list
[email protected]
https://mail.music.vt.edu/mailman/listinfo/sursound

Re: [Sursound] the recent 2-channel 3D sound formats and their viability for actual 360 degree sound

Reply via email to