Jörn Nettingsmeier wrote:

On 07/18/2011 06:18 PM, Stefan Schreiber wrote:



Which means that they are probably using HRTF techniques. Because HRTF
is an individual parameter, they would have to use some form of
"standard" HRTF, as long as they don't perform individual measurements.
For me, the interviewer didn't ask the right questions.


quite obviously, the interviewer either doesn't have much insight into surround sound psychoacoustics as a whole, or he's deliberately playing dumb for the (dubious) benefit of his readers.


Jörn, yes, but I tried to distinguish between the interviewer and the technique which is actually reviewed. ..


And again, that’s not just amplitude.


master of suspense. to the uninitiated, this wording implies high magic. to the slightly more initiated, the word "phase" begins to glow in deep blue letters on the wall, and we have read so many amazing things in our hifi magazines about phase, and our friends in the pub don't understand it.


Right you are ;-) , even completey right, but see my first commentary above.


So we’re taking advantage of
what we learned there to create this feeling that things are being
projected into space in the D axis, the depth axis.


<sound of coffee being expelled through the nose>

the what?

so this is 4d spacetime, right? x, y, z, and d :) now this funny drone noise, is that minkowski spinning in his grave?


Careful, here I differ!

In a parametric approach, d makes a lot of sense. It is not clear from the interview < how > the distance cues are reproduced, agreed.

Music representation according to this approach is clearly five-dimensional (x,y,z, d and t!), so they call this "multidimensial audio"/MDA... O:-) :-)


This < might > be something new, and indeed difficult to obtain with 5.1
or (classical) Ambisonics. (If at all.)


ambisonics is about recreating a sound field (for many listeners). head-tracked binaural (whether fed over loudspeakers or headphones) is a single-listener thing. any cues that will work without head tracking for more than a single person with known orientation in the room can be tacked to ambisonics just as well.


Ambisonics 1st order doesn't reproduce close distance. And maybe it is just for one or two listeners. We have to be fair...


However, X-talk cancelling techniques would require close speakers.


i'm not sure about this. from what i've heard, rwth aachen are running a CAVE with head tracking and binaural feeds delivered by a cube of speakers (as that is the only layout that wouldn't interfere too much with their screen configuration). no idea how exactly they do it, but there should be some papers out there. iirc they can even accomodate more than one listener. haven't heard it, though.


Heinrich Hertz Institut (Berlin) does reproduction of 3D video without glasses, while they are tracking observer positions. Even the XBox might track players, so what? (Kinect, distance cues quite directly via IR camera, if I remember well.)


What I heard that day at SRS was a witch’s brew of breakthrough audio
technologies, a combination of new psychoacoustic depth-rendering
techniques applied through the filter of a game-changing approach to
mixing movie soundtracks that SRS calls Multi-dimensional Audio, or
MDA. Together, they form the basis of CircleCinema 3D, a feature that
will begin appearing in flat-panel HDTVs and soundbars from SRS
licensees in 2012, and perhaps later, in A/V receivers.


this is gibberish.


Look, he is just a journalist, not a sursound-trained suround scientist... 8-) One technique journalist I know has told me that he plans to visit SRS when he is next time in LA, which will be soonly. The interview should include better question, he already knows...


But the coding of depth cues seems to be something new, and if this
works, it is really impressing.


actually, i don't see that happening for more than one person, without head tracking.


Very unclear, indeed. Somebody has to review the approach from a more technical point of view!


P.S.: The next surround system has to be independent of speaker
configurations, and to include the 3D/"sphere" aspect. If you can
reproduce distance cues, even better.


distance cues are mostly gimmickry in my opinion. you can fake distance in a number of ways, but most are really dependent on the spectrum and envelope of the program material. most aspects of distance encoding are also orthogonal to most surround techniques, which means they can be added at will, today. they don't even necessitate a fancy new name.


Ok. So just < do > this in a commecial system?!

But again, if they design some parametric or "audio object" based system, it is natural to add some distance parameter. (In 3D video, the parallel approach would be "2D and depth". It is pretty natural and efficient, although there are some limits in accuiracy.)


you could just say "i'm doing crosstalk-cancelled binaual delivery via speakers using near-field hrtfs as described by menzies and others", or you could say "i'm using vector-base amplitude panning of anechoic audio objects as introduced by pulkki, combined with room synthesis based on well-known algorithms a, b, and c, some lowpass to mimic air absorption and adaptive resampling delay to obtain doppler shifts".

Absolutely, but this an area where they just might stop talking. I already have suspected that they are using VBAP and X-talk cancelled binaural representation (not on this list). Your analysis seems to be very sophisticated, and probably pretty close to the real thing. Bravo!

But even if they use ingredients which are all known in the scientific community, they are trying to define a new standard, or at least a new commercial system which is based on all this science. It is hard to design anything commercial when the science behind is not understood. In this sense, I don't have any problem with the SRS appoach. (It is still hard enough to get a system work...)

Now, compare this to "our" different European WFS attempts, and you will see what I mean. (This is very interesting and scientifically probably more advanced, but commecially, this is just going nowhere. A new cinema audio standard would intend to introduce at least < some > meaningful 3D clues. I hope that I don't have some 97 new enemies from the WFS community... :-[ )


of course you could also say "we are harnessing ultrasound-triggered ectoplasm for real 4-d sound projectiong using our proprietary one-more-dimension-than-your-mum technology". yawn.


Disagree, and strongly! They are demonstrating their technology, this is not about vaporware.


it's so friggin' hard to make the walls of the listening room disappear (with _any_ surround technique) that i don't see how the majority of consumers would ever respond to distance cues properly, with the exception of some bumblebee-in-your-ear tricks or depth effects mediated by visuals. the former are often limited to very specific content, and for the latter, if you have visuals, then like it or not, mono is totally adequate and the brain will do the rest (exaggerated, but only very slightly).

And maybe they want to have some "proximity effects" in cinema/film audio? (Some can be done now, I know.)


My opinion: It is the right time to introduce some improved surround system into the market, at least in the cinema area there seems to be real demand. Scientifically, we should have gathered enough knowledge to be able to do so, by now.


Surround is not just about Ambisonics and maybe WFS, yet again.

Even if SRS is using technology which is an old hat and based on some very old math conceived by Huygens or even earlier :-D , well, at least they do something.


Best,

Stefan
_______________________________________________
Sursound mailing list
[email protected]
https://mail.music.vt.edu/mailman/listinfo/sursound

Reply via email to