Re: [Sursound] 3DAA | Audio Alliance

Sampo Syreeni Fri, 31 Dec 2010 14:00:02 -0800

On 2010-12-23, [email protected] wrote:

Presumably, the 3D/AA has embraced "object-oriented audio" in order toa) abstract from speaker layouts b) reduce number of audio "channels"to 6 or 8 (i.e. fit into 5.1/7.1 distribution media like Blu-ray) andc) to make production more streamlined.

Object oriented audio usually denotes mono sources which aresynthetically panned right in the end, at the receiver side. This is allfine and good -- most synthetic ambisonic material is born this way aswell, and with irregular arrays, this can in fact be the best way to go.I mean, a) it decouples the panning algorithm from the transmission one,so that ambisonic, WFS, whatever holophonic formalism can be used torender the stuff (sometimes the low cost render can be better using someother formalism than ambisonic), and b) the material remains unmixedright to the end, so that it encourages reuse if the object basedintermediary is also exposed to reusers. (Cf. MPEG4 Structured Audio.)

What pains *me*, and what I told 3DAA directly already, is that thisdoes not address captured physical soundfields at all. In that spaceconversions (between e.g. WFS and HOA) are extremely painful if at allworked out, HOA seems like the only technology with real heightcapability as of now, and so I wonder whether 3DAA can actually deliverreal 3D there without incorporating some old ambisonic tech. In a veryscalable form as well, because net bandwidth is still a real issue formultichannel online playback, even factoring in compression.

They've yet to answer my email. So I think it could do some good if thelist's big guns could also weigh in on this, individually, wrt 3DAA.Before the latter settle on something too limiting and impracticable fore.g. cinematic soundscape work. Which *is* one of the biggest sellersafter all, and something the Alliance recognizes even in their publicitymaterial...

If the quality of the resulting 3D surround sounds terrible -- yes,QUAD comes to mind -- then what a waste of time it will have been.

They seem to be standardizing a transmission format. An object based onehas its limitations (e.g. usually just mono sources), but then it canalso stimulate innovation in how to render the stuff -- or arerecognition of how well ambisonic panning actually does. It doesn'tset the eventual sound, it sets a stage for precisely the kind ofdecoder innovation MAG envisioned. Only on digital age steroids. So Idon't think we have a problem in there, at all.

As I already said, I see the problem in settling for WFS *only* (I havea really bad whiff of it and the patents underlying it, Iosono/Dolby,here). This could be very bad for free/open content, online.

I also see a nasty preservation problem: WFS pickups are pretty muchnever dense enough to capture higher harmonics, nor well-regularizedenough to deal gracefully with oblique angles of arrival, nor do theyespecially attenuate spatial aliasing at HF within even the prominentlobe like coincident pickups do. So a hundred years from now, what dothe librarians and audio preservationists have to work with wrt capturedsoundfields? A totally irregular representation which isn't any too easyto make right even with the highest end DSP and psychoacousticalmodelling. I don't like that picture at all. (The synthetic stuff willbe pristine as ever, provided that they really parametrized it right andopenly published the parametrization. Kudos for that. But it's notenough per se.)

Is there an Ambisonic and/or Ambi-derived system that sounds reallygood that should be considered?

I actually think synthetic sources should be kept separate and renderedonly at the receiving end. Because that effectively leads to "infiniteorder decoding". The best of them all, by a far shot, and conducive to arace in better and better decoders. Just what we want.

As for today's capture of live performance/sound(effects) in a 3D space,even Plain Old Ambisonic fares better than WFS or any other micingtechnique. Having heard Eero Aro's demonstration of POA 4-speakerpantophony, and Ville Pulkki's demonstrations of both unaided and DirACaided pantophonic/periphonic work, I can say with certainty that mostpeople hearing a simple SoundField recording via a state of the artdecoder would be impressed, if not even fooled into thinking it was thereal thing, sitting at the sweet spot and blind-folded. Plus I think wecould probably do so well with even the basic B-format and someperiphonic DirAC derivative that I would be fooled as well, eventhoughI'm a pretty picky listener.

Really, even the old, analog, pantophonic implementation of ambisonicvia BHJ transmission is frighteningly good on only four speakers. You*do* startle at a lateral sound, you *do* turn you head, the source*does* stay stable, and when you turn, it *is* there. Firmly in betweenno speakers. Turning backwards you do hear the phasiness, but it doesn'tannoy you; it's rather like a poor man's hall echo or something. Thatthen using precisely two channels -- and now we're talking about four,ain't we? Full B-format? (Personally I then won't vouch forout-of-sweetspot reproduction. It sounds massively spatially distortedto my ear.)

My guess is that MAG would be on that Working Group, if he was stillalive.


Should be. Probably wouldn't.

Who should be there in his stead?

As far as classical ambisonic goes, I'd say Geoffrey Barton. But as faras the current folks dealing with modern and open/free incarnations ofthe technology go, my vote would be for Angelo Farina. Maybe FilippoFazi, if he could forget the physics for a change and bewilling/prepared to explain the tech in layman terms. I mean let's faceit, people in these committees don't really understand physicalacoustics or the slighter nuances of spatial audio; at least yet.

I'm pretty sure who we'd ideally want to represent the ambisoniccommunity is someone with a) noted doctoral/lectorial or professorialcredentials (so that others make note of se's comments), b) anengineering background (to keep it real, simple and to the point), c)political and interpersonal background/experience (for influence withina working group/committee), d) a tight backing crowd to cover the loadof the factual bombardment that will be coming (this list could be thetechnical one, but where to get the political, financial, etc. ones?),and e) above all a nice guy who's always present (to facilitate and gainsocial appreciation within the crowd).

Unfortunately such well-supported technical-missionary-diplomats are fewand far between. ;)

--
Sampo Syreeni, aka decoy - [email protected], http://decoy.iki.fi/front
+358-50-5756111, 025E D175 ABE5 027C 9494 EEB0 E090 8BA9 0509 85C2
_______________________________________________
Sursound mailing list
[email protected]
https://mail.music.vt.edu/mailman/listinfo/sursound

Re: [Sursound] 3DAA | Audio Alliance

Reply via email to