Re: [Sursound] an exploratory mail

Stefan Schreiber Fri, 25 Jan 2013 00:58:11 -0800

http://www.content-technology.com/standards/?p=162

Key issues to be addressed are, a compact and bit-efficientrepresentation of multi-channel audio programs with the ability toscale up to transparent quality and the flexible rendering of an audioprogram to an arbitrary number of loudspeakers for arbitraryconfigurations. 3D Audio support via headphones is a key capability inorder to deliver an immersive experience for users of mobile devices.


The "mobile device  part" sounds very much like me, but anyway...       :-D

This technology will form the basis for MPEG-H 3D Audio, the Audiopart (Part 3) of the MPEG-H (ISO/IEC 23008) suite of technologies.
MPEG has launched a new Ad hoc Group (AhG) activity towards respondingto industry needs on the adoption of MPEG Audio technology. This AhGwill assess industry needs in the marketplace, respond to inquiriesand make available information such as conformance streams,application notes and white papers, that might support adoption ofMPEG Audio technology in the various application domains. Theinformation will be disseminated by various means, including websitesand user forums and participation in the AhG is open to the public.


So, supposedly there is an AhG "responding to inquiries".

I, for my part, will try to get into some constructive contact with this< new > group.


Best,

Stefan Schreiber

P.S.: They possibly should check some of our recent discussion.



Bo-Erik Sandholm wrote:

I was at the Stockholm MPEG meeting listening to the 3D-audio session.

If I remember correctly the guy holding the "keynote" speech actually talked 
about several choices of 3D audio encodings in the future standard to choose from for 
different
Target devices and distribution types.
I got the impression that there would be a head parameter field having a flag telling how many sound channels to use and according to what algorithms thesound channels following where to be decoded.When I listened to him I kept thinking that ambisonics B-format in varying orders, maybe in combination with a few dedicated channels for LFE effects and Dialogue would fulfill thestated "vision" for 3D surround sound at least for applications with a modest bandwidth need.
How BAD would 3D audio really be with a Ambisonics B+ system  with 4 channels 
for 3D soundscape and 2 additional discrete channels for Dialogue and LFE?
This as a low entry point and higher or mixed orders could be used for more 
advanced systems.
One of the speakers presented the sound design process for movie sound, The impression of that process was that almost 100% of current movie sound spatialisation iscompletely artificial and very multichannel with a lot of mono sound samples to be placed and controlled in 3D, and looks a lot how the sound scape artists currently work.
In that context I felt that not all people in the industry have understood the 
great idea of separating the working tools and formats for creation of a 
soundscape and the
Distribution formats and decoding to arbitrary speaker localization.
From the 3D-Auro presentation, I got the impression that they need 24 bits in the audio 
5.1 files to "hide" their 3D information, so their format might not be ideal 
for distribution to
Bandwidth starved devices (if there will be any such devices in the future :-). 
So 3D-auro is mostly the old 5.1 with a added higer level, and how well will 
that work for realistic soundscapes :-).

The 3D-auro idea of 5 slim high speaker cabinets with speaker elements for  
ground level and height in addition to LFE  was smart, but Ambisonics could use 
the same speakers and in
Addition compensate if the speakers are not placed according to the standard.

The Japanese 22.2 format is a fixed speaker setup and it is a lot of discrete 
channels to distribute, my wife will NOT accept a 22.2 system in our living 
room :-).

I  really think this is the real (last ?) window of opportunity for more 
widespread use of ambisonics.

Maybe our target for ambisonics  is not to be a solution for all 
implementations, but to find a niche in the device landscape with different 
distribution formats.

I agree with Stefan but in addition I  would like to push the advantage of a B+ 
with only 6 channels.
But as ambisonics have been in use for large outdoor conserts it should 
probably be good enough for a cinema:-).
And it would be an extremely large advantage I think to be able to present a 
LIVE demo of a 3D sound system at the next session, even if only a small one.
I was a bit disappointed that the 3D-Auro session was only in stereo :-), the 
being the only talk with any sound as far as I remember.

Bo-Erik Sandholm
Ambisonic hobby recorder.
Stockholm Sweden

-----Original Message-----
From: [email protected] [mailto:[email protected]] On 
Behalf Of Stefan Schreiber
Sent: den 24 januari 2013 04:33
To: Surround Sound discussion group
Subject: Re: [Sursound] an exploratory mail

Richard Dobson wrote:
On 23/01/2013 01:39, Stefan Schreiber wrote:

..
Why are you actually not reading what I was posting? One of therequirements is "arbitrary" speaker layouts. Full stop. (There willbe some fixed layouts, I guess. But still.)
...
of multi-channel audio programs and the ability to flexibly renderan audio program to an arbitrary number of loudspeakers witharbitrary configurations.
Possibly people were working on the basis of your initial comment:
"However, my impression is that the MPEG's intention is more to settleon something relatively simple, like 22.2, Auro-3D speaker layout etc.
"

Which would narrow the range of layouts considerably.
I can well understand the attraction that dealing with specificcompanies would have for the MPEG committees. They have clearlyidentified and authoritative individuals to deal with who representthe company - whether a CTO or a CEO. Who will stand up to be the CTOor CEO of "Ambisonics", with the support of the community? It wouldostensibly need to be someone (or a small group) not encumbered bypossible conflicts of interests with commercial organisations theywork for. But also someone who can discuss and accommodate thespecial needs of cinema while making the broader argument too. So theywould still need one way or another to speak with the authentic voiceof "the industry".
Ambisonics may in all sorts of ways be both the superior and the mostappropriate technology, but even now it has barely escaped thelaboratory and the concert hall. We can be sure Auro-3D etc will belobbying intensively, not least on the strength of existing industryadoption (e.g. Auro-3D's list of cinemas using their system). Soambisonics has quite a bit of ground to make up, in effect not only tomake its case, but also to make the case against the existing andalready more established choices.
Richard Dobson
I would suggest that the "fast track" Ambisonics based proposal which I have made - relying on 
already defined elements, or elements which could be defined and implemented in an easy and fast way, like a 
"modified B+ scheme with higher order and more direct channels" - is way superiour to Auro-3D, even 
if using "only" 3rd order.

In fact, the "3D performance" of any reasonable .AMB/HOA based approach (referring to the 
potential "vertical resolution" of 2nd+ order height
elements) seems to beat any proposed commercial 3D audio solution for cinemas which I 
have seen recently. (Let us also consider that the "3D performance" of 
classical WFS doesn't seem to be that good, too. :-) )

Therefore, Ambisonics fits best into the official requirements of the MPEG's 
CfP: It is a very natural approach for 3D audio, and arbitrary speaker layouts 
are no problem.
If I am right (and it seems like...), it would be a no-brainer to chose some Ambisonics based solution as the fexible part of the new MPEG-H part 3 standard. (Of course they still might want to define 11.1 or 22.2 as typical "fixed" solutions. If we speak about solutions for 3D audio, I would not expect that you can install dozens of speakers everywhere.However, you should keep the chance to implement some "excellent"solutions for concert halls/big cinemas etc. But you would have to cover smaller/cheaper installations, home installations, mobile audio via
headphones...)

In spite of a former commentary of Dave Malham I would expect that it would be < 
very > difficult to agree on any standard for audio objects, as there isn't any 
common approach available. (They would have to chose some clearly propietary 
solution, and this doesn't look like a reasonable standard definition.)

Therefore, Ambisonics seems to win again. In spite of different opinions on 
some issues, the technique is based on a mathematical theory, and there is a 
considerable implementation tradition.

If anything, there is far more experience available to implement sound field 
based solutions than to implement to completely new object based solutions.

In this sense, I think we don't have to be too shy. Ambisonics is used in many 
installations. In fact,  Dolby Atmos and (DTS) Headphone:X or MDA are the 
newcomers Ambisonics is used for open air concerts.

FLAC made it without CEOs or CTO's, so what? (It got a typical de-facto standard for 
lossless audio compression, without < any > company support.)
But also someone who can discuss and accommodate the special needs ofcinema while making the broader argument too. So they would still needone way or another to speak with the authentic voice of "the industry".
Yes. Audio "for 3D movies" was the initial motivation, and they are certainly 
interested in 3D audio for UHD TV...

I gave some feedback to think about a flexible and broad solution for everything. They should 
present a 3D audio codec in the wide sense, because this is what they really seem to need. If the 
required and "to be defined" 3D audio codec is strong enough for cinema use and not too 
"heavy", it should  fit for UHD, games and mobile audio, too.


Best regards

Stefan

_______________________________________________
Sursound mailing list
[email protected]
https://mail.music.vt.edu/mailman/listinfo/sursound


_______________________________________________
Sursound mailing list
[email protected]
https://mail.music.vt.edu/mailman/listinfo/sursound

Re: [Sursound] an exploratory mail

Reply via email to