Re: [Sursound] Two new approaches for the distribution of surround sound/3D audio

Stefan Schreiber Sun, 11 Aug 2013 21:22:08 -0700

Paul Hodges wrote:

--On 29 July 2013 03:57 +0100 Stefan Schreiber <st...@mail.telepac.pt>
wrote:

UHJ (surround/3D audio) as extension of stereo based files
(distribution via Internet, on discs and streaming, including
YouTube, Spotify etc.)


I like the potential of this idea very much; but it can only move
forward with the free availability of freely available encoders and
decoders for 2, 3 and 4-channel UHJ, in both standalone and plugin
formats.  Seeing as how mere 2-channel versions have signally failed to

become available at all, I wonder what chance there is.

I had hoped that somebody else would state the obvious, in the end Ihave to do this myself... :-)

While I would understand the above argument IF UHJ would be some area onits own, my proposal actually implied that you would use (in the end) aB format decoder.

You would < additionally > need an UHJ channel extractor (works on theAAC file/ .M4P/Ogg etc. < input > ), and secondly the UHJ to B format"translator". (The latter is just the application of some formulas whichmight not be trivial but are known and/or can be deduced. From an ITperspective, this is very little program code. You just have to applyknown formulas. This step also doesn't depend a lot on the specificprogramming language which is used. Mathematics stays mathematics, andthe "language" of mathematical formulas is older than programminglanguages - which explains why formulas look more or less the same inany programming language - well, if I/you exclude Forth and otherexotics.... :-D )

I would call the two additional steps the < UHJ front end > for some Bformat decoder.

I know that there would have to be done a lot more work to publish Bformat programs/plugins/mobile apps etc., and to describe B decoderdesign. Specifically, I believe that B decoders nowadays < should > beable to support output via headphones and binaural techniques. SectionIII of my 1st posting suggests that head-tracking hardware is bothavailable and cheap enough to be applied in real-world products,including future < surround capable HT headphones >. I mentioned thespecific hardware used in the Oculus Rift VR headset, just to give someexample for some existing HT chip. (There is plenty of other hardwarearound.)

It might help to set up some open group, which would promote the use anddesign of B format (HOA? Section II...) decoders: describing the theorybehind, offering (open sourced) program code, distributing freesolutions etc. (To set up a working "open" group requires someorganisational skills, but it can be done.)

Again, the real problem seems to be the lack of available B formatdecoders. (My proposal is to transport "B format over stereo", in somesimple description. If so, it is again obvious that you should see theuse of UHJ extension channels just as a front end for B format, becausethis is the format which has to be decoded.)

I believe that "you" should promote the fact that B format is a real 3Daudio format, using just 4 channels. This is obviously some intriguingfact. (Note that the spatial "3D" resolution of full FOA is actually thesame as the spatial 2D resolution of XYW, because Ambisonics is isotropic.)

IMO, 2-channel UHJ is something from the past. Don't use this if youcould distribute the real thing?! Which means B format, not a reducedform of B format. The use of 3/4 channel UHJ (maybe more channels forhigher oders) was suggested to stay compatible with 2-channelaudio/stereo files and streams. It has been shown that existingfile/container formats would allow the transport of < UHJ<---->B format> over stereo, via at least two different extension techniques. (Fileextensions, extensions in current container formats)



Best regards,

Stefan

P.S.: Mpeg Surround is also a decoder based design. (MPS encoder/decoder)

The same is valid for the future (Mpeg) 3D audio codec, currently indevelopment. I know that they take the topic "binaural output viaheadphones" very seriously, you just have to look into their CfP andsimilar documents...



P.S. 2:

Like everyone else I wish I had the time myself; but when factoring in
the need to learn about DSP programming and modern programming
languages, other commitments, and the slowing down of age...

Paul

Not any single person could do all the programming stuff, at least notanymore. There are just too many different platforms around....

Nevertheless, B format decoders/apps will be written if Ambisonics isseen as a format which is worth to be implemented. (Or if there isenough music in this format around.)In this sense, I would look to the applications/aspects which are"beyond" of what is offered by the 5.1 ITU layout. (IMO Ambisonicsstarts to shine if you factor in the inherent capability torecord/encode < full-sphere> 3D audio. And because you could really notexpect that available 3D audio loudspeaker layouts would look about thesame everywhere, the Ambisonics decoder can be seen as a necessaryinterface to real-world loudspeaker configurations, or to headphones.2nd advantage... More arguments?!)

_______________________________________________
Sursound mailing list
Sursound@music.vt.edu
https://mail.music.vt.edu/mailman/listinfo/sursound

Re: [Sursound] Two new approaches for the distribution of surround sound/3D audio

Reply via email to