Re: [Sursound] What does a mic with more than 4 channels give you?

Sampo Syreeni Fri, 26 Apr 2013 18:18:43 -0700

On 2013-04-26, Fons Adriaensen wrote:

(Okay, this one is long and filled with intuition-beyond-verified-math.Take it with a grain of salt, even if I think there's a point or twothere..)

Nobody claims there's a hard border between the 'correctlyreconstructed' area and the rest. If you're close enough the errorwill be small, just as x is a reasonable approximation to sin(x) forsmall x, adding an x^3 term will give a better one, etc. The lengthscale is wavelenght.

The same obviously goes for all series expansions, not justFourier-Bessel, unless it just so happens what you're reconstructing isalready 100% some truncated, partial sum of the series you're using. ForF-B and most other useful series that is rarely the case.

Robert, if you're looking for the precise place where the scale entersinto the equations, it's via the radial Bessel term. As you increase theorder, the span (also effective support) of the Bessel functionsincluded thus far grows linearly in distance from the origin as well,and sets the distance from the sweet spot upto which the square normerror stays within some given bound.

In the classical POA framework you don't see the Bessel term directlybecause everything is built up from directivity patterns and otherdistinctly coincident concepts. But if you go through a basis changefrom plane waves to Fourier-Bessel, the radial dependency falls outnaturally and works just as it does in the explicit form utilized bye.g. NFC-HOA and spherical WFS work. Alternatively you could just solvethe wave equation in spherical coordinates using separation ofvariables; Bessel functions constitute the most natural basis forexpressing the radial part under square norm.

What originally messed at least me up was two things: the highdirectionality of higher order systems which is just unphysical at asingle point (precisely four degrees of freedom there), and the factthat we have at least three different decoders going even with POA(systematic, max-Re and in-phase). The first problem is resolvedprecisely by including the radial Fourier component as well, because itshows higher directivity always goes along with a larger volume aroundthe central point, for both reconstruction and pickup design. The secondone is about the mode of convergence we choose for the F-B series, muchlike you can sacrifice fast convergence of ordinary Fourier series tocontrol Gibbs's phenomenon: systematic gives you optimal square normconvergence but leads to oscillating components, in-phase killsoscillation but convergences a lot slower especially at the start of theseries where the relevant directional components are badly localisedover the sphere, and max-Re is somewhere in between.

From this perspective the basic length scale enters analogously to how

the Nyquist frequency enters conventional sampling theory. When you setthe sampling frequency, you at the same time set the upper limit to whatcan be represented faithfully, and rom there on anything above half F_sfolds cyclically. Despite the fact that in POA we conceptually work withinfinitely extended soundfields, we do so roughly using a Fourier-Besselseries, not the continuous transform version of the construct. If weused the latter, there would be no intrinsic length scale, but with theseries you actually have to fix one of the Bessel functions to a givenscale. Or to make the analogy even more exact, of the four differentforms of usual Fourier analysis, we're not using the transform(continuous in time and frequency), the series (periodic in time,discrete in frequency), or the DFT (periodic and discrete in both timeand frequency), but the discrete time/shift Fourier transform (discretein time, periodic in frequency).

In that case the spatial sampling frequency sets the scale above whichspatial-directional frequencies fold -- in mic design we get poorrejection of higher order components present in the field, in sparserigs what happens is exactly the same thing that happens in sphericalWFS above some given frequency. The effect is exactly the same as it iswith the sampling theorem, only it's expressed in a sphericallysymmetric form so that most setups both in mics and in playback rigsviolate the sampling constraint to some degree. (Neither of the usualplanar Fourier basis, truncated, nor the Fourier-Bessel basis, againtruncated, is a subset of the other to any finite degree of truncation.They both span the whole of L^2(R^3) in the limit, but approach thatlimit in fully incomparable ways, even in the usual square norm. Ibelieve this is where the confusion regarding spatial derivatives andspherical harmonical components came from, years ago, e.g. in the formof -- was it -- the ten component "chi-format"; the latter mixed andmatched two different series, with confusing results.)

(I believe this is also the reason Gerzon mentions Gaussian summationformulae in his work on the classical tetrahedral Soundfield mic: thesymmetry inherent in tetrahedra, used together with pickups that have anideal first order directivity pattern because of physical reasons, buysyou extraordinarily high directional rejection of out-of-band componentswhich only breaks down above the asymmetry caused by capsule spacing.That spacing is effectively what sets the length scale/spatial bandwidthlimit within POA. You have to do some nasty mathematical gymnastics inorder to show that for real, but at least my third lobe says the lengthscale is definitely there, despite being masked by the act that in thevery special case of ideal first order POA, you could do without as faras mic design goes. There the representation is essentially scale-freeand only references against whatever frequency you happening to betalking about, leading us to talk about it all in relative terms, likeillposed transform matrices, noise amplification and sensitivity.)

The reason that limit is not part of the orthodox theory is once againan artifact caused by the fully coincident analysis framework and thefunkiness that comes from the fact that we can exchange the first (andonly the first) order components for point velocities, withoutconsidering the extended field at all. POA truly is a special case. Whenyou go to higher orders you actually ought to be doing it the NFC-HOAway, and when you do, the spatial length scale necessarily entersexplicitly, in the form of a fixed array diameter. If you resize thearray, you have to do the analog of bandlimited interpolation in thespherical domain -- implemented in NFC-HOA as the summation over, was itnow, the B(.,.) transfer functions. Those effectively work asdirectional anti-imaging interpolation when you go to higher D, and as adirectional anti-alias when you do the opposite.

Then, just as it happens in more usual forms of resampling, it'sdifficult to see the precise effect before you go into the transformdomain: there the effect is a brickwall filter, while in the dual basedomain the operation seems to be just a soft blurring which stretchesthe half energy cutoff of the basis functions further out. That's thenwhy you don't easily see what the real effect is if you work in the basedomain, as we do in the classical POA derived theory; there we don'tconsider the hard sampling criterion at all but only deal with thesensitivity issues caused by the approaching band edge at each temporalfrequency, and sometimes get surprised by the nastiness caused byunrejected spatial alias or reactive fields. (The latter are just finefrom a theoretical viewpoint, but can bring in unexpected free degreesof freedom and so break e.g. mic designs which didn't expect them. Thatproblem is particular to higher order, because at first order, physicsmeans there is no essential difference between pressure gradient, thefirst order Fourier-Bessel expansion of the pressure field, andvelocity.)

And of course all of that is doubly difficult to to see when you startout with the central, coincident viewpoint: there there is no explicitarray to consider, so no array diameter either. The Bessel terms arefully implicit and have to be dug out by intention if you want to seethem. The conventional theory gives you absolutely no hint that youshould look in that direction before you consider a finite sphericalmic/rig consisting of monopoles and/or first order (cardioid, fig-8)mics (the best we can do with stock components) which can no longer beconsidered coincident. Suddenly you have to consider thespatial-directional sampling effects because in order to get the niceharmony we get from Gaussian interpolation would necessiate physicalmics with at least second order, near-ideal directional selectivity.That you can't buy in a store, so you have to synthesize the responsefrom simpler atoms, and suddenly your math goes haywire because it nowshows all of the nasty radial, dependent on your spatial-directionalsampling cutoff dependent terms as well. (And actually doesn't even getsolved properly unless you impose such a cutoff. In most papers thatcutoff is imposed by accident by restricting the analysis to the orderof the spherical harmonical decomposition that the system aims at; bitmistake: in the process you end up assuming the pressure field is smoothto that order, in precisely the sense that the system requires, which itin reality mostly isn't.)

--

Sampo Syreeni, aka decoy - [email protected], http://decoy.iki.fi/front+358-50-5756111, 025E D175 ABE5 027C 9494 EEB0 E090 8BA9 0509 85C2

_______________________________________________
Sursound mailing list
[email protected]
https://mail.music.vt.edu/mailman/listinfo/sursound

Re: [Sursound] What does a mic with more than 4 channels give you?

Reply via email to