On 2013-04-26, Fons Adriaensen wrote:

(Okay, this one is long and filled with intuition-beyond-verified-math. Take it with a grain of salt, even if I think there's a point or two there..)

Nobody claims there's a hard border between the 'correctly reconstructed' area and the rest. If you're close enough the error will be small, just as x is a reasonable approximation to sin(x) for small x, adding an x^3 term will give a better one, etc. The length scale is wavelenght.

The same obviously goes for all series expansions, not just Fourier-Bessel, unless it just so happens what you're reconstructing is already 100% some truncated, partial sum of the series you're using. For F-B and most other useful series that is rarely the case.

Robert, if you're looking for the precise place where the scale enters into the equations, it's via the radial Bessel term. As you increase the order, the span (also effective support) of the Bessel functions included thus far grows linearly in distance from the origin as well, and sets the distance from the sweet spot upto which the square norm error stays within some given bound.

In the classical POA framework you don't see the Bessel term directly because everything is built up from directivity patterns and other distinctly coincident concepts. But if you go through a basis change from plane waves to Fourier-Bessel, the radial dependency falls out naturally and works just as it does in the explicit form utilized by e.g. NFC-HOA and spherical WFS work. Alternatively you could just solve the wave equation in spherical coordinates using separation of variables; Bessel functions constitute the most natural basis for expressing the radial part under square norm.

What originally messed at least me up was two things: the high directionality of higher order systems which is just unphysical at a single point (precisely four degrees of freedom there), and the fact that we have at least three different decoders going even with POA (systematic, max-Re and in-phase). The first problem is resolved precisely by including the radial Fourier component as well, because it shows higher directivity always goes along with a larger volume around the central point, for both reconstruction and pickup design. The second one is about the mode of convergence we choose for the F-B series, much like you can sacrifice fast convergence of ordinary Fourier series to control Gibbs's phenomenon: systematic gives you optimal square norm convergence but leads to oscillating components, in-phase kills oscillation but convergences a lot slower especially at the start of the series where the relevant directional components are badly localised over the sphere, and max-Re is somewhere in between.

From this perspective the basic length scale enters analogously to how
the Nyquist frequency enters conventional sampling theory. When you set the sampling frequency, you at the same time set the upper limit to what can be represented faithfully, and rom there on anything above half F_s folds cyclically. Despite the fact that in POA we conceptually work with infinitely extended soundfields, we do so roughly using a Fourier-Bessel series, not the continuous transform version of the construct. If we used the latter, there would be no intrinsic length scale, but with the series you actually have to fix one of the Bessel functions to a given scale. Or to make the analogy even more exact, of the four different forms of usual Fourier analysis, we're not using the transform (continuous in time and frequency), the series (periodic in time, discrete in frequency), or the DFT (periodic and discrete in both time and frequency), but the discrete time/shift Fourier transform (discrete in time, periodic in frequency).

In that case the spatial sampling frequency sets the scale above which spatial-directional frequencies fold -- in mic design we get poor rejection of higher order components present in the field, in sparse rigs what happens is exactly the same thing that happens in spherical WFS above some given frequency. The effect is exactly the same as it is with the sampling theorem, only it's expressed in a spherically symmetric form so that most setups both in mics and in playback rigs violate the sampling constraint to some degree. (Neither of the usual planar Fourier basis, truncated, nor the Fourier-Bessel basis, again truncated, is a subset of the other to any finite degree of truncation. They both span the whole of L^2(R^3) in the limit, but approach that limit in fully incomparable ways, even in the usual square norm. I believe this is where the confusion regarding spatial derivatives and spherical harmonical components came from, years ago, e.g. in the form of -- was it -- the ten component "chi-format"; the latter mixed and matched two different series, with confusing results.)

(I believe this is also the reason Gerzon mentions Gaussian summation formulae in his work on the classical tetrahedral Soundfield mic: the symmetry inherent in tetrahedra, used together with pickups that have an ideal first order directivity pattern because of physical reasons, buys you extraordinarily high directional rejection of out-of-band components which only breaks down above the asymmetry caused by capsule spacing. That spacing is effectively what sets the length scale/spatial bandwidth limit within POA. You have to do some nasty mathematical gymnastics in order to show that for real, but at least my third lobe says the length scale is definitely there, despite being masked by the act that in the very special case of ideal first order POA, you could do without as far as mic design goes. There the representation is essentially scale-free and only references against whatever frequency you happening to be talking about, leading us to talk about it all in relative terms, like illposed transform matrices, noise amplification and sensitivity.)

The reason that limit is not part of the orthodox theory is once again an artifact caused by the fully coincident analysis framework and the funkiness that comes from the fact that we can exchange the first (and only the first) order components for point velocities, without considering the extended field at all. POA truly is a special case. When you go to higher orders you actually ought to be doing it the NFC-HOA way, and when you do, the spatial length scale necessarily enters explicitly, in the form of a fixed array diameter. If you resize the array, you have to do the analog of bandlimited interpolation in the spherical domain -- implemented in NFC-HOA as the summation over, was it now, the B(.,.) transfer functions. Those effectively work as directional anti-imaging interpolation when you go to higher D, and as a directional anti-alias when you do the opposite.

Then, just as it happens in more usual forms of resampling, it's difficult to see the precise effect before you go into the transform domain: there the effect is a brickwall filter, while in the dual base domain the operation seems to be just a soft blurring which stretches the half energy cutoff of the basis functions further out. That's then why you don't easily see what the real effect is if you work in the base domain, as we do in the classical POA derived theory; there we don't consider the hard sampling criterion at all but only deal with the sensitivity issues caused by the approaching band edge at each temporal frequency, and sometimes get surprised by the nastiness caused by unrejected spatial alias or reactive fields. (The latter are just fine from a theoretical viewpoint, but can bring in unexpected free degrees of freedom and so break e.g. mic designs which didn't expect them. That problem is particular to higher order, because at first order, physics means there is no essential difference between pressure gradient, the first order Fourier-Bessel expansion of the pressure field, and velocity.)

And of course all of that is doubly difficult to to see when you start out with the central, coincident viewpoint: there there is no explicit array to consider, so no array diameter either. The Bessel terms are fully implicit and have to be dug out by intention if you want to see them. The conventional theory gives you absolutely no hint that you should look in that direction before you consider a finite spherical mic/rig consisting of monopoles and/or first order (cardioid, fig-8) mics (the best we can do with stock components) which can no longer be considered coincident. Suddenly you have to consider the spatial-directional sampling effects because in order to get the nice harmony we get from Gaussian interpolation would necessiate physical mics with at least second order, near-ideal directional selectivity. That you can't buy in a store, so you have to synthesize the response from simpler atoms, and suddenly your math goes haywire because it now shows all of the nasty radial, dependent on your spatial-directional sampling cutoff dependent terms as well. (And actually doesn't even get solved properly unless you impose such a cutoff. In most papers that cutoff is imposed by accident by restricting the analysis to the order of the spherical harmonical decomposition that the system aims at; bit mistake: in the process you end up assuming the pressure field is smooth to that order, in precisely the sense that the system requires, which it in reality mostly isn't.)
--
Sampo Syreeni, aka decoy - [email protected], http://decoy.iki.fi/front +358-50-5756111, 025E D175 ABE5 027C 9494 EEB0 E090 8BA9 0509 85C2
_______________________________________________
Sursound mailing list
[email protected]
https://mail.music.vt.edu/mailman/listinfo/sursound

Reply via email to