Re: [music-dsp] two fundamental questions Re: FFT for realtime synthesis?

Ross Bencina Wed, 31 Oct 2018 03:28:33 -0700

Hi,

Sorry, late to the party and unable to read the backlog, but:

The "FFT^-1" technique that Robert mentions is from a paper by Rodet andDepalle that I can't find right now. It's widely cited in the literatureas "FFT^-1"

That paper only deals with steady-state sinusoids however. It won'taccurately deal with transients or glides.

There has been more recent work on spectral-domain synthesis and I'mfairly sure that some techniques have found their way into some quitefamous commercial products.

Bonada, J.; Loscos, A.; Cano, P.; Serra, X.; Kenmochi, H. (2001)."Spectral Approach to the Modeling of the Singing Voice". In Proc. ofthe 111th AES Convention.




> My goal is to resynthesize arbitary noises.

In that case you need to think about how an FFT represents "arbitrarynoises".

One approach is to split the signal into sinusoids + noise (a.k.a.spectral modeling synthesis).

https://en.wikipedia.org/wiki/Spectral_modeling_synthesis

It is worth reviewing Xavier Serra's PhD thesis for the basics (what wasalready established in the late 1980s.)


http://mtg.upf.edu/content/serra-PhD-thesis

Here's the PDF:
https://repositori.upf.edu/bitstream/handle/10230/34072/Serra_PhDthesis.pdf?sequence=1&isAllowed=y

There was a bunch of in the early 90's on real-time additive synthesisat CNMAT, e.g.


https://quod.lib.umich.edu/i/icmc/bbp2372.1995.091/1/--bring-your-own-control-to-additive-synthesis?page=root;size=150;view=text

Of course there is a ton of more recent work. You could do worse thanlooking at the papers of Xavier Serra and Jordi Bonada:

http://mtg.upf.edu/research/publications



On 31/10/2018 1:35 PM, gm wrote:

But back to my question, I am serious, could you compress a spectrum byjust adding the bins that fall together?

I'm not sure what "compress" means in this context, nor am I sure what"fall together" means. But here's some points to note:

A steady state sine wave in the time domain will be transformed by ashort-time fourier transform into a spectral peak, convolved (in thefrequency domain) by the spectrum of the analysis envelope. If you knowthat all of your inputs are sine waves, then you can perform "spectralpeak picking" (AKA MQ analysis) and reduce your signal to a list of sinewaves and their frequencies and phases -- this is the sinusoidalcomponent of Serra's SMS (explained in the pdf linked above).

Note that since a sinusoid ends up placing non-zero values in every FFTbin, you'd need to account for that in your spectral estimation, whichbasic MQ does not -- hence it does not perfectly estimate the sinusoids.

In any case, most signals are not sums of stationary sinusoids. Andsince signals are typically buried in noise, or superimposed on top ofeach other, so the problem is not well posed. For two very simpleexamples: consider two stable sine waves at 440Hz and 441Hz -- you willneed a very long FFT to distinguish this from a singleamplitude-modulated sine wave? or consider a sine wave plus white noise-- the accuracy of frequency and phase recovery will depend on how muchinput you have to work with.

I think by "compression" you mean "represent sparsely" (i.e. with somereduced representation.) The spectral modeling approach is to "model"the signal by assuming it has some particular structure (e.g.sinusoids+noise, or sinusoids+transients+noise) and then work out how toextract this structure from the signal (or to reassemble it for synthesis).

An alternative (more mathematical) approach is to simply assume that thesignal is sparse in some (unknown) domain. It turns out that if yoursignal is sparse, you can apply a constrained random dimensionalityreduction to the signal and not lose any information. This is the fieldof compressed sensing. Note that in this case, you haven't recovered anystructure.


Ross


















_______________________________________________
dupswapdrop: music-dsp mailing list
music-dsp@music.columbia.edu
https://lists.columbia.edu/mailman/listinfo/music-dsp

Re: [music-dsp] two fundamental questions Re: FFT for realtime synthesis?

Reply via email to