Re: [music-dsp] two fundamental questions Re: FFT for realtime synthesis?

gm Wed, 31 Oct 2018 11:02:17 -0700

Thanks for your time

My question rephrased:

Lets assume a spectrum of size N, can you create a meaningfull spectrumof size N/2

by simply adding every other bin together?

Neglecting the artefacts of the forward transform, lets say anartificial spectrum

(or a spectrum after peak picking that discards the region around the peaks)

Lets say two sinusoids in two adjacent bins, will summing them into asingle bin of a half sized spectrum

make sense and represent them adequately?

In my limited understanding, yes, but I am not sure, and would like toknow why not

if that is not the case.

I'm not sure what "compress" means in this context, nor am I sure what"fall together" means. But here's some points to note:
A steady state sine wave in the time domain will be transformed by ashort-time fourier transform into a spectral peak, convolved (in thefrequency domain) by the spectrum of the analysis envelope. If youknow that all of your inputs are sine waves, then you can perform"spectral peak picking" (AKA MQ analysis) and reduce your signal to alist of sine waves and their frequencies and phases -- this is thesinusoidal component of Serra's SMS (explained in the pdf linked above).
Note that since a sinusoid ends up placing non-zero values in everyFFT bin, you'd need to account for that in your spectral estimation,which basic MQ does not -- hence it does not perfectly estimate thesinusoids.
In any case, most signals are not sums of stationary sinusoids. Andsince signals are typically buried in noise, or superimposed on top ofeach other, so the problem is not well posed. For two very simpleexamples: consider two stable sine waves at 440Hz and 441Hz -- youwill need a very long FFT to distinguish this from a singleamplitude-modulated sine wave? or consider a sine wave plus whitenoise -- the accuracy of frequency and phase recovery will depend onhow much input you have to work with.
I think by "compression" you mean "represent sparsely" (i.e. with somereduced representation.) The spectral modeling approach is to "model"the signal by assuming it has some particular structure (e.g.sinusoids+noise, or sinusoids+transients+noise) and then work out howto extract this structure from the signal (or to reassemble it forsynthesis).
An alternative (more mathematical) approach is to simply assume thatthe signal is sparse in some (unknown) domain. It turns out that ifyour signal is sparse, you can apply a constrained randomdimensionality reduction to the signal and not lose any information.This is the field of compressed sensing. Note that in this case, youhaven't recovered any structure.
Ross


_______________________________________________
dupswapdrop: music-dsp mailing list
music-dsp@music.columbia.edu
https://lists.columbia.edu/mailman/listinfo/music-dsp

Re: [music-dsp] two fundamental questions Re: FFT for realtime synthesis?

Reply via email to