Re: [PD] noise floor: median in audio signals, for peak extraction

Mathieu Bouchard Thu, 22 Dec 2011 07:48:24 -0800

Le 2011-12-20 à 21:29:00, Alexandre Torres Porres a écrit :

this would be kinda like using the [median] or [median_n] objects, butover audio blocks and not number lists.


Ok, lots of heavy details below...

The way [median] and [median_n] are built, they take a lot of time.Sorting each window of 32 elements is slow with any full sort. Even theusually slow process of putting elements one by one in a sorted array isfaster than any full sort like qsort() or [sort], in this kind ofsituation, because you need to look at the result of the sort after eachinsertion/removal.

But a sort based on binary-trees will make your median filter able tocompete with the speed of FFT, for example. You keep your sorted «array»as a sorted binary tree and this makes it fast to insert, delete, and findthe middle. But you can't do that with C++'s std::map because it offersno way to find the middle quickly... you'd need something special.

There may be ways to bend quicksort so that it can do many similar sortsquickly, but you can't do that with qsort() nor anything based on it.

I don't know how zexy's [sort]. Apparently it doesn't use the quicksortmethod that I assumed it did, it uses the shellsort method instead, whichis sometimes slower, sometimes faster. I wonder whether it could speed upyour task if you made a kind of [median_n] that would reuse thealready-sorted list to speed up the next sort. (This would not improve aquicksort, but I wonder whether it'd improve [sort]).

Generally, median-based methods are harder to work with, as they involvelots of comparisons and swaps and such, by sorting or by doing things thatare like sorting ; whereas mean-based methods involve simple fast passesof addition and multiplication. But both give quite different results, inmany situations.

Since there's the need of calculating this in and using the result backin the same block round into the audio chain, I can't put the spectruminto a table, and then calculate the median over bits of it.

with [tabsend~] and a [bang~], then you can send a whole block into themessage domain, compute it, and get it back as a signal, one block later.

But then, how to do it? Should I be able to pull this out only if Iwrite a "median~" or [noise_floor~] external?
Or somehow there's another way to do this with some existing external,or a similar technique, or even some audio math trick using [fexpr~] orsomething?

I don't think you can do any reasonable sort using [fexpr~]... exceptperhaps a strange undecipherable network of a hundred [fexpr~] or so. Ithink it's easier to write an external.

This has to do with the other post I did about a project that attemptsto isolate notes into a chord in a spectrum, something like melodyne isdoes.

So, why is a mean (average) not good enough, while a median would be goodenough ? It's possible, but I'd like to hear an explanation.


 ______________________________________________________________________
| Mathieu BOUCHARD ----- téléphone : +1.514.383.3801 ----- Montréal, QC

_______________________________________________
Pd-list@iem.at mailing list
UNSUBSCRIBE and account-management -> 
http://lists.puredata.info/listinfo/pd-list

Re: [PD] noise floor: median in audio signals, for peak extraction

Reply via email to