Re: Graillon 1.0, VST effect fully made with D

Guillaume Piolat via Digitalmars-d-announce Sun, 29 Nov 2015 08:21:11 -0800

On Sunday, 29 November 2015 at 15:34:34 UTC, Ola Fosheim Grøstadwrote:

I don't now much about current pitch trackers, but I think youcan do a high quality one for voice using filterbanks. Somepeople do resynthesis that way (and well, that is just analternative to FFT after all).

You are precisely right, if you don't need reconstruction nothingforces you to use the FFT!There is also a sample-wise FFT I've came across, which isexpensive but avoids chunking.

I assume you can make a better pitch tracker that isspecialized for voice by thinking about FoF synthesis, thesound of the voice is really a sequence of bursts of roughlythe same shape (like granular synthesis in a way) and youshould be able to figure out some statistical relationshipbetween formants and how they change with pitch.

Looking for similar grains is the idea behind the popularauto-correlation pitch detection methods. Require two periodselse no autocorrelation peak though. The rumor says that thenon-realtime Autotune works with that, along with many modernpitch detection methods.

I'm not saying it is easy. Probably a lot published on thisthough.
I don't know what "voicedness" is? You mean things like vibrato?

vibrato is the pitch variation that occur when the larynx is wellrelaxed.

voicedness is the difference between sssssss(unvoiced) and zzzzzz(voiced).A phonem is voiced when there is periodic glottal closure andopenings.

When the sound isn't voiced, there is no period. There isn't a"pitch" there. So pitch detection tend to come with a confidencemeasure.

The devil in that is that voicedness itself is half a lie, or letsay a leaky abstraction, it breaks down for distorted vocals.

I guess that's why IRCAM can sell licenses to superVP. :)

Their paper on that topic are interesting, they group spectralpeaks by formants and move them together.

Re: Graillon 1.0, VST effect fully made with D

Reply via email to