On 10/23/13 2:19 AM, Ross Bencina wrote:
The idea is to isolate each vocal tract filtered glottal pulse in its own grain (i.e. glottal pulse convolved with the impulse response of the vocal tract). Thus changing the grain rate is more or less equivalent to changing the glottal pulse rate leaving the vocal tract IR remains unchanged (except you're also convolving with a window).

i like to call this "Lent's algorithm" after Keith Lent from a 1989 CMJ. it has also been attributed to Hamon. it needs a very good pitch detector to pull it off. and i wrote an AES paper about it in 1995, i think.


If the IR length is longer than the fundamental period you won't be able to isolate the pulses exactly. But if the IR is shorter than the period then you would expect lowering the frequency to add gaps. Similarly, raising the frequency would increase overlap of each filtered glottal pulse.

What I'd like to know is what's the best way of centering the windows on the pulses? and is it better to use asymmetrical windows?

dunno how to answer the 2nd question... perhaps an asymmetrical window is better.

about the first, i would square the incoming audio and filter it with a LPF but with a high cutoff frequency. look for maximum bumps in that smoothed squared waveform (maximum energy), record the latest bump location, and *nudge* the window location so that the center of the windows (assuming a symmetrical Hann-like window) eventually gets centered around the maximum energy pulses. in other words, 99% of the location of the window should be 1 period later (as determined by the pitch detector) than the previous window location. and 1% or 2% should be nudging it either a little earlier or a little later toward the nearest maximum energy pulse.

if you're doing an asymmetrical window (which i haven't done), perhaps center the maximum of the window around the maximum energy pulse. the problem i have with the non-symmetrical window is making it sufficiently complementary. if you have complementary windows (upslope + downslope = 1), then if there is zero pitch shifting, what comes out is an exact (but delayed) replica of what goes in.

--

r b-j                  r...@audioimagination.com

"Imagination is more important than knowledge."



--
dupswapdrop -- the music-dsp mailing list and website:
subscription info, FAQ, source code archive, list archive, book reviews, dsp 
links
http://music.columbia.edu/cmc/music-dsp
http://music.columbia.edu/mailman/listinfo/music-dsp

Reply via email to