Re: [music-dsp] PSOLA pitch shifting - resample or not?

robert bristow-johnson Sat, 26 Oct 2013 18:31:23 -0700

On 10/23/13 2:19 AM, Ross Bencina wrote:

The idea is to isolate each vocal tract filtered glottal pulse in itsown grain (i.e. glottal pulse convolved with the impulse response ofthe vocal tract). Thus changing the grain rate is more or lessequivalent to changing the glottal pulse rate leaving the vocal tractIR remains unchanged (except you're also convolving with a window).

i like to call this "Lent's algorithm" after Keith Lent from a 1989CMJ. it has also been attributed to Hamon. it needs a very good pitchdetector to pull it off. and i wrote an AES paper about it in 1995, ithink.

If the IR length is longer than the fundamental period you won't beable to isolate the pulses exactly. But if the IR is shorter than theperiod then you would expect lowering the frequency to add gaps.Similarly, raising the frequency would increase overlap of eachfiltered glottal pulse.
What I'd like to know is what's the best way of centering the windowson the pulses? and is it better to use asymmetrical windows?

dunno how to answer the 2nd question... perhaps an asymmetrical windowis better.

about the first, i would square the incoming audio and filter it with aLPF but with a high cutoff frequency. look for maximum bumps in thatsmoothed squared waveform (maximum energy), record the latest bumplocation, and *nudge* the window location so that the center of thewindows (assuming a symmetrical Hann-like window) eventually getscentered around the maximum energy pulses. in other words, 99% of thelocation of the window should be 1 period later (as determined by thepitch detector) than the previous window location. and 1% or 2% shouldbe nudging it either a little earlier or a little later toward thenearest maximum energy pulse.

if you're doing an asymmetrical window (which i haven't done), perhapscenter the maximum of the window around the maximum energy pulse. theproblem i have with the non-symmetrical window is making it sufficientlycomplementary. if you have complementary windows (upslope + downslope =1), then if there is zero pitch shifting, what comes out is an exact(but delayed) replica of what goes in.


--

r b-j                  r...@audioimagination.com

"Imagination is more important than knowledge."



--
dupswapdrop -- the music-dsp mailing list and website:
subscription info, FAQ, source code archive, list archive, book reviews, dsp 
links
http://music.columbia.edu/cmc/music-dsp
http://music.columbia.edu/mailman/listinfo/music-dsp

Re: [music-dsp] PSOLA pitch shifting - resample or not?

Reply via email to