Re: [music-dsp] PSOLA pitch shifting - resample or not?

Theo Verelst Mon, 21 Oct 2013 14:56:24 -0700

Wen Xue wrote:

Maybe a beginner's question here:


when pitch-synchronized OLA is used to modify speech pitch, do we
resample the original signal or not?

From 80s speech coding I recall the analysis of the formant of a signalcould be determined by some form of FFT, and I suppose just like withother applications, you can overlap/average the results, depending onhow the bin-size works with that particular signal.

However, there's a conceptual difference between knowing or measuringthe length of the waveform, or of N waveforms (presuming there is asingular, undisturbed waveform), and making a harmonic analysis of thatparticular length of the waveform(s), or, as a bit different approach totake a fixed FFT interval length, and do a general frequency analysis,without making it so that the fundamental is the lowest frequency of thethe FFT analysis. Unless you take a random length FFT (not uncommon inmodern accelerated libs), and are willing to live with the roundingerror you'll get, depending on the number of measured (and partiallyaveraged) waves, their frequency, and the sample frequency. Thisrounding can be considerable, which for speech coding may be fine.

You could also do an actual re-sampling of the signal, based on samplingtheory. which entails having taking proper equi-distant, impulse sampleswith your Analog to Digital convertor, using some small or largewindowed version of the sinc (sin(x)/x)) function and the proper mathand signal flow rolling.

If you did actual re-sampling, and you make sure the re-sampledfrequency is higher, or you made sure harmonics were absent or filteredout to prevent aliasing, you could try to match your averaging interval(for N=1 or N>1 full wave shapes, in case of a single wave, no musicalchords or atonal components) with the sample-length of the waveformyou're analyzing.

Presuming you gave sufficient spectral components in a general FFT toinverse FFt the waveform at a different fundamental frequency isprobably going t give you a hard time if you want to get a littleaccurate. Serious filtering could get you rid of the transients thatwill mess up your FFT results, but the results are probably going to berelatively crude, have little to do with the re-sampling in EE terms,but may suffice for speech coding on phones or so.



Theo V.

--
dupswapdrop -- the music-dsp mailing list and website:
subscription info, FAQ, source code archive, list archive, book reviews, dsp 
links
http://music.columbia.edu/cmc/music-dsp
http://music.columbia.edu/mailman/listinfo/music-dsp

Re: [music-dsp] PSOLA pitch shifting - resample or not?

Reply via email to