Steve,

I don't have any real experience in DSP methodologies, although I have picked up on the high-level theories in my research. However, I am *very* strong-willed in the "Where there's a will, there's a way" category. :)

Here's my current thought:

Sphinx is an open source STT library that can work in real-time (specifically sphinx2). Could we not pipe the called party's audio into it and then look for a given period of time with no text output?

I also found this site, http://www.cs.wpi.edu/~claypool/courses/525-S01/projects/proj1/ where a prof's got a project for students that fits perfectly with what I'd like to do. He mentions Rabiner and Sambur's algorithm (from 1975) for detecting isolated speech endpoints.

Brad

Steve Underwood wrote:

Hi Brad,

If you want to detect that a sound is voice, rather than something else, it isn't easy. There is information around on the Internet about methods, but I have never tried them and don't know how well they work. Unless you have some understanding of DSP I wouldn't bother trying. On the other hand, if you do have some DSP expertise it might be a fun thing to try.

Regards,
Steve

Brad Waite wrote:

Does anyone know if there's public voice detection algorithms available? I've scoured the net for the last hour or so, and I can't come up with anything except a few proprietary or embedded solutions.

I know dsp.c uses goertzel algorithms for DTMF detection, but how does one detect voice?

I dunno, maybe detecting voice isn't the way to go. I want to begin playback of a file after a phone/answering machine has answered.




_______________________________________________
Asterisk-Users mailing list
[EMAIL PROTECTED]
http://lists.digium.com/mailman/listinfo/asterisk-users


_______________________________________________ Asterisk-Users mailing list [EMAIL PROTECTED] http://lists.digium.com/mailman/listinfo/asterisk-users

Reply via email to