Hi all,
I am trying to build a language learning app in which users can
pronounce foreign words and automatically get feedback on how they
did. It's a client-server setup.
The server is a speech recognizer which I've built for this purpose.
The client is an android activity which main purpose is to send the
user's speech via a socket to the server.
The program is already able to send the audio data to the server using
the AudioRecord class. I record the data with a sample rate of 16kHz,
16-bit PCM, mono. My buffer size is 4096 bytes.
The main problem is that the recordings contain quite a lot of clicks;
indicating missing samples. Anyone knows how to make the recording
process less prone to these missing samples?
I think I can go down to 8kHz without much speech recognition
performance decrease, if that would help me get rid of the clicks, but
I'll have to retrain my recognizer again, so I'd rather not do that :)
Thanks in advance!

-- 
You received this message because you are subscribed to the Google
Groups "Android Developers" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/android-developers?hl=en

Reply via email to