FYI

if anyone wants the files (about 200k) give a holler

you can run the whole mess on a single machine along with FG. The hit to the
frame rate is TBD.

----- Original Message -----
From: "John Wojnaroski" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Tuesday, September 21, 2004 9:44 AM
Subject: Voice stuff


> Hi David,
>
> Attached are two files:
>
> comm_747 is a really bad hack for the CMU Sphinx ASR engine to create a
text
> string and sent it out on the network to the "atc_server"
>
> voice.tgz is more hacking to send a text string to the festival server..
> lines 73 to 76 is my highly sophisticated AI /ATC controller ;-)
> it untars as ..../Voice
>
> You can run festival as a server "../bin/festival --server" on the same
> machine you run "atc_net_demo" and the audio will be produced on that
> machine or you can uncomment lines 295-309 and also send a .wav file back
to
> the client machine.
>
> A quick recap:
> Machine #1 runs sphinx2 which receives the audio input from the user,
> converts it to text and sends it over to Machine #2 which parses the text
> string into tokens and does it's AI/ATC stuff, formulates a text response
> and passes that to the festival server  (in this case on 127.0.0.1) which
> creates the audio file and outputs it to the local soundcard. If you run
the
> festival server on a seperate network machine or back on #1 you need to
> create a short .scm script to add the user to the festival access list as
in
> atc.scm
>
> You need to upload the ASR sphinx2 stuff and TTS festival stuff from CMU
or
> wherever. If you need some help or tips in that area give a holler...
> http://linux-sound.org/speech.html is a list of speech related websites
you
> might find useful. In particular--
> the Festival set, XVoice, Sphinx, and MBROLA. I'm using the
>
> You'll find http://www.speech.cs.cmu.edu/tools/lmtool.html quite useful
for
> creating a LM and phonelist for the ASR program. With the smaller
> dictionaries voice recognition is very good but if you mumble a lot (like
I
> do) the XVoice folks have a wiki page with instructions on how to add a
> voice trainer and improve/tailor the AM for individual speech patterns.
>
> Setting up the programs takes a little work. You can use the AM provided
> with sphinx2, but you'll get much better results if you upload and install
> the hub4-2000-11-17-1 model. And you will  need to create a LM that
contains
> ATC phrases and words
>
> Good luck, again you've got my email, don't hesitate if you need help or
> have questions.
>
> Regards
> John W.
>


_______________________________________________
Flightgear-devel mailing list
[EMAIL PROTECTED]
http://mail.flightgear.org/mailman/listinfo/flightgear-devel
2f585eeea02e2c79d7b1d8c4963bae2d

Reply via email to