Hi,

I am trying to add a new phone to the BU radio phoneset. The following
steps I have followed:

1. Generated the prompts:
 festival --heap 2100000 -b ../../festvox/src/diphones/diphlist.scm
../../festvox/src/diphones/us_withTXH_schema.scm '(diphone-gen-waves
"prompt-wav" "prompt-lab" "etc/kaldiph_withTXH.list")'

2. Recorded the prompts

3. Labelled the prompts like so:
../festvox/src/general/do_build label_dtw

4. Getting the distribution of diphones using make_diph_index did not work
no matter what I tried. So I did this step manually by entering the
following fields in the dic/kaldiph.est file:
<diphone> <fileid> <start time> <mid time> <end time>  (with all times in
seconds).

5. Extracted pitch marks, lpc coefficients and mel cepstrum coefficients
with the do_build function and appropriate arguments.

6. Also added some words containing this new phone to the lexicon, trained
new CART rules, compiled the new lexicon and set this up for use.

When I run SayPhones from the command line, it seems to work OK. I can hear
the new phone added. It may not be very clear but it is audible.
On running SayText (also from the command line), I get a message saying:
Phoneme: txh has no duration info
Using UniSyn method to synthesize an utterance.
UniSyn: using default diphone pau-pau for s-txh

Where am I going wrong? And how can I get txh to have a duration for
SayText?

Regards,
Sriram Shankar.
_______________________________________________
Festlang-talk mailing list
[email protected]
https://lists.berlios.de/mailman/listinfo/festlang-talk

Reply via email to