Hi Rupert, Andreas,

I need information on whether client can input multiple model files at same
time. For e.g. Sphinx requires acoustic( "feat.params", "mdef", "means",
"mixture_weights", "noisedict", "transition_matrices", "variances") ,
language(en-us.lm.dmp), dictionary(en-cmudict.0.6d) model to be processed
simultaneously . Also acoustic model files can't be interchanged with other
acoustic model files.


I went through various source code - only a 'single model' file is always
parsed.

1) Is to possible to parse multiple model files names to ContentItem, if
yes please provide me brief details of usage?

2) Can client parse bundle-name of the model file (i.e
org.apache.stanbol.data.model.wsj ) to the Content-Item?

If yes, it will be helpful for client. Just by* parsing* bundle-name of his
own set of acoustic, language and dictionary bundle , he can use the sphinx
engine, instead of asking such large no. of model files.

In ModelProvider Interface I have done the above ( passing bundle-name to
the method: LanguageModel getModel(String lang, String bundleName) ), bit
different from what you asked.

If *multiple Model file names *can be parsed - than my code needs to be
changed significantly. I will update it after your response.

3) I am building the code for SpeechToTextEngine (I am referencing  Tika
Engine Source code), is there any more thing that I must know for building
the Engine.


Regards,
Suman Saurabh

Reply via email to