On 03/29/2012 11:22 AM, Peter Thygesen wrote:
Using RunAE;
Must be doing something wrong. No model is created and I dont see any
scores being generated...
main class: org.apache.uima.examples.RunAE
arguments: -s2 descriptors/TokenizerTrainer.xml corpus
VM args: -Xmx1000m
CONSOLE OUTPUT:
--------------------------------------
Processed Document aaaaaaa.xmi
.....
Processed Document zzzzzzz.xmi
Mar 29, 2012 11:17:00 AM opennlp.uima.tokenize.TokenizerTrainer
collectionProcessComplete(203)
INFO: Collected 929 token samples.
It was able to find 929 sentences, but maybe they do not
contain tokens?
You should check the sentence and token type in your Tokenizer Trainer
descriptor. Does the specified types there match with the annotations
in the CAS?
Indexing events using cutoff of 5
Computing event counts... done. 0 events
It should be able to generate a couple of thousand events
here. So it is strange that its zero.
Anyway we might want to enhance the log output a bit so we can
find problems.
Jörn