On 04/12/2012 11:32 AM, Peter Thygesen wrote:
Strange. Still problems. I reduced the corpus files to 10 files. running with RunAE still doesn't produce any events, but when I run it with UIMA Analysis Engine configuration it works.
That sounds strange, because it should not make a difference at all. Trivial reasons for that are that something is really different, e.g. you consume not the same CASes, you use another xml descriptor for the training, etc. I suggest to double check that. Or you are just hitting some kind of bug. To figure that out we should improve the log output of the OpenNLP Tokenizer Trainer AE in a way it actually tells us what is wrong. Would you mind to build a trunk version of OpenNLP and test with that one instead? Jörn
