On 4/21/2012 12:40 PM, Jim - FooBar(); wrote: > On 13/02/12 23:07, Michael Collins wrote: >> Does opennlp provide a way to create the *.train file based on a body >> of text which I provide, or is the *.train file created another way. > Apart from the sentence detector there is no way to automatically > create training data for other tasks (POS,NER etc)...these are often > language and domain dependant. For the sentence detector however it is > easy to create your own private training data (as Jorn said) targeted > especially for your problem domain. assuming of course that the > pre-trained model is not good enough for you...i find it's pretty > good! :) > > Jim Also, unlike a lot of the other models, the sentence detector can actually be trained and works quite well with just a few sentences to train on. ~20-30 does really well.
James
