On 7/8/11 2:58 AM, Amal Elmah wrote:
I built a model using Opennlp using the instructions in the documentation
so I used training data in such the following format:

<START:person>  Pierre Vinken<END>  , 61 years old , will join the board as a 
nonexecutive director Nov. 29 .
Mr .<START:person>  Vinken<END>  is chairman of Elsevier N.V. , the Dutch 
publishing group .
<START:person>  Rudolph Agnew<END>  , 55 years old and former chairman of 
Consolidated Gold Fields

I am now trying to evaluate this model but I don't know if the test data should 
conatin tag as the above or not.



The OpenNLP built-in evaluation always needs the tags in the test data, otherwise it assumes that
the data with no-tags is correct and counts every name as a mistake.

You also need much more training data.

Please feel free to send us a patch to our documentation to clarify this.

Jörn

Reply via email to