On 8/9/11 6:58 PM, William Colen (JIRA) wrote:
What about the methods from Evaluator?

void missclassified(Span references[], Span predictions[], String 
referenceSample, String predictedSample, String sentence)
void missclassified(Span references[], Span predictions[], String 
referenceSample, String predictedSample, String[] sentenceTokens)
void missclassified(String references[], String predictions[], String 
referenceSample, String predictedSample, String[] sentenceTokens)

Or do you think we should take advantage of some structure provided by 
F-Measure classes? I can't see it yet.

Don't we have the samples?

The evaluator knows that a sample was incorrectly classified.

It could provide the original gold sample, and the predicted sample,
this way a report tool can calculate the difference between the two samples and output/mark it,
or compute statistics about mistakes.

What do you think?

Jörn

Reply via email to