On 8/9/11 6:58 PM, William Colen (JIRA) wrote:
What about the methods from Evaluator?
void missclassified(Span references[], Span predictions[], String
referenceSample, String predictedSample, String sentence)
void missclassified(Span references[], Span predictions[], String
referenceSample, String predictedSample, String[] sentenceTokens)
void missclassified(String references[], String predictions[], String
referenceSample, String predictedSample, String[] sentenceTokens)
Or do you think we should take advantage of some structure provided by
F-Measure classes? I can't see it yet.
Don't we have the samples?
The evaluator knows that a sample was incorrectly classified.
It could provide the original gold sample, and the predicted sample,
this way a report tool can calculate the difference between the two
samples and output/mark it,
or compute statistics about mistakes.
What do you think?
Jörn