Hi,

I used the Categorizer::SVM library for large data classification (great
tool); however, I'm having trouble analyzing the result from the SVM
learner.

- Categories have scores of either 0 or 1, with 1 being that  this document
belongs to this category, and 0 otherwise. Are there any scores representing
probabilities or confidence level of belong to certain category other than
these 0, 1 values?
- Suppose this document could belong to 3 possible categories: cat1, cat2,
and cat3. The best_category method simply picks the first category as the
classification decision. If you call, $hypothesis->categories, the
categories outputed don't seem to be in the order of probabilities or
confidence level. They seem to be in the fixed order....and whatever listed
first is favored.

I hope someone can clear my confusion on the scores  of categories in the
SVM module.

Thank you very much in advance,
Jennifer

Reply via email to