On Fri, Jul 29, 2011 at 10:39 AM,  <[email protected]> wrote:
> Dear Greg,
>
>
> I have added the code how I generate "my own confusion matrix" to the Wiki.
> In my understanding, my function uses the predictions from the out-of-bag
> prediction. But I guess that I have overlooked some nasty detail.

You call:
  pred,conf=cmp.ClassifyExample(pts[i])
This uses the full composite model to make each prediction, it doesn't
do the same out-of-bag prediction done by
ScreenComposite.ShowVoteResults

It is possible to get both the out-of-bag confusion matrix as a python
object (it's part of the return tuple from
ScreenComposite.ShowVoteResults) and the breakdown of the out-of-bag
predictions by point (not quite as straightforward, but possible).
What exactly are you trying to do?

> Cheers & Thanks,
> Paul
>
>
> P.S.: When comparing the results with a PipelinePilot-based Bayesian
> catagorization model (ECFP_4 & standard settings), I'm surprised to see
> that the PipelinePilot model is significantly better. I thought that the
> MorganFingerprints are comparable to the ECFPs and would have assumed that
> the model quality is in a similar range.

It's probably not the fingerprints, but the model-building approach
that makes the difference here. You can test this if you want in knime
using the RDKit morgan fingerprints with the naive bayes fingerprint
learner they added in version 2.4.

Best Regards,
-greg

------------------------------------------------------------------------------
Got Input?   Slashdot Needs You.
Take our quick survey online.  Come on, we don't ask for help often.
Plus, you'll get a chance to win $100 to spend on ThinkGeek.
http://p.sf.net/sfu/slashdot-survey
_______________________________________________
Rdkit-discuss mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to