> > Hmmm, the reality is that I don't think a binary prediction will
...
> > the relative probabilities of success and failure?
> Could we perhaps encode all of the samples as binary {0,1} data, but
> train the classifier to return a continuous result?  That is, use
> samples like:
> 
> ((time=17411,key=0x3a4b,htl=12), target=1)    // success
> ((time=17480,key=0x3a4b,htl=15), target=1)    // success
> ((time=17485,key=0x3a4c,htl=8), target=1)     // success
> ((time=17487,key=0x3a4b,htl=9), target=0)    // fail
> 
> with the regression mode.  The predictor will then return values
> between 0 and 1 which we can interpret as a probability of success.

Yes I think that's doable.  I've agree it makes sense to encode
things like "host ip" and "key" as binary inputs, so that the SVM
can get a sense of certain class A / class B / class C domains, or
the notion of "close keys."  For time, I think it makes sense to
break it down into the standard minute:hour:weekday:monthday:month:year
format and use each of these as seperate input vectors, as there must
be some usage cycles highly correlated to some of these indicators.

Rudi

_______________________________________________
devl mailing list
[EMAIL PROTECTED]
http://hawk.freenetproject.org:8080/cgi-bin/mailman/listinfo/devl

Reply via email to