On 2010-02-19 18:29 PM, Aahz wrote:
On Fri, Feb 19, 2010, Mark Livingstone wrote:

I am looking for suggestions! I am doing some experimentation and want
to know if there are any utilities available that will take a file as
input, get the num folds and num times, and do the slice and dice file
operation ready then for training / testing?

You will need to expand your jargon if you want anyone unfamiliar with
this specific operation to provide assistance.  (I.e. I have no clue what
you're talking about.)

It's really off-topic for this list, but K-fold cross-validation is a way of testing how well some prediction method will perform. Roughly, you split up the data into K chunks. You use K-1 chunks to train your method and test on the remaining chunk. You then repeat this K times with each chunk playing the role of the test chunk exactly once. Then you average the performance of your prediction method over each of the K tests.

Mark, I recommend that you join the scipy-users mailing list. We'll be happy to field your data analysis questions over there. These kinds of questions really are unrelated to the Apple platform even if you intend to do the analysis on an Apple machine.

  http://www.scipy.org/Mailing_Lists

You may also want to check the SpamBayes project. Their validation framework might be applicable to your problem set.

  http://spambayes.sourceforge.net/

--
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless enigma
 that is made terrible by our own mad attempt to interpret it as though it had
 an underlying truth."
  -- Umberto Eco

_______________________________________________
Pythonmac-SIG maillist  -  Pythonmac-SIG@python.org
http://mail.python.org/mailman/listinfo/pythonmac-sig
unsubscribe: http://mail.python.org/mailman/options/Pythonmac-SIG

Reply via email to