Re: [R] K Fold CrossValidation

2013-05-15 Thread Kevin Hao
Below is the some code, may be helpful for you. [maybe have the finished package which includes mlr with crossvaliation], you can check http://cran.r-project.org/web/packages/available_packages_by_date.html. you can check this

Re: [R] K Fold CrossValidation

2013-05-15 Thread Kevin Hao
Maybe the following code is helpful for you. At the same time, you can check the ChemometricsWithRhttp://cran.r-project.org/web/packages/ChemometricsWithR/index.html package from http://cran.r-project.org/web/packages/available_packages_by_date.html. # product crossvalidation index crossvalind

[R] need help for Imbalanced classification problems!!!

2013-05-14 Thread Kevin Hao
Hi all, I am facing the imbalanced classification problems. That means I have a dataset, in which the ratio of majority data to minority data is 100:1 (or more). In addition, the independent variables are many and this is a binary classification questions. The model I built give poor predictive

Re: [R] Read big data (3G ) methods ?

2013-04-27 Thread Kevin Hao
format first (RDS, ff, sqlite, bigmemory, ...). Most packages have routines to convert CSV files to the binary format. With all of the above LaF helps. ffbase contains a routine laf_to_ffdf to convert to to ff format. HTH, Jan On 04/27/2013 04:34 AM, Kevin Hao wrote: Thank you very much

[R] Read big data (3G ) methods ?

2013-04-26 Thread Kevin Hao
Hi all scientists, Recently, I am dealing with big data ( 3G txt or csv format ) in my desktop (windows 7 - 64 bit version), but I can not read them faster, thought I search from internet. [define colClasses for read.table, cobycol and limma packages I have use them, but it is not so fast].

Re: [R] Read big data (3G ) methods ?

2013-04-26 Thread Kevin Hao
file into smaller ones. On Fri, Apr 26, 2013 at 8:09 AM, Kevin Hao rfans4ch...@gmail.com wrote: Hi all scientists, Recently, I am dealing with big data ( 3G txt or csv format ) in my desktop (windows 7 - 64 bit version), but I can not read them faster, thought I search from internet

Re: [R] Read big data (3G ) methods ?

2013-04-26 Thread Kevin Hao
, Ye Lin ye...@lbl.gov wrote: Have you think of build a database then then let R read it thru that db instead of your desktop? On Fri, Apr 26, 2013 at 8:09 AM, Kevin Hao rfans4ch...@gmail.com wrote: Hi all scientists, Recently, I am dealing with big data ( 3G txt or csv format ) in my

Re: [R] Read big data (3G ) methods ?

2013-04-26 Thread Kevin Hao
, Kevin Hao rfans4ch...@gmail.com wrote: Hi all scientists, Recently, I am dealing with big data ( 3G txt or csv format ) in my desktop (windows 7 - 64 bit version), but I can not read them faster, thought I search from internet. [define colClasses for read.table, cobycol and limma packages I

Re: [R] Read big data (3G ) methods ?

2013-04-26 Thread Kevin Hao
Thank you very much. More and more methods are coming. That sounds great! Thanks, kevin On Fri, Apr 26, 2013 at 7:51 PM, Duncan Murdoch murdoch.dun...@gmail.comwrote: On 13-04-26 3:00 PM, Kevin Hao wrote: Hi Ye, Thanks. That is a good method. have any other methods instead of using

[R] Help me make faster R code for Kennard-Stone algorithm [My code is so slow from Matlab]

2013-04-24 Thread Kevin Hao
Hi all, Can you help me change my Kennard-Stone algorithm to faster one? [The original code can run fast in matlab, but when I change matlab code to R code, it is so slow.] Since my code so crude and too many loops (changed from matlab code), it is too slow. I hope that you can help to improve