Re: [R] Using R for Production - Discussion

2010-11-02 Thread Saeed Abu Nimeh
I worked on a project where we used a random forest classifier to predict a binary response. We trained a model in the ec2 cloud with 3 million observations and 44 features. We stored the model that was generated by R using save(mymodel,file=model.Rdata). Now we use model.Rdata locally to predict

Re: [R] Help with time in R

2010-07-21 Thread Saeed Abu Nimeh
You can use strptime to specify the format of the date and time you want, e.g. x1-strptime(x, %Y-%m-%d %H:%M:%S) x1 [1] 2010-04-02 12:00:05 str(x1) POSIXlt[1:1], format: 2010-04-02 12:00:05 On Wed, Jul 21, 2010 at 8:02 AM, Aaditya Nanduri aaditya.nand...@gmail.com wrote: Ms. Chisholm, If

Re: [R] Figures in Latex

2010-07-23 Thread Saeed Abu Nimeh
http://nixtricks.wordpress.com/2009/11/09/latex-multiple-figures-under-the-same-caption-using-subfigure/ It will create two rows of subfigures with two subfigures on each row On Fri, Jul 23, 2010 at 6:43 AM, li li hannah@gmail.com wrote: Hi all,   I want to add 6 plots in the format of 2

Re: [R] transforming dates into years

2010-08-13 Thread Saeed Abu Nimeh
myFrame$year-years(strptime(x)) On Fri, Aug 13, 2010 at 12:36 PM, Dimitri Liakhovitski dimitri.liakhovit...@gmail.com wrote: Hello! If I have in my data frame MyFrame a variable saved as a Date and want to translate it into years, I currently do it like this using zoo: library(zoo) as.year

[R] Importance of levels in a factor variable

2010-08-26 Thread Saeed Abu Nimeh
I have a dataset of multiple variables and a response. For example, str(x) 'data.frame': 3557238 obs. of 44 variables: $ response : Factor w/ 2 levels $ var2: Factor w/5000 levels If var2 for example is a factor with 5000 levels, what is the best approach to determine which of these

Re: [R] Looking for an image (R 64-bit on Linux 64-bit) on Amazon EC2

2010-08-26 Thread Saeed Abu Nimeh
No need to do that. They have some instances that run 64-bit ubuntu. If I remember correctly we had to install 64-bit R from the debian packages on the ubuntu instance. On Wed, Aug 25, 2010 at 6:12 PM, noclue_ tim@netzero.net wrote: You have a 64 bit Linux?  If so... Dowload the sources

Re: [R] Importance of levels in a factor variable

2010-08-27 Thread Saeed Abu Nimeh
- From: r-help-boun...@r-project.org [mailto:r-help-boun...@r- project.org] On Behalf Of Saeed Abu Nimeh Sent: Thursday, August 26, 2010 1:40 PM To: r-help@r-project.org Subject: [R] Importance of levels in a factor variable I have a dataset of multiple variables and a response. For example, str

[R] Collapsing levels of categorical variables

2010-08-31 Thread Saeed Abu Nimeh
In this paper [1] the author mentioned a procedure by M. Greenace that can be used to collapse the levels of a categorical variable by setting up a table with the frequency of each level and the proportion of the target value in each level. Then collapsing the table level by level looking at the

[R] Communicating with an R daemon from perl

2010-10-08 Thread Saeed Abu Nimeh
Is there a way to communicate with a running R daemon from perl. I tried RSPerl but the functions there initiate an R instance first. I would like to keep an R instance running in the background and communicate with it using Perl. The problem is due to a large object that we need which has to be

Re: [R] can not print probabilities in svm of e1071

2010-04-29 Thread Saeed Abu Nimeh
svm.model - svm(y~.,data=dataset,probability=TRUE) svm.pred-predict(svm.model, test.set, decision.values = TRUE, probability = TRUE) library(ROCR) svm.roc - prediction(attributes(svm.pred)$decision.values, test.set) svm.auc - performance(svm.roc, 'tpr', 'fpr') plot(svm.auc) On Thu, Apr 29,

Re: [R] ROC curve in R

2010-07-01 Thread Saeed Abu Nimeh
Try the ROCR package. http://rocr.bioinf.mpi-sb.mpg.de/ROCR.pdf Saeed On Thu, Jul 1, 2010 at 9:50 AM, ashu6886 ashu.infy.m...@gmail.com wrote: Hi, i have a fairly large amount of genomic data. I have created a dataframe which has Reference as one column and Variation as another. I want to

[R] Function similar to combine.levels in Hmisc package

2010-07-09 Thread Saeed Abu Nimeh
Is there a function similar to combine.levels ( in the Hmisc package) that combines the levels of factors, but not based on their frequency. Alternatively, I am looking into using the significance of the dummy variables of factors based on their Pr(|t|) value using the linear model, then deleting

Re: [R] how to make R plot under Linux

2010-02-22 Thread Saeed Abu Nimeh
Try to install xming in your windows box http://www.straightrunning.com/XmingNotes/. Make sure to run xming before plotting. Saeed On Mon, Feb 22, 2010 at 12:46 PM, xin wei xin...@stat.psu.edu wrote: hi, Guys: thank you so much for all the suggestion. Now I seem to be able to set up x11

Re: [R] R Graphics into Latex‏

2010-02-24 Thread Saeed Abu Nimeh
Use \usepackage{epsfig} after your \documentclass. Then make sure to run LaTex not PDFLaTex On Wed, Feb 24, 2010 at 3:29 PM, Lars Bishop lars...@gmail.com wrote: Hi, I'm new in Latex and I'm trying to include an R chart into a Latex document. This is what I'm doing: 1) In R: save the chart

[R] snow package on multi core unix box

2007-12-05 Thread Saeed Abu Nimeh
Is the rmpi package (or rpvm) needed to exploit multiple cores on a single unix box using the snow package. The documentation of the package does not provide info about setting up a single machine with multiple cores. Also, if how effective is it to run a bayesian simulation on parallel (or

[R] R on a multi core unix box

2007-12-06 Thread Saeed Abu Nimeh
Hi, I installed the snow package on a unix box that has multiple cores. To be able to exploit the multiple cores (on one pc) do I still need to install the rmpi package (or rpvm). Another question, if i run a bayesian simulation on the multiple core after setting them up correctly (using snow),

Re: [R] Dual Core vs Quad Core

2007-12-17 Thread Saeed Abu Nimeh
I ran a bayesian simulation sometime ago and it took me 1 week to finish on a debian box (Dell PE 2850 Dual Intel [EMAIL PROTECTED] 6GB). I think it depends on the setting of the experiment and whether the code can be parallelized. Simon Blomberg wrote: I've been running R on a quad-core using

Re: [R] Installing R on BSD

2008-01-08 Thread Saeed Abu Nimeh
add_pkg -r R Kitty Lee wrote: Dear users, I try to follow the instruction on this page to install R on 4.4BSD network. http://cran.r-project.org/doc/manuals/R-admin.html#Using-make I can unpack the file but the system can't recognize the command: ./configure make Any ideas

Re: [R] Invoking R on BSD

2008-01-08 Thread Saeed Abu Nimeh
contributors. Type 'contributors()' for more information and 'citation()' on how to cite R or R packages in publications. Type 'demo()' for some demos, 'help()' for on-line help, or 'help.start()' for an HTML browser interface to help. Type 'q()' to quit R. Kitty Lee wrote: Thanks to Saeed Abu Nimeh. I

[R] ROCR package finding maximum accuracy and optimal cutoff point

2009-03-26 Thread Saeed Abu Nimeh
If we use the ROCR package to find the accuracy of a classifier pred - prediction(svm.pred, testset[,2]) perf.acc - performance(pred,acc) Do we find the maximum accuracy as follows (is there a simplier way?): max(perf@x.values[[1]]) Then to find the cutoff point that maximizes the accuracy 

Re: [R] ROCR package finding maximum accuracy and optimal cutoff point

2009-03-28 Thread Saeed Abu Nimeh
fnr.rocr-cutoff.list.fnr[as.numeric(optimal.cutoff.fnr)] Now acc.rocr, fpr.rocr, fnr.rocr will give you the accuracy, fpr, and fnr percentages Saeed Abu Nimeh wrote: If we use the ROCR package to find the accuracy of a classifier pred - prediction(svm.pred, testset[,2]) perf.acc - performance(pred,acc

[R] ROCR package partial false positive and accuracy

2009-04-06 Thread Saeed Abu Nimeh
Hi, In the ROCR package is there a way to find the accuracy that corresponds to a given false positive rate. In version 1.0-2, the authors of the package added an option to find the partial area under the ROC curve up to a given false positive rate by passing an optional parameter fpr.stop:

Re: [R] how to compute a roc curve

2008-10-31 Thread Saeed Abu Nimeh
Try library(ROCR) Pau Marc Munoz Torres wrote: Hi, I'm trying to set up a prediction software, now i testing the performance of my method, so i need to calculate a ROC curve, specially auc, cut-off, sens and spec, i just looking at ROCH package, but it's a mass for me, i'm not a math guy

Re: [R] Security Data extraction

2009-01-15 Thread Saeed Abu Nimeh
Subba Rao wrote: Hi, Today I came across the R application and I will admit I am not a Statistician. However, I think this application will be useful for me at work. I am a Network/System Security Engineer trying to make sense of the huge security data I collect. I am trying to visualize

Re: [R] SVM

2009-09-17 Thread Saeed Abu Nimeh
read Support Vector Machines in R http://www.jstatsoft.org/v15/i09/paper On Thu, Sep 17, 2009 at 4:39 AM, Samuel Okoye samu...@yahoo.com wrote: Hello, I have 12 sample each sample has got 1000 observation, i.e I have a matrix X with 1000 rows and 12 columns! m - svm(t(X)) p - predict (m)

Re: [R] two questions for R beginners

2010-02-25 Thread Saeed Abu Nimeh
On Thu, Feb 25, 2010 at 9:31 AM, Patrick Burns pbu...@pburns.seanet.com wrote: * What were your biggest misconceptions or stumbling blocks to getting up and running with R? 1- Compared to other programming languages it is hard to learn R by example, because it is hard to find code on the web

Re: [R] two questions for R beginners

2010-02-26 Thread Saeed Abu Nimeh
the volume of the list and will help in targeting lists by expertise. Thanks, Saeed On Fri, Feb 26, 2010 at 2:08 AM, Patrick Burns pbu...@pburns.seanet.com wrote: Saeed, If the R-help list were split, what do you see as the pieces? Pat On 26/02/2010 01:53, Saeed Abu Nimeh wrote: On Thu, Feb

Re: [R] two questions for R beginners

2010-02-26 Thread Saeed Abu Nimeh
Hi Ivan, On 2/26/10 6:30 AM, Ivan Calandra wrote: You are definitely right... What to do with bad beginner's questions is not a simple issue. If a beginner's mailing list is created, who will answer to such questions? If I subscribe to the beginners mailing list, then I have to expect

Re: [R] two questions for R beginners

2010-02-26 Thread Saeed Abu Nimeh
sorry meant community not committee On 2/26/10 8:36 PM, Saeed Abu Nimeh wrote: Hi Ivan, On 2/26/10 6:30 AM, Ivan Calandra wrote: You are definitely right... What to do with bad beginner's questions is not a simple issue. If a beginner's mailing list is created, who will answer

Re: [R] svm of e1071 package

2010-04-06 Thread Saeed Abu Nimeh
I think the problem is that you have R configured as 32-bits. If that is the case, then you will only have access to 4 gigs of RAM (see http://www.brianmadden.com/blogs/brianmadden/archive/2004/02/19/the-4gb-windows-memory-limit-what-does-it-really-mean.aspx). Try booting up an ubuntu instance in

Re: [R] In svm(), how to connect quantitative prediction result to categorical result?

2011-04-12 Thread Saeed Abu Nimeh
I trained a linear svm and did classification. looking at the model I have, with a binary response 0/1, the decision values look like this: head(svm.model$decision.values) 2.5 3.1 -1.0 looking at the fitted values head(svm.model$fitted) 1 1 0 So it looks like anything less than or equal 0 is

Re: [R] prediction error in ROCR package when sampled y consists of only one class

2011-04-15 Thread Saeed Abu Nimeh
Try performing stratified sampling when doing cv. cran.r-project.org/web/packages/ipred On Fri, Apr 15, 2011 at 11:00 AM, Soyeon Kim yunni0...@gmail.com wrote: Dear R users, Hi. I am using prediction function in ROCR package. y consists of two classes 0 and 1. However, since I am using

Re: [R] How to reference a package in academical paper

2011-03-07 Thread Saeed Abu Nimeh
http://www.iiap.res.in/astrostat/School07/R/html/utils/html/citation.html On Mon, Mar 7, 2011 at 4:12 PM, Jan Hornych jh.horn...@gmail.com wrote: Dear, I am now writing more formal academical paper, and would like to reference an R package. Do you have any recommendation how to do it?