I worked on a project where we used a random forest classifier to
predict a binary response. We trained a model in the ec2 cloud with 3
million observations and 44 features. We stored the model that was
generated by R using save(mymodel,file=model.Rdata). Now we use
model.Rdata locally to predict
You can use strptime to specify the format of the date and time you want, e.g.
x1-strptime(x, %Y-%m-%d %H:%M:%S)
x1
[1] 2010-04-02 12:00:05
str(x1)
POSIXlt[1:1], format: 2010-04-02 12:00:05
On Wed, Jul 21, 2010 at 8:02 AM, Aaditya Nanduri
aaditya.nand...@gmail.com wrote:
Ms. Chisholm,
If
http://nixtricks.wordpress.com/2009/11/09/latex-multiple-figures-under-the-same-caption-using-subfigure/
It will create two rows of subfigures with two subfigures on each row
On Fri, Jul 23, 2010 at 6:43 AM, li li hannah@gmail.com wrote:
Hi all,
I want to add 6 plots in the format of 2
myFrame$year-years(strptime(x))
On Fri, Aug 13, 2010 at 12:36 PM, Dimitri Liakhovitski
dimitri.liakhovit...@gmail.com wrote:
Hello!
If I have in my data frame MyFrame a variable saved as a Date and want
to translate it into years, I currently do it like this using zoo:
library(zoo)
as.year
I have a dataset of multiple variables and a response. For example,
str(x)
'data.frame': 3557238 obs. of 44 variables:
$ response : Factor w/ 2 levels
$ var2: Factor w/5000 levels
If var2 for example is a factor with 5000 levels, what is the best
approach to determine which of these
No need to do that. They have some instances that run 64-bit ubuntu.
If I remember correctly we had to install 64-bit R from the debian
packages on the ubuntu instance.
On Wed, Aug 25, 2010 at 6:12 PM, noclue_ tim@netzero.net wrote:
You have a 64 bit Linux? If so...
Dowload the sources
-
From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-
project.org] On Behalf Of Saeed Abu Nimeh
Sent: Thursday, August 26, 2010 1:40 PM
To: r-help@r-project.org
Subject: [R] Importance of levels in a factor variable
I have a dataset of multiple variables and a response. For example,
str
In this paper [1] the author mentioned a procedure by M. Greenace that
can be used to collapse the levels of a categorical variable by
setting up a
table with the frequency of each level and the proportion of the
target value in each level. Then collapsing the table
level by level looking at the
Is there a way to communicate with a running R daemon from perl. I
tried RSPerl but the functions there initiate an R instance first. I
would like to keep an R instance running in the background and
communicate with it using Perl.
The problem is due to a large object that we need which has to be
svm.model - svm(y~.,data=dataset,probability=TRUE)
svm.pred-predict(svm.model, test.set, decision.values = TRUE,
probability = TRUE)
library(ROCR)
svm.roc - prediction(attributes(svm.pred)$decision.values, test.set)
svm.auc - performance(svm.roc, 'tpr', 'fpr')
plot(svm.auc)
On Thu, Apr 29,
Try the ROCR package. http://rocr.bioinf.mpi-sb.mpg.de/ROCR.pdf
Saeed
On Thu, Jul 1, 2010 at 9:50 AM, ashu6886 ashu.infy.m...@gmail.com wrote:
Hi,
i have a fairly large amount of genomic data. I have created a dataframe
which has Reference as one column and Variation as another. I want to
Is there a function similar to combine.levels ( in the Hmisc package)
that combines the levels of factors, but not based on their frequency.
Alternatively, I am looking into using the significance of the dummy
variables of factors based on their Pr(|t|) value using the linear
model, then deleting
Try to install xming in your windows box
http://www.straightrunning.com/XmingNotes/. Make sure to run xming
before plotting.
Saeed
On Mon, Feb 22, 2010 at 12:46 PM, xin wei xin...@stat.psu.edu wrote:
hi, Guys:
thank you so much for all the suggestion. Now I seem to be able to set up
x11
Use \usepackage{epsfig} after your \documentclass. Then make sure to
run LaTex not PDFLaTex
On Wed, Feb 24, 2010 at 3:29 PM, Lars Bishop lars...@gmail.com wrote:
Hi,
I'm new in Latex and I'm trying to include an R chart into a Latex document.
This is what I'm doing:
1) In R: save the chart
Is the rmpi package (or rpvm) needed to exploit multiple cores on a
single unix box using the snow package. The documentation of the package
does not provide info about setting up a single machine with multiple
cores. Also, if how effective is it to run a bayesian simulation on
parallel (or
Hi,
I installed the snow package on a unix box that has multiple cores. To be
able to exploit the multiple cores (on one pc) do I still need to install
the rmpi package (or rpvm). Another question, if i run a bayesian simulation
on the multiple core after setting them up correctly (using snow),
I ran a bayesian simulation sometime ago and it took me 1 week to finish
on a debian box (Dell PE 2850 Dual Intel [EMAIL PROTECTED] 6GB). I think it
depends on the setting of the experiment and whether the code can be
parallelized.
Simon Blomberg wrote:
I've been running R on a quad-core using
add_pkg -r R
Kitty Lee wrote:
Dear users,
I try to follow the instruction on this page to install R on 4.4BSD network.
http://cran.r-project.org/doc/manuals/R-admin.html#Using-make
I can unpack the file but the system can't recognize the command:
./configure
make
Any ideas
contributors.
Type 'contributors()' for more information and
'citation()' on how to cite R or R packages in publications.
Type 'demo()' for some demos, 'help()' for on-line help, or
'help.start()' for an HTML browser interface to help.
Type 'q()' to quit R.
Kitty Lee wrote:
Thanks to Saeed Abu Nimeh. I
If we use the ROCR package to find the accuracy of a classifier
pred - prediction(svm.pred, testset[,2])
perf.acc - performance(pred,acc)
Do we find the maximum accuracy as follows (is there a simplier way?):
max(perf@x.values[[1]])
Then to find the cutoff point that maximizes the accuracy
fnr.rocr-cutoff.list.fnr[as.numeric(optimal.cutoff.fnr)]
Now acc.rocr, fpr.rocr, fnr.rocr will give you the accuracy, fpr, and
fnr percentages
Saeed Abu Nimeh wrote:
If we use the ROCR package to find the accuracy of a classifier
pred - prediction(svm.pred, testset[,2])
perf.acc - performance(pred,acc
Hi,
In the ROCR package is there a way to find the accuracy that
corresponds to a given false positive rate. In version 1.0-2, the
authors of the package added an option to find the partial area under
the ROC curve up to a given false positive rate by passing an optional
parameter fpr.stop:
Try library(ROCR)
Pau Marc Munoz Torres wrote:
Hi,
I'm trying to set up a prediction software, now i testing the performance
of my method, so i need to calculate a ROC curve, specially auc, cut-off,
sens and spec, i just looking at ROCH package, but it's a mass for me, i'm
not a math guy
Subba Rao wrote:
Hi,
Today I came across the R application and I will admit I am not a
Statistician. However, I think this application will be useful for me
at work. I am a Network/System Security Engineer trying to make sense
of the huge security data I collect. I am trying to visualize
read Support Vector Machines in R http://www.jstatsoft.org/v15/i09/paper
On Thu, Sep 17, 2009 at 4:39 AM, Samuel Okoye samu...@yahoo.com wrote:
Hello,
I have 12 sample each sample has got 1000 observation, i.e I have a matrix X
with 1000 rows and 12 columns!
m - svm(t(X))
p - predict (m)
On Thu, Feb 25, 2010 at 9:31 AM, Patrick Burns pbu...@pburns.seanet.com wrote:
* What were your biggest misconceptions or
stumbling blocks to getting up and running
with R?
1- Compared to other programming languages it is hard to learn R by
example, because it is hard to find code on the web
the volume of the list and will help in
targeting lists by expertise.
Thanks,
Saeed
On Fri, Feb 26, 2010 at 2:08 AM, Patrick Burns pbu...@pburns.seanet.com wrote:
Saeed,
If the R-help list were split, what do you
see as the pieces?
Pat
On 26/02/2010 01:53, Saeed Abu Nimeh wrote:
On Thu, Feb
Hi Ivan,
On 2/26/10 6:30 AM, Ivan Calandra wrote:
You are definitely right...
What to do with bad beginner's questions is not a simple issue.
If a beginner's mailing list is created, who will answer to such
questions?
If I subscribe to the beginners mailing list, then I have to expect
sorry meant community not committee
On 2/26/10 8:36 PM, Saeed Abu Nimeh wrote:
Hi Ivan,
On 2/26/10 6:30 AM, Ivan Calandra wrote:
You are definitely right...
What to do with bad beginner's questions is not a simple issue.
If a beginner's mailing list is created, who will answer
I think the problem is that you have R configured as 32-bits. If that
is the case, then you will only have access to 4 gigs of RAM (see
http://www.brianmadden.com/blogs/brianmadden/archive/2004/02/19/the-4gb-windows-memory-limit-what-does-it-really-mean.aspx).
Try booting up an ubuntu instance in
I trained a linear svm and did classification. looking at the model I
have, with a binary response 0/1, the decision values look like this:
head(svm.model$decision.values)
2.5
3.1
-1.0
looking at the fitted values
head(svm.model$fitted)
1
1
0
So it looks like anything less than or equal 0 is
Try performing stratified sampling when doing cv.
cran.r-project.org/web/packages/ipred
On Fri, Apr 15, 2011 at 11:00 AM, Soyeon Kim yunni0...@gmail.com wrote:
Dear R users,
Hi. I am using prediction function in ROCR package.
y consists of two classes 0 and 1.
However, since I am using
http://www.iiap.res.in/astrostat/School07/R/html/utils/html/citation.html
On Mon, Mar 7, 2011 at 4:12 PM, Jan Hornych jh.horn...@gmail.com wrote:
Dear,
I am now writing more formal academical paper, and would like to reference
an R package. Do you have any recommendation how to do it?
33 matches
Mail list logo