Re: [R] Splitting Data Frame into Two Based on Source Array

2008-09-09 Thread Adam D. I. Kramer
data_main[ match(src,data_main$V1), ] and the compliment of src (call it srcc) data_main[ match(srcc,data_main$V1), ] ...this only works so long as there is only one occurrance of each item in V1 in V1. --Adam On Tue, 9 Sep 2008, Gundala Viswanath wrote: Dear all, Suppose I have this data

Re: [R] cluster/snow question

2008-09-09 Thread Markus Schmidberger
Hi Tolga, in SNOW you have to start a cluster with the command library(snow) cluster - makeCluster(#nodes) The object cluster is a list with an object for each node and each object again is a list with all informations (rank, comm, tags) The size of the cluster is the length of the list.

Re: [R] yahoo finance into R

2008-09-09 Thread Peter Dalgaard
thomastos wrote: Hi R, I am familiar with the basics of R. To learn more I would like how to get data from Yahoo!finance directly into R. So basically I want a data frame or matrix to do some data analysis. How do I do this? RSiteSearch(yahoo) get.hist.quote() from tseries yahooSeries()

[R] match problem by rownames

2008-09-09 Thread Xianming Wei
Hi all, While dat['a1',] and dat['a10',] produce the same results in the following example, I'd like dat['a1',] to return NAs. dat - data.frame(x1 = paste(letters[1:5],10, sep=''), x2=rnorm(5)) rownames(dat) - dat$x1 dat['a1',] dat['a10',] sessionInfo() R version 2.7.2 (2008-08-25)

[R] Compiling date

2008-09-09 Thread Megh Dal
Hi, I have following kind of dataset (all are dates) in my Excel sheet. 09/08/08 09/05/08 09/04/08 09/02/08 09/01/08 29/08/2008 28/08/2008 27/08/2008 26/08/2008 25/08/2008 22/08/2008 21/08/2008 20/08/2008 18/08/2008 14/08/2008 13/08/2008 08/12/08 08/11/08 08/08/08 08/07/08 However I want to use

[R] Memory allocation problem (during kmeans)

2008-09-09 Thread rami batal
Dear all, I am trying to apply kmeans clusterring on a data file (size is about 300 Mb) I read this file using x=read.table('file path' , sep= ) then i do kmeans(x,25) but the process stops after two minutes with an error : Error: cannot allocate vector of size 907.3 Mb when i read the

Re: [R] How to preserve date format while aggregating

2008-09-09 Thread Prof Brian Ripley
This is completely wrong: min _is_ defined for date-times: min(.leap.seconds) [1] 1972-07-01 01:00:00 BST Please do study the posting guide and do your homework before posting: you seem unaware of what the POSIXct class is, so ?DateTimeClasses is one place you need to start. And

Re: [R] isolate elements in vector that match one of many possible values

2008-09-09 Thread Adam D. I. Kramer
Check out ?match, ?%in% x - c(1,2,3,4) y - c(1,2,4) match(y,x) [1] 1 2 4 --Adam On Mon, 8 Sep 2008, Andrew Barr wrote: Hi all, I want to get the index numbers of all elements of a vector which match any of a long series of possible values. Say x - c(1,2,3,4) and I want to know which

Re: [R] make methods work in lapply - remove lapply's environment

2008-09-09 Thread Prof Brian Ripley
This is a side-effect of lapply being in the base namespace and not evaluating its arguments, as explained on its help page which also points out that using a wrapper is sometimes needed. It also points out that code has been written that relies on the current behaviour. On Mon, 8 Sep 2008,

Re: [R] naive variance in GEE

2008-09-09 Thread Prof Brian Ripley
On Mon, 8 Sep 2008, Qiong Yang wrote: The standard error from logistic regression is slightly different from the naive SE from GEE under independence working correlation structure. Shouldn't they be identical? Anyone has insight about this? They are computed quantities from iterations with

[R] S.O.S try doesnot work in boot?

2008-09-09 Thread ctu
First thanks for Jinsong's suggestions I would like to do a bootstrap in a nonlinear model. But it fails to converge in most of time. (it did converge if I just use nls without boot). Thus, I use try function to resolve my problem. This following code is from Jinsong's suggestion.

Re: [R] correct lme syntax for this problem?

2008-09-09 Thread ONKELINX, Thierry
Dear Matthew, First of all I'm forwarding this to R-SIG-Mixed, which is a more appropriate list for your question. Using a mixed effect with only 5 levels is a borderline situation. Douglas Bates recommends at least 6 levels in order to get a more or less reliable estimate. So I would consider

Re: [R] S.O.S try doesnot work in boot?

2008-09-09 Thread Prof Brian Ripley
Returning NA (of the correct length, not length 1) will not help you, as all the derived statistics from the bootstrap runs will be NA. But here you never looked at the result of try. On Tue, 9 Sep 2008, [EMAIL PROTECTED] wrote: First thanks for Jinsong's suggestions I would like to do a

[R] How do I compute interactions with anova.mlm ?

2008-09-09 Thread Schadwinkel, Stefan
Hi, I wish to compute multivariate test statistics for a within-subjects repeated measures design with anova.mlm. This works great if I only have two factors, but I don't know how to compute interactions with more than two factors. I suspect, I have to create a new grouping factor and then

[R] exporting tapply objects to csv-files

2008-09-09 Thread Kunzler, Andreas
Dear Everyone, I try to create a cvs-file with different results form the table function. Imagine a data-frame with two vectors a and b where b is of the class factor. I use the tapply function to count a for the different values of b. tapply(a,b,table) and I use the table function to have a

Re: [R] Read from url requiring authentication?

2008-09-09 Thread Damien
René Sachse wrote: Damien schrieb: I'm looking into opening an url on a server which requires authentication. Under a Windows Operating System you could try to start R with the --internet2 option. This worked in my case. Thanks René it did the trick for me too! Best Regards, Damien

Re: [R] Read from url requiring authentication?

2008-09-09 Thread Damien
On 8 Sep, 20:15, Prof Brian Ripley [EMAIL PROTECTED] wrote: On Mon, 8 Sep 2008, Damien wrote: Hi all, I'm looking into opening an url on a server which requires authentication. After failing to find some kind of connection structure to fill in I turned to explicitly stating the

Re: [R] Compiling date

2008-09-09 Thread David Scott
On Mon, 8 Sep 2008, Megh Dal wrote: Hi, I have following kind of dataset (all are dates) in my Excel sheet. 09/08/08 09/05/08 09/04/08 09/02/08 09/01/08 29/08/2008 28/08/2008 27/08/2008 26/08/2008 25/08/2008 22/08/2008 21/08/2008 20/08/2008 18/08/2008 14/08/2008 13/08/2008 08/12/08 08/11/08

Re: [R] Memory allocation problem (during kmeans)

2008-09-09 Thread Peter Dalgaard
rami batal skrev: Dear all, I am trying to apply kmeans clusterring on a data file (size is about 300 Mb) I read this file using x=read.table('file path' , sep= ) then i do kmeans(x,25) but the process stops after two minutes with an error : Error: cannot allocate vector of size

Re: [R] How do I compute interactions with anova.mlm ?

2008-09-09 Thread Peter Dalgaard
Schadwinkel, Stefan skrev: Hi, I wish to compute multivariate test statistics for a within-subjects repeated measures design with anova.mlm. This works great if I only have two factors, but I don't know how to compute interactions with more than two factors. I suspect, I have to create

Re: [R] match problem by rownames

2008-09-09 Thread Charilaos Skiadas
As suggested in ?[.data.frame, try: dat[match('a1', rownames(dat)),] Haris Skiadas Department of Mathematics and Computer Science Hanover College On Sep 9, 2008, at 2:41 AM, Xianming Wei wrote: Hi all, While dat['a1',] and dat['a10',] produce the same results in the following example, I'd

Re: [R] how to split a data framed with sequences

2008-09-09 Thread jim holtman
Is this what you want: my.df - data.frame(a = c(1:5, 1:10, 1:20), b = runif(35)) split(my.df, c(0, cumsum(diff(my.df$a) 0))) $`0` a b 1 1 0.2655087 2 2 0.3721239 3 3 0.5728534 4 4 0.9082078 5 5 0.2016819 $`1` a b 6 1 0.89838968 7 2 0.94467527 8 3 0.66079779 9 4

[R] how to split a data framed with sequences

2008-09-09 Thread David Carslaw
Hi all, Given a data frame: my.df - data.frame(a = c(1:5, 1:10, 1:20), b = runif(35)) I want to split it by a such that I end up with a list containing 3 components i.e. the first containing a = 1 to 5, the second a = 1 to 10 etc. In other words, sets of sequences of a. I can't seem to find

Re: [R] match problem by rownames

2008-09-09 Thread Dimitris Rizopoulos
try this: dat - data.frame(x1 = paste(letters[1:5],10, sep=''), x2=rnorm(5)) row.names(dat) - dat$x1 dat['a1' %in% row.names(dat), ] dat['a10' %in% row.names(dat), ] I hope it helps. Best, Dimitris Hi all, While dat['a1',] and dat['a10',] produce the same results in the following

[R] plotting group means

2008-09-09 Thread Erich Studerus
Hi all, I want to plot the grouped means of some variables. The dependent variables and the grouping factor are stored in different columns. I want to draw a simple line-plot of means, in which the x-axis represents the variables and y-axis represents the means. The means of the groups should

Re: [R] Question about multiple regression

2008-09-09 Thread Gustaf Rydevik
On Mon, Sep 8, 2008 at 7:47 PM, Dimitri Liakhovitski [EMAIL PROTECTED] wrote: Thank you everyone for your responses. I'll answer several questions. 1. Disclaimer: I have **NO IDEA** of the details of what you want to do or why -- but I am willing to bet that there are better ways of doing

Re: [R] Compiling date

2008-09-09 Thread Henrique Dallazuanna
Try this: strptime(x, ifelse(nchar(x) == 8, '%d/%m/%y', '%d/%m/%Y')) On Tue, Sep 9, 2008 at 3:48 AM, Megh Dal [EMAIL PROTECTED] wrote: Hi, I have following kind of dataset (all are dates) in my Excel sheet. 09/08/08 09/05/08 09/04/08 09/02/08 09/01/08 29/08/2008 28/08/2008

Re: [R] Compiling date

2008-09-09 Thread Dr Eberhard Lisse
Why not Format - Cell in Excell? el on 9/9/08 1:03 PM Henrique Dallazuanna said the following: Try this: strptime(x, ifelse(nchar(x) == 8, '%d/%m/%y', '%d/%m/%Y')) On Tue, Sep 9, 2008 at 3:48 AM, Megh Dal [EMAIL PROTECTED] wrote: Hi, I have following kind of dataset (all are dates)

Re: [R] exporting tapply objects to csv-files

2008-09-09 Thread Henrique Dallazuanna
Try creating a new object: tb - rbind(table(a), do.call(rbind.data.frame, tapply(a, b, table))) names(tb) - unique(a) then write to csv by write.table. On Tue, Sep 9, 2008 at 5:48 AM, Kunzler, Andreas [EMAIL PROTECTED] wrote: Dear Everyone, I try to create a cvs-file with different results

Re: [R] plotting group means

2008-09-09 Thread Chuck Cleland
On 9/9/2008 6:49 AM, Erich Studerus wrote: Hi all, I want to plot the grouped means of some variables. The dependent variables and the grouping factor are stored in different columns. I want to draw a simple line-plot of means, in which the x-axis represents the variables and y-axis

Re: [R] plotting group means

2008-09-09 Thread ONKELINX, Thierry
Dear Erich, Have a look at ggplot2 library(ggplot2) dataset - expand.grid(x = 1:20, y = factor(LETTERS[1:4]), value = 1:10) dataset$value - rnorm(nrow(dataset), sd = 0.5) + as.numeric(dataset$y) plotdata - aggregate(dataset$value, list(x = dataset$x, y = dataset$y), mean) plotdata -

Re: [R] write dataframes

2008-09-09 Thread Williams, Robin
Hi, Just a thought. You wrote: ob1-object1$ORF ob2-object2$ORF and then use cbind like, HG-cbind(on1,ob2) but there is an error. Is there any other function I can use? If you copied and pasted this from R, then your problem is Hg - cbind(on1,ob2) You mean Hg - cbind(ob1,ob2) So perhaps

[R] write dataframes

2008-09-09 Thread Roberto Olivares-Hernández
Hi, After manipulate my data I have ended up with 5 different data frames with different number of observations but the same number of variables (columns) An example, if I write str(object1), I see this, data.frame': 47 obs. of 3 variables: $ ORF: Factor w/ 245 levels

Re: [R] Compiling date

2008-09-09 Thread stephen sefick
this is day month year? look at chron or maybe the easiest is to use excel to change the format On Tue, Sep 9, 2008 at 7:12 AM, Dr Eberhard Lisse [EMAIL PROTECTED] wrote: Why not Format - Cell in Excell? el on 9/9/08 1:03 PM Henrique Dallazuanna said the following: Try this: strptime(x,

Re: [R] plotting group means

2008-09-09 Thread Jim Lemon
Hi Erich, Have a look at brkdn.plot in the plotrix package. Jim __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented,

Re: [R] plotting group means

2008-09-09 Thread hadley wickham
On Tue, Sep 9, 2008 at 6:56 AM, ONKELINX, Thierry [EMAIL PROTECTED] wrote: Dear Erich, Have a look at ggplot2 library(ggplot2) dataset - expand.grid(x = 1:20, y = factor(LETTERS[1:4]), value = 1:10) dataset$value - rnorm(nrow(dataset), sd = 0.5) + as.numeric(dataset$y) Or with

Re: [R] R_USER - in which file should I include it?

2008-09-09 Thread Eduardo M. A. M.Mendes
Hello Many thanks. It works just fine. How about the packages issue? That is, same thing for the installation path. Cheers Ed -Original Message- From: Gabor Grothendieck [mailto:[EMAIL PROTECTED] Sent: Monday, September 08, 2008 10:01 PM To: Eduardo M. A. M.Mendes Cc:

[R] Hardwarefor R cpu 64 vs 32, dual vs quad

2008-09-09 Thread Nic Larson
Need to buy fast computer for running R on. Today we use 2,8 MHz intel D cpu and the calculations takes around 15 days. Is it possible to get the same calculations down to minutes/hours by only changing the hardware? Should I go for an really fast dual 32 bit cpu and run R over linux or xp or go

Re: [R] plotting group means

2008-09-09 Thread Erich Studerus
Thanks for all the suggestions, but it seems, that all these functions need a rearrangement of my data, since in my case, the dependent variables are in different columns. The error.bars.by-function seems to be the only plotting function, that does not need a rearrangement. Are there other

Re: [R] R_USER - in which file should I include it?

2008-09-09 Thread Gabor Grothendieck
You might look at ?.libPaths (note the dot) and play around with adding a .libPaths command to your Rprofile.site and again you may need Administrator rights when editing it. If that does not help then you can try clarifying the problem. In particular what the same refers to and what is

[R] PCA and % variance explained

2008-09-09 Thread pgseye
After doing a PCA using princomp, how do you view how much each component contributes to variance in the dataset. I'm still quite new to the theory of PCA - I have a little idea about eigenvectors and eigenvalues (these determine the variance explained?). Are the eigenvalues related to loadings

Re: [R] plotting group means

2008-09-09 Thread hadley wickham
On Tue, Sep 9, 2008 at 8:38 AM, Erich Studerus [EMAIL PROTECTED] wrote: Thanks for all the suggestions, but it seems, that all these functions need a rearrangement of my data, since in my case, the dependent variables are in different columns. The error.bars.by-function seems to be the only

Re: [R] Vorticity and Divergence

2008-09-09 Thread Ravi Varadhan
Both vorticity and divergence are defined in terms of partial derivatives. You can compute these derivatives using the `grad' function in numDeriv package. U - function(X) { your U function} V - function(X) { your V function} # where X = c(x,y) library(numDeriv) grU - function(X) grad(X,

[R] Question

2008-09-09 Thread Veronique.Pinard
Hi, I'm trying to verify the assumption of homogeneity of variance of residuals in an ANOVA with levene.test. I don't know how to define the groups. I have 3 factors : A, B and C(AxB). What do I have to change or to add in the command to set that I'm working with the residuals and to set

Re: [R] R_USER - in which file should I include it?

2008-09-09 Thread Eduardo M. A. M.Mendes
Many thanks. I shall look at it. In case I run into trouble again, I'll try to clarify the the same. Ed -Original Message- From: Gabor Grothendieck [mailto:[EMAIL PROTECTED] Sent: Tuesday, September 09, 2008 10:46 AM To: Eduardo M. A. M.Mendes Cc: r-help@r-project.org Subject: Re: [R]

Re: [R] Gumbell distribution - minimum case

2008-09-09 Thread Aaron Mackey
If you mean you want an EVD with a fat left tail (instead of a fat right tail), then can;t you just multiply all the values by -1 to reverse the distribution? A new location parameter could then shift the distribution wherever you want along the number line ... -Aaron On Mon, Sep 8, 2008 at

[R] How does predict.lm work?

2008-09-09 Thread Williams, Robin
Hi, Please could someone explain how this element of predict.lm works? From the help file ` newdata An optional data frame in which to look for variables with which to predict. If omitted, the fitted values are used. ' Does this dataframe (newdata) need to have the same variable names as

Re: [R] How does predict.lm work?

2008-09-09 Thread Gabor Grothendieck
Just try it: BOD # built in data frame Time demand 118.3 22 10.3 33 19.0 44 16.0 55 15.6 67 19.8 BOD.lm - lm(demand ~ Time, BOD) predict(BOD.lm, list(Time = 10)) 1 25.73571 predict(BOD.lm, list(10)) Error in eval(expr, envir, enclos) : object

Re: [R] How does predict.lm work?

2008-09-09 Thread Marc Schwartz
on 09/09/2008 09:59 AM Williams, Robin wrote: Hi, Please could someone explain how this element of predict.lm works? From the help file ` newdata An optional data frame in which to look for variables with which to predict. If omitted, the fitted values are used. ' Does this

[R] printing all rows

2008-09-09 Thread ANJAN PURKAYASTHA
Hi, my data table has 38939 rows. R prints the first 1 columns and then prints an error message:[ reached getOption(max.print) -- omitted 27821 rows ]]. is it possible to set the maxprint parameter so that R prints all the rows? tia, anjan -- = anjan

Re: [R] Hardwarefor R cpu 64 vs 32, dual vs quad

2008-09-09 Thread Prof Brian Ripley
On Tue, 9 Sep 2008, Nic Larson wrote: Need to buy fast computer for running R on. Today we use 2,8 MHz intel D cpu and the calculations takes around 15 days. Is it possible to get the same calculations down to minutes/hours by only changing the hardware? No: you would need to arrange to

[R] puzzle about contrasts

2008-09-09 Thread Kenneth Knoblauch
Hi, I'm trying to redefine the contrasts for a linear model. With a 2 level factor, x, with levels A and B, a two level factor outputs A and B - A from an lm fit, say lm(y ~ x). I would like to set the contrasts so that the coefficients output are -0.5 (A + B) and B - A, but I can't get the sign

Re: [R] exporting tapply objects to csv-files

2008-09-09 Thread hadley wickham
On Tue, Sep 9, 2008 at 3:48 AM, Kunzler, Andreas [EMAIL PROTECTED] wrote: Dear Everyone, I try to create a cvs-file with different results form the table function. Imagine a data-frame with two vectors a and b where b is of the class factor. I use the tapply function to count a for the

Re: [R] PCA and % variance explained

2008-09-09 Thread ngottlieb
I did PCA stuff years there is a thing that is called a scree score Which will give an indication of the number of PCA's and the variance explained. Might want to web search on scree score and PCA. -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of

[R] Linear Modeling the best alternative

2008-09-09 Thread stephen sefick
I have a data set of mean velocity, discharge, and mean depth. I need to find out which model best fits them out of log linear, linear, some other kind of model... Using excel I have found that linear is not that bad and log10(discharge) vs. the other two variables (I am trying to predict

Re: [R] puzzle about contrasts

2008-09-09 Thread Prof Brian Ripley
-0.5*(A+B) is not a contrast, which is the seat of your puzzlement. All you can get from y ~ x is an intercept (a column of ones) and a single 'contrast' column for 'x'. If you use y ~ 0+x you can get two columns for 'x', but R does not give you an option of what columns in the case: see the

[R] passing graph image data from remote Rserve

2008-09-09 Thread Patil, Prasad
Hello, I am using Rserve to create a dedicated computational back-engine. I generate and pass an array of data to a java application on a separate server. I was wondering if the same is possible for an image. I believe that Rserve supports passing certain R objects and JRclient can cast these

[R] Modality Test

2008-09-09 Thread Amin W. Mugera
Dear Readers: I have two issues in nonparametric statistical analysis that i need help: First, does R have a package that can implement the multimodality test, e.g., the Silverman test, DIP test, MAP test or Runt test. I have seen an earlier thread (sometime in 2003) where someone was trying to

Re: [R] passing graph image data from remote Rserve

2008-09-09 Thread Patil, Prasad
I believe I have found my solution, so please disregard. Thanks [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide

Re: [R] Linear Modeling the best alternative

2008-09-09 Thread Ben Bolker
stephen sefick ssefick at gmail.com writes: I have a data set of mean velocity, discharge, and mean depth. I need to find out which model best fits them out of log linear, linear, some other kind of model... Using excel I have found that linear is not that bad and log10(discharge) vs. the

Re: [R] puzzle about contrasts

2008-09-09 Thread Peter Dalgaard
Prof Brian Ripley skrev: -0.5*(A+B) is not a contrast, which is the seat of your puzzlement. All you can get from y ~ x is an intercept (a column of ones) and a single 'contrast' column for 'x'. If you use y ~ 0+x you can get two columns for 'x', but R does not give you an option of what

[R] Help with 'spectrum'

2008-09-09 Thread rkevinburton
For the command 'spectrum' I read: The spectrum here is defined with scaling 1/frequency(x), following S-PLUS. This makes the spectral density a density over the range (-frequency(x)/2, +frequency(x)/2], whereas a more common scaling is 2π and range (-0.5, 0.5] (e.g., Bloomfield) or 1 and

Re: [R] puzzle about contrasts

2008-09-09 Thread Peter Dalgaard
Peter Dalgaard skrev: Prof Brian Ripley skrev: -0.5*(A+B) is not a contrast, which is the seat of your puzzlement. All you can get from y ~ x is an intercept (a column of ones) and a single 'contrast' column for 'x'. If you use y ~ 0+x you can get two columns for 'x', but R does not

[R] : writeMat

2008-09-09 Thread erola pairo
I write a .mat file using the writeMat() command, but when i try to load it in Matlab it says that file may be corrupt. I did it a month ago and it worked. It exists any option that I can change for making the file readable to Matlab? A - c(1:10) dim(A) - c(2,5) library(R.matlab)

[R] help on wavelet

2008-09-09 Thread giov
Hi, I have little experience using wavelet and I would like to know if it is possible,using R wavelet package, to have a plot of frequency versus time. thank you giov -- View this message in context: http://www.nabble.com/help-on-wavelet-tp19395583p19395583.html Sent from the R help mailing

[R] binomial(link=inverse)

2008-09-09 Thread Ben Bolker
this may be a better question for r-devel, but ... Is there a particular reason (and if so, what is it) that the inverse link is not in the list of allowable link functions for the binomial family? I initially thought this might have something to do with the properties of canonical vs

Re: [R] printing all rows

2008-09-09 Thread Adam D. I. Kramer
options(max.print) $max.print [1] 9 options(max.print=10) options(max.print) $max.print [1] 1e+05 ...so check what your max.print is, and figure out whether you need to set it to nrow, ncol, or nrow*ncol of your data frame...then do so...though of course, this is a global variable,

Re: [R] help on wavelet

2008-09-09 Thread stephen sefick
It depends on what you want to do. In wavelet speak frequency is scale. these are the libraries: wmtsa - wavCWT (make sure that you pick the wavelet. I suggest morlet because it is compactly supported (disappears to zero quickly)) I would also suggest the fields packages for the tim.colors

[R] creating table of averages

2008-09-09 Thread Lawrence Hanser
Dear Colleagues, I have a dataframe with variables: [1] ID category a11a12 a13a21 [7] a22a23a31a32 b11b12 [13] b13b21b31b32 b33b41 [19] b42

[R] randomForest

2008-09-09 Thread Kate Behrman
I am combining many different random forest objects run on the same data set using the combine ( ) function. After combining the forests I am not sure whether the variable importance, local importance, and rsq predictors are recalculated for the new random forest object or are calculated

Re: [R] Compiling date

2008-09-09 Thread Dr Eberhard W Lisse
Is this Month-Day or Day-Month or a mixture of both? I still think using the Format - Cell - Date will work much better... el On 09 Sep 2008, at 11:21 , David Scott wrote: On Mon, 8 Sep 2008, Megh Dal wrote: Hi, I have following kind of dataset (all are dates) in my Excel sheet.

Re: [R] creating table of averages

2008-09-09 Thread Adam D. I. Kramer
Maybe something like this: by(df[,c(77,81,86,90,94,98,101,106)],df$category,apply,2,mean) ...which would then need to be reformatted into a data frame (there is probably an easy way to do this which I don't know). aggregate seems like a more reasonable choice, but the function for aggregate

Re: [R] creating table of averages

2008-09-09 Thread Duncan Murdoch
On 9/9/2008 2:12 PM, Adam D. I. Kramer wrote: Maybe something like this: by(df[,c(77,81,86,90,94,98,101,106)],df$category,apply,2,mean) ...which would then need to be reformatted into a data frame (there is probably an easy way to do this which I don't know). sparseby() in the reshape

Re: [R] creating table of averages

2008-09-09 Thread Lawrence Hanser
Perfect! Thanks. On Tue, Sep 9, 2008 at 11:27 AM, Duncan Murdoch [EMAIL PROTECTED]wrote: On 9/9/2008 2:12 PM, Adam D. I. Kramer wrote: Maybe something like this: by(df[,c(77,81,86,90,94,98,101,106)],df$category,apply,2,mean) ...which would then need to be reformatted into a data frame

Re: [R] Hardwarefor R cpu 64 vs 32, dual vs quad

2008-09-09 Thread Henrik Bengtsson
On Tue, Sep 9, 2008 at 6:31 AM, Nic Larson [EMAIL PROTECTED] wrote: Need to buy fast computer for running R on. Today we use 2,8 MHz intel D cpu and the calculations takes around 15 days. Is it possible to get the same calculations down to minutes/hours by only changing the hardware? Should I

Re: [R] cluster/snow question

2008-09-09 Thread tolga . i . uzuner
Hi Markus, Many thanks. Is the cluster variable you mention below available in the environment of the nodes ? Specifically, within that environment, how could one identify the rank of that specific node ? My code would use that information to partition the problem. Thanks, Tolga Markus

[R] Information on the number of CPU's

2008-09-09 Thread tolga . i . uzuner
Dear R Users, I am on Windows XP SP2 platform, using R version 2.7.2 . I was wondering if there is a way to find out, within R, the number of CPU's on my machine ? I would use this information to set the number of nodes in a cluster, depending on the machine. Sys.info() and .Platform do not

[R] Binning

2008-09-09 Thread Felipe Carrillo
Dear List: I have a dataset with over 5000 records and I would like to put the Count in bins based on the ForkLength. e.g. Forklength Count 32-34? 35-37? 38-40? and so on... and lastly I would like to plot (scatterplot) including the

Re: [R] cluster/snow question

2008-09-09 Thread Luke Tierney
On Tue, 9 Sep 2008, [EMAIL PROTECTED] wrote: Hi Markus, Many thanks. Is the cluster variable you mention below available in the environment of the nodes ? Specifically, within that environment, how could one identify the rank of that specific node ? No -- that isn't the way snow works. With

[R] splitting time vector into days

2008-09-09 Thread Alexy Khrabrov
Greetings -- I have a dataframe a with one element a vector, time, of POSIXct values. What's a good way to split the data frame into periods of a$time, e.g. days, and apply a function, e.g. mean, to some other column of the dataframe, e.g. a$value? Cheers, Alexy

Re: [R] Modality Test

2008-09-09 Thread roger koenker
the diptest package, perhaps? url:www.econ.uiuc.edu/~rogerRoger Koenker email[EMAIL PROTECTED]Department of Economics vox: 217-333-4558University of Illinois fax: 217-244-6678Champaign, IL 61820 On Sep 9, 2008, at

Re: [R] cluster/snow question

2008-09-09 Thread tolga . i . uzuner
Understood, that's what I'll do. I'm thinking of exporting the number of nodes to all nodes and passing in the node rank as 1:nonodes through clusterApply. Thanks all, Tolga Luke Tierney [EMAIL PROTECTED] 09/09/2008 20:11 To [EMAIL PROTECTED] cc [EMAIL PROTECTED], r-help@r-project.org

Re: [R] Information on the number of CPU's

2008-09-09 Thread Prof Brian Ripley
On Tue, 9 Sep 2008, [EMAIL PROTECTED] wrote: Dear R Users, I am on Windows XP SP2 platform, using R version 2.7.2 . I was wondering if there is a way to find out, within R, the number of CPU's on my machine ? I would use this information to set the number of nodes in a cluster, depending on the

Re: [R] Information on the number of CPU's

2008-09-09 Thread tolga . i . uzuner
Many thanks, that's very helpful. Regards, Tolga - Original Message - From: Prof Brian Ripley [EMAIL PROTECTED] Sent: 09/09/2008 20:57 CET To: Tolga Uzuner Cc: r-help@r-project.org Subject: Re: [R] Information on the number of CPU's On Tue, 9 Sep 2008, [EMAIL PROTECTED] wrote: Dear

Re: [R] Modality Test

2008-09-09 Thread Mark Difford
Hi Amin, First, does R have a package that can implement the multimodality test, e.g., the Silverman test, DIP test, MAP test or Runt test. Jeremy Tantrum (a Ph.D. student of Werner Steutzle's, c. 2003/04) did some work on this. There is some useful code on Steutzle's website:

Re: [R] Modality Test

2008-09-09 Thread Mark Difford
Whoops! I think that should be Stuetzle --- though I very much doubt that he reads the list. Mark Difford wrote: Hi Amin, First, does R have a package that can implement the multimodality test, e.g., the Silverman test, DIP test, MAP test or Runt test. Jeremy Tantrum (a Ph.D. student

Re: [R] Information on the number of CPU's

2008-09-09 Thread Luke Tierney
The wmic command line utility can also be used to query this; on a dual-core Vista laptop I get C:\Users\lukewmic cpu get NumberOfCores,NumberOfLogicalProcessors NumberOfCores NumberOfLogicalProcessors 2 2 luke -- Luke Tierney University of Iowa

Re: [R] Modality Test

2008-09-09 Thread Mark Difford
Hi Amin, And I have just remembered that there is a function called curveRep in Frank Harrell's Hmisc package that might be useful, even if not quite in the channel of your enquiry. curveRep was added to the package after my struggles, so I never used it and so don't know how well it performs

[R] NMDS and varimax rotation

2008-09-09 Thread Bernd Panassiti
hello, subsequently to a NMDS analysis (performed with metaMDS or isoMDS) is it possible to rotate the axis through a varimax-rotation? Thanks in advance. Bernd Panassiti __ R-help@r-project.org mailing list

[R] csaps in R?

2008-09-09 Thread Dr Carbon
Is there is function in R equivalent to Matlab's csaps? I need a spline function with the same calculation of the smoothing parameter in csaps to compare some results. AFAIK, the spar in smooth.spline is related but not the same. __ R-help@r-project.org

[R] tsdiag error

2008-09-09 Thread rkevinburton
Does anyone know why I get the following error when trying tsdiag? Error in UseMethod(tsdiag) : no applicable method for tsdiag I am invoking it as: tsdiag(mar). Thank you. Kevin __ R-help@r-project.org mailing list

Re: [R] splitting time vector into days

2008-09-09 Thread stephen sefick
?aggregate ?window.zoo ?rollapply anyway have a look at package zoo On Tue, Sep 9, 2008 at 3:25 PM, Alexy Khrabrov [EMAIL PROTECTED] wrote: Greetings -- I have a dataframe a with one element a vector, time, of POSIXct values. What's a good way to split the data frame into periods of a$time,

Re: [R] NMDS and varimax rotation

2008-09-09 Thread stephen sefick
have you looked at the vegan viginette- I know there is proscrutes rotation. On Tue, Sep 9, 2008 at 3:54 PM, Bernd Panassiti [EMAIL PROTECTED] wrote: hello, subsequently to a NMDS analysis (performed with metaMDS or isoMDS) is it possible to rotate the axis through a varimax-rotation?

[R] building a package that contains S4 classes and methods

2008-09-09 Thread Marie Pierre Sylvestre
Hello R users, I am trying to make a my first package and I get an error that I can understand. The package is build out of three files (one for functions, 1 for s4 classes and 1 for s4 methods). Once I source them I run package.skeleton( name=TDC ) within a R session and I get Creating

Re: [R] splitting time vector into days

2008-09-09 Thread jim holtman
-09-08 14:00:00 3 4 2008-09-08 21:00:00 4 $`20080909` dates values 5 2008-09-09 04:00:00 5 6 2008-09-09 11:00:00 6 7 2008-09-09 18:00:00 7 $`20080910` dates values 8 2008-09-10 01:00:00 8 9 2008-09-10 08:00:00 9 10 2008-09-10 15

Re: [R] Help with 'spectrum'

2008-09-09 Thread Prof Brian Ripley
This is why some help pages have references: please use them (Venables Ripley explain the exact formulae used in R). On Tue, 9 Sep 2008, [EMAIL PROTECTED] wrote: For the command 'spectrum' I read: The spectrum here is defined with scaling 1/frequency(x), following S-PLUS. This makes the

Re: [R] Binning

2008-09-09 Thread jim holtman
This should do what you want. #--x - read.table('clipboard', header=TRUE, as.is=TRUE) # convert dates x$date - as.POSIXct(strptime(x$SampleDate, %m/%d/%Y)) # put ForkLength into bins x$bins - cut(x$ForkLength, breaks=c(32, 34, 37, 40), include.lowest=TRUE) # count the bins tapply(x$Count, x$bins,

Re: [R] naive variance in GEE

2008-09-09 Thread Thomas Lumley
On Mon, 8 Sep 2008, Qiong Yang wrote: Hi, The standard error from logistic regression is slightly different from the naive SE from GEE under independence working correlation structure. Yes Shouldn't they be identical? Anyone has insight about this? No, they shouldn't. They are different

Re: [R] naive variance in GEE

2008-09-09 Thread Thomas Lumley
Sorry, I misread your message. Prof Ripley is right, as usual -- the estimates use different stopping criteria and so are just numerically different. -thomas On Tue, 9 Sep 2008, Thomas Lumley wrote: On Mon, 8 Sep 2008, Qiong Yang wrote: Hi, The standard error from logistic

[R] [R-pkgs] survey package

2008-09-09 Thread Thomas Lumley
Version 3.9 of the survey package is now on CRAN. Since the last announcement (version 3.6-11, about a year ago) the main changes are - Database-backed survey objects: the data can live in a SQLite (or other DBI-compatible) database and be loaded as needed. - Ordinal logistic regression -

  1   2   >