Re: [R] How to Read a Large CSV into a Database with R

2010-11-15 Thread Lee Hachadoorian
On Mon, 15 Nov 2010 13:28:40 -0500, Anthony Damico wrote: Do you have any other ideas as to how I might diagnose what's going on here? Or, alternatively, is there some workaround that would get this giant CSV into a database? If you think there's a reasonable way to use the IMPORT command

[R] Fwd: indexing lists

2010-11-15 Thread Chris Carleton
Sorry folks, I keep forgetting to switch to my r-help email to send the replies so they get unintentionally sent to a moderator (particularly sorry for that moderators...) C -- Forwarded message -- From: Chris Carleton w.ccarle...@gmail.com Date: 15 November 2010 17:07 Subject:

Re: [R] Executing Command on Multiple R Objects

2010-11-15 Thread David Winsemius
On Nov 15, 2010, at 4:43 PM, patze003 wrote: Hello Everyone - I want to print a number of results from lme function objects out to a txt file. How could I do this more efficiently than what you see here: out2 - capture.output(summary(mod2a)) out3 - capture.output(summary(mod3)) out4 -

[R] Cannot install packages in R 2.12.0 on Windows 7

2010-11-15 Thread Dick Beyer
Hi, I am unable to install packages on my R 2.12.0 Windows 7 machine. Here are the relevant lines: sessionInfo() R version 2.12.0 (2010-10-15) Platform: x86_64-pc-mingw32/x64 (64-bit) locale: [1] LC_COLLATE=English_United States.1252 LC_CTYPE=English_United States.1252

Re: [R] hclust, does order of data matter?

2010-11-15 Thread Peter Langfelder
On Mon, Nov 15, 2010 at 2:07 PM, rchowdhury rchowdh...@alumni.upenn.edu wrote: Hello, I am using the hclust function to cluster some data.  I have two separate files with the same data.  The only difference is the order of the data in the file.  For some reason, when I run the two files

Re: [R] How to plot effect of x1 while controlling for x2

2010-11-15 Thread Joshua Wiley
Hi Mark, If the cumbersome part is that you have to create new data to use predict, then I think the answer is no, there is not an easier way. However, we can consider easy ways to make new data that fit with certain constraints (e.g., variables = their mean). Here's an example: ## original

Re: [R] hclust, does order of data matter?

2010-11-15 Thread Christian Hennig
I don't know how the hclust function is implemented, but generally in hierarchical clustering the result can be ambiguous if there are several distances of identical value in the dataset (or identical between-cluster distances occur when aggregating clusters). The role of the order of the data

Re: [R] hclust, does order of data matter?

2010-11-15 Thread Reshmi Chowdhury
Here is the code I am using: m - read.csv(data_unsorted.csv,header=TRUE) m - na.omit(m) cs - hclust(dist(t(m),method=euclidean),method=complete) ds - as.dendrogram(cs) In this case, m is a 106x40 matrix of doubles. When I change the order of the columns, I get different results... Thanks, RC

Re: [R] rotate column names in large matrix

2010-11-15 Thread Chris Stubben
You could display the matrix as an image then add column names (rotated used srt) if you really want them. x - cor(matrix(rnorm(600), 60, 100)) # set margins with extra space at top and xpd=TRUE to write outside plot region op-par(mar=c(1,1,5,1), xpd=TRUE) # display image without the 90

Re: [R] How to plot effect of x1 while controlling for x2

2010-11-15 Thread Greg Snow
Look at Predict.Plot (and TkPredict) from the TeachingDemos package. -- Gregory (Greg) L. Snow Ph.D. Statistical Data Center Intermountain Healthcare greg.s...@imail.org 801.408.8111 -Original Message- From: r-help-boun...@r-project.org [mailto:r-help-boun...@r- project.org] On

[R] Sampling problem

2010-11-15 Thread wangwallace
Hey, I am hoping someone can help me with a sampling question. I have a data frame of 8 variables (the first column is the subjects' id): SubIDCSE1 CSE2 CSE3 CSE4 WSE1 WSE2 WSE3 WSE4 1 6 5 6 2 6 22 4 2 6 4

[R] Proportional hazard model with weibull baseline hazard

2010-11-15 Thread Lorena Avendaño
Dear R-users, I would like to fit a fully parametric proportional hazard model with a weibull baseline hazard and a logit link function. This is, the hazard function is: lambda_i (t) = lambda_0 (t) psi (x_i* beta) where lambda_0 is a weibull distribution and psi a logistic distribution. Does

Re: [R] indexing lists

2010-11-15 Thread Chris Carleton
Thanks for the suggestions. The issue for me is that the top level index is also like a database key so it might be a bit annoying to coerce it to char() so that I can reference it with a $ and then I would have to still be able to find out what the name was automatically. I've got a function

[R] Converting numbers to percentages for boxplots

2010-11-15 Thread daniel carawan
Hi, I need to convert general numbers to percentages in R to create a boxplot for purposes of comparison. Can you please tell me how I can do that? My data is attached as a csv file. Thanks. Daniel __ R-help@r-project.org mailing list

Re: [R] hclust, does order of data matter?

2010-11-15 Thread Peter Langfelder
On Mon, Nov 15, 2010 at 2:19 PM, Reshmi Chowdhury rchowdh...@alumni.upenn.edu wrote: Here is the code I am using: m - read.csv(data_unsorted.csv,header=TRUE) m - na.omit(m) cs - hclust(dist(t(m),method=euclidean),method=complete) ds - as.dendrogram(cs) As Christian said, you may want to

Re: [R] How to Read a Large CSV into a Database with R

2010-11-15 Thread Gabor Grothendieck
On Mon, Nov 15, 2010 at 1:28 PM, Anthony Damico ajdam...@gmail.com wrote: Hi Gabor, Thank you for your willingness to help me through this.  The code you sent works on my machine exactly the same way as it does on yours. Unfortunately, when I run the same code on the 1.3GB file, it creates

Re: [R] indexing lists

2010-11-15 Thread David Winsemius
On Nov 15, 2010, at 5:07 PM, Chris Carleton wrote: Thanks for the suggestions. The issue for me is that the top level index is also like a database key so it might be a bit annoying to coerce it to char() so that I can reference it with a $ and then I would have to still be able to find

Re: [R] How to plot effect of x1 while controlling for x2

2010-11-15 Thread Bert Gunter
... and/or perhaps ?coplot in base R graphics or ?xyplot in trellis. -- Bert Gunter On Mon, Nov 15, 2010 at 2:25 PM, Greg Snow greg.s...@imail.org wrote: Look at Predict.Plot (and TkPredict) from the TeachingDemos package. -- Gregory (Greg) L. Snow Ph.D. Statistical Data Center

Re: [R] comparing levels of aggregation with negative binomial models

2010-11-15 Thread Ben Bolker
Ben Raymond Ben.Raymond at rhul.ac.uk writes: Dear R community, I would like to compare the degree of aggregation (or dispersion) of bacteria isolated from plant material. My data are discrete counts from leaf washes. While I do have xy coordinates for each plant, it is

Re: [R] Converting numbers to percentages for boxplots

2010-11-15 Thread David Winsemius
On Nov 15, 2010, at 5:33 PM, daniel carawan wrote: Hi, I need to convert general numbers to percentages in R to create a boxplot for purposes of comparison. Can you please tell me how I can do that? My data is attached as a csv file. No, it's not. Please read the Posting Guide. (.csv is

Re: [R] Zero truncated Poisson distribution amp; R2WinBUGS

2010-11-15 Thread Ben Bolker
julien martin julemartin0320 at gmail.com writes: (2) The second part of the script calls WinBUGS and run the binomial mixture models on the count data. In this case the count matrix y was converted to a vector C1 before being passed over to BUGS Any idea how to create a zero truncated

[R] Integrating functions / vector arithmetic

2010-11-15 Thread Eduardo de Oliveira Horta
Hello, I was trying to build some functions which I would like to integrate over an interval using the function 'integrate' from the 'stats' package. As an example, please consider the function h(u)=sin(pi*u) + sqrt(2)*sin(pi*2*u) + sqrt(3)*sin(pi*3*u) + 2*sin(pi*4*u) Two alternative ways to

Re: [R] About upgrade R

2010-11-15 Thread csrabak
Em 14/11/2010 20:42, csrabak escreveu: Em 14/11/2010 18:24, Tal Galili escreveu: Hi John, thank you for that input. It could be that the code I wrote here: http://www.r-statistics.com/2010/04/changing-your-r-upgrading-strategy-and-the-r-code-to-do-it-on-windows/ Should be updated so every

Re: [R] Bootstrap error message: Error in statistic(data, origina l, ...) : unused argument(s) ( ...) [Broadcast]

2010-11-15 Thread German
Hey everybody. I found the solution to the problem - at least for univariate statistics. In order to use the bootstrap method for an arbitrary statistic, the syntax for the function has to be in the form: Given a univariate formula with argument 'x': myfunction - function(x,p){formula(x[p])}

[R] Aggregate on identical link attributes

2010-11-15 Thread LCOG1
Hello R community, I need to do some aggregation based on the test data below. The below code works ok, but when its applied to my real data which includes over 9,000 records the process runs for over an hour. I know there is a more efficient way of doing this. I want to Sum the below

[R] Batch Processing Files

2010-11-15 Thread Nate Miller
Hi All! I have some experience with R, but less experience writing scripts using R and have run into a challenge that I hope someone can help me with. I have multiple .csv files of data each with the same 3 columns of data, but potentially of varying lengths (some data files are from short

Re: [R] Integrating functions / vector arithmetic

2010-11-15 Thread David Winsemius
On Nov 15, 2010, at 7:18 PM, Eduardo de Oliveira Horta wrote: Hello, I was trying to build some functions which I would like to integrate over an interval using the function 'integrate' from the 'stats' package. As an example, please consider the function h(u)=sin(pi*u) +

Re: [R] Integrating functions / vector arithmetic

2010-11-15 Thread Eduardo de Oliveira Horta
Thanks! Seems to work just fine! Best regards, Eduardo Horta On Mon, Nov 15, 2010 at 10:37 PM, David Winsemius dwinsem...@comcast.netwrote: On Nov 15, 2010, at 7:18 PM, Eduardo de Oliveira Horta wrote: Hello, I was trying to build some functions which I would like to integrate over an

Re: [R] indexing lists

2010-11-15 Thread Chris Carleton
Thanks for the suggestions, but 'cat' is not causing name space conflicts for me and since I'm not packaging the code for anyone else to use, I'm less than concerned about potential conflicts. I did type that too quickly, and I have resolved my problem using a workaround that does not involve

Re: [R] Sampling problem

2010-11-15 Thread Michael Bedward
Hello, Is this what you want ? sampleX - function(X, nGrp1, nsamples) # X is matrix or data.frame with cols for two groups of variables # with grp1 in cols 2:5 and grp2 in cols 6:9 # # nGrp1 - number of variables to sample from group 1 # # nsamples - number of rows in output matrix if (nGrp1

Re: [R] indexing lists

2010-11-15 Thread David Winsemius
On Nov 15, 2010, at 7:58 PM, Chris Carleton wrote: Thanks for the suggestions, but 'cat' is not causing name space conflicts for me install.packages( library(fortunes) fortune(dog) and since I'm not packaging the code for anyone else to use, I'm less than concerned about potential

Re: [R] Proportional hazard model with weibull baseline hazard

2010-11-15 Thread Dennis Murphy
Hi: Perhaps this post from earlier today will be useful: http://r.789695.n4.nabble.com/Re-interpretation-of-coefficients-in-survreg-AND-obtaining-the-hazard-function-td3043160.html#a3043160 HTH, Dennis 2010/11/15 Lorena Avendaño mlorena.avend...@gmail.com Dear R-users, I would like to fit a

[R] Debugging segfault in foreach

2010-11-15 Thread Steve Lianoglou
Hi, I'm using R-2.12 on a linux 64bit machine. When I run a chunk of code inside a foreach() %do% { ...} or %dopar% {...} (with doMC backend) I keep getting a segfault. Running the *same* code within lapply(something, function(x) ... ) doesn't result in any segfaults. I'll paste the output

[R] AOV/LME

2010-11-15 Thread janesense
Hi everyone, I'm having a little trouble with working out what formula is better to use for a repeated measures two way anova. I have two factors, L (five levels) and T (two levels). L and T are both crossed factors (all participants do all combinations). So, I do:

Re: [R] How to Read a Large CSV into a Database with R

2010-11-15 Thread Seth Falcon
Hi Anthony, On Mon, Nov 15, 2010 at 7:07 AM, Anthony Damico ajdam...@gmail.com wrote: Hi, I'm working in R 2.11.1 x64 on Windows x86_64-pc-mingw32.  I'm trying to insert a very large CSV file into a SQLite database.  I'm pretty new to working with databases in R, so I apologize if I'm

Re: [R] Batch Processing Files

2010-11-15 Thread Dennis Murphy
Hi: See inline. On Mon, Nov 15, 2010 at 4:26 PM, Nate Miller natemille...@gmail.com wrote: Hi All! I have some experience with R, but less experience writing scripts using R and have run into a challenge that I hope someone can help me with. I have multiple .csv files of data each with

[R] Aligning points and boxes in legend

2010-11-15 Thread mark wallace
Hi list, I have been trying to create a legend for a plot that includes polygons, lines, and points. I've included an example below. I have two problems with this legend. 1. I can't seem to change the colour of the box that represents the polygon in the legend. 2. The boxes and points in the

Re: [R] About upgrade R

2010-11-15 Thread Tal Galili
Hello Cesar, Thank you for the reply. Another question I have is if it is possible to detect the library path of an old R install, from the terminal of the new R install. Cheers, Tal Contact Details:--- Contact me:

Re: [R] Cannot install packages in R 2.12.0 on Windows 7

2010-11-15 Thread Prof Brian Ripley
The error seems to be a BioC problem[*] and associated with having a package installed with an empty DESCRIPTION file. So please take a look at your installed packages and remove any such. The following R code may help: for(lib in .libPaths()) { descs - Sys.glob(file.path(lib, *,

<    1   2