Re: [R] Function to locate points in 3d octants or points on two axes
On Tue, Feb 01, 2011 at 08:30:22PM -0500, Bryan Hanson wrote: [Sorry, resending with a proper subject line!] Hi Guru's... I have a set of points that may lie along any of the x, y and z axes in a Cartesian coordinate system. I am hoping that a function exists which will determine if any two selected points are on different axes, i.e, if the one of the points is on x and the other on y or z, not elsewhere on the x axis. Put another way, I need to determine if the triangle formed by the two points and the origin lies in the xy, xz or yz planes. This might be as simple as testing if any particular value is zero, i.e. if the x coordinate is zero, then the points must be on the z and y axes and the triangle in the yz plane. But, I'm looking for a fairly general solution, one that also returns the appropriate plane as the answer. Very closely related to this, I could use a function that determines which of the 8 octants a point lies in. Seems like the cross product might be part of this, but I'm a little rusty on how to apply it. I hope this is clear enough, and someone has a suggestion to point me in the right direction. Before writing my own klunky version, I thought I'd ask. Hi. I think that for suggesting an appropriate solution it may be needed to know, which data structure is used for the input pairs of points. For example, it may a single matrix n times 3 with points as rows and a pair is represented by two indices of the points. Alternatively, the input may be a single matrix n times 6, where rows are pairs of points. In any case, the input may be simplified using sign() function. For example a - as.matrix(expand.grid(x=c(-1.1, 0, 1.1), y=c(0, 1.2), z=c(0, 1.3))) a x y z [1,] -1.1 0.0 0.0 [2,] 0.0 0.0 0.0 [3,] 1.1 0.0 0.0 [4,] -1.1 1.2 0.0 [5,] 0.0 1.2 0.0 [6,] 1.1 1.2 0.0 [7,] -1.1 0.0 1.3 [8,] 0.0 0.0 1.3 [9,] 1.1 0.0 1.3 [10,] -1.1 1.2 1.3 [11,] 0.0 1.2 1.3 [12,] 1.1 1.2 1.3 sign(a) x y z [1,] -1 0 0 [2,] 0 0 0 [3,] 1 0 0 [4,] -1 1 0 [5,] 0 1 0 [6,] 1 1 0 [7,] -1 0 1 [8,] 0 0 1 [9,] 1 0 1 [10,] -1 1 1 [11,] 0 1 1 [12,] 1 1 1 This output represents a classification of the points into a finite number of regions and keeps the information needed for any of the tasks, which you mention. Hope this helps. Petr Savicky. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Hidden environment
On Tue, 1 Feb 2011, Joel wrote: Hi I wondering if its possible to make a hidden environment for a package, so the user cant see whats in it but the diffrent functions inside the package can use the variables in the environment. Up to a point: 1) See the manual about the concept of a name space. 2) A package can also have one or more private environments, using new.env() or local(): grDevices is an example. However, R is Open Source so a determined user can find such environments: hence 'the user cant see whats in it' (is your ' key broken?) is not really achievable. Hope you can understand what Im after. Joel -- Brian D. Ripley, rip...@stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UKFax: +44 1865 272595 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Finding the maximum in a particular group in a dataframe
Hello I am trying to find a way to find the max value, for only a subset of a dataframe, depending on how the data is grouped for example, How would I find the maxmium responce, for all the GPR119a condition below: I've tried tapply tapply(GPR119data$responce, GPR119data$GPR119a, max) Error in tapply(GPR119data$responce, GPR119data$GPR119a, max) : arguments must have same length responce,mouce,condition 0.105902,KO,con 0.232018561,KO,con 0.335008375,KO,con 0.387025433,KO,GPR119a 0.576769897,KO,GPR119a 0.645120419,KO,GPR119a 0.2538608,KO,GPR119b 0.183061952,KO,GPR119b 0.824035587,KO,GPR119b 0.399201597,KO,GPR119c 0.417006618,KO,GPR119c 0.572958834,KO,GPR119c 0.229467444,KO,GPR119d 0.294089745,KO,GPR119d 0.309964445,KO,GPR119d 0.30474325,KO,GPR119e 0.159374839,KO,GPR119e 0.467726848,KO,GPR119e 1.01841912,KO,GPR119f 0.423028621,KO,GPR119f 0.223588597,KO,GPR119f Thank [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] combining vectors into list
Hi R users I have these two vectors: Row - sample(1:25, 10) Col - sample(1:25, 10) Is there a way to combine them into a list, whose first component is a vector containing the first element of Row and the first element of Col, the second component is a vector containing the second element of Row and the second element of Col, and so on... Thanks you Lorenzo [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Why my package is not being generate?
Hi all, I have commited a new release of my package and waited for more than one day. After that install.packages(tests,repos=http://r-forge.r-projet.org;) still returns package 'tests' is not available. I have done the local build check and the package seems ok. Also have mofified the Title in the DESCRIPTION file. What can be wrong? Is there any tool or recomendation to validade my package before submit it? -- Cristiano Strieder São José dos Campos - SP [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] error in scan(...
I know it's a common error and there is a lot of help available but still can't resolve the issue: all i am trying to do is to read a csv file from my folder and this is what i get: Error in scan(file, what, nmax, sep, dec, quote, skip, nlines, na.strings, : scan() expected 'a real', got '1,m,a,F,165,240,26.5,31,0.738,0.704,1.095,0.606,0.847' Can you help? -- View this message in context: http://r.789695.n4.nabble.com/error-in-scan-tp3253487p3253487.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] reshape data, adding rows to dataframe
Hello everyone, I have a data set like this: head( fish_transect) ID_TRANSECT ID_PROJECT DE_ZONE DE_LOCALITY DE_SECTOR MES 1 42 MB TarragonaCreixell Control I 9 2 42 MB TarragonaCreixell Control I 9 3 42 MB TarragonaCreixell Control I 9 4 42 MB TarragonaCreixell Control I 9 5 42 MB TarragonaCreixell Control I 9 6 42 MB TarragonaCreixell Control I 9 ID_SPECIES WEIGHT SIZE NFAMILIA 1 Spondyliosoma cantharus15.64 10 1Sparidae 2 Symphodus melanocercus 11.21 10 1 Labridae 3 Diplodus vulgaris 30.20 10 2 Sparidae 4 Diplodus vulgaris 52.24 12 2 Sparidae 5 Diplodus sargus221.4114 5Sparidae 6 Diplodus annularis 3.47 6 1Sparidae I have been trying to duplicate the rows where N 1, that is I want a row for each animal. Right now as you can see I have for example 5 D. sargus which are 14 cm length and so on. How can I get 1 row for each animal? I've been trying the reshape function without success. I also tried in access but couldn't do it either. Thanks a lot, Lucia -- View this message in context: http://r.789695.n4.nabble.com/reshape-data-adding-rows-to-dataframe-tp3253640p3253640.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Counter in a For Loop - Efficiency Issue
I have a 32000 x 14 matrix (M) where entry (x, y) corresponds to person x at a discrete time y. I then have a matrix (M2) of 6 x 2 where each entry is a an event by a person that is in the above 32000 and at a time that is in the range of the discrete time points above. I want to populate the another matrix (M3) such that (x, y) is the number of events of person x between times y-1 and y. This is simple to code, (I just put the discrete time points and the 32000 people in their own vectors, V1 and V2): for (i in 1:32000){ for (j in 1:6){ for (k in 2:14){ if (M2[1, j] == V2[i] M2[2, j] V1[k-1] M2[2, j] = V1[k]) M3[i, k] = M3[i, k]+1 }}} This would work im sure, the problem is that it takes a very long time. Any advice on helping it run faster would be greatly appreciated. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Preparing dataset for glmnet: factors to dummies
Hi Frank, I believe that glmnet scales variables by their standard deviations. This would not be appropriate for categorical predictors. That's an excellent point, which many are likely to forget (including me) since one is using a model matrix. The default argument is to standardize inputs, but there is an option to turn it off. (One could then standardize continuous inputs on different scales oneself.) Regards, Mark. -- View this message in context: http://r.789695.n4.nabble.com/Preparing-dataset-for-glmnet-factors-to-dummies-tp3250791p3253538.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] error in density plot
Hi, I have the code for the density plot j - 8 plot(density(diff_in_sample[,1]), main = list.files()[j]) for(i in 1:25){ lines(density(diff_in_sample[,i])) } This gives me an error Error in density.default(diff_in_sample[, i]) : 'x' contains missing values I am not quite sure what to do. I have the same code doing plots for otherfiles but just one is not working. what xactly it means missing values even if there is there how can i tell it to overlook those any help would be appreciated Thanks Ramya -- View this message in context: http://r.789695.n4.nabble.com/error-in-density-plot-tp3253489p3253489.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] How to Plot Two Curves Into One Page
On Tue, 1 Feb 2011 14:20:51 +0900 Gundala Viswanath gunda...@gmail.com wrote: I have a R script that contain these lines for plotting: plot(foo,lwd=2,lty=3,col=red, main=); plot(bar,lwd=2,lty=3,col=blue); legend(0.6,0.6,c('Default','Probabilistic'), col=c('red','blue'),lwd=3); But it generate 1 file (Rplot.pdf) with two pages. Each page for 1 plot. Is there a way I can put them together in to one page? - G.V. Hello Gundala, If these are fairly simple plots (i.e. curve, points, bar, step) you can arrange your data as columns in a matrix and use matplot(). matplot(vectorOfxValues, FooBarAsMatrix, lwd=2, lty=3, col=c(red, blue)) legend(0.6,0.6,c('Default','Probabilistic'), col=c('red','blue'),lwd=3) Regards, Edwin -- Dr. Edwin Groot, postdoctoral associate AG Laux Institut fuer Biologie III Schaenzlestr. 1 79104 Freiburg, Deutschland +49 761-2032945 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] RMySQL, cant find MySQL
Hi I'm having some trouble with getting RMySQL installed on my Ubuntu system. As the apt-get install version of R is 2.10.x I've reinstalled 2.12 the old fashion way by ./conf and make. But now when I shall install RMySQL I get Configuration error: could not find the MySQL installation include and/or library directories. Manually specify the location of the MySQL libraries and the header files and re-run R CMD INSTALL. And in truth I cant find the headers or libraries anywhere on my comp, but I have MySQL up and running. I did install MySQL via apt-get so if anyone know where it put the different files I would love to know :). Thx for the help Joel -- View this message in context: http://r.789695.n4.nabble.com/RMySQL-cant-find-MySQL-tp3253689p3253689.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] help
We don't need a loop! require(Rmpfr) factorial(mpfr(1:500,3800)) 2011/2/2 Waclaw Kusnierczyk w...@idi.ntnu.no library(bc) factorial = function(n) bc(sprintf(' define factorial(n) { if (n 2) return (1) f = 2 i = 2 while (i n) f *= ++i return (f) } factorial(%d)', n)) -- Mi³ego dnia [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] combining vectors into list
Hi, Try this: mapply(FUN=c, Row, Col, SIMPLIFY=FALSE) HTH, Ivan Le 2/2/2011 08:55, Lorenzo Cattarino a écrit : Hi R users I have these two vectors: Row- sample(1:25, 10) Col- sample(1:25, 10) Is there a way to combine them into a list, whose first component is a vector containing the first element of Row and the first element of Col, the second component is a vector containing the second element of Row and the second element of Col, and so on... Thanks you Lorenzo [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Ivan CALANDRA PhD Student University of Hamburg Biozentrum Grindel und Zoologisches Museum Abt. Säugetiere Martin-Luther-King-Platz 3 D-20146 Hamburg, GERMANY +49(0)40 42838 6231 ivan.calan...@uni-hamburg.de ** http://www.for771.uni-bonn.de http://webapp5.rrz.uni-hamburg.de/mammals/eng/1525_8_1.php __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] reshape data, adding rows to dataframe
Hi, Not sure what you want to do, but take a look at ?duplicated, ?unique, and maybe ?aggregate; it might be what you're looking for. HTH, Ivan Le 2/2/2011 10:13, Lucia Rueda a écrit : Hello everyone, I have a data set like this: head( fish_transect) ID_TRANSECT ID_PROJECT DE_ZONE DE_LOCALITY DE_SECTOR MES 1 42 MB TarragonaCreixell Control I 9 2 42 MB TarragonaCreixell Control I 9 3 42 MB TarragonaCreixell Control I 9 4 42 MB TarragonaCreixell Control I 9 5 42 MB TarragonaCreixell Control I 9 6 42 MB TarragonaCreixell Control I 9 ID_SPECIES WEIGHT SIZE NFAMILIA 1 Spondyliosoma cantharus15.64 10 1Sparidae 2 Symphodus melanocercus 11.21 10 1 Labridae 3 Diplodus vulgaris 30.20 10 2 Sparidae 4 Diplodus vulgaris 52.24 12 2 Sparidae 5 Diplodus sargus221.4114 5Sparidae 6 Diplodus annularis 3.47 6 1Sparidae I have been trying to duplicate the rows where N 1, that is I want a row for each animal. Right now as you can see I have for example 5 D. sargus which are 14 cm length and so on. How can I get 1 row for each animal? I've been trying the reshape function without success. I also tried in access but couldn't do it either. Thanks a lot, Lucia -- Ivan CALANDRA PhD Student University of Hamburg Biozentrum Grindel und Zoologisches Museum Abt. Säugetiere Martin-Luther-King-Platz 3 D-20146 Hamburg, GERMANY +49(0)40 42838 6231 ivan.calan...@uni-hamburg.de ** http://www.for771.uni-bonn.de http://webapp5.rrz.uni-hamburg.de/mammals/eng/1525_8_1.php __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] RMySQL, cant find MySQL
On Wed, 2 Feb 2011, Joel wrote: Hi I'm having some trouble with getting RMySQL installed on my Ubuntu system. As the apt-get install version of R is 2.10.x I've reinstalled 2.12 the old fashion way by ./conf and make. But now when I shall install RMySQL I get Configuration error: could not find the MySQL installation include and/or library directories. Manually specify the location of the MySQL libraries and the header files and re-run R CMD INSTALL. And in truth I cant find the headers or libraries anywhere on my comp, but I have MySQL up and running. I did install MySQL via apt-get so if anyone know where it put the different files I would love to know :). Well, you should ask such questions on R-sig-debian or R-sig-db, but I presume you forgot that you would need MySQL-dev (or whatever it is called on Ubuntu), the development part of the MySQL installation. Thx for the help Joel -- View this message in context: http://r.789695.n4.nabble.com/RMySQL-cant-find-MySQL-tp3253689p3253689.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html PLEASE do as we ask. -- Brian D. Ripley, rip...@stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UKFax: +44 1865 272595 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Help in getting info from a DataFrame
Hi r-help-boun...@r-project.org napsal dne 31.01.2011 15:51:16: snip Hi thanks. I am not sure if I have understood 100% but at least I know now that there's a diference. About this: lapsales Store Prod qtd 5 Aveiro Lapiseira 3 8 Coimbra Lapiseira 1 9 Setubal Lapiseira 1 lapsales[max(lapsales$qtd),Store] [1] Setubal Levels: Aveiro Coimbra Evora lx Setubal why Setubal and not Aveiro? Can you explain me the result? Probably qtd is not what you think it is. I get dados- data.frame(Store = c (Setubal,lx,Aveiro,Coimbra,Aveiro,Evora,Aveiro,Coimbra,Setubal),Prod=c(BlocoDesenho,BlocoDesenho,Tinteiro,Régua,Lapiseira,Regua,Tinteiro,Lapiseira,Lapiseira),qtd=c(2,1,2,1,3,1,1,1,1)) lapsales - dados[dados$Prod==Lapiseira, ] lapsales[which.max(lapsales[ , qtd ]), Store] [1] Aveiro Levels: Aveiro Coimbra Evora lx Setubal and str(lapsales) 'data.frame': 3 obs. of 3 variables: $ Store: Factor w/ 5 levels Aveiro,Coimbra,..: 1 2 5 $ Prod : Factor w/ 5 levels BlocoDesenho,..: 2 2 2 $ qtd : num 3 1 1 Maybe you get something different. Regards Petr thanks AD -- View this message in context: http://r.789695.n4.nabble.com/Help-in-getting- info-from-a-DataFrame-tp3247740p3248883.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Counter in a For Loop - Efficiency Issue
On Wed, Feb 02, 2011 at 09:35:28AM +0200, Leendert Punt wrote: I have a 32000 x 14 matrix (M) where entry (x, y) corresponds to person x at a discrete time y. I then have a matrix (M2) of 6 x 2 where each entry is a an event by a person that is in the above 32000 and at a time that is in the range of the discrete time points above. I want to populate the another matrix (M3) such that (x, y) is the number of events of person x between times y-1 and y. This is simple to code, (I just put the discrete time points and the 32000 people in their own vectors, V1 and V2): for (i in 1:32000){ for (j in 1:6){ for (k in 2:14){ if (M2[1, j] == V2[i] M2[2, j] V1[k-1] M2[2, j] = V1[k]) M3[i, k] = M3[i, k]+1 }}} Consider the following approach, which uses M2, V1 and V2 as used in the code above. V1 - c(0, 10, 20, 30, 40) V2 - 1:3 # list of persons in the required order M2 - rbind( c(1, 2, 1, 1, 3, 1, 3, 1), c(11, 21, 21, 21, 11, 21, 11, 31)) person - factor(M2[1, ], levels=V2) interval - cut(M2[2, ], breaks=V1) M3 - table(person, interval) M2 [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [1,]12113131 [2,] 11 21 21 21 11 21 11 31 M3 interval person (0,10] (10,20] (20,30] (30,40] 1 0 1 3 1 2 0 0 1 0 3 0 2 0 0 Does this approach work for your data? Petr Savicky. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] sum the values in a vector as a complete number
Hi r-help-boun...@r-project.org napsal dne 01.02.2011 07:02:50: Hi AD, You might try the following: # data a - c(2,3,5) b - c(8,7) # you got this wrong ;) # option 1 foo - function(x) as.numeric(paste(x, sep = , collapse = )) # examples foo(a) # [1] 235 foo(b) # [1] 87 foo(a) + foo(b) # [1] 322 # option 2 foo2 - function(x, y) foo(x) + foo(y) # example foo2(a, b) # [1] 322 See ?paste and ?as.numeric for more information. HTH, Jorge Above solutions are maybe quicker but here is one plain numeric. fff-function(x) rev(10^(0:(length(x)-1))) sum(a*fff(a))+sum(b*fff(b)) [1] 322 or you can put the whole operation inside a function fff-function(x) sum(x*rev(10^(0:(length(x)-1 fff(a)+fff(b) Regards Petr On Mon, Jan 31, 2011 at 11:22 PM, ADias wrote: Hi I am trying to create a function that is able to calculate this sum: a-c(2,3,5) b-(8,7) with a meaning 235 and b 87. So the result of this sum would be 235 + 87 = 322. I've searched a function like strsplit but that worked for integers and in reverse - not spliting but combining. Can you give me a hand on this please? thanks AD -- View this message in context: http://r.789695.n4.nabble.com/sum-the-values-in-a-vector-as-a-complete- number-tp3250470p3250470.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] reshape data, adding rows to dataframe
On Wed, Feb 02, 2011 at 01:13:11AM -0800, Lucia Rueda wrote: Hello everyone, I have a data set like this: head( fish_transect) ID_TRANSECT ID_PROJECT DE_ZONE DE_LOCALITY DE_SECTOR MES 1 42 MB TarragonaCreixell Control I 9 2 42 MB TarragonaCreixell Control I 9 3 42 MB TarragonaCreixell Control I 9 4 42 MB TarragonaCreixell Control I 9 5 42 MB TarragonaCreixell Control I 9 6 42 MB TarragonaCreixell Control I 9 ID_SPECIES WEIGHT SIZE NFAMILIA 1 Spondyliosoma cantharus15.64 10 1Sparidae 2 Symphodus melanocercus 11.21 10 1 Labridae 3 Diplodus vulgaris 30.20 10 2 Sparidae 4 Diplodus vulgaris 52.24 12 2 Sparidae 5 Diplodus sargus221.4114 5Sparidae 6 Diplodus annularis 3.47 6 1Sparidae I have been trying to duplicate the rows where N 1, that is I want a row for each animal. Right now as you can see I have for example 5 D. sargus which are 14 cm length and so on. How can I get 1 row for each animal? I've been trying the reshape function without success. I also tried in access but couldn't do it either. Hello. Let me use a simpler example for testing. dat - data.frame(animal=c(a, b, c), N=c(2, 1, 3)) dat animal N 1 a 2 2 b 1 3 c 3 Is the following operation what you want in terms of this small example? expand - dat[rep(1:nrow(dat), times=dat$N), ] rownames(expand) - NULL expand animal N 1 a 2 2 a 2 3 b 1 4 c 3 5 c 3 6 c 3 Hope this helps. Petr Savicky. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] How column names/row names are preserved in matrix calculation?
Can somebody tell me that, if I do some arithmetic calculation over 2 matrices then how the column names and row names are preserved? It seems that, for multiplication, column names and row names of the 2nd matrix are preserved and for additional, there seems not having any explicit rule: set.seed(1) dat1 - matrix(rnorm(25), 5); colnames(dat1) = rownames(dat1) = paste(mat1, 1:5, sep=) dat2 - matrix(rnorm(25), 5); colnames(dat2) = rownames(dat2) = paste(mat2, 1:5, sep=) dat3 - matrix(rnorm(25), 5); colnames(dat3) = rownames(dat3) = paste(mat3, 1:5, sep=) dat1 mat11 mat12 mat13 mat14 mat15 mat11 -0.6264538 -0.8204684 1.5117812 -0.04493361 0.91897737 mat12 0.1836433 0.4874291 0.3898432 -0.01619026 0.78213630 mat13 -0.8356286 0.7383247 -0.6212406 0.94383621 0.07456498 mat14 1.5952808 0.5757814 -2.2146999 0.82122120 -1.98935170 mat15 0.3295078 -0.3053884 1.1249309 0.59390132 0.61982575 dat2 mat21 mat22 mat23 mat24 mat25 mat21 -0.05612874 1.35867955 -0.4149946 -0.1645236 -0.7074952 mat22 -0.15579551 -0.10278773 -0.3942900 -0.2533617 0.3645820 mat23 -1.47075238 0.38767161 -0.0593134 0.6969634 0.7685329 mat24 -0.47815006 -0.05380504 1.1000254 0.5566632 -0.1123462 mat25 0.41794156 -1.37705956 0.7631757 -0.6887557 0.8811077 dat1 %*% dat2 + dat3 mat21 mat22 mat23 mat24 mat25 mat11 -1.2567987 0.5365897 3.5473394 0.89542630 2.5962144 mat12 -0.9370075 -1.0928568 0.2483355 -2.23467503 0.3284073 mat13 0.7665543 -2.6496778 1.8774045 1.45703219 0.9533289 mat14 0.7245671 4.5147054 -1.3445549 0.02866729 -5.356 mat15 -0.2173109 -0.1053570 0.3000135 2.88350373 -0.2541405 Can somebody finalize me what is the **rule** for preserving column names and row names for matrix arithmatics? Thanks, [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Help in getting info from a DataFrame
Hi r-help-boun...@r-project.org napsal dne 02.02.2011 11:14:18: Hi r-help-boun...@r-project.org napsal dne 31.01.2011 15:51:16: snip Hi thanks. I am not sure if I have understood 100% but at least I know now that there's a diference. About this: lapsales Store Prod qtd 5 Aveiro Lapiseira 3 8 Coimbra Lapiseira 1 9 Setubal Lapiseira 1 lapsales[max(lapsales$qtd),Store] [1] Setubal Levels: Aveiro Coimbra Evora lx Setubal why Setubal and not Aveiro? Can you explain me the result? Sorry I did not look to your code and used the code I sent you as a solution What do you expect max(lapsales$qtg) give you as a result? If you changed the number qtd to 4 instead of 3 you will get lapsales[max(lapsales$qtd),Store] [1] NA Levels: Aveiro Coimbra Evora lx Setubal So R give you what you ask for. You asked for third line and you got it. If you asked for which position is the maximum number lapsales[which.max(lapsales$qtd),Store] [1] Aveiro Levels: Aveiro Coimbra Evora lx Setubal You will get desired result. Regards Petr Probably qtd is not what you think it is. I get dados- data.frame(Store = c (Setubal,lx,Aveiro,Coimbra,Aveiro,Evora,Aveiro,Coimbra,Setubal),Prod=c (BlocoDesenho,BlocoDesenho,Tinteiro,Régua,Lapiseira,Regua,Tinteiro,Lapiseira,Lapiseira),qtd=c (2,1,2,1,3,1,1,1,1)) lapsales - dados[dados$Prod==Lapiseira, ] lapsales[which.max(lapsales[ , qtd ]), Store] [1] Aveiro Levels: Aveiro Coimbra Evora lx Setubal and str(lapsales) 'data.frame': 3 obs. of 3 variables: $ Store: Factor w/ 5 levels Aveiro,Coimbra,..: 1 2 5 $ Prod : Factor w/ 5 levels BlocoDesenho,..: 2 2 2 $ qtd : num 3 1 1 Maybe you get something different. Regards Petr thanks AD -- View this message in context: http://r.789695.n4.nabble.com/Help-in-getting- info-from-a-DataFrame-tp3247740p3248883.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] kmeans: number of cluster centres must lie between 1 and nrow(x)
Thank you for the suggestion and it is exactly as you said only one observation in each cluster. I know I can avoid this anyway and I am just out of curiosity of the error. I am writing a special algorithm to cluster some datasets with different numbers of observations. For some particular datasets, there is only one observation(e.g. people died of a rare disease). kmeans() thus will not work at this situation. Feng 2011/2/2 Rafael Björk rafael.bj...@gmail.com If you change the algorithm, the function allow you to do this: kmeans(a, 20,algorithm=Lloyd) Maybe i'm missing something here, but why would you want to create as many clusters as there are observations? Won't the outcome just be one observation in each cluster? -- Feng Li Department of Statistics Stockholm University 106 91 Stockholm, Sweden http://feng.li/ [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Odp: Read a similarity matrix from excel
Hi r-help-boun...@r-project.org napsal dne 01.02.2011 10:20:48: Good morning, I have an excel spreadsheet with similarities among objects. The format of the file is the following: 1st row: empty cell,object-1-name,object-name-2,...,object-N 2nd row: object-name-1,0,s1,2,s1,3,...,s1,N 3rd row: object-name-2,s2,1,0,s2,3,...,s2,N The table is symmetrical and contains the similarities among the objects (sx,k stands for similarity among x and k). The first row contains the name of the objects (strings of characters) in the same order that these same names are contained in the first column. The cell 1,1 is empty. How can I read this in R so that after reading it I can perform a hierarchical clustering. The table is 78x78 so it is relatively easy to manually change something if it makes it easier to read. Select your table in Excel Press Ctrl-C In R do test-read.delim(clipboard) you shall get something like that test X a b c 1 a 1 23 3 2 b 4 5 6 3 c 7 8 9 you has to get rid of first column mat-as.matrix(test[,-1]) and if you want you can put row.names tou your matrix row.names(mat)-colnames(mat) mat a b c a 1 23 3 b 4 5 6 c 7 8 9 row.names(mat)-test$X mat a b c a 1 23 3 b 4 5 6 c 7 8 9 Regards Petr I found many articles about reading a data table with variables and then constructing the distance matrix from within R, but I have the similarity matrix instead. Thank you for your help George __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] pass nrow(x) to dots in function(x){plot(x,...)}
Dear Rers, I have a function to barplot() a matrix, eg myfun - function(x, ...) { barplot(x , ... )} (The real function is more complicated, it does things to the matrix first.) So I can do: m1 - matrix(1:20,4) myfun(m1) myfun(m1, main=My title) I'd like to be able to add the number of rows of the matrix passed to the function to the ... argument, eg myfun(m1, main=paste(n=,ns)) where 'ns' would be nrow(m1) I've tried this but it doesn't work: myfun - function(x, ...) { ns - nrow(x) barplot(x , ... ) } myfun(m1, main=paste(n = ,ns) ) ns is not found So, basically, how do I assign an object inside a function that I can then access in the dots when executing the function? Many thanks Marianne -- Marianne Promberger PhD, King's College London http://promberger.info R version 2.12.0 (2010-10-15) Ubuntu 9.04 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Counter in a For Loop - Efficiency Issue
without a reproducible sample, it is hard to tell, but I will give it a shot. Maybe it's possible to merge your M with M2: merge(M, M2) If you only want to count times, you can use seq_along(x) in a by function eg: dat - data.frame(person=rep(c(1,2,3), each=5), time=rnorm(15)) by(dat$person, dat$person, function(x) seq_along(x)) Ofcourse you can use any of the other vectorisation functions (lapply, apply, aggregate, ) HTH Bart -- View this message in context: http://r.789695.n4.nabble.com/Counter-in-a-For-Loop-Efficiency-Issue-tp3253674p3253698.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] reshape data, adding rows to dataframe
Hi Ivan, Thanks for your reply. This is what I want to do: Imagine my dataset looks like this: Species N Size Coris julis 1 8 Coris julis 3 10 D.vulgaris 2 12 I have 1 C.julis of 8 cm, 3 C. julis of 10 cm and 2 D.vulgaris of 12 cm. I want 1 row for each animal, something like this: Species N Size Coris julis 1 8 Coris julis 1 10 Coris julis 1 10 Coris julis 1 10 D.vulgaris 1 12 D.vulgaris 1 12 I hope that helps. I tried reshape but couldn't do it and from what I've seen I don't think that aggregate, unique or duplicated would do it... but I'm not good with R so I can be wrong. Cheers! Lucia -- View this message in context: http://r.789695.n4.nabble.com/reshape-data-adding-rows-to-dataframe-tp3253640p3253780.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Manipulating XTS data
Hi, I'm pulling in data for the past 30 days or so. I'm going to slice each day into 5 minute time intervals, and I want to find the average value across all days for the given slice of time. That is, I want to know what the average value at 8:00 was for the 30 day period, the average value at 8:05, and so on. Is there a quick and easy way to do this with XTS. I figured i could skip loops somehow if I knew how to better use this package. Thanks! -Greg __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] clustering with finite mixture model
Dear R-help, I am doing clustering via finite mixture model. Please suggest some packages in R to find clusters via finite mixture model with continuous variables. And also I wish to verify the distributional properties of the mixture distributions by fitting the model with lognormal, gamma, exponentials etc,. Thanks in advance, warm regards,Ms.Karunambigai M PhD Scholar Dept. of Biostatistics NIMHANS Bangalore India [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] reshape data, adding rows to dataframe
Try this: transform(fish[rep(seq(nrow(fish)), fish$N),], N = 1) On Wed, Feb 2, 2011 at 8:58 AM, Lucia Rueda lucia.ru...@ba.ieo.es wrote: Hi Ivan, Thanks for your reply. This is what I want to do: Imagine my dataset looks like this: Species N Size Coris julis 1 8 Coris julis 3 10 D.vulgaris 2 12 I have 1 C.julis of 8 cm, 3 C. julis of 10 cm and 2 D.vulgaris of 12 cm. I want 1 row for each animal, something like this: Species N Size Coris julis 1 8 Coris julis 1 10 Coris julis 1 10 Coris julis 1 10 D.vulgaris 1 12 D.vulgaris 1 12 I hope that helps. I tried reshape but couldn't do it and from what I've seen I don't think that aggregate, unique or duplicated would do it... but I'm not good with R so I can be wrong. Cheers! Lucia -- View this message in context: http://r.789695.n4.nabble.com/reshape-data-adding-rows-to-dataframe-tp3253640p3253780.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Henrique Dallazuanna Curitiba-Paraná-Brasil 25° 25' 40 S 49° 16' 22 O [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] reshape data, adding rows to dataframe
Exactly!! Thanks a lot Petr. It worked! Thansk to you as well Ivan! -- View this message in context: http://r.789695.n4.nabble.com/reshape-data-adding-rows-to-dataframe-tp3253640p3253793.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Error of 'memory not mapped' in ff Package with VirtualBox
Dear R Helpers, I would like to report on an error in the ff package here. The ff package is an R package which enables us to store large data on disk systematically and have fast access to the database. I used the package in Linux as a guest OS of VirtualBox, and executed the following commands. library(ff) x-ff(1:5,length=5,update=TRUE,vmode=double,filename=header.ff,overwrite=TRUE) Afterwards, I got the following error message. *** caught segfault *** address 0x, cause 'memory not mapped' Traceback: 1: .Call(get_vector, .ffmode[vmode], attr(x, physical), index, nreturn, PACKAGE = ff) 2: `[.ff`(ret, 1) 3: ret[1] 4: as.vector(ret[1]) 5: identical(as.vector(initdata[1]), as.vector(ret[1])) 6: ff(1:5, length = 5, update = TRUE, vmode = double, filename = header.ff, overwrite = TRUE) Possible actions: 1: abort (with core dump, if enabled) 2: normal R exit 3: exit R without saving workspace 4: exit R saving workspace Selection: Can anyone give an advice to solve the problem? Best Regards, Ryan - Wonsang You Leibniz Institute for Neurobiology -- View this message in context: http://r.789695.n4.nabble.com/Error-of-memory-not-mapped-in-ff-Package-with-VirtualBox-tp3253856p3253856.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] multicore + xeon ?
Is there any reason to expect a problem ? i'm running this script on the cluster down the hall: module load R/2.11.0 R library(multicore) fxx-function(ll) runif(1) mclapply(1:10,fxx) i get: Error in fork() : Unable to fork. less /proc/cpuinfo yields: processor : 0 vendor_id : GenuineIntel cpu family : 6 model : 15 model name : Intel(R) Xeon(R) CPU X5365 @ 3.00GHz stepping: 11 cpu MHz : 2992.502 cache size : 4096 KB physical id : 0 siblings: 4 core id : 0 cpu cores : 4 fpu : yes fpu_exception : yes cpuid level : 10 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm syscall nx lm constant_tsc pni monito r ds_cpl vmx est tm2 cx16 xtpr dca lahf_lm bogomips: 5990.06 clflush size: 64 cache_alignment : 64 address sizes : 38 bits physical, 48 bits virtual -- View this message in context: http://r.789695.n4.nabble.com/multicore-xeon-tp3253882p3253882.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] help
On Tue, Feb 01, 2011 at 12:51:18PM -0800, Kiogou Lydie wrote: PLEASE HELP I actually want to do the following: a[j] = (1/(j!))*?? (i-1-d), j = 500, ?? means product i = 1 to j ?? Yet, j! will stop at 170 and ?? (i-1-d) at 172; so, a[j] will not exceed 170. I would like to have at least 200 a[j]. ?? WHAT SHOULD I DO? Computing factorial of 171 and larger numbers may be done using extended arithmetic in Rmpfr as already suggested. Depending on the purpose of computing factorial of large numbers, consider also the function lfactorial(), which computes the logarithm in the standard numeric type. lfactorial(1000) [1] 5912.128 Hope this helps. Petr Savicky. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] String to Matrix
Hello, How to convert x into y? x [1] 15, 23, 2, 21, 11, 5 y [,1] [,2] [1,] 15 23 [2,] 232 [3,]221 [4,] 21 11 [5,] 115 Thanks a lot! Romildo [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] String to Matrix
Try this: embed(scan(textConnection(x), sep = ,), 2) On Wed, Feb 2, 2011 at 10:12 AM, Romildo Martins romildo.mart...@gmail.comwrote: Hello, How to convert x into y? x [1] 15, 23, 2, 21, 11, 5 y [,1] [,2] [1,] 15 23 [2,] 232 [3,]221 [4,] 21 11 [5,] 115 Thanks a lot! Romildo [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Henrique Dallazuanna Curitiba-Paraná-Brasil 25° 25' 40 S 49° 16' 22 O [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Strange result from sort: sort(c(aa, ff)) gives ff aa with R.2.12.1 on windows 7
'Strange' to have no response on this. Can a knowledgeable Danish writer please confirm that this is how the OSes are supposed to handle Danish collation? On Mon, 24 Jan 2011, Prof Brian Ripley wrote: On Mon, 24 Jan 2011, Søren Højsgaard wrote: Dear list, Please consider the following call of sort sort(c(a,f)) [1] a f sort(c(f,a)) [1] a f sort(c(aa,ff)) [1] ff aa sort(c(ff,aa)) [1] ff aa The last two results look strange to me. Is that a bug??? It seems that you and your OS disagree about Danish, and I'm in no position to know which is correct. But this is not an R issue: the sorting is done by OS services. The result seems to come from calls to order: order(c(a,f)) [1] 1 2 order(c(f,a)) [1] 2 1 order(c(aa,ff)) [1] 2 1 order(c(ff,aa)) [1] 1 2 I get the same results on R.2.12.1, R.2.11.1 and R.2.13.0 on Windows 7. However on Linux, I get the right answer (the answer I expected). From the help pages I get the impression that there might be an issue about locale, but I didn't understand the details. Can anyone tell me what goes on here, please I recall that 'aa' used to sort at the end of the alphabet in Danish telephone books, so it seems the sort used on Windows thinks so too. See ?Comparison for some further details. What I don't understand is that someone resident in Denmark finds this strange I get exactly the same in a Danish locale on Mac OS X, for example: sort(c(aa,ff)) [1] ff aa and also on my Linux box (Fedora 14 with LC_COLLATE=da_DK.utf8) sort(c(aa,ff)) [1] ff aa en_DK is not a Danish locale (in is English in Denmark). If you want an English sort, try an English locale for LC_COLLATE (there may well be several, hence 'an'). Regards Søren sessionInfo() R version 2.12.1 Patched (2010-12-27 r53883) Platform: i386-pc-mingw32/i386 (32-bit) locale: [1] LC_COLLATE=Danish_Denmark.1252 LC_CTYPE=Danish_Denmark.1252 [3] LC_MONETARY=Danish_Denmark.1252 LC_NUMERIC=C [5] LC_TIME=Danish_Denmark.1252 attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] SHDtools_1.0 sessionInfo() R version 2.12.1 (2010-12-16) Platform: i686-pc-linux-gnu (32-bit) locale: [1] LC_CTYPE=en_DK.utf8 LC_NUMERIC=C [3] LC_TIME=en_DK.utf8LC_COLLATE=en_DK.utf8 [5] LC_MONETARY=C LC_MESSAGES=en_DK.utf8 [7] LC_PAPER=en_DK.utf8 LC_NAME=C [9] LC_ADDRESS=C LC_TELEPHONE=C [11] LC_MEASUREMENT=en_DK.utf8 LC_IDENTIFICATION=C attached base packages: [1] stats graphics grDevices utils datasets methods base __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Brian D. Ripley, rip...@stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UKFax: +44 1865 272595 -- Brian D. Ripley, rip...@stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UKFax: +44 1865 272595__ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Error of 'memory not mapped' in ff Package with VirtualBox
This is clearly a problem occuring in the C code of the ff package. Please report to the package maintainer, Cced here (with the 'at a minimum' information requested in the posting guide, most importantly the Linux architecture and the versions involved). FWIW it works for me on sessionInfo() R version 2.12.1 (2010-12-16) Platform: x86_64-unknown-linux-gnu (64-bit) ... other attached packages: [1] ff_2.2-1 bit_1.1-6 If possible, try running this under valgrind (see 'Writing R Extensions') to get more info on the cause: at least run under gdb to get more precise info on what was not mapped. You don't say what the host OS of VirtualBox was. We've found I/O under VirtualBox with a x86_64 Linux host to be slow, but R to run reliably. On Wed, 2 Feb 2011, Wonsang You wrote: Dear R Helpers, I would like to report on an error in the ff package here. The ff package is an R package which enables us to store large data on disk systematically and have fast access to the database. I used the package in Linux as a guest OS of VirtualBox, and executed the following commands. library(ff) x-ff(1:5,length=5,update=TRUE,vmode=double,filename=header.ff, overwrite=TRUE) Afterwards, I got the following error message. *** caught segfault *** address 0x, cause 'memory not mapped' Traceback: 1: .Call(get_vector, .ffmode[vmode], attr(x, physical), index, nreturn, PACKAGE = ff) 2: `[.ff`(ret, 1) 3: ret[1] 4: as.vector(ret[1]) 5: identical(as.vector(initdata[1]), as.vector(ret[1])) 6: ff(1:5, length = 5, update = TRUE, vmode = double, filename = header.ff, overwrite = TRUE) Possible actions: 1: abort (with core dump, if enabled) 2: normal R exit 3: exit R without saving workspace 4: exit R saving workspace Selection: Can anyone give an advice to solve the problem? Best Regards, Ryan - Wonsang You Leibniz Institute for Neurobiology -- View this message in context: http://r.789695.n4.nabble.com/Error-of-memory-not-mapped-in-ff-Package-with-VirtualBox-tp3253856p3253856.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Brian D. Ripley, rip...@stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UKFax: +44 1865 272595 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Strange result from sort: sort(c(aa, ff)) gives ff aa with R.2.12.1 on windows 7
Not sure if I qualify as being knowledgeable, but... You write I recall that 'aa' used to sort at the end of the alphabet in Danish telephone books, so it seems the sort used on Windows thinks so too. See ?Comparison for some further details. What I don't understand is that someone resident in Denmark finds this strange Yes, I can confirm that aa resides at the end of the Danish alphabet (as an old way of writing the letter which in modern writing is å). But what should one then do if one wants aa to mean an a followed by another a and not aa (=å) when calling sort?? Regards Søren Højsgaard -Oprindelig meddelelse- Fra: Prof Brian Ripley [mailto:rip...@stats.ox.ac.uk] Sendt: 2. februar 2011 13:21 Til: Søren Højsgaard Cc: r-h...@stat.math.ethz.ch Emne: Re: [R] Strange result from sort: sort(c(aa, ff)) gives ff aa with R.2.12.1 on windows 7 'Strange' to have no response on this. Can a knowledgeable Danish writer please confirm that this is how the OSes are supposed to handle Danish collation? On Mon, 24 Jan 2011, Prof Brian Ripley wrote: On Mon, 24 Jan 2011, Søren Højsgaard wrote: Dear list, Please consider the following call of sort sort(c(a,f)) [1] a f sort(c(f,a)) [1] a f sort(c(aa,ff)) [1] ff aa sort(c(ff,aa)) [1] ff aa The last two results look strange to me. Is that a bug??? It seems that you and your OS disagree about Danish, and I'm in no position to know which is correct. But this is not an R issue: the sorting is done by OS services. The result seems to come from calls to order: order(c(a,f)) [1] 1 2 order(c(f,a)) [1] 2 1 order(c(aa,ff)) [1] 2 1 order(c(ff,aa)) [1] 1 2 I get the same results on R.2.12.1, R.2.11.1 and R.2.13.0 on Windows 7. However on Linux, I get the right answer (the answer I expected). From the help pages I get the impression that there might be an issue about locale, but I didn't understand the details. Can anyone tell me what goes on here, please I recall that 'aa' used to sort at the end of the alphabet in Danish telephone books, so it seems the sort used on Windows thinks so too. See ?Comparison for some further details. What I don't understand is that someone resident in Denmark finds this strange I get exactly the same in a Danish locale on Mac OS X, for example: sort(c(aa,ff)) [1] ff aa and also on my Linux box (Fedora 14 with LC_COLLATE=da_DK.utf8) sort(c(aa,ff)) [1] ff aa en_DK is not a Danish locale (in is English in Denmark). If you want an English sort, try an English locale for LC_COLLATE (there may well be several, hence 'an'). Regards Søren sessionInfo() R version 2.12.1 Patched (2010-12-27 r53883) Platform: i386-pc-mingw32/i386 (32-bit) locale: [1] LC_COLLATE=Danish_Denmark.1252 LC_CTYPE=Danish_Denmark.1252 [3] LC_MONETARY=Danish_Denmark.1252 LC_NUMERIC=C [5] LC_TIME=Danish_Denmark.1252 attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] SHDtools_1.0 sessionInfo() R version 2.12.1 (2010-12-16) Platform: i686-pc-linux-gnu (32-bit) locale: [1] LC_CTYPE=en_DK.utf8 LC_NUMERIC=C [3] LC_TIME=en_DK.utf8LC_COLLATE=en_DK.utf8 [5] LC_MONETARY=C LC_MESSAGES=en_DK.utf8 [7] LC_PAPER=en_DK.utf8 LC_NAME=C [9] LC_ADDRESS=C LC_TELEPHONE=C [11] LC_MEASUREMENT=en_DK.utf8 LC_IDENTIFICATION=C attached base packages: [1] stats graphics grDevices utils datasets methods base __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Brian D. Ripley, rip...@stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UKFax: +44 1865 272595 -- Brian D. Ripley, rip...@stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UKFax: +44 1865 272595 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Problem with getFX function
Hi everyone, Following this post: http://r.789695.n4.nabble.com/currency-conversion-function-tt906056.html#a906061 I was trying to run the code: foo - function(from, to, date){ url - http://www.oanda.com/convert/classic?script=..%2Fconvert%2Fclassiclanguage=envalue=1 params - sprintf(%sdate=%sexch=%sexch2=margin_fixed=0expr=%sexpr2=SUBMIT=Convert+Nowlang=endate_fmt=us, url, format(as.Date(date), %m/%d/%y), from, to) Lines - readLines(params) value - gsub(.*([0-9]\\.+[0-9]+).*, \\1, grep(nl, grep(from, grep(to, Lines, value = TRUE), value = TRUE), value = TRUE)) as.numeric(value) } foo('BRL', 'USD', '2010-10-14') But it gives me the following: numeric(0) Warning message: In readLines(params) : incomplete final line found on ' http://www.oanda.com/convert/classic?script=..%2Fconvert%2Fclassiclanguage=envalue=1date=10/14/10exch=BRLexch2=margin_fixed=0expr=USDexpr2=SUBMIT=Convert+Nowlang=endate_fmt=us ' Does anyone what the problem is about? If I type simply: getFX(EUR/USD,from=2011-01-01) Then I get similar error: [1] EURUSD Warning message: In readLines(tmp) : incomplete final line found on 'xxx\AppData\Local\Temp\RtmpI1HUzb\file27f7532' (I've hide the path) Thanks [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] error in scan(...
It is clear that you have letters in the data you are trying to process; that is the cause of the erro. What was the 'scan' statement you were using? Were you expecting all numerics in the string? The error message is consistent with your input. On Wed, Feb 2, 2011 at 2:07 AM, gked grigoriy.lyuks...@gmail.com wrote: I know it's a common error and there is a lot of help available but still can't resolve the issue: all i am trying to do is to read a csv file from my folder and this is what i get: Error in scan(file, what, nmax, sep, dec, quote, skip, nlines, na.strings, : scan() expected 'a real', got '1,m,a,F,165,240,26.5,31,0.738,0.704,1.095,0.606,0.847' Can you help? -- View this message in context: http://r.789695.n4.nabble.com/error-in-scan-tp3253487p3253487.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Jim Holtman Data Munger Guru What is the problem that you are trying to solve? __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Strange result from sort: sort(c(aa, ff)) gives ff aa with R.2.12.1 on windows 7
On Wed, 2 Feb 2011, Søren Højsgaard wrote: Not sure if I qualify as being knowledgeable, but... You write I recall that 'aa' used to sort at the end of the alphabet in Danish telephone books, so it seems the sort used on Windows thinks so too. See ?Comparison for some further details. What I don't understand is that someone resident in Denmark finds this strange Yes, I can confirm that aa resides at the end of the Danish alphabet (as an old way of writing the letter which in modern writing is å). But what should one then do if one wants aa to mean an a followed by another a and not aa (=å) when calling sort?? Set Sys.setlocale(LC_COLLATE, ) appropriately (sorry, that's very OS-specific but 'C' and 'en' or 'English' probably work on Windows). On platforms using ICU (most, but not Windows), see also ?icuSetCollate for further ways to tweak collation: that has aarhus in its examples. Regards Søren Højsgaard -Oprindelig meddelelse- Fra: Prof Brian Ripley [mailto:rip...@stats.ox.ac.uk] Sendt: 2. februar 2011 13:21 Til: Søren Højsgaard Cc: r-h...@stat.math.ethz.ch Emne: Re: [R] Strange result from sort: sort(c(aa, ff)) gives ff aa with R.2.12.1 on windows 7 'Strange' to have no response on this. Can a knowledgeable Danish writer please confirm that this is how the OSes are supposed to handle Danish collation? On Mon, 24 Jan 2011, Prof Brian Ripley wrote: On Mon, 24 Jan 2011, Søren Højsgaard wrote: Dear list, Please consider the following call of sort sort(c(a,f)) [1] a f sort(c(f,a)) [1] a f sort(c(aa,ff)) [1] ff aa sort(c(ff,aa)) [1] ff aa The last two results look strange to me. Is that a bug??? It seems that you and your OS disagree about Danish, and I'm in no position to know which is correct. But this is not an R issue: the sorting is done by OS services. The result seems to come from calls to order: order(c(a,f)) [1] 1 2 order(c(f,a)) [1] 2 1 order(c(aa,ff)) [1] 2 1 order(c(ff,aa)) [1] 1 2 I get the same results on R.2.12.1, R.2.11.1 and R.2.13.0 on Windows 7. However on Linux, I get the right answer (the answer I expected). From the help pages I get the impression that there might be an issue about locale, but I didn't understand the details. Can anyone tell me what goes on here, please I recall that 'aa' used to sort at the end of the alphabet in Danish telephone books, so it seems the sort used on Windows thinks so too. See ?Comparison for some further details. What I don't understand is that someone resident in Denmark finds this strange I get exactly the same in a Danish locale on Mac OS X, for example: sort(c(aa,ff)) [1] ff aa and also on my Linux box (Fedora 14 with LC_COLLATE=da_DK.utf8) sort(c(aa,ff)) [1] ff aa en_DK is not a Danish locale (in is English in Denmark). If you want an English sort, try an English locale for LC_COLLATE (there may well be several, hence 'an'). Regards Søren sessionInfo() R version 2.12.1 Patched (2010-12-27 r53883) Platform: i386-pc-mingw32/i386 (32-bit) locale: [1] LC_COLLATE=Danish_Denmark.1252 LC_CTYPE=Danish_Denmark.1252 [3] LC_MONETARY=Danish_Denmark.1252 LC_NUMERIC=C [5] LC_TIME=Danish_Denmark.1252 attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] SHDtools_1.0 sessionInfo() R version 2.12.1 (2010-12-16) Platform: i686-pc-linux-gnu (32-bit) locale: [1] LC_CTYPE=en_DK.utf8 LC_NUMERIC=C [3] LC_TIME=en_DK.utf8LC_COLLATE=en_DK.utf8 [5] LC_MONETARY=C LC_MESSAGES=en_DK.utf8 [7] LC_PAPER=en_DK.utf8 LC_NAME=C [9] LC_ADDRESS=C LC_TELEPHONE=C [11] LC_MEASUREMENT=en_DK.utf8 LC_IDENTIFICATION=C attached base packages: [1] stats graphics grDevices utils datasets methods base __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Brian D. Ripley, rip...@stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UKFax: +44 1865 272595 -- Brian D. Ripley, rip...@stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UKFax: +44 1865 272595 -- Brian D. Ripley, rip...@stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UKFax: +44
Re: [R] error in density plot
Ramya ramya.victory at gmail.com writes: I have the code for the density plot j - 8 plot(density(diff_in_sample[,1]), main = list.files()[j]) for(i in 1:25){ lines(density(diff_in_sample[,i])) } This gives me an error Error in density.default(diff_in_sample[, i]) : 'x' contains missing values [snip] ?NA lines(na.omit(density(diff_in_sample[,i]))) Consider replacing the loop by apply() [no efficiency gain, but more compact code] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Function to locate points in 3d octants or points on two axes
Thanks Petr, the sign function will be of help. I was not aware of it. Bryan On Feb 2, 2011, at 3:21 AM, Petr Savicky wrote: On Tue, Feb 01, 2011 at 08:30:22PM -0500, Bryan Hanson wrote: [Sorry, resending with a proper subject line!] Hi Guru's... I have a set of points that may lie along any of the x, y and z axes in a Cartesian coordinate system. I am hoping that a function exists which will determine if any two selected points are on different axes, i.e, if the one of the points is on x and the other on y or z, not elsewhere on the x axis. Put another way, I need to determine if the triangle formed by the two points and the origin lies in the xy, xz or yz planes. This might be as simple as testing if any particular value is zero, i.e. if the x coordinate is zero, then the points must be on the z and y axes and the triangle in the yz plane. But, I'm looking for a fairly general solution, one that also returns the appropriate plane as the answer. Very closely related to this, I could use a function that determines which of the 8 octants a point lies in. Seems like the cross product might be part of this, but I'm a little rusty on how to apply it. I hope this is clear enough, and someone has a suggestion to point me in the right direction. Before writing my own klunky version, I thought I'd ask. Hi. I think that for suggesting an appropriate solution it may be needed to know, which data structure is used for the input pairs of points. For example, it may a single matrix n times 3 with points as rows and a pair is represented by two indices of the points. Alternatively, the input may be a single matrix n times 6, where rows are pairs of points. In any case, the input may be simplified using sign() function. For example a - as.matrix(expand.grid(x=c(-1.1, 0, 1.1), y=c(0, 1.2), z=c(0, 1.3))) a x y z [1,] -1.1 0.0 0.0 [2,] 0.0 0.0 0.0 [3,] 1.1 0.0 0.0 [4,] -1.1 1.2 0.0 [5,] 0.0 1.2 0.0 [6,] 1.1 1.2 0.0 [7,] -1.1 0.0 1.3 [8,] 0.0 0.0 1.3 [9,] 1.1 0.0 1.3 [10,] -1.1 1.2 1.3 [11,] 0.0 1.2 1.3 [12,] 1.1 1.2 1.3 sign(a) x y z [1,] -1 0 0 [2,] 0 0 0 [3,] 1 0 0 [4,] -1 1 0 [5,] 0 1 0 [6,] 1 1 0 [7,] -1 0 1 [8,] 0 0 1 [9,] 1 0 1 [10,] -1 1 1 [11,] 0 1 1 [12,] 1 1 1 This output represents a classification of the points into a finite number of regions and keeps the information needed for any of the tasks, which you mention. Hope this helps. Petr Savicky. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Error of 'memory not mapped' in ff Package with VirtualBox
Dear Prof. Brian Ripley, Thank you for your kind advice. I have to report to you that I attempted to save the ff file at a shared directory between guest OS (Linux) and host OS (Windows 7). I tried to change the working directory as an unshared folder, and then I tried the same commands. In this case, I had no problem as you did. In conclusion, the ff package does not support to assign memory for shared folders between guest OS and host OS. For your reference, my experiments were done in the following environment. R version 2.12.1 (2010-12-16) Platform: i686-pc-linux-gnu (32-bit) I appreciate you for your help. Best Regards, Ryan 2011/2/2 Prof Brian Ripley rip...@stats.ox.ac.uk This is clearly a problem occuring in the C code of the ff package. Please report to the package maintainer, Cced here (with the 'at a minimum' information requested in the posting guide, most importantly the Linux architecture and the versions involved). FWIW it works for me on sessionInfo() R version 2.12.1 (2010-12-16) Platform: x86_64-unknown-linux-gnu (64-bit) ... other attached packages: [1] ff_2.2-1 bit_1.1-6 If possible, try running this under valgrind (see 'Writing R Extensions') to get more info on the cause: at least run under gdb to get more precise info on what was not mapped. You don't say what the host OS of VirtualBox was. We've found I/O under VirtualBox with a x86_64 Linux host to be slow, but R to run reliably. On Wed, 2 Feb 2011, Wonsang You wrote: Dear R Helpers, I would like to report on an error in the ff package here. The ff package is an R package which enables us to store large data on disk systematically and have fast access to the database. I used the package in Linux as a guest OS of VirtualBox, and executed the following commands. library(ff) x-ff(1:5,length=5,update=TRUE,vmode=double,filename=header.ff, overwrite=TRUE) Afterwards, I got the following error message. *** caught segfault *** address 0x, cause 'memory not mapped' Traceback: 1: .Call(get_vector, .ffmode[vmode], attr(x, physical), index, nreturn, PACKAGE = ff) 2: `[.ff`(ret, 1) 3: ret[1] 4: as.vector(ret[1]) 5: identical(as.vector(initdata[1]), as.vector(ret[1])) 6: ff(1:5, length = 5, update = TRUE, vmode = double, filename = header.ff, overwrite = TRUE) Possible actions: 1: abort (with core dump, if enabled) 2: normal R exit 3: exit R without saving workspace 4: exit R saving workspace Selection: Can anyone give an advice to solve the problem? Best Regards, Ryan - Wonsang You Leibniz Institute for Neurobiology -- View this message in context: http://r.789695.n4.nabble.com/Error-of-memory-not-mapped-in-ff-Package-with-VirtualBox-tp3253856p3253856.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Brian D. Ripley, rip...@stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UKFax: +44 1865 272595 [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] subset in a BIG matrix
I have a matrix with a lot of values inside.. when I execute the folowing command matrix2=subset(martix, condition.) it works... but after the previous command I execute another subset matrix3=subset(martix2, condition2.) and appears the following error: (subscript) logical subscript too long How can I solve this??? I think the probles is the size of the matrix... Thanks.. -- View this message in context: http://r.789695.n4.nabble.com/subset-in-a-BIG-matrix-tp3254115p3254115.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Lattice nb/wlist help
Hi all, I have a grid of points at regular intervals (taken from a raster). I want to create a listw matrix which opperates like this 1 1 1 1 1 1 1 1 1 and also like this 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 and so on I want it to work with localG from package spdep: localG(x, listw, zero.policy=NULL, spChk=NULL) Any help is much appreciated. Thanks Gary -- View this message in context: http://r.789695.n4.nabble.com/Lattice-nb-wlist-help-tp3253962p3253962.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] reshape data, adding rows to dataframe
Thanks Henrique! It solves the problem of having the column N with unreal number of individuals since N=1 with Petr's example: expand - transform(dat[rep(1:nrow(dat), times=dat$N), ] ,N=1) rownames(expand) - NULL expand expand animal N 1 a 1 2 a 1 3 b 1 4 c 1 5 c 1 6 c 1 Cheers! -- View this message in context: http://r.789695.n4.nabble.com/reshape-data-adding-rows-to-dataframe-tp3253640p3254038.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Significant codes in mtable
Hi all, Does anyone know a way to change the significant stars in mtable (package memisc)? The default is Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1, however I need it to be Signif. codes: 0 '***' 0.01 '**' 0.05 '*' 0.1 ' ' 1 Kind regards, Erich [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Finding the dominant factor in an unbalanced group
Hi all. This is perhaps more a statistics question but I'm hoping someone can help me. I have a group of patients for whom I'm looking at beat to beat RR interval changes. I have plotted the difference between one beat length and the next against the difference between the previous beat length and the current one. This gives me a plot with four quadrants: the bottom left corresponding to successively shorter beats, the top right to succesively longer, the top left to a shorter followed by a longer beat and the bottom right to a longer by a shorter. In theory if successive changes in beat length are random there should be an approximately equal number of counts in each quadrant of my plot. I have a dataframe which for each of my patients lists the number of counts in each quadrant: (dput data at the end of this mail) I can determine whether the distribution is balanced or not with a Chi Squared (chisq.test) However what I would like to do is determine whether there is a dominant quadrant (eg. CBP06118 in the example data), or a dominant pair of quadrants (eg CBP06036 in the example data) and if so which they are. If my dataset were only 10 patients it probably wouldn't be a problem (although I'm not certain what statistical check I could do beyond re-applying chisquared tests with only the relevant quadrants which sounds dodgy to me) the problem occurs because my data set is a couple of orders of magnitude bigger, Can anyone help? dput data is: structure(list(basestudy = structure(1:10, .Label = c(CBP06036, CBP06095, CBP06098, CBP06100, CBP06112, CBP06118, CBP06127, CBP06158, CBP06163, CBP06166), class = factor), tl = c(302L, 211L, 347L, 223L, 178L, 230L, 243L, 278L, 391L, 252L), tr = c(99L, 134L, 171L, 210L, 158L, 252L, 89L, 247L, 258L, 168L), br = c(305L, 212L, 346L, 223L, 178L, 231L, 244L, 277L, 388L, 254L), bl = c(142L, 288L, 284L, 191L, 144L, 360L, 147L, 184L, 164L, 186L)), .Names = c(basestudy, tl, tr, br, bl), row.names = c(NA, 10L), class = data.frame) Many thanks -- Sandy Small Clinical Physicist NHS Greater Glasgow and Clyde and NHS Forth Valley This message may contain confidential information. If yo...{{dropped:21}} __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Error w tokenizer .jcall unless run as root
Hi all, I've got an R script that loads the following libraries library(rJava) library(RWeka) library(tm) At some when I try to run the tm tokenizer as MyTokenizer - function(x) NGramTokenizer(x, Weka_control(min = 1, max = 4)) ECONOMIST_TDM - TermDocumentMatrix(ECONOMIST_CORPUS,control=list(tokenize=MyTokenizer)) I get the following error unless I'm running R as root... Error in .jcall(RWekaInterfaces, [S, tokenize, .jcast(tokenizer, : RcallMethod: cannot determine object class ECONOMIST_TDM - TermDocumentMatrix(ECONOMIST_CORPUS,control=list(tokenize=MyTokenizer)) Error in .jcall(RWekaInterfaces, [S, tokenize, .jcast(tokenizer, : RcallMethod: cannot determine object class I first noticed this problem in the summer, and just left it thinking I would stumble across a solution. I do realize this means I need to set permissions somewhere properly -- but I haven't stumbled across the affected file and even if I don't know what the proper permisions/owner/group should be. Does anyone know what I need to do? I'm running R on Lucid Lynx. Regards, Nick [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Acf of Frima
Hello, I am trying to calculate the autocovariance matrix for any general farima(p,d,q) with p,q 1. Could anyone give an idea how to implement in R or if there is any package for this? thank you beforehand. Jose. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Significant codes in mtable
Take a look on symnum function: symnum(runif(30), cutpoints = c(0, 0.01, 0.05, 0.1, 1), symbols = c(***, **, *, )) On Wed, Feb 2, 2011 at 11:25 AM, Striessnig, Erich erich.striess...@wu.ac.at wrote: Hi all, Does anyone know a way to change the significant stars in mtable (package memisc)? The default is Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1, however I need it to be Signif. codes: 0 '***' 0.01 '**' 0.05 '*' 0.1 ' ' 1 Kind regards, Erich [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Henrique Dallazuanna Curitiba-Paraná-Brasil 25° 25' 40 S 49° 16' 22 O [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Finding the maximum in a particular group in a dataframe
On Feb 2, 2011, at 3:48 AM, Asan Ramzan wrote: Hello I am trying to find a way to find the max value, for only a subset of a dataframe, depending on how the data is grouped for example, How would I find the maxmium responce, for all the GPR119a condition below: I've tried tapply tapply(GPR119data$responce, GPR119data$GPR119a, max) Error in tapply(GPR119data$responce, GPR119data$GPR119a, max) : arguments must have same length You need to use tapply on the names of the columns , not on the values: tapply(GPR119data$responce, GPR119data$condition, max) tapply(GPR119data$responce, GPR119data$condition, max) con GPR119a GPR119b GPR119c GPR119d GPR119e GPR119f 0.3350084 0.6451204 0.8240356 0.5729588 0.3099644 0.4677268 1.0184191 Then you can choose the GPR119a value for further examination or display; tapply(GPR119data$responce, GPR119data$condition, max)[GPR119a] GPR119a 0.6451204 responce,mouce,condition I'm guessing from this display that when you read this data in you used the default separator which is white-space and that does not include commas. In addition to the above, try adding sep=, in your read.table function. 0.105902,KO,con 0.232018561,KO,con 0.335008375,KO,con 0.387025433,KO,GPR119a 0.576769897,KO,GPR119a 0.645120419,KO,GPR119a 0.2538608,KO,GPR119b 0.183061952,KO,GPR119b 0.824035587,KO,GPR119b 0.399201597,KO,GPR119c 0.417006618,KO,GPR119c 0.572958834,KO,GPR119c 0.229467444,KO,GPR119d 0.294089745,KO,GPR119d 0.309964445,KO,GPR119d 0.30474325,KO,GPR119e 0.159374839,KO,GPR119e 0.467726848,KO,GPR119e 1.01841912,KO,GPR119f 0.423028621,KO,GPR119f 0.223588597,KO,GPR119f Thank [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. David Winsemius, MD West Hartford, CT __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] help
hello I have recently started using the LOCFIT package, together with Clive Loader's book. I need to implement some method for automatic (plug-in) bandwidth selection in a multivariate kernel regression. From the book, and the LOCFIT documentation, it is not clear whether this is possible. As far as I can see, the only time the various automated procedures, e.g., SJPI, are used is in the kdeb function, which is for preparing a summary comparison of various selection methods, and only for a *univariate kernel density. *Does anyone here know of a way to implement automatic (plug-in) bandwidth selection for kernel regression, maybe with some R function that calls LOCFIT? Or can you suggest a good alternative package for this, either in R/Matlab or a stand-alone? Thanks a lot, and regards __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] clustering with finite mixture model
There are quite a few packages that work with finite mixtures, as evidenced by the descriptions here: http://cran.r-project.org/web/packages/index.html These might be useful: http://cran.r-project.org/web/packages/flexmix/index.html http://cran.r-project.org/web/packages/mclust/index.html -Matt On 02/02/2011 04:28 AM, karuna m wrote: Dear R-help, I am doing clustering via finite mixture model. Please suggest some packages in R to find clusters via finite mixture model with continuous variables. And also I wish to verify the distributional properties of the mixture distributions by fitting the model with lognormal, gamma, exponentials etc,. Thanks in advance, warm regards,Ms.Karunambigai M PhD Scholar Dept. of Biostatistics NIMHANS Bangalore India [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Matthew S Shotwell Assistant Professor School of Medicine Department of Biostatistics Vanderbilt University __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Help in getting info from a DataFrame
On Jan 31, 2011, at 3:44 AM, ADias wrote: David Winsemius wrote: On Jan 30, 2011, at 5:27 PM, ADias wrote: dados- data .frame (Store = c (Setubal ,lx ,Aveiro ,Coimbra ,Aveiro,Evora,Aveiro,Coimbra,Setubal),Prod=c(Bloco Desenho,Bloco Desenho ,Tinteiro ,Régua ,Lapiseira ,Regua ,Tinteiro,Lapiseira,Lapiseira),qtd=c(2,1,2,1,3,1,1,1,1)) See if this is a more useful approach: lapsales - dados[dados$Prod==Lapiseira, ] lapsales[which.max(lapsales[ , qtd ]), Store] [1] Aveiro Levels: Aveiro Coimbra Evora lx Setubal -- David Winsemius, MD West Hartford, CT __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Hi, thank you for the help. Two questions I would like to make. Suppose I have put your answer on an object like this: r-lapsales[which.max(lapsales[ , qtd ]), Store] so r [1] Aveiro Levels: Aveiro Coimbra Evora lx Setubal and suppoese I wish to add this year result to previous years results: results-c(Lx,Setubal) so I did: results-c(results,r) and what I get is: results [1] Lx Setubal 1 Right. You appended a factor variable to a character variable and got the numeric representation for the Aveiro values which is 1 and it then got coerced into 1 because the firts element offered to the c() function was character. why 1 and not Aveiro? Second question is: lapsales Store Prod qtd 5 Aveiro Lapiseira 3 8 Coimbra Lapiseira 1 9 Setubal Lapiseira 1 lapsales[max(lapsales$qtd),Store] [1] Setubal Levels: Aveiro Coimbra Evora lx Setubal The max of lapsales$qtd is 3 to it is looking up the third store == Setubal. -- David. why Setubal and not Aveiro? many thanks AD -- View this message in context: http://r.789695.n4.nabble.com/Help-in-getting-info-from-a-DataFrame-tp3247740p3248276.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. David Winsemius, MD West Hartford, CT __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Applying multiple functions to one object
Dear list members, I recall seeing a convenience function for applying multiple functions to one object (i.e., almost the opposite of 'mapply’) somewhere. Example: If the function was named ’fun’ the output of fun(3.14, mode, typeof, class) would be identical to the output of c(mode(3.14), typeof(3.14), class(3.14)) Is my memory failing me, or does such a function already exists in a package? Of course, it’s not difficult to define a summary function and apply this to the object, but writing, for example, fun(x, mean, median, sd, mad) to quickly show the relevant information is much more *convient*. It would be even nicer with a function that could also handle vectors and lists of values, and output the result as data frames or matrices. Example: x = c(foo, bar, foobar) fun(x, nchar, function(st) substr(st, 1 ,2) ) y = list(3, 3L, 3.14, factor(3)) fun(x, mode, typeof, class) -- Karl Ove Hufthammer __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Applying multiple functions to one object
Hi Karl, same to me. Much of the times when coding I think, 'damn it, I have seen that before, but where...' ... and so the following is from scratch, not from memory. fun-function(x,...){ mthd-list(...) lapply(mthd,function(m) do.call(m,list(x))) } fun(3.14, mode, typeof, class) there is no error-catching for non-existing functions, no naming of results and so on, but it may be a start. hth. Am 02.02.2011 14:59, schrieb Karl Ove Hufthammer: Dear list members, I recall seeing a convenience function for applying multiple functions to one object (i.e., almost the opposite of 'mapply’) somewhere. Example: If the function was named ’fun’ the output of fun(3.14, mode, typeof, class) would be identical to the output of c(mode(3.14), typeof(3.14), class(3.14)) Is my memory failing me, or does such a function already exists in a package? Of course, it’s not difficult to define a summary function and apply this to the object, but writing, for example, fun(x, mean, median, sd, mad) to quickly show the relevant information is much more *convient*. It would be even nicer with a function that could also handle vectors and lists of values, and output the result as data frames or matrices. Example: x = c(foo, bar, foobar) fun(x, nchar, function(st) substr(st, 1 ,2) ) y = list(3, 3L, 3.14, factor(3)) fun(x, mode, typeof, class) -- Eik Vettorazzi Institut für Medizinische Biometrie und Epidemiologie Universitätsklinikum Hamburg-Eppendorf Martinistr. 52 20246 Hamburg T ++49/40/7410-58243 F ++49/40/7410-57790 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] General Solution to Drawing a Spline Curve in 3d?
Hello Again... I know that R has plenty of functions for drawing splines between two points in 2d, and plenty of ways to fit a spline curve among various points in 2d or 3d (I think), but, is there a function which can be used to draw a simple spline curve between any two points in 3d space? Since such a spline curve would be fundamentally a 2d object, I imagine that if a 3d version exists it would have to have some sort of argument specifying where that 2d plane should be drawn (maybe in cylindrical coordinates?). What I'm looking for would be the 3d analog of grid.curve or xspline. What I'm trying to do is related to the idea of great circles, as implemented in package geosphere, but the curves I'd like to draw are not on the surface of a sphere nor are they parts of a circle. Thanks for any suggestions! Bryan *** Bryan Hanson Professor of Chemistry Biochemistry __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] error in density plot
On Feb 2, 2011, at 8:22 AM, Ben Bolker wrote: Ramya ramya.victory at gmail.com writes: I have the code for the density plot j - 8 plot(density(diff_in_sample[,1]), main = list.files()[j]) for(i in 1:25){ lines(density(diff_in_sample[,i])) } This gives me an error Error in density.default(diff_in_sample[, i]) : 'x' contains missing values [snip] ?NA lines(na.omit(density(diff_in_sample[,i]))) I don't know if that will work, but since Bolker wrtote it, it's got a reasonable probability of being correct code. I generally use the na.rm argument inside the density function rather than na.omit wrapped around it. lines(density(diff_in_sample[,i], na.rm=TRUE)) Although if there are a lot of NA's then the registration of the lines results are going to be way off from the original plot axis. -- David. Consider replacing the loop by apply() [no efficiency gain, but more compact code] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. David Winsemius, MD West Hartford, CT __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] update not working
R-help, I'm using the update command for a multiple regression model and it is just not working: update(model1, . ~ . temp:wind:rad,data=ozone.pollution) Error: unexpected input in model2-update(model1, . ~ . summary(model1) Call: lm(formula = ozone ~ temp * wind * rad + I(rad^2) + I(temp^2) + I(wind^2), data = ozone.pollution) Residuals: Min 1Q Median 3Q Max -38.894 -11.205 -2.736 8.809 70.551 Coefficients: Estimate Std. Error t value Pr(|t|) (Intercept)5.683e+02 2.073e+02 2.741 0.00725 ** temp -1.076e+01 4.303e+00 -2.501 0.01401 * wind -3.237e+01 1.173e+01 -2.760 0.00687 ** rad -3.117e-01 5.585e-01 -0.558 0.57799 I(rad^2) -3.619e-04 2.573e-04 -1.407 0.16265 I(temp^2) 5.833e-02 2.396e-02 2.435 0.01668 * I(wind^2) 6.106e-01 1.469e-01 4.157 6.81e-05 *** temp:wind 2.377e-01 1.367e-01 1.739 0.08519 . temp:rad 8.403e-03 7.512e-03 1.119 0.26602 wind:rad 2.054e-02 4.892e-02 0.420 0.67552 temp:wind:rad -4.324e-04 6.595e-04 -0.656 0.51358 --- Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1 Residual standard error: 17.82 on 100 degrees of freedom Multiple R-squared: 0.7394, Adjusted R-squared: 0.7133 F-statistic: 28.37 on 10 and 100 DF, p-value: 2.2e-16 Can anyone help with this? Thanks in advance [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Regression type 2, x measured with error
Excellent!! thanks for all! misil 2011/2/1 mbedward [via R] ml-node+3253006-114577237-210...@n4.nabble.comml-node%2b3253006-114577237-210...@n4.nabble.com Another search term is geometric mean regression. For simple models you can try the lmodel2 package. Michael On 2 February 2011 04:31, David Winsemius [hidden email]http://user/SendEmail.jtp?type=nodenode=3253006i=0 wrote: On Feb 1, 2011, at 10:41 AM, misil wrote: I wanna to do a Regression type 2 or Regression with X measured with erroranybody knows how can i make it in R?? thanks! There are several prior posts in the archives ... although perhaps not the Nabble archives since they start throwing away stuff after about a year. Search on orthogonal regression, total least sqaures regression or Deming regression. David Winsemius, MD West Hartford, CT __ [hidden email] http://user/SendEmail.jtp?type=nodenode=3253006i=1mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ [hidden email] http://user/SendEmail.jtp?type=nodenode=3253006i=2mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- If you reply to this email, your message will be added to the discussion below: http://r.789695.n4.nabble.com/Regression-type-2-x-measured-with-error-tp3251875p3253006.html To unsubscribe from Regression type 2, x measured with error, click herehttp://r.789695.n4.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_codenode=3251875code=anVhcGluQGdtYWlsLmNvbXwzMjUxODc1fDE5NjM3ODYxNTY=. -- View this message in context: http://r.789695.n4.nabble.com/Regression-type-2-x-measured-with-error-tp3251875p3254308.html Sent from the R help mailing list archive at Nabble.com. [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] matrix and a function - apply function
Hi I have this function and this matrix: function(x,y) x+y/x m-matrix(c(1,2,4,2,10,8),3,2) m [,1] [,2] [1,]12 [2,]2 10 [3,]48 each row represent a point (x,y) in a chart and I want via my fucntion to calculate the image in order to get this results: for point (1,2) I would get 1+2/1 = 3 for point (2,10) I would get 2+10/2 = 7 for point (4,8) I would get 4+8/4 = 6 I have tried using sapply here but I get this: sapply(m,function(x,y) x+y/x) Error in y/x : 'y' is missing what I am doing wrong? thanks ADias -- View this message in context: http://r.789695.n4.nabble.com/matrix-and-a-function-apply-function-tp3254271p3254271.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Need help subsetting time series data
Hi all, I have multiple datasets of time series data taken from GPS collars. The collars are supposed to take a fix every hour on the half hour, i.e., 0:30, 1:30, 2:30...23:30, (because it sometimes takes longer for the collars to acquire a location the minute of these locations vary from 30-34) but because of a software glitch in the collars, at random times the collars start taking multiple fixes between programmed fixes, i.e., 22:31, 23:31, 0:31, 1:26, 1:29, 1:30, 1:31, 1:32, 1:33, 1:35, 1:35, 1:35, 1:35, 1:36, 1:36, 1:36, 2:30. These glitches occur approximately once a day throughout the 24 hour and 60 minute cycle. I want to remove all these extra locations from my dataset, but am new to R and haven't figured out a way to do so. I've tried some inelegant solutions involving verbose code, but haven't been able to come up with something that works correctly. Some things I've tried: #Subsetting out by minute value: MR1001=read.csv(etc) datetime-paste(MR1001$date,MR1001$time) datetime-as.POSIXlt(strptime(as.character(datetime), tz=UTC, %m/%d%Y %H:%M:%S)) MR1001$min-datetime$min t1-subset(MR1001,min==30|min==31|min==32|min==33|min==34) This works for most of the data, but when the unwanted fixes occur during the 30-34 minute mark of an hour (see example above) they are kept, which I don't want. To deal with this I tried to incorporate the time between fixes in an attempt to write an ifelse statement and subset the data that way: MR1001=read.csv(etc) MR1001=read.csv(etc) datetime-paste(MR1001$date,MR1001$time) datetime-as.POSIXlt(strptime(as.character(datetime), tz=UTC, %m/%d%Y %H:%M:%S)) MR1001$min-datetime$min t1-subset(MR1001,min==30|min==31|min==32|min==33|min==34) datetime-paste(MR1001$date,MR1001$time) datetime-as.POSIXct(strptime(as.character(datetime), tz=UTC, %m/%d%Y %H:%M:%S)) datetime2-datetime[-1] datetime2[length(datetime)]-datetime2[length(datetime)-1]+3600 datetime3-datetime2-datetime datetime4-datetime3/60 datetime5-as.numeric(datetime4) t1$diff-datetime5 This didn't work either though, because when an unwanted fix occurred after a wanted fix the value in the diff column was small, and I couldn't figure out how to subset the data in that format (I want to keep the 1st, 2nd, 3rd, and last fix in the example columns below (23:31, 00:31, 01:30, 2:30)): timediff 23:31 60 00:31 58.78 01:30 1.07 01:31 1.07 01:32 1.07 01:33 1.07 01:34 1.08 2:30 60 I hope this explanation is clear and that someone with more experience than me can help with a solution. Thanks very much in advance for your time and help! Nathaniel -- View this message in context: http://r.789695.n4.nabble.com/Need-help-subsetting-time-series-data-tp3254236p3254236.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Help me apply mapply
Hello all I would like to ask your help use mapply. I have a function called findCell that takes two arguments(x,sr) where x is a vector of size two (e.g x-c(2,3) and sr is a matrix. I would like to call many times the findCell function (thus I need mapply) for different x inputs but always for the same sr. as x is a vector of size two (two cells) I want to pass inside inside the following 2*10 matrix. -The first input is the two elements of the first row cbin(test[1,1],test[1,2]) -the second input is cbind(test[2,1],test[2,2] -the third input is cbind(test[3,1],test[3,2] and so on This is the str: str(cbind(test[,1],test[,2])) num [1:30, 1:2] -1 -0.667 -0.333 0 0.333 ... so I tried the following: mapply(findCell,x=cbind(test[,1],test[,2]),sr=sr) Error in if (!is.finite(length.out) || length.out 0L) stop(length must be non-negative number) else if (length.out == : missing value where TRUE/FALSE needed Calls: mapply - .Call - Anonymous - seq - seq.default I have spend time reading the ?mapply but I am not sure what is the problem here. Could you please help me understand what is missing? Best Regards Alex __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] SVM Prediction and Plot
Hi I'm trying to predict using a model I fitted with SVM. I constructed the model (called Svm) using a training set, and now I want to use a test set (called BankTest) for prediction. The response variable is in the first column of BankTest. SvmPred = predict(Svm, BankTest[,-1], probability=TRUE) SvmPredRes = table(Pred = SvmPred, True = BankTest[,1]) I get this error: Error in table(Pred = SvmPred, True = BankTest) : all arguments must have the same length I checked the length of both BankTest[,1] and SvmPredict. length(SvmPred) [1] 2282 length(BankTest[,1]) [1] 2598 How are these two columns not the same size? Surely the prediction should try to predict all the values of BankTest[,1]? Any help would be much appreciated. -- View this message in context: http://r.789695.n4.nabble.com/SVM-Prediction-and-Plot-tp3254421p3254421.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] matrix and a function - apply function
On Feb 2, 2011, at 9:12 AM, ADias wrote: Hi I have this function and this matrix: function(x,y) x+y/x m-matrix(c(1,2,4,2,10,8),3,2) m [,1] [,2] [1,]12 [2,]2 10 [3,]48 each row represent a point (x,y) in a chart and I want via my fucntion to calculate the image in order to get this results: for point (1,2) I would get 1+2/1 = 3 for point (2,10) I would get 2+10/2 = 7 for point (4,8) I would get 4+8/4 = 6 I have tried using sapply here but I get this: sapply(m,function(x,y) x+y/x) Error in y/x : 'y' is missing I'm not sure what sapply does with a matrix argument. I've only used t with vectors and lists. I suspect that it would straighten out the argument to a length = 6 vector. (And then, of course, the y wouldn't be there.) what I am doing wrong? Two things: instead use apply() and realize that the argument is passed as a vector apply(m, 1, function(x) x[1] +x[2]/x[1] ) -- David Winsemius, MD West Hartford, CT __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] SVM Prediction and Plot
Hi, On Wed, Feb 2, 2011 at 9:56 AM, Brian ctto...@hotmail.com wrote: Hi I'm trying to predict using a model I fitted with SVM. I constructed the model (called Svm) using a training set, and now I want to use a test set (called BankTest) for prediction. The response variable is in the first column of BankTest. SvmPred = predict(Svm, BankTest[,-1], probability=TRUE) SvmPredRes = table(Pred = SvmPred, True = BankTest[,1]) I get this error: Error in table(Pred = SvmPred, True = BankTest) : all arguments must have the same length I checked the length of both BankTest[,1] and SvmPredict. length(SvmPred) [1] 2282 length(BankTest[,1]) [1] 2598 How are these two columns not the same size? Surely the prediction should try to predict all the values of BankTest[,1]? What do you get when you type: R head(SvmPred) -- Steve Lianoglou Graduate Student: Computational Systems Biology | Memorial Sloan-Kettering Cancer Center | Weill Medical College of Cornell University Contact Info: http://cbio.mskcc.org/~lianos/contact __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] matrix and a function - apply function
Hi r-help-boun...@r-project.org napsal dne 02.02.2011 16:05:21: On Feb 2, 2011, at 9:12 AM, ADias wrote: Hi I have this function and this matrix: function(x,y) x+y/x m-matrix(c(1,2,4,2,10,8),3,2) m [,1] [,2] [1,]12 [2,]2 10 [3,]48 each row represent a point (x,y) in a chart and I want via my fucntion to calculate the image in order to get this results: for point (1,2) I would get 1+2/1 = 3 for point (2,10) I would get 2+10/2 = 7 for point (4,8) I would get 4+8/4 = 6 I have tried using sapply here but I get this: sapply(m,function(x,y) x+y/x) Error in y/x : 'y' is missing I'm not sure what sapply does with a matrix argument. I've only used t with vectors and lists. I suspect that it would straighten out the argument to a length = 6 vector. (And then, of course, the y wouldn't be there.) what I am doing wrong? Two things: instead use apply() and realize that the argument is passed as a vector apply(m, 1, function(x) x[1] +x[2]/x[1] ) Maybe apply is not necessary here m[,1]+m[,2]/m[,1] [1] 3 7 6 Regards Petr -- David Winsemius, MD West Hartford, CT __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] matrix and a function - apply function
there is no need for 'apply' here, because R can handle vectors. ord-m[,1]+m[,2]/m[,1] Am 02.02.2011 15:12, schrieb ADias: Hi I have this function and this matrix: function(x,y) x+y/x m-matrix(c(1,2,4,2,10,8),3,2) m [,1] [,2] [1,]12 [2,]2 10 [3,]48 each row represent a point (x,y) in a chart and I want via my fucntion to calculate the image in order to get this results: for point (1,2) I would get 1+2/1 = 3 for point (2,10) I would get 2+10/2 = 7 for point (4,8) I would get 4+8/4 = 6 I have tried using sapply here but I get this: sapply(m,function(x,y) x+y/x) Error in y/x : 'y' is missing what I am doing wrong? thanks ADias -- Eik Vettorazzi Institut für Medizinische Biometrie und Epidemiologie Universitätsklinikum Hamburg-Eppendorf Martinistr. 52 20246 Hamburg T ++49/40/7410-58243 F ++49/40/7410-57790 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] grey scale graphs
Hi everyone, Does anyone know how to get black and white theme (grey scale,, I would say) graphs using lattice or ggplot2, as it is shown in this webpage: http://lmdvr.r-forge.r-project.org/figures/figures.html? I am using Sweave, and I cannot get that color configuration. I have added the following option: trellis.device(color=FALSE) but I got a pdf file with color graphs. Thank in advance. -- Sebastián Daza sebastian.d...@gmail.com __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] update not working
Hi Luis, Well, when I copied your code I discovered that On Wed, Feb 2, 2011 at 9:42 AM, Luis Ridao luri...@gmail.com wrote: R-help, I'm using the update command for a multiple regression model and it is just not working: update(model1, . ~ . – temp:wind:rad,data=ozone.pollution) ^ this is not a minus sign! Are you using a word processer to write your code? update(ozone ~ temp * wind * rad + I(rad^2) + I(temp^2) + I(wind^2), . ~ . -temp:wind:rad) works for me. Best, Ista Error: unexpected input in model2-update(model1, . ~ . – summary(model1) Call: lm(formula = ozone ~ temp * wind * rad + I(rad^2) + I(temp^2) + I(wind^2), data = ozone.pollution) Residuals: Min 1Q Median 3Q Max -38.894 -11.205 -2.736 8.809 70.551 Coefficients: Estimate Std. Error t value Pr(|t|) (Intercept) 5.683e+02 2.073e+02 2.741 0.00725 ** temp -1.076e+01 4.303e+00 -2.501 0.01401 * wind -3.237e+01 1.173e+01 -2.760 0.00687 ** rad -3.117e-01 5.585e-01 -0.558 0.57799 I(rad^2) -3.619e-04 2.573e-04 -1.407 0.16265 I(temp^2) 5.833e-02 2.396e-02 2.435 0.01668 * I(wind^2) 6.106e-01 1.469e-01 4.157 6.81e-05 *** temp:wind 2.377e-01 1.367e-01 1.739 0.08519 . temp:rad 8.403e-03 7.512e-03 1.119 0.26602 wind:rad 2.054e-02 4.892e-02 0.420 0.67552 temp:wind:rad -4.324e-04 6.595e-04 -0.656 0.51358 --- Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1 Residual standard error: 17.82 on 100 degrees of freedom Multiple R-squared: 0.7394, Adjusted R-squared: 0.7133 F-statistic: 28.37 on 10 and 100 DF, p-value: 2.2e-16 Can anyone help with this? Thanks in advance [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Ista Zahn Graduate student University of Rochester Department of Clinical and Social Psychology http://yourpsyche.org __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] help
On 2/2/11 3:59 AM, Łukasz Ręcławowicz wrote: We don't need a loop! require(Rmpfr) factorial(mpfr(1:500,3800)) This is very good! I get an unexpected warning, though: Warning message: In if (mpfr.is.integer(x)) round(r) else r : the condition has length 1 and only the first element will be used vQ __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] grey scale graphs
Well, if you're using pdf, you could set the gray option there, e.g., pdf.options(colormodel=gray) Or you could just use a gray theme. In ggplot2 use + theme_gray() Not sure in lattice, but I think it also has a themeing system. Best, Ista 2011/2/2 Sebastián Daza sebastian.d...@gmail.com: Hi everyone, Does anyone know how to get black and white theme (grey scale,, I would say) graphs using lattice or ggplot2, as it is shown in this webpage: http://lmdvr.r-forge.r-project.org/figures/figures.html? I am using Sweave, and I cannot get that color configuration. I have added the following option: trellis.device(color=FALSE) but I got a pdf file with color graphs. Thank in advance. -- Sebastián Daza sebastian.d...@gmail.com __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Ista Zahn Graduate student University of Rochester Department of Clinical and Social Psychology http://yourpsyche.org __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] drawing from one cell to another using layout() - possible?
Is it possible to cross the cell boundaries set by layout using base graphics? I.e. I want to draw e.g. a line from one layout cell to another. Is there a way to do that? layout(matrix(c(1,2), byrow=TRUE, ncol=2)) plot.new() text(0,0,paste(rep(a, 200), collapse=), xpd=T) layout.show(2) I would like the a's to not end at the layout borders of the left cell. Thanks in advance, Mark PS. I need to use base not groid graphics, though it may be simpler... -- Mark Heckmann (Dipl. Wirt.-Ing.) phone +49 (0) 421/1614618 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] help
Hi, maybe you rethink your calculations. Just for curiosity, what's the purpose? First of all, using prod(1:j-1-.25) #and prod(1:j) instead of a loop is much more efficent. If you need all values for a[1] up to a[j] you can use cumprod But this doesn't get you rid of numeric overflow but 1/(1*2*3*4*..*j) * (-d)(1-d)(2-d)...(j-1-d) is the same as (-d)/1 * (1-d)/2 * (2-d)/3 ... * (j-1-d)/j (multiplication is commutative) j-1:500 a-cumprod((j-1-.25)/j) or am I missing something? Am 01.02.2011 21:51, schrieb Kiogou Lydie: PLEASE HELP I actually want to do the following: a[j] = (1/(j!))*Π (i-1-d), j = 500, Π means product i = 1 to j Yet, j! will stop at 170 and Π (i-1-d) at 172; so, a[j] will not exceed 170. I would like to have at least 200 a[j]. WHAT SHOULD I DO? PLEASE SEE MY CODE FOR DETAIL!! R CODE: ### d = .25 # generate j! v=j=1 for (i in 1:200){ v[i] = i for (i in 2:200){ j[1]=1 j[i] = j[i-1]*v[i] } } j # generate aj l=A=a=0 for (k in 1:200){ l[k] = (k-1-d) for (i in 2:200){ A[1] = l[1] A[i] = A[i-1]*l[i] for ( i in 1:200){ a[i]= (1/j[i])*A[i] } } } a END CODE ## # With GOD, everything is POSSIBLE. Avec DIEU, tout est POSSIBLE. [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Eik Vettorazzi Institut für Medizinische Biometrie und Epidemiologie Universitätsklinikum Hamburg-Eppendorf Martinistr. 52 20246 Hamburg T ++49/40/7410-58243 F ++49/40/7410-57790 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Finding the maximum in a particular group in a dataframe
Hi: Using df as your data frame, max(subset(df, condition == 'GPR119a', select = responce)) [1] 0.6451204 You asked this question yesterday with several correct responses. What was wrong with them? Dennis On Wed, Feb 2, 2011 at 12:48 AM, Asan Ramzan asanram...@yahoo.com wrote: Hello I am trying to find a way to find the max value, for only a subset of a dataframe, depending on how the data is grouped for example, How would I find the maxmium responce, for all the GPR119a condition below: I've tried tapply tapply(GPR119data$responce, GPR119data$GPR119a, max) Error in tapply(GPR119data$responce, GPR119data$GPR119a, max) : arguments must have same length responce,mouce,condition 0.105902,KO,con 0.232018561,KO,con 0.335008375,KO,con 0.387025433,KO,GPR119a 0.576769897,KO,GPR119a 0.645120419,KO,GPR119a 0.2538608,KO,GPR119b 0.183061952,KO,GPR119b 0.824035587,KO,GPR119b 0.399201597,KO,GPR119c 0.417006618,KO,GPR119c 0.572958834,KO,GPR119c 0.229467444,KO,GPR119d 0.294089745,KO,GPR119d 0.309964445,KO,GPR119d 0.30474325,KO,GPR119e 0.159374839,KO,GPR119e 0.467726848,KO,GPR119e 1.01841912,KO,GPR119f 0.423028621,KO,GPR119f 0.223588597,KO,GPR119f Thank [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] SMA and EMA in package TTR
Hi Anyi, That's not currently possible. You could use the rollmean function in the development version of the zoo package (on R-forge) to calculate the simple moving average as you described. I would like to add this as an option to TTR functions, but it is not high on my priority list. I would appreciate patches. Best, -- Joshua Ulrich | FOSS Trading: www.fosstrading.com On Sat, Jan 29, 2011 at 7:39 PM, Anyi Zhu anyi@gmail.com wrote: Hi, Just wondering for the SMA and EMA in package TTR, is it possible to me to code it so that, say if I need to calculate SMA (x, n=100), when the sample size is less than 100, it will give me the SMA (x, k) where k is the sample size of the data? Right now it only gives me an invalid n error. Thanks! [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Function to locate points in 3d octants or points on twoaxes
-Original Message- From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] On Behalf Of Bryan Hanson Sent: Wednesday, February 02, 2011 5:23 AM To: Petr Savicky Cc: r-help@r-project.org Subject: Re: [R] Function to locate points in 3d octants or points on twoaxes Thanks Petr, the sign function will be of help. I was not aware of it. Bryan On Feb 2, 2011, at 3:21 AM, Petr Savicky wrote: On Tue, Feb 01, 2011 at 08:30:22PM -0500, Bryan Hanson wrote: [Sorry, resending with a proper subject line!] Hi Guru's... I have a set of points that may lie along any of the x, y and z axes in a Cartesian coordinate system. I am hoping that a function exists which will determine if any two selected points are on different axes, i.e, if the one of the points is on x and the other on y or z, not elsewhere on the x axis. Put another way, I need to determine if the triangle formed by the two points and the origin lies in the xy, xz or yz planes. This might be as simple as testing if any particular value is zero, i.e. if the x coordinate is zero, then the points must be on the z and y axes and the triangle in the yz plane. But, I'm looking for a fairly general solution, one that also returns the appropriate plane as the answer. Very closely related to this, I could use a function that determines which of the 8 octants a point lies in. Seems like the cross product might be part of this, but I'm a little rusty on how to apply it. I hope this is clear enough, and someone has a suggestion to point me in the right direction. Before writing my own klunky version, I thought I'd ask. For a general solution you also need to consider how you want to deal with boundary conditions. For example, if the x and y coordinates are zero for both points then the points lie in both the xz and yz planes. And if one of the coordinates of a point is zero, then how do you decide which quadrant it is in? Dan Daniel Nordlund Bothell, WA USA __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Acf of Frima
What's farima? functional arima or fractional arima? If the former, the fda package includes a function cor.fd for correlation matrix from functional data object(s). If the latter, I suggest you use sos as follows: library(sos) (fr - findFn('fractional arima')) findFn searches a database of help pages of contributed packages. This particular call returned 13 links in 5 packages and opened the results in a table in a web browser with hot links , sorted to put first the package with the most matches. Hope this helps. Spencer On 2/2/2011 5:43 AM, Chuse chuse wrote: Hello, I am trying to calculate the autocovariance matrix for any general farima(p,d,q) with p,q 1. Could anyone give an idea how to implement in R or if there is any package for this? thank you beforehand. Jose. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Spencer Graves, PE, PhD President and Chief Operating Officer Structure Inspection and Monitoring, Inc. 751 Emerson Ct. San José, CA 95126 ph: 408-655-4567 __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Function to locate points in 3d octants or points on twoaxes
Thanks Dan for pointing that out. My question really arose from the need to draw splines between arbitrary 3d pairs of points, so I posted a new question to the list addressing that more specifically. While the issue of 3d splines must have been dealt with in graphics/ animation oriented programs/languages, I think I may have to grow my own and I'll need the suggestions offered by you and Petr. Thanks, Bryan On Feb 2, 2011, at 11:55 AM, Daniel Nordlund wrote: -Original Message- From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org ] On Behalf Of Bryan Hanson Sent: Wednesday, February 02, 2011 5:23 AM To: Petr Savicky Cc: r-help@r-project.org Subject: Re: [R] Function to locate points in 3d octants or points on twoaxes Thanks Petr, the sign function will be of help. I was not aware of it. Bryan On Feb 2, 2011, at 3:21 AM, Petr Savicky wrote: On Tue, Feb 01, 2011 at 08:30:22PM -0500, Bryan Hanson wrote: [Sorry, resending with a proper subject line!] Hi Guru's... I have a set of points that may lie along any of the x, y and z axes in a Cartesian coordinate system. I am hoping that a function exists which will determine if any two selected points are on different axes, i.e, if the one of the points is on x and the other on y or z, not elsewhere on the x axis. Put another way, I need to determine if the triangle formed by the two points and the origin lies in the xy, xz or yz planes. This might be as simple as testing if any particular value is zero, i.e. if the x coordinate is zero, then the points must be on the z and y axes and the triangle in the yz plane. But, I'm looking for a fairly general solution, one that also returns the appropriate plane as the answer. Very closely related to this, I could use a function that determines which of the 8 octants a point lies in. Seems like the cross product might be part of this, but I'm a little rusty on how to apply it. I hope this is clear enough, and someone has a suggestion to point me in the right direction. Before writing my own klunky version, I thought I'd ask. For a general solution you also need to consider how you want to deal with boundary conditions. For example, if the x and y coordinates are zero for both points then the points lie in both the xz and yz planes. And if one of the coordinates of a point is zero, then how do you decide which quadrant it is in? Dan Daniel Nordlund Bothell, WA USA __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] subset in a BIG matrix
To solve it, a little more data would help. At least provide exactly the statements you are using and an 'str' of the objects; e.g., str(matrix2) str(condition2) error message might indicate that the length of condition2 is larger than the subset of matrix2 that you want. On Wed, Feb 2, 2011 at 8:30 AM, alcesgabbo alcesga...@hotmail.com wrote: I have a matrix with a lot of values inside.. when I execute the folowing command matrix2=subset(martix, condition.) it works... but after the previous command I execute another subset matrix3=subset(martix2, condition2.) and appears the following error: (subscript) logical subscript too long How can I solve this??? I think the probles is the size of the matrix... Thanks.. -- View this message in context: http://r.789695.n4.nabble.com/subset-in-a-BIG-matrix-tp3254115p3254115.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Jim Holtman Data Munger Guru What is the problem that you are trying to solve? __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] subset in a BIG matrix
Hi, On Wed, Feb 2, 2011 at 8:30 AM, alcesgabbo alcesga...@hotmail.com wrote: I have a matrix with a lot of values inside.. when I execute the folowing command matrix2=subset(martix, condition.) it works... but after the previous command I execute another subset matrix3=subset(martix2, condition2.) and appears the following error: (subscript) logical subscript too long How can I solve this??? I think the probles is the size of the matrix... Without your actual code, it's hard to tell, but if I were to take a guess, I think you're generating your `condition2` logic vector based on your original `matrix` maybe? If you aren't already doing so, put your subsetting logic into your `subset` call, eg: matrix3 - subset(matrix2, some.column = some.value) ## or whatever -steve -- Steve Lianoglou Graduate Student: Computational Systems Biology | Memorial Sloan-Kettering Cancer Center | Weill Medical College of Cornell University Contact Info: http://cbio.mskcc.org/~lianos/contact __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] grey scale graphs
On Feb 2, 2011, at 10:26 AM, Sebastián Daza wrote: Hi everyone, Does anyone know how to get black and white theme (grey scale,, I would say) graphs using lattice or ggplot2, as it is shown in this webpage: http://lmdvr.r-forge.r-project.org/figures/figures.html? I am using Sweave, and I cannot get that color configuration. I have added the following option: trellis.device(color=FALSE) but I got a pdf file with color graphs. Maybe you should use postscript as a device for which the default is for color=FALSE. ?trellis.device Thank in advance. -- Sebastián Daza sebastian.d...@gmail.com __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. David Winsemius, MD West Hartford, CT __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Why my package is not being generated?
Hi all, I have commited a new release of my package and waited for more than one day. After that install.packages(tests,repos=http://r-forge.r-projet.org;) still returns package 'tests' is not available. I have done the local build check and the package seems ok. Also have mofified the Title in the DESCRIPTION file. What can be wrong? Is there any tool or recomendation to validade my package before submit it? -- Cristiano Strieder São José dos Campos - SP __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] SVM Prediction and Plot
Thanks for the reply. When I run this: head(SvmPred) 3570 2361 5406 2041 3440 4123 NNYYYY Levels: N Y -- View this message in context: http://r.789695.n4.nabble.com/SVM-Prediction-and-Plot-tp3254421p3254570.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Efficient way to determine if a data frame has missing observations
I have a data set covering a large number of cities with values for characteristics such as land area, population, and employment. The problem I have is that some cities lack observations for some of the characteristics and I'd like a quick way to determine which cities have missing data. For example: city-c(A,A,A,B,B,C) var-c(sqmi,pop,emp,pop,emp,pop) value-c(10,100,40,30,10,20) df-data.frame(city,var,value) In this data frame, city A has complete data for the three variables, while city B is missing land area, and city C only has population data. In the full data frame, my approach to finding the missing observations has been to create a data frame with all combinations of 'city' and 'var', merge this onto the original data frame, and then extract the observations with missing data for 'value': city_unq-c(A,B,C) var_unq-c(sqmi,pop,emp) comb-expand.grid(city=city_unq,var=var_unq) mrg-merge(comb,df,by=c(city,var),all=T) missing-mrg[is.na(mrg$value),] This works, but on a large dataset it gets slow and I'm looking for a a more efficient way to achieve this same result. Any suggestions would be much appreciated. Cheers [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] exact logistic regression
Hello to R people Does anybody know to calculate exact logistic regression in R? Does such option exist anywhere? Surprisingly, could not find it using search engine. It is hard to believe, however, that such useful function is not implemented in R yet? Could you help, please Thank you Denis __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Merging by factor variables
I'm wondering about the behavior of the merge function when using factors as by variables. I know that when you combine two factors using c() the results can be odd, as in: c(factor(1:5),factor(6:10)) which prints: [1] 1 2 3 4 5 1 2 3 4 5 I presume this is because factors are actually stored as integers, with 6,7,8,9,10 stored internally as 1,2,3,4,5. This concerns me somewhat, as I often merge data frames using factors as the by variables. From what I can tell, the merge function creates matches based on factor labels (i.e. the result of as.character(factor_var)) and not the internally stored integers, but I'm wondering if there are particular lurking problems that I should be aware of? I'm especially curious as to how R recalculates the levels of the by variables in outer joins where not every observation is matched, as in: df1-data.frame(a=factor(c(a,b)),b=1:2) df2-data.frame(a=factor(c(b,c)),c=2:3) df3-merge(df1,df2,by=a,all=T) Many thanks! [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Average of several line plots
Hi, I have several data sets which are all approximately within the same values as each other (both X and Y) and all of these data sets more or less overlap each other when plotted on the same graph. However, although each data set varies between approximately the same range, there are vastly different numbers of data points within each data set (due to the nature of how these values had to be measured). Currently, I am trying to figure out a way to produce a graph showing all these data sets and a line showing the average of all of the data sets. Given the different number of data points, I can't think of an easy way to do this. Can anybody help? Thanks! -- View this message in context: http://r.789695.n4.nabble.com/Average-of-several-line-plots-tp3254850p3254850.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Function to locate points in 3d octants or points on twoaxes
On Feb 2, 2011, at 12:28 PM, Bryan Hanson wrote: Thanks Dan for pointing that out. My question really arose from the need to draw splines between arbitrary 3d pairs of points, so I posted a new question to the list addressing that more specifically. While the issue of 3d splines must have been dealt with in graphics/animation oriented programs/languages, I think I may have to grow my own and I'll need the suggestions offered by you and Petr. Thanks, Bryan I fit crossed cubic regression splines using the rms package, which might provide this functionality. However, I get the idea you want an exact fit rather than a fit that is constrained to a particular functional form with parameters determined by a minimization metric involving a large number of points. -- David On Feb 2, 2011, at 11:55 AM, Daniel Nordlund wrote: -Original Message- From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org ] On Behalf Of Bryan Hanson Sent: Wednesday, February 02, 2011 5:23 AM To: Petr Savicky Cc: r-help@r-project.org Subject: Re: [R] Function to locate points in 3d octants or points on twoaxes Thanks Petr, the sign function will be of help. I was not aware of it. Bryan On Feb 2, 2011, at 3:21 AM, Petr Savicky wrote: On Tue, Feb 01, 2011 at 08:30:22PM -0500, Bryan Hanson wrote: [Sorry, resending with a proper subject line!] Hi Guru's... I have a set of points that may lie along any of the x, y and z axes in a Cartesian coordinate system. I am hoping that a function exists which will determine if any two selected points are on different axes, i.e, if the one of the points is on x and the other on y or z, not elsewhere on the x axis. Put another way, I need to determine if the triangle formed by the two points and the origin lies in the xy, xz or yz planes. This might be as simple as testing if any particular value is zero, i.e. if the x coordinate is zero, then the points must be on the z and y axes and the triangle in the yz plane. But, I'm looking for a fairly general solution, one that also returns the appropriate plane as the answer. Very closely related to this, I could use a function that determines which of the 8 octants a point lies in. Seems like the cross product might be part of this, but I'm a little rusty on how to apply it. I hope this is clear enough, and someone has a suggestion to point me in the right direction. Before writing my own klunky version, I thought I'd ask. For a general solution you also need to consider how you want to deal with boundary conditions. For example, if the x and y coordinates are zero for both points then the points lie in both the xz and yz planes. And if one of the coordinates of a point is zero, then how do you decide which quadrant it is in? Dan Daniel Nordlund Bothell, WA USA __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. David Winsemius, MD West Hartford, CT __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Merging by factor variables
H Roark wrote: I'm wondering about the behavior of the merge function when using factors as by variables. I know that when you combine two factors using c() the results can be odd, as in: c(factor(1:5),factor(6:10)) which prints: [1] 1 2 3 4 5 1 2 3 4 5 I presume this is because factors are actually stored as integers, with 6,7,8,9,10 stored internally as 1,2,3,4,5. This concerns me somewhat, as I often merge data frames using factors as the by variables. From what I can tell, the merge function creates matches based on factor labels (i.e. the result of as.character(factor_var)) and not the internally stored integers, but I'm wondering if there are particular lurking problems that I should be aware of? I'm especially curious as to how R recalculates the levels of the by variables in outer joins where not every observation is matched, as in: df1-data.frame(a=factor(c(a,b)),b=1:2) df2-data.frame(a=factor(c(b,c)),c=2:3) df3-merge(df1,df2,by=a,all=T) As far as I know, there is no reason to be concerned when using merge as you do. The magic that ?merge is performing is actually being done in ?rbind, and you should read the help for that, particularly under Data frame methods. You can also study the code of base.rbind.data.frame to see what it's actually doing. --Erik __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Efficient way to determine if a data frame has missing observations
H Roark wrote: I have a data set covering a large number of cities with values for characteristics such as land area, population, and employment. The problem I have is that some cities lack observations for some of the characteristics and I'd like a quick way to determine which cities have missing data. For example: city-c(A,A,A,B,B,C) var-c(sqmi,pop,emp,pop,emp,pop) value-c(10,100,40,30,10,20) df-data.frame(city,var,value) In this data frame, city A has complete data for the three variables, while city B is missing land area, and city C only has population data. In the full data frame, my approach to finding the missing observations has been to create a data frame with all combinations of 'city' and 'var', merge this onto the original data frame, and then extract the observations with missing data for 'value': city_unq-c(A,B,C) var_unq-c(sqmi,pop,emp) comb-expand.grid(city=city_unq,var=var_unq) mrg-merge(comb,df,by=c(city,var),all=T) missing-mrg[is.na(mrg$value),] Perhaps the following, or a variation thereof? subset(as.data.frame(table(city = df$city, var = df$var)), Freq == 0) __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] SVM Prediction and Plot
Hi, On Wed, Feb 2, 2011 at 10:35 AM, Brian ctto...@hotmail.com wrote: Thanks for the reply. When I run this: head(SvmPred) 3570 2361 5406 2041 3440 4123 N N Y Y Y Y Levels: N Y Hmmm ... I think we'll need more info. It looks like it should be working. By your output, your `probabilites=TRUE` parameter in your call to `predict` isn't actually doing anything. You need to train the svm and set `probability=TRUE` if you're really after class probabilties. Also, maybe there are some NA's in your data? Also, to be extra careful, the second argument to predict should be a matrix, I guess it already is, but ... what happens if you call predict like so: R SvmPred = predict(Svm, as.matrix(BankTest[,-1]), na.omit=na.fail) -steve -- Steve Lianoglou Graduate Student: Computational Systems Biology | Memorial Sloan-Kettering Cancer Center | Weill Medical College of Cornell University Contact Info: http://cbio.mskcc.org/~lianos/contact __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Efficient way to determine if a data frame has missing observations
Try this: subset(as.data.frame(xtabs( ~ city + var, df)), !Freq) On Wed, Feb 2, 2011 at 1:49 PM, H Roark hrbuil...@hotmail.com wrote: I have a data set covering a large number of cities with values for characteristics such as land area, population, and employment. The problem I have is that some cities lack observations for some of the characteristics and I'd like a quick way to determine which cities have missing data. For example: city-c(A,A,A,B,B,C) var-c(sqmi,pop,emp,pop,emp,pop) value-c(10,100,40,30,10,20) df-data.frame(city,var,value) In this data frame, city A has complete data for the three variables, while city B is missing land area, and city C only has population data. In the full data frame, my approach to finding the missing observations has been to create a data frame with all combinations of 'city' and 'var', merge this onto the original data frame, and then extract the observations with missing data for 'value': city_unq-c(A,B,C) var_unq-c(sqmi,pop,emp) comb-expand.grid(city=city_unq,var=var_unq) mrg-merge(comb,df,by=c(city,var),all=T) missing-mrg[is.na(mrg$value),] This works, but on a large dataset it gets slow and I'm looking for a a more efficient way to achieve this same result. Any suggestions would be much appreciated. Cheers [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Henrique Dallazuanna Curitiba-Paraná-Brasil 25° 25' 40 S 49° 16' 22 O [[alternative HTML version deleted]] __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Average of several line plots
On 2011-02-02 08:44, mattnixon wrote: Hi, I have several data sets which are all approximately within the same values as each other (both X and Y) and all of these data sets more or less overlap each other when plotted on the same graph. However, although each data set varies between approximately the same range, there are vastly different numbers of data points within each data set (due to the nature of how these values had to be measured). Currently, I am trying to figure out a way to produce a graph showing all these data sets and a line showing the average of all of the data sets. Given the different number of data points, I can't think of an easy way to do this. Have a look at ?approxfun Peter Ehlers Can anybody help? Thanks! __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] Indexing from two variables
Hello, thank you all for your patience and time I am essentially trying to get disorganised data into long form for linear modelling. I have 2 dataframes rec and book Each row in book needs to be pasted onto the end of several of the rows of rec according to two variables in the row: MRN and COURSE which match. I have tried the following and variations thereon to no avail: /code #for each line of the recovery dataframe #insert the one line of the book dataset that corresponds to the MRN AND the course-of-treatment (COURSE) #get the mrn and course from the first line of the recovery dataframe (rec) i=1 newlist=list() colnames(newlist)=colnames(book) for ( i in 1:dim(rec)[1]) { mrn=as.numeric(as.vector(rec$MRN[i])); course=as.character(rec$COURSE[i]); ## find the corresponding row in the book dataframe ## by generating a logical vector and using ## it to access book get.vector-as.vector(((as.numeric(as.vector(book$MRN))==mrn) (as.character(book$COURSE)==course))) #gives you a vector of logicals (works) newlist[i]-book[get.vector,] ### (doesn't work) i=i+1; } /code If anyone has any suggestions on 1)getting this to work 2) making it more elegant (or perhaps just less clumsy) If I have been unclear in any way I beg your pardons. I do understand I haven't combined any data above, I think if I can generate a long-format dataframe I can combine them all on my own. Apologies to anybody who receives this twice, I had a bounced reply the first time, and assumed it was unsuccessful. Ross Dunne MB MRCPsych ross.du...@tcd.ie -- View this message in context: http://r.789695.n4.nabble.com/Indexing-from-two-variables-tp3255405p3255405.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Average of several line plots
It is not clear from your message whether these data represent functions or such. Could you perhaps bring up an example? I'm guessing each data set contains evaluation points and function values at those points, and that you would like to plot these functions and the mean function all on the same graph, is that right? If the points in the domain differ from each other, it's not as immediate to build a mean function. A quick recipe would be to use splinefun on each data set, generate a new grid for the domain, evaluate each obtained spline on this grid to get new data sets and build the mean function from there. But without an example it's really just guessing. Hope this helps anyway. ?splinefun Best regards, Eduardo On Wed, Feb 2, 2011 at 2:44 PM, mattnixon m.r.ni...@ex.ac.uk wrote: Hi, I have several data sets which are all approximately within the same values as each other (both X and Y) and all of these data sets more or less overlap each other when plotted on the same graph. However, although each data set varies between approximately the same range, there are vastly different numbers of data points within each data set (due to the nature of how these values had to be measured). Currently, I am trying to figure out a way to produce a graph showing all these data sets and a line showing the average of all of the data sets. Given the different number of data points, I can't think of an easy way to do this. Can anybody help? Thanks! -- View this message in context: http://r.789695.n4.nabble.com/Average-of-several-line-plots-tp3254850p3254850.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Indexing from two variables
Hard to know exactly without seeing the structrure of rec and book, but I would start with ?merge Best, Ista On Wed, Feb 2, 2011 at 2:06 PM, dunner ross.du...@tcd.ie wrote: Hello, thank you all for your patience and time I am essentially trying to get disorganised data into long form for linear modelling. I have 2 dataframes rec and book Each row in book needs to be pasted onto the end of several of the rows of rec according to two variables in the row: MRN and COURSE which match. I have tried the following and variations thereon to no avail: /code #for each line of the recovery dataframe #insert the one line of the book dataset that corresponds to the MRN AND the course-of-treatment (COURSE) #get the mrn and course from the first line of the recovery dataframe (rec) i=1 newlist=list() colnames(newlist)=colnames(book) for ( i in 1:dim(rec)[1]) { mrn=as.numeric(as.vector(rec$MRN[i])); course=as.character(rec$COURSE[i]); ## find the corresponding row in the book dataframe ## by generating a logical vector and using ## it to access book get.vector-as.vector(((as.numeric(as.vector(book$MRN))==mrn) (as.character(book$COURSE)==course))) #gives you a vector of logicals (works) newlist[i]-book[get.vector,] ### (doesn't work) i=i+1; } /code If anyone has any suggestions on 1)getting this to work 2) making it more elegant (or perhaps just less clumsy) If I have been unclear in any way I beg your pardons. I do understand I haven't combined any data above, I think if I can generate a long-format dataframe I can combine them all on my own. Apologies to anybody who receives this twice, I had a bounced reply the first time, and assumed it was unsuccessful. Ross Dunne MB MRCPsych ross.du...@tcd.ie -- View this message in context: http://r.789695.n4.nabble.com/Indexing-from-two-variables-tp3255405p3255405.html Sent from the R help mailing list archive at Nabble.com. __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Ista Zahn Graduate student University of Rochester Department of Clinical and Social Psychology http://yourpsyche.org __ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.