Hello,
I'm trying to use the lsa (latent semantic analysis) package, and running
into a problem that seems to be related to the number of documents being
processed. Here's the code I'm running (after loading the lsa and rstem
packages), and the error message:
SnippetsPath -
Even in R 2.6.2 the symbol is Rf_KillAllDevices, so this is an rpy
installation issue (one that I cannot reproduce).
Rpy has a mailing list, so why ask here and not there? (In any case, the
convention is that questions about the development version of R should be
asked on the R-devel list.)
FC == Felipe Carrillo [EMAIL PROTECTED]
on Mon, 24 Mar 2008 18:42:36 -0700 (PDT) writes:
FC Thanks everyone for your help the qt function worked
FC like a dream. I ended up using it like this; T -
FC qt((1-0.05/2),7) to estimate 95% intervals divided by
FC 2. 7 would be the
works fine by me with the data you supplied:
x
V1 V2 V3 V4
1 8.30440e-02 3.75276e+02 680220majority
2 5.50816e-09 2.48914e-05 26377conformation
3 1.69618e-04 7.66505e-01 1546938 interaction
4 3.90425e-05 1.76433e-01 1655338
Dear Pedro,
Sorting your data before plotting it will probably sort your problem.
plotdata2 - plotdata2[order(plotdata2$group), ]
HTH,
Thierry
ir. Thierry Onkelinx
Instituut voor natuur- en bosonderzoek / Research
Dear all, I am having some problems with the folowing task and would
appreciate any thoughts on the matter.
I have a sliding window time series of rainfall. This is 24 hour window
created from an hourly record. Every hour, a new twenty four hour total
is calculated. I would like to extract the
Hi Shirley,
You can use the function sort_df() from the reshape package to sort an
entire data.frame based on one column.
cheers,
Paul
Shirley Wu wrote:
Hello,
I have a data frame consisting of four columns and would like to sort
based on the first column and then write the sorted data
Hi everybody,
Does R graphs Tippett plots?
Thank you.
__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal,
Spiros Stavropoulos (DT) wrote:
Hi everybody,
Does R graphs Tippett plots?
Hi Spiros,
You can get pretty much what you want with:
# make up probability and likelihood data
p1-log(cumsum(abs(rnorm(20
l1-rev(rescale(cumsum(abs(rnorm(20))),c(0,1)))
# plot it in stairstep mode
Jason Q. McClintic wrote:
Dear List:
In short, I am writing a number of functions as building blocks for
other functions and have some questions about scoping and passing arguments.
Suppose I have functions foo1, foo2, and foo3 such that:
foo1-function(a=1,b=TRUE,c=FALSE){#do
Try this:
aggregate(list(Number=x$Number), by=list(Gene_Name=x$Gene_Name), sum)
On 25/03/2008, Charles Murtaugh [EMAIL PROTECTED] wrote:
Hi--
This is a question with a trivial and obvious answer, I'm sure, but I can't
seem to find it in the help files and books that I have handy. I have
Maybe:
xyplot(values ~ ind|ind, data = stack(split(m, 1:4)))
On 24/03/2008, Ron Bonner [EMAIL PROTECTED] wrote:
Sorry if this is an FAQ, but I haven't found the answer (yet)...
I'm trying to plot each row of a simple numeric matrix in a separate
panel using the layout features of lattice,
Hi,
I am working with CART regression now. Could anyone tell me in which cases
it is better to use mean square error for splitting nodes and when mean
absolute error should be preferred.
I am now using the default (MSE) version and I can see that the obtained
optimal tree is very different from
my.mat[rownames(my.mat) in c(5951, 236, 6306, 5950, 145742, 1123),]
or something close to this. Need to read introduction to R and subsetting.
On 3/24/08, dinesh kumar [EMAIL PROTECTED] wrote:
Dear R users
I have a big matrix like
60211188790290117410151990
Dear R-helpers,
I´m in the context of writing a general function for error propagation
in R.
There are somehow a few questions I would like to ask (discuss), as my
statistical knowledge is somewhat restricted.
Below is the function I wrote, the questions are marked.
Many thanks in advance.
Andrej-Nikolai Spiess a.spiess at uke.uni-hamburg.de writes:
Dear R-helpers,
I´m in the context of writing a general function for error propagation
in R.
There are somehow a few questions I would like to ask (discuss), as my
statistical knowledge is somewhat restricted.
Below is the
I would like to as for advice for the use of “lmer” (package ‘lme4’) and
writing the proper syntax to best describe my data using a mixed-effects model.
I have just started to use these models, and although I have read some good
examples (Extending the Linear Model with R, Faraway 2005; and the
Dear Thierry,
Thanks, it worked. I though the algorithm behind the plotting uses
some approach like unique(), so that the order of the plot is that of
the first occurrence of each code, but I could not get into the code
to modify this.
Anyway, Hadley has already gently volunteered to look into
Kevin, thanks for writing. Yes, sorry, I forgot to mention that this is
a Windows XP Professional system running GNU Emacs 22.1.1
(i386-mingw-nt5.1.2600) from Vincent Goulet, and R 2.6.2 Windows
version. I pasted in the sessionInfo() output from ESS inside of Emacs
to the end of this note.
Was
hello everyone,
I would like to as for advice for the use of lmer
(package lme4) and writing the proper syntax to best
describe my data using a mixed-effects model.
I have just started to use these models, and although
I have read some good examples (Extending the Linear
Model with R,
One other reference is the vignette in the Brobgingnag
package; this is a cookbook approach that gives step-by-step
instructions on how to build a simple package using S4 methods.
rksh
On 21 Mar 2008, at 17:29, Martin Morgan wrote:
Hi Christophe --
In terms of documentation, see
See the reshape package.
library(reshape)
yy - melt(xx, id=c(Gene.name))
cast(yy, Gene.name~variable, sum)
--- Charles Murtaugh [EMAIL PROTECTED]
wrote:
Hi--
This is a question with a trivial and obvious
answer, I'm sure, but I can't seem to find it in the
help files and books that I
Hi Kevin,
I usually install R in e.g., C:\R-2.6.2 to avoid such problems.
Best,
Jim
Zembower, Kevin wrote:
Kevin, thanks for writing. Yes, sorry, I forgot to mention that this is
a Windows XP Professional system running GNU Emacs 22.1.1
(i386-mingw-nt5.1.2600) from Vincent Goulet, and R
On 3/25/2008 10:06 AM, James W. MacDonald wrote:
Hi Kevin,
I usually install R in e.g., C:\R-2.6.2 to avoid such problems.
Explicitly including
\usepackage{Sweave}
may also help, so that Sweave doesn't try to give an explicit path.
Then again, it may not, depending on which LaTeX you're
At 10:48 PM 3/24/2008, Chistos wrote:
John,
There is a peak finding algorithm attributed to Prof. Ripley in the R-help
archive. It has a span parameter which might give you something close to
what you seem to be looking for.
http://finzi.psych.upenn.edu/R/Rhelp02a/archive/33097.html
-Christos
I'm having a little trouble with the glm.predict function. I read the other
answers related to this question (the lm.predict question), but i'm still
receiving the same warnings, then i thought i didnt got it right.
Warning:
'newdata' had 87 rows but variable(s) found have 263 rows
Commands:
I am not sure how to implement this in R, but beam forming, used in
geophysics, sounds like an alternative. We are currently looking at
this with the help of a matlab guru to look at the shift in the
dissolved oxygen peaks as you go down river. This assumes that you
have an array of reading
Dear All,
After having overcome the issue of legends (thanks, Thierry, once
more), I am trying to use facetting, but here also I can not find how
to do this. I do not want to use qplot, but rather the more flexible
options. However, it seems I am doing still something pretty stupid,
because I
Hi R,
I have an excel file in which the third column is date and others are
character and numeric.
Number of columns are 12
If I use this to read the file in R: x = read.xls(D:\\file.xls)
The problem is that my date column is read in julian dates.
So I am using:
Hi Pedro,
I think that the code below gives you the plot you want. Notice that you
don't have to add everthing seperatly. But you can if you feel the need
to do so.
ggplot(plotdata2, aes(x = x, y = y)) + geom_point() + facet_grid(group ~
.)
plot0 - ggplot(plotdata2, aes(x = x, y = y))
layer1 -
On Tue, Mar 25, 2008 at 9:49 AM, Pedro de Barros [EMAIL PROTECTED] wrote:
Dear All,
After having overcome the issue of legends (thanks, Thierry, once
more), I am trying to use facetting, but here also I can not find how
to do this. I do not want to use qplot, but rather the more flexible
I would like to compare two data sets saved as text files (example below) to
determine if both sets are identical(or if dat2 is missing information that is
included in dat1) and if they are not identical list what information is
different between the two sets(ie output a1, a3 as the differing
Hi R,
I have a list,
l=list(list(c(1,2),c(4,5),c(6,7)),list(c(11,22,33),c(44,55,66)))
l
[[1]]
[[1]][[1]]
[1] 1 2
[[1]][[2]]
[1] 4 5
[[1]][[3]]
[1] 6 7
[[2]]
[[2]][[1]]
[1] 11 22 33
[[2]][[2]]
[1] 44 55 66
How do I know the dimensions of this list?...
Here is how to find the values in common:
dat1 - paste('a', 1:6, sep='')
dat2 - paste('a', c(2,4:6), sep='')
# find the data in common
intersect(dat1, dat2)
[1] a2 a4 a5 a6
On 3/25/08, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote:
I would like to compare two data sets saved as text files
Hi all,
Found some comments on this on the FAQ under 2.24 but someone may have
some more explanations ... and I might then understand.
Operating system: Vista Ultimate (SP1 test version)
Installed R: R version 2.6.2 (2008-02-08)
When I try to install packages which need to write into
Hi,
Shubha Vishwanath Karanth wrote:
l=list(list(c(1,2),c(4,5),c(6,7)),list(c(11,22,33),c(44,55,66)))
How do I know the dimensions of this list?... In other words, how many
sub-lists, sub-sub-lists etc...are there in l? How do I know this?
Is this what you want?
l=list(list(c(1,2),c(4,5),c(6,7)),list(c(11,22,33),c(44,55,66)))
str(l)
List of 2
$ :List of 3
..$ : num [1:2] 1 2
..$ : num [1:2] 4 5
..$ : num [1:2] 6 7
$ :List of 2
..$ : num [1:3] 11 22 33
..$ : num [1:3] 44 55 66
On 3/25/08, Shubha Vishwanath Karanth
Thanks a lot for your reply...Exactly...I need the same...But can I store these
numbers and hence can access further? In other words how do I extract the
numbers/dimensions from this class?
Shubha Karanth | Amba Research
Ph +91 80 3980 8031 | Mob +91 94 4886 4510
Bangalore * Colombo * London *
Hello,
I want to be able to make a plot that has several series with
different color and linetype.
Online documentation suggest that this is possible, but I haven't found how:
We can also create redundant mappings, mapping the same variable to
multiple aesthetics. This is most useful when
Dear All,
I'm struggling a little with the behaviour of R with GLM interactions. In
particular, I have a dataset with two factors - call them factor A and
factor B, where I would like to fit a GLM that is factor A + (grouped
factor A):factor B.
To try to isolate this, I've ignored the
I an R-beginner.
I would like to use/call some functions/objects of R from
microsoft .NET environment. I was wondering whether or not
it is possible. If yes, could you please give me some suggestions
on how to approach it (any documentation on this)? Thanks!
-James
[[alternative
In case someone else runs into this, I found the problem, it was related to
having some zero-length text files. Make sure you have valid (non-empty)
data files for loading into the document-term matrix.
Alex
-- Forwarded message --
From: Alex McKenzie [EMAIL PROTECTED]
Date: Mar
On Mon, 24 Mar 2008, Shirley Wu wrote:
Hello,
I have a data frame consisting of four columns and would like to sort
based on the first column and then write the sorted data frame to a
file.
df - read.table(file.txt, sep=\t)
where file.txt is simply a tab-delimited file containing 4
Thanks Henrique - that didn't exactly solve my problem but gave me enough to
do so!
In your solution each row is plotted as a vertical row of points at the
value of 'ind' while I wanted to plot the values against the column numbers,
i.e. 1 to 5.
The solution was to create a vector of column
Maybe something like this:
lis - lapply(l, lapply, length)
names(lis) - lapply(l, length)
On 25/03/2008, Shubha Vishwanath Karanth [EMAIL PROTECTED] wrote:
Thanks a lot for your reply...Exactly...I need the same...But can I store
these numbers and hence can access further? In other words how
On Tue, Mar 25, 2008 at 11:48 AM, Tribo Laboy [EMAIL PROTECTED] wrote:
Hello,
I want to be able to make a plot that has several series with
different color and linetype.
Online documentation suggest that this is possible, but I haven't found how:
We can also create redundant mappings,
On 3/25/08, Ron Bonner [EMAIL PROTECTED] wrote:
Thanks Henrique - that didn't exactly solve my problem but gave me enough to
do so!
In your solution each row is plotted as a vertical row of points at the
value of 'ind' while I wanted to plot the values against the column numbers,
i.e. 1
But what can I do in case I don't know the number of columns in my
file??
I mean is there any way I can specify the colClass of only third column
and for other columns it can take the default classes??
Not supported, sorry (I put it on the list).
x = read.xls( D:\\file.xls, colClasses =
Hello All,
When I use bw.SJ (based on Sheather Jones, 1991) in R to estimate
the bandwidth for a highly skewed data, I get the following message:
sample is too sparse to find TD. I played around with the parameters
such as no. of bins (nb), lower, upper (range over which to minimize) to
no
Dear all and Mikis
I have the opportunity to compare fits with the 'classical' glm and
gamlss and no smoother of any kind just the same model formula (both
with the binomial family). I get exactly the same coefficients but
very different residuals, gamlss giving residuals which are
Utkarsh Singhal [EMAIL PROTECTED] wrote in message
news:[EMAIL PROTECTED]
Hi R,
I have an excel file in which the third column is date and others are
character and numeric.
If I use this to read the file in R: x = read.xls(D:\\file.xls)
The problem is that my date column is read in
I have a problem, would really appreciate if anyone could help me. I am
trying to use bootstrap to resample a dataframe n then use that sample of
dataframe to calcuate parameter estimates using betabinomial model. I am
kind of lost coz the bootstrap function requires us to give
For quasi-Poisson regression, I have tried to compare the output of the
quasipois function in the aod package to the output of the glm function
with family=quasi(link=log, variance=mu^2) in the stats package.
The standard errors are about 32.5 times larger in the former than in
the latter. If
Try this, but I don't understand what you are doing. I am at a loss.
y-c(1,0,2)
n-rep(2,length(y))
z1-rep(1,length(y))
z2-c(1,0,1)
mydata-as.data.frame(cbind(n,y,z1,z2))
--
Let's not spend our time and resources thinking about things that are
so little or so large that all they really do
why can't you know the number of columns if it is an excel file?
On Tue, Mar 25, 2008 at 2:36 PM, Earl F. Glynn
[EMAIL PROTECTED] wrote:
Utkarsh Singhal [EMAIL PROTECTED] wrote in message
news:[EMAIL PROTECTED]
Hi R,
I have an excel file in which the third column is date and others are
You need to think through the application of your model. Is it more important
to get more cases classified correctly, or to avoid bigger errors versus a
probability prediction? You should optimize your choice of a loss function
so that it is appropriate to the way in which the model will be used.
Does Hadley's response to the following post still hold for the most recent
version of ggplot2?
http://tolstoy.newcastle.edu.au/R/e2/help/07/07/21347.html#21379qlink2
I'm trying to accomplish in ggplot2 what the relation component of
scales does in lattice, e.g.,
stripplot(yield ~ variety |
On Tue, Mar 25, 2008 at 3:06 PM, Law, Jason
[EMAIL PROTECTED] wrote:
Does Hadley's response to the following post still hold for the most recent
version of ggplot2?
http://tolstoy.newcastle.edu.au/R/e2/help/07/07/21347.html#21379qlink2
Yes, it does. It's still on my todo list, but I'm
Zembower, Kevin kzembowe at jhuccp.org writes:
\usepackage{C:/PROGRA~1/R/R-26~1.2/share/texmf/Sweave}
Hi !
Just ran into the same problem. Reason is the line above that is inserted into
the .Rnw file by Sweave. Cause is the blank in the dir name. I solved the
problem by moving the R
Hi
Today i tried to install
package R-base-2.6.1-3.1.i586.rpm
at suse 10.3.
When i run the program and I do:
linux:~ # R
.
.
'help.start()' for an HTML browser interface to help.
Type 'q()' to quit R.
demo(graphics)
demo(graphics)
Type
Hi the list.
I am writing a tutorial for my student using LaTeX and sweave. I include
some example that work (obiously) but I would also like to include some
example that do NOT work (for pedagogie)...
Is it possible ?
At this point, I find that :
- if there is a error in the code, Sweave
Even if you don't know the number of columns, you probably know the name
of the date variables. You can read the Excel file as is and later
convert dates in R:
Reading the date from Excel gives the daynumber. There is the difference
between day zero for R (1. 1. 1970) and Excel (31. 12. 1899) and
pMikis' answer below... As guessed by Pr. Ripley, I was comparing
apples and oranges indeed, and would have been better inspired looking
at methods !/ppi Patrick
br /
br /a href=mailto:[EMAIL PROTECTED]
class=moz-txt-link-abbreviated[EMAIL PROTECTED]/a wrote:
br /blockquote type=citeDear all
Hi Guys,
I am trying to show a plot of means with two groups. i.e.
plotMeans(s$CON, s$GENERAL.STARTING.DEPT, s$Cat, error.bars=se)
and I get the error in Rcmdr:
ERROR: need finite 'ylim' values
Pasting the code in R gets the following error:
x) : no non-missing arguments to min; returning
A new package 'randomLCA' is available on CRAN.
Its main purpose is to fit latent class models with random effects,
such as those used in diagnostic testing. This methodology can also
be applied in other areas. It also fits standard latent class and will
plot.
Ken
Hi all,
I've released a new package to CRAN called 'yaml'. It allows R to
parse YAML documents (http://yaml.org) into R objects, and there are
a few methods to convert R objects into YAML. The package uses the
Syck YAML parser (http://whytheluckystiff.net/syck/), the same parser
that Ruby
Thanks everyone for the input. It turns out that either read.table or
write.table was having trouble with single quotes in the data
(read.table has an option for specifying quote characters or ignoring
them, but as far as I could tell, the data frame was still sorting
correctly and the
Please read this first:
http://www.perceptualedge.com/articles/visual_business_intelligence/dual-scaled_axes.pdf
It's a reasoned discussion of why it's a bad idea and proposes some
alternative methods.
Another good article is:
K. W. Haemer. Double scales are dangerous. The American Statistician,
I have 2 tables,T1 and T2. There are 4 columns of T1 and 2 columns of T2
T2
1. 990334. G21
2. 844047. G25
3. 281739. G37
4 280903. G002615
5 418546. G004657
The data in T1
V1V2V3 V4
1 T00015 AFP1 human G004657
2 T00035AP-2alphaA
Try ?merge
Ted.
On 25-Mar-08 22:32:29, Kris Ghosh wrote:
I have 2 tables,T1 and T2. There are 4 columns of T1 and 2 columns of
T2
T2
1. 990334. G21
2. 844047. G25
3. 281739. G37
4 280903. G002615
5 418546. G004657
The data in T1
V1V2V3
Hi R-helpers,
I have a dataframe (called data) with 100 columns, the columns of which are
named with integers ranging from 1900 to 1999.
I wish to extract those columns which names are =1950 and =1970.
I tried:
data2-subset(data,select=(names(data)=1950 names(data)=1970))
but that doesn't
[EMAIL PROTECTED] wrote:
Hi R-helpers,
I have a dataframe (called data) with 100 columns, the columns of which are
named with integers ranging from 1900 to 1999.
I wish to extract those columns which names are =1950 and =1970.
I tried:
data2-subset(data,select=(names(data)=1950
That type of error often means a data problem, for example
dividing by standard deviation zero, or something similar.
Please look at your data values. Do you have enough items at
each level of x that standard deviations can be calculated?
__
[EMAIL PROTECTED] wrote:
Hi R-helpers,
I have a dataframe (called data) with 100 columns, the columns of which are
named with integers ranging from 1900 to 1999.
I wish to extract those columns which names are =1950 and =1970.
I tried:
data2-subset(data,select=(names(data)=1950
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA256
Sir:
Thank you for your assistance. I am most interested in your suggestion
| You can use list(...) in foo3, and manually split the args to those
| that
| belong in foo1 and those that belong in foo2, and then construct calls
| from them. (This
I am seeking to hire a hardware/software consultant to
help me build a stem with a 64 bit OS, R, and a
SQL-compatible DBMS. Please contact me if you know of
anyone qualified.
Details:
I am in need of a robust system that will run R, and a
SQL compatible database. I will be working with large
On Tue, 25 Mar 2008, [EMAIL PROTECTED] wrote:
Dear all and Mikis
I have the opportunity to compare fits with the 'classical' glm and
gamlss and no smoother of any kind just the same model formula (both
with the binomial family). I get exactly the same coefficients but
very different
[EMAIL PROTECTED] wrote in
news:[EMAIL PROTECTED]:
I would like to compare two data sets saved as text files (example
below) to determine if both sets are identical(or if dat2 is missing
information that is included in dat1) and if they are not identical
list what information is different
Dear Max,
As Rich Heiberger has suggested, it's difficult to diagnose the error
without the data, and it's especially curious that this previously worked (I
assume from your message with the same data) and doesn't now.
Regards,
John
John Fox, Professor
Hi,
I have a similar query (how to compare 2 datasets), but my dataset is a bit
different.
I want to compare each data in dataset 1 to data in dataset 2 and get the
data which is common to both datasets.
For example;
I have a a file (named mysample).
V1 V2
YBL064C YBR067C
YBL064C YBR204C
Here is one way to find the common rows. You can then use the 'keys'
gotten back to reconstruct a new data frame:
f1 - read.table(textConnection(V1 V2
+ YBL064C YBR067C
+ YBL064C YBR204C
+ YBL064C YDR368W
+ YBL064C YJL067W
+ YBL064C YPR160W
+ YBR053C YGL089C
+ YBR053C YHR113W
+ YBR053C
Hi:
I have been able to finally crunch my data by
importing it by week(thank you all for your help),but
here we go again..
Now I'am trying to do it for the the whole year.
Since the
dataset is huge I'm only making a 3 weeks dataframe.
- I want to get the mean of pd by week
- I want to count
Hi,
Thanks for the feedback. I have tried it on the small size sample and ref
and it works. Now I want to use a larger dataset for myref (the reference
file) . The reference file contains 112189 rows. Can I use the same approach
that works for the small example? Or are there other alternatives
On Fri, 2008-03-21 at 22:37 -0700, Sherri Rose wrote:
Dear R-Help List,
I'm trying to simulate data from a conditional distribution, and
haven't been able to modify my existing code to do so. I searched
the archives, but didn't find any previous post that matched my
question.
Sorry but I was interested in reading as date format from the excel
itself. Is there any way of doing this?
Regards
Utkarsh
-Original Message-
From: Gabor Grothendieck [mailto:[EMAIL PROTECTED]
Sent: Tuesday, March 25, 2008 8:43 PM
To: Utkarsh Singhal
Cc: [EMAIL
Hello together,
Is there a tool to test the statistical differences between parameter estimates
of a nlsList fit?
I fitted degradation data using the nlsList method and want to find out whether
derived rate constants are significantly different from each other at the
grouping factors soil
I wonder how long it will take until metereologists will see the light.
http://www.zoolex.org/walter.html
__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide
Le mar. 25 mars à 10:19, Duncan Murdoch a écrit :
On 3/25/2008 10:06 AM, James W. MacDonald wrote:
Hi Kevin,
I usually install R in e.g., C:\R-2.6.2 to avoid such problems.
Explicitly including
\usepackage{Sweave}
may also help, so that Sweave doesn't try to give an explicit path.
Then
88 matches
Mail list logo