date:20080512

Re: [R] problem configuring package udunits

2008-05-12 Thread Prof Brian Ripley

Please ask the maintainer.  The 'udunits' R package seems to require its 
libraries in particular places.  Note that it is looking for 
/usr/lib/libudunits.a, that is a static library in a 32-bit library 
directory, whereas you most likely have /usr/lib64/libudunits.so or 
/usr/lib64/libudunits.a  (This only works for me because I have both 32- 
and 64-bit versions of udunits installed.)


However, that is if you have udunits installed, since 
/usr/include/udunits.h is missing.  You do need both the udunits and 
udunits-dev packages installed.


On Sun, 11 May 2008, Bradley Christoffersen wrote:



Hi R Users,

I am new to running R on a Linux platform (I'm used to Windows)


On Windows someone else struggled with this badly designed package for 
you.



- I'm running R
2.7.0 on Ubuntu 7.10 (Gutsy) as sudo (without Emacs).  My architecture is
Pentium D (x86_64).

I am having problems successsfully configuring the downloaded package 
'udunits'. When I execute



 install.packages(udunits, lib=/usr/local/lib/R/library)


I get the following result:


trying URL 'http://cran.cnr.Berkeley.edu/src/contrib/udunits_1.3.tar.gz'
Content type 'application/x-gzip' length 29985 bytes (29 Kb)
opened URL
==
downloaded 29 Kb

* Installing *source* package 'udunits' ...
creating cache ./config.cache
checking how to run the C preprocessor... gcc -std=gnu99 -E
checking for gcc... gcc -std=gnu99
checking whether the C compiler (gcc -std=gnu99 -g -O2 ) works... yes
checking whether the C compiler (gcc -std=gnu99 -g -O2 ) is a cross-compiler...
no
checking whether we are using GNU C... yes
checking whether gcc -std=gnu99 accepts -g... yes
checking for /usr/local/include/udunits.h... no
checking for /usr/include/udunits.h... no
checking for /home/saleskalab/include/udunits.h... no
checking for /usr/local/lib/libudunits.a... no
checking for /usr/lib/libudunits.a... no
checking for /lib/libudunits.a... no
checking for /home/saleskalab/lib/libudunits.a... no
***
***
NOTE: udunits package not found!  Either install it in a standard place (/usr or
/usr/local), or edit the file udunits_1.0/udunits/src/Makevars.in and put in the
location where the package is installed.
***
***
exit: 1162: Illegal number: -1
ERROR: configuration failed for package 'udunits'
** Removing '/usr/local/lib/R/library/udunits'

The downloaded packages are in
   /tmp/RtmpmhqM2D/downloaded_packages
Updating HTML index of packages in '.Library'
Warning message:
In install.packages(udunits, lib = /usr/local/lib/R/library) :
 installation of package 'udunits' had non-zero exit status


I have tried downloading the package manually and running ./configure, but with
the same result.  As the error message suggested, I looked at the file
udunits_1.0/udunits/src/Makevars.in, (contents below), but am not sure what to
modify.

##PKG_CPPFLAGS=-I/path/to/udunits/header
##PKG_LIBS=-L/path/to/udunits/lib -ludunits

[EMAIL PROTECTED]@ [EMAIL PROTECTED]@
[EMAIL PROTECTED]@


[[elided Hotmail spam]]
Brad
_
Get Free (PRODUCT) RED™  Emoticons, Winks and Display Pics.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.



--
Brian D. Ripley,  [EMAIL PROTECTED]
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford, Tel:  +44 1865 272861 (self)
1 South Parks Road, +44 1865 272866 (PA)
Oxford OX1 3TG, UKFax:  +44 1865 272595__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Cumulative lattice histograms

2008-05-12 Thread Ola Caster

Dear all,

It's fairly straightforward to plot cumulative histograms using the hist()
function. You do something like:

h - hist(rnorm(100), plot=FALSE)
h$counts- cumsum(h$counts)
plot(h)

However, I have failed to find any example where this is done using the
lattice histogram() function. I realize I need to slightly alter the panel
function panel.histogram. Specifially I would like to add the following line
in red, just like I did above:

function (x, breaks, equal.widths = TRUE, type = density, nint =
round(log2(length(x)) +
1), alpha = plot.polygon$alpha, col = plot.polygon$col, border =
plot.polygon$border,
lty = plot.polygon$lty, lwd = plot.polygon$lwd, ...)
{
plot.polygon - trellis.par.get(plot.polygon)
xscale - current.panel.limits()$xlim
panel.lines(x = xscale[1] + diff(xscale) * c(0.05, 0.95),
y = c(0, 0), col = border, lty = lty, lwd = lwd, alpha = alpha)
if (length(x)  0) {
if (is.null(breaks)) {
breaks - if (is.factor(x))
seq_len(1 + nlevels(x)) - 0.5
else if (equal.widths)
do.breaks(range(x, finite = TRUE), nint)
else quantile(x, 0:nint/nint, na.rm = TRUE)
}
h - hist.constructor(x, breaks = breaks, ...)

h$counts- cumsum(h$counts)

y - if (type == count)
h$counts
else if (type == percent)
100 * h$counts/length(x)
else h$intensities
breaks - h$breaks
nb - length(breaks)
if (length(y) != nb - 1)
warning(problem with 'hist' computations)
if (nb  1) {
panel.rect(x = breaks[-nb], y = 0, height = y, width =
diff(breaks),
col = col, alpha = alpha, border = border, lty = lty,
lwd = lwd, just = c(left, bottom))
}
}
}
environment: namespace:lattice


My problem is I'm too unexperienced in handling these panel functions to
achieve this. Simply copying the panel function, appending the line, and
giving it another name obviously doesn't work (it won't find the function
hist.constructor). I would very much appreciate help on how this could be
done, or some other way to draw cumulative lattice histograms.

Thanks in advance,
Ola Caster

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Standard error of combination of parameter estimates

2008-05-12 Thread Mark Donoghoe

Hi,

 

Is there a simple command for computing the standard error of a
combination of parameter estimates in a GLM?

 

For example:

 

riskdata$age1 - riskdata$age

riskdata$age2 - ifelse(riskdata$age67,0,riskdata$age-67)

 

model - glm(death~age1+age2+ldl,
data=riskdata,family=binomial(link=logit))

 

And we want to find the standard error of the partial linear predictor
for the combination of the two age parameters, age1+age2, at some age X,
without needing to separately use the covariance matrix?

 

Thanks,

 

Mark Donoghoe


#
This e-mail message has been scanned for Viruses and Content and cleared 
by MailMarshal
#



IMPORTANT NOTICE: This e-mail and any attachment to it are intended only to be 
read or used by the named addressee. 
It is confidential and may contain legally privileged information. No 
confidentiality or privilege is waived or lost 
by any mistaken transmission to you. The CTC is not responsible for any 
unauthorised alterations to this e-mail or 
attachment to it. Views expressed in this message are those of the individual 
sender, and are not necessarily the 
views of the CTC. If you receive this e-mail in error, please immediately 
delete it and notify the sender. You must 
not disclose, copy or use any part of this e-mail if you are not the intended 
recipient.

#

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] lme nesting/interaction advice

2008-05-12 Thread Federico Calboli


On 11 May 2008, at 22:45, Andrew Robinson wrote:


lme(y ~ selection * males, random = ~1|replica/selection/males,  
mydata)


forgive me, but I seem to see nesting in the random statement.   
That is

what happens when we separate factors with a '/'; they are nested.  We
would expect that statement to not provide the correct df for the
bog-standard fully crossed design.


Please read page 23/24 of the Pinheiro and Bates book, Mixed-Effects  
Models in S and S-PLUS. It might prove enlightening.


F

--
Federico C. F. Calboli
Department of Epidemiology and Public Health
Imperial College, St. Mary's Campus
Norfolk Place, London W2 1PG

Tel +44 (0)20 75941602   Fax +44 (0)20 75943193

f.calboli [.a.t] imperial.ac.uk
f.calboli [.a.t] gmail.com

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Cumulative lattice histograms

2008-05-12 Thread Richard . Cotton

 It's fairly straightforward to plot cumulative histograms using the 
hist()
 function. You do something like:
 
 h - hist(rnorm(100), plot=FALSE)
 h$counts- cumsum(h$counts)
 plot(h)
 
 However, I have failed to find any example where this is done using the
 lattice histogram() function. I realize I need to slightly alter the 
panel
 function panel.histogram. Specifially I would like to add the following 
line
 in red, just like I did above:
 
 function (x, breaks, equal.widths = TRUE, type = density, nint =
 round(log2(length(x)) +
 1), alpha = plot.polygon$alpha, col = plot.polygon$col, border =
 plot.polygon$border,
 lty = plot.polygon$lty, lwd = plot.polygon$lwd, ...)
 {
 plot.polygon - trellis.par.get(plot.polygon)
 xscale - current.panel.limits()$xlim
 panel.lines(x = xscale[1] + diff(xscale) * c(0.05, 0.95),
 y = c(0, 0), col = border, lty = lty, lwd = lwd, alpha = alpha)
 if (length(x)  0) {
 if (is.null(breaks)) {
 breaks - if (is.factor(x))
 seq_len(1 + nlevels(x)) - 0.5
 else if (equal.widths)
 do.breaks(range(x, finite = TRUE), nint)
 else quantile(x, 0:nint/nint, na.rm = TRUE)
 }
 h - hist.constructor(x, breaks = breaks, ...)
 
 h$counts- cumsum(h$counts)
 
 y - if (type == count)
 h$counts
 else if (type == percent)
 100 * h$counts/length(x)
 else h$intensities
 breaks - h$breaks
 nb - length(breaks)
 if (length(y) != nb - 1)
 warning(problem with 'hist' computations)
 if (nb  1) {
 panel.rect(x = breaks[-nb], y = 0, height = y, width =
 diff(breaks),
 col = col, alpha = alpha, border = border, lty = lty,
 lwd = lwd, just = c(left, bottom))
 }
 }
 }

You are nearly there!  You just need to:

1. Specify the package of hist.constructor so R's search mechanism finds 
it.
h - lattice:::hist.constructor(x, breaks = breaks, ...)

2. Give the new function a name.
panel.cumul.histogram - function (x, breaks, etc.

3. Specify the function to use for the panel in your call to histogram.
histogram(some params, panel=panel.cumul.histogram)

4. You'll also need to play about with the y axis limits.
histogram(some params, ylim=c(0, something))

Here's an example with the singer data:

panel.cumul.histogram - function (x, breaks, equal.widths = TRUE, type = 
density, nint =
round(log2(length(x)) +
1), alpha = plot.polygon$alpha, col = plot.polygon$col, border =
plot.polygon$border,
lty = plot.polygon$lty, lwd = plot.polygon$lwd, ...)
{
plot.polygon - trellis.par.get(plot.polygon)
xscale - current.panel.limits()$xlim
panel.lines(x = xscale[1] + diff(xscale) * c(0.05, 0.95),
y = c(0, 0), col = border, lty = lty, lwd = lwd, alpha = alpha)
if (length(x)  0) {
if (is.null(breaks)) {
breaks - if (is.factor(x))
seq_len(1 + nlevels(x)) - 0.5
else if (equal.widths)
do.breaks(range(x, finite = TRUE), nint)
else quantile(x, 0:nint/nint, na.rm = TRUE)
}
h - lattice:::hist.constructor(x, breaks = breaks, ...)

h$counts- cumsum(h$counts)

y - if (type == count)
h$counts
else if (type == percent)
100 * h$counts/length(x)
else h$intensities
breaks - h$breaks
nb - length(breaks)
if (length(y) != nb - 1)
warning(problem with 'hist' computations)
if (nb  1) {
panel.rect(x = breaks[-nb], y = 0, height = y, width =
diff(breaks),
col = col, alpha = alpha, border = border, lty = lty,
lwd = lwd, just = c(left, bottom))
}
}
}

histogram( ~ height | voice.part, 
data = singer, 
nint = 17,
endpoints = c(59.5, 76.5), 
layout = c(2,4), 
aspect = 1,
xlab = Height (inches),
panel=panel.cumul.histogram, 
type=count, 
ylim=c(0,max(summary(singer$voice.part)+1)))

Having said all this, I'm not entirely convinced that a cumulative 
histogram is as useful as your standard issue empirical cumulative density 
function.

Regards,
Richie.

Mathematical Sciences Unit
HSL




ATTENTION:

This message contains privileged and confidential inform...{{dropped:20}}

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] [R-sig-ME] lme nesting/interaction advice

2008-05-12 Thread Ken Beath


On 12/05/2008, at 4:52 AM, Federico Calboli wrote:


On 10 May 2008, at 07:36, Kingsford Jones wrote:

Federico,

I think you'll be more likely to receive the type of response you're
looking for if you formulate your question more clearly.  The
inclusion of commented, minimal, self-contained, reproducible code
(as is requested at the bottom of every email sent by r-help) is an
effective way to clarify the issues.  Also, when asking a question
about fitting a model it's helpful to describe the specific research
questions you want the model to answer.


snip

I apprecciate that my description of the *full* model is not 100%  
clear, but my main beef was another.


The main point of my question is, having a 3 way anova (or ancova,  
if you prefer), with *no* nesting, 2 fixed effects and 1 random  
effect, why is it so boneheaded difficult to specify a bog standard  
fully crossed model? I'm not talking about some rarified esoteric  
model here, we're talking about stuff tought in a first year Biology  
Stats course here[1].


Now, to avoid any chances of being misunderstood in my use of the  
words 'fully crossed model', what I mean is a simple


y ~ effect1 * effect2 * effect3

with effect3 being random (all all the jazz that comes from this  
fact). I fully apprecciate that the only reasonable F-tests would be  
for effect1, effect2 and effect1:effect2, but there is no way I can  
use lme to specify such simple thing without getting the *wrong*  
denDF. I need light on this topic and I'd say it's a general enough  
question not to need much more handholding than this.




There is only one random effect, so where does the crossing come  
from ? The fixed effects vary across blocks, but they are fixed so are  
just covariates. For this type of data the usual model in lme4 is  
y~fixed1+fixed2+1|group and for lme split into fixed and random parts.



Having said that, I did look at the mixed-effects mailing list  
before posting here, and it looks like it was *not* the right place  
to post anyway:


'This mailing list is primarily for useRs and programmeRs interested  
in *development* and beta-testing of the lme4 package.'


although the R-Me is now CC'd in this.

I fully apprecciate that R is developed for love, not money, and if  
I knew how to write an user friendly frontend for nlme and lme4 (and  
I knew how to actually get the model I want) I'd be pretty happy to  
do so and submit it as a library. In any case, I feel my complaint  
is pefectly valid, because specifying such basic model should  
ideally not such a chore, and I think the powers that be might  
actually find some use from user feedback.




The problems seems to be that you want lme to work in the same way as  
an ANOVA table and it doesn't. The secret with lme and lme4 is to  
think about the structure of the data and describe with an equation.  
Then each term in the equation corresponds to part of the model  
definition in R.



Once I have sorted how to specify such trivial model I'll face the  
horror of the nesting, in any case I attach a toy dataset I created  
especially to test how to specify the correct model (silly me).




I'm a bit lost with your data file, it has 4 covariates, which is more  
than enough for 2 fixed effects, assuming block is the grouping and y  
the outcome.


Ken


Best,

Federico Calboli

[1] So much bog standard that the Zar, IV ed, gives a nice table of  
how to compute the F-tests correctly, taking into account that one  
of the 3  effects is randon (I'll send the exact page and table  
number tomorrow, I don't have the book at home).


testdat.txt
--
Federico C. F. Calboli
Department of Epidemiology and Public Health
Imperial College, St. Mary's Campus
Norfolk Place, London W2 1PG

Tel +44 (0)20 75941602   Fax +44 (0)20 75943193

f.calboli [.a.t] imperial.ac.uk
f.calboli [.a.t] gmail.com



___
[EMAIL PROTECTED] mailing list
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] lme nesting/interaction advice

2008-05-12 Thread Federico Calboli



On 12 May 2008, at 01:05, Andrew Robinson wrote:


On Mon, May 12, 2008 at 10:34:40AM +1200, Rolf Turner wrote:


On 12/05/2008, at 9:45 AM, Andrew Robinson wrote:


On Sun, May 11, 2008 at 07:52:50PM +0100, Federico Calboli wrote:


The main point of my question is, having a 3 way anova (or  
ancova, if
you prefer), with *no* nesting, 2 fixed effects and 1 random  
effect,

why is it so boneheaded difficult to specify a bog standard fully
crossed model? I'm not talking about some rarified esoteric model
here, we're talking about stuff tought in a first year Biology  
Stats

course here[1].


That may be so, but I've never needed to use one.


So what?  This is still a standard, common, garden-variety
model that you will encounter in exercises in many (if not
all!) textbooks on experimental design and anova.


To reply in similar vein, so what?  Why should R-core or the R
community feel it necessary to reproduce every textbook example?  How
many times have *you* used such a model in real statistical work,
Rolf?


There is a very important reason why R (or any other stats package)  
should *easily* face the challenge of bog standard models: because it  
is a *tool* for an end (i.e. the analysis of data to figure out what  
the heck they tell us) rather than a end in itself.


Bog standard models are *likely* to be used over and over again  
because they are *bog standard*, and they became such by being used  
*lots*.


If someone with a relatively easy model cannot use R for his job s/he  
will use something else, and the R community will *not* increase in  
numbers. Since R is a *community driven project*, you do the math on  
what that would mean in the long run.


Regards,

Federico

--
Federico C. F. Calboli
Department of Epidemiology and Public Health
Imperial College, St. Mary's Campus
Norfolk Place, London W2 1PG

Tel +44 (0)20 75941602   Fax +44 (0)20 75943193

f.calboli [.a.t] imperial.ac.uk
f.calboli [.a.t] gmail.com

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] mgcv::gam shrinkage of smooths

2008-05-12 Thread Simon Wood

On Tuesday 06 May 2008 23:34, David Katz wrote:
 In Dr. Wood's book on GAM, he suggests in section 4.1.6 that it might be
 useful to shrink a single smooth by adding S=S+epsilon*I to the penalty
 matrix S. The context was the need to be able to shrink the term to zero if
 appropriate. I'd like to do this in order to shrink the coefficients
 towards zero (irrespective of the penalty for wiggliness) - but not
 necessarily all the way to zero. IE, my informal prior is to keep the
 contribution of a specific term small.

 1) Is adding eps*I to the penalty matrix an effective way to achieve this
 goal?

 2) How do I accomplish this in practice using mgcv::gam?

Are you saying that you would like to specify the amount of shrinkage 
directly, with the degree of shrinkage set `by hand' or do you want mgcv::gam 
to estimate the degree of shrinkage? 

If you want to supply a fixed ridge penalty then use argument `H' of 
mgcv::gam, which is designed for just this sort of purpose. 

If you want what's suggested in section 4.1.6 then try s(...,bs=ts) or 
s(...,bs=cs). 

If you want to add an extra ridge penalty to a smooth and have mgcv::gam 
estimate its smoothing parameter, then you would need to add write a smoother 
class to do this (by modifying one of the existing ones). See ?p.spline, or 
exercise 8, chapter 5 of Wood (2006) GAMs:An Intro with R.

best,
Simon

-- 
 Simon Wood, Mathematical Sciences, University of Bath, Bath, BA2 7AY UK
 +44 1225 386603  www.maths.bath.ac.uk/~sw283

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Standard error of combination of parameter estimates

2008-05-12 Thread Søren Højsgaard

You can use the esticon function in the doBy package. 
Regards
Søren



Fra: [EMAIL PROTECTED] på vegne af Mark Donoghoe
Sendt: ma 12-05-2008 09:11
Til: r-help@r-project.org
Emne: [R] Standard error of combination of parameter estimates



Hi,



Is there a simple command for computing the standard error of a
combination of parameter estimates in a GLM?



For example:



riskdata$age1 - riskdata$age

riskdata$age2 - ifelse(riskdata$age67,0,riskdata$age-67)



model - glm(death~age1+age2+ldl,
data=riskdata,family=binomial(link=logit))



And we want to find the standard error of the partial linear predictor
for the combination of the two age parameters, age1+age2, at some age X,
without needing to separately use the covariance matrix?



Thanks,



Mark Donoghoe


#
This e-mail message has been scanned for Viruses and Content and cleared
by MailMarshal
#



IMPORTANT NOTICE: This e-mail and any attachment to it are intended only to be 
read or used by the named addressee.
It is confidential and may contain legally privileged information. No 
confidentiality or privilege is waived or lost
by any mistaken transmission to you. The CTC is not responsible for any 
unauthorised alterations to this e-mail or
attachment to it. Views expressed in this message are those of the individual 
sender, and are not necessarily the
views of the CTC. If you receive this e-mail in error, please immediately 
delete it and notify the sender. You must
not disclose, copy or use any part of this e-mail if you are not the intended 
recipient.

#

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] lme nesting/interaction advice

2008-05-12 Thread Federico Calboli


On 12 May 2008, at 09:29, Dieter Menne wrote:


Federico:

First, mixed models are different from standard 101 Anova, and  
quite a lot

of the nesting stuff I used to ponder about 30 year ago when I started
teaching this is no longer relevant and works implicitely when you  
code the

parameters correctly.


with effect3 being random (all all the jazz that comes from this  
fact). I
fully apprecciate that the only reasonable F-tests would be for  
effect1,
effect2 and effect1:effect2, but there is no way I can use lme to  
specify

such simple thing without getting the *wrong* denDF. 

Good to know that you are sure what is right; probably == SAS.  
Since most

people active in the lme-business have read

http://wiki.r-project.org/rwiki/doku.php?id=guides:lmer-tests

http://finzi.psych.upenn.edu/R/Rhelp02a/archive/76742.html


carefully, you might be rather lonely.


I will. While I do, feel free to have a look at Appendix A.3 (page  
App6, at the end of the book) of the Zar 'Biostatistical Analysis',  
IV ed, second table from the top. That's where I get the feeling for  
what's right or wrong. I surely cannot get it from SAS because I  
never had it. I never had the budget for it, so much so I had to lear  
how to use R from the start because it was free and that was the  
budget of my department had for stats software


All in all, if you feel statistical analysis has moved forth from  
such humble beginnings (the book I mean, not SAS), and you can  
convince of that every ref for every paper you submit, please do tell  
me how you do it, it would be more valuable than knowing how to fit  
my model.


Cheers,

Federico





--
Federico C. F. Calboli
Department of Epidemiology and Public Health
Imperial College, St. Mary's Campus
Norfolk Place, London W2 1PG

Tel +44 (0)20 75941602   Fax +44 (0)20 75943193

f.calboli [.a.t] imperial.ac.uk
f.calboli [.a.t] gmail.com

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] rpart with circular data?

2008-05-12 Thread Rainer M Krug

Hi

I am planning to use a CART analysis with rpart() to analyse the
impact of, among others, slope, altitude and aspect on mortality
rates.
My question is:
Is there a p[roblem with using aspect as a predictor as it is circular?
And if it is a problem (which I suspect), is there a transformation I
could use to transform aspect?

Thanks

Rainer

-- 
Rainer M. Krug, PhD (Conservation Ecology, SUN), MSc (Conservation
Biology, UCT), Dipl. Phys. (Germany)

Plant Conservation Unit
Department of Botany
University of Cape Town
Rondebosch 7701
South Africa

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Cumulative lattice histograms

2008-05-12 Thread Dieter Menne

Ola Caster ola.caster at gmail.com writes:

 It's fairly straightforward to plot cumulative histograms using the hist()
 function. You do something like:
 
 h - hist(rnorm(100), plot=FALSE)
 h$counts- cumsum(h$counts)
 plot(h)
 
 However, I have failed to find any example where this is done using the
 lattice histogram() function. 


This is not a full solution to your problem with the red line, but at least
comes close (hope so) to your original hist with lattic

library(lattice)
h = sort(rnorm(100),decreasing=TRUE)
df = data.frame(h=h,cum=cumsum(h))
histogram(h~cum,data=df)


Dieter

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] lme nesting/interaction advice

2008-05-12 Thread Federico Calboli


On 11 May 2008, at 23:34, Rolf Turner wrote:


It doesn't seem to me to be a complaint as such.  It is a
request for insight.  I too would like some insight as to
what on earth is going on.  And why do you say Federico
shows no evidence of having searched the archives?  One can
search till one is blue in the face and come away no wiser
on this issue.

cheers,

Rolf Turner


Cheers for the support Rolf. I have searched the archives, and have  
the Pinheiro and Bates book in front of my nose (plus MASS4 and many  
others).


The bottom line here is, I'm pretty cool with RTFM and all that, my  
problem is that I do not have a clear FM to read about my issue, and  
hence I have to pester (because that how people seem to feel) the  
list. I apologise for asking an inconvenient question.


Fede

--
Federico C. F. Calboli
Department of Epidemiology and Public Health
Imperial College, St. Mary's Campus
Norfolk Place, London W2 1PG

Tel +44 (0)20 75941602   Fax +44 (0)20 75943193

f.calboli [.a.t] imperial.ac.uk
f.calboli [.a.t] gmail.com

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] [R-sig-ME] lme nesting/interaction advice

2008-05-12 Thread Federico Calboli


On 12 May 2008, at 10:05, Ken Beath wrote:
There is only one random effect, so where does the crossing come  
from ? The fixed effects vary across blocks, but they are fixed so  
are just covariates. For this type of data the usual model in lme4  
is y~fixed1+fixed2+1|group and for lme split into fixed and random  
parts.


First off, whoa, an helpful reply! thanks for that, I hope I won't  
sound sarcastic or aggressive because I do not mean to be either.


Regarding your comment, the experiment was replicated three times, in  
3 different months. I would argue that for the fixed effects to be  
meaningful, they must have an effect over an above the effect:month  
interaction (because each fixed effect, and their interaction, might  
vary between each replicate). I would then argue I need to calculate


1) fixed.effect1:random.effect
2) fixed.effect2:random.effect
3) fixed.effect1:fixed.effect2:random.effect

to test if fixed.effect1 is meaningful (and use 1) as the error); if  
fixed.effect2 has is meaningful (and use 2) as the error);  
fixed.effect1:fixed.effect2 is meaningful (and use 3) as the error).


I'm happy to be correct if I am wrong here.

The problems seems to be that you want lme to work in the same way  
as an ANOVA table and it doesn't. The secret with lme and lme4 is  
to think about the structure of the data and describe with an  
equation. Then each term in the equation corresponds to part of  
the model definition in R.


I'll try to do that.



Once I have sorted how to specify such trivial model I'll face the  
horror of the nesting, in any case I attach a toy dataset I  
created especially to test how to specify the correct model (silly  
me).




I'm a bit lost with your data file, it has 4 covariates, which is  
more than enough for 2 fixed effects, assuming block is the  
grouping and y the outcome.


In the data file, 'selection' and 'males' are fixed effects, and  
'month' is the effect I am using for the model we are discussing  
here. The y was generatde with runif() just to have something, I'm  
not expecting any intersting result, just to understand how to fit  
the right model.


In the dataset 'line' is nested within 'selection' and 'block' is  
nested within 'month'. That's the nesting I will have to take into  
account once I get the more straightforward (sic!) model we're  
discussing right.


Best,

Federico


--
Federico C. F. Calboli
Department of Epidemiology and Public Health
Imperial College, St. Mary's Campus
Norfolk Place, London W2 1PG

Tel +44 (0)20 75941602   Fax +44 (0)20 75943193

f.calboli [.a.t] imperial.ac.uk
f.calboli [.a.t] gmail.com

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] lme nesting/interaction advice

2008-05-12 Thread Andrew Robinson

On Mon, May 12, 2008 at 10:50:03AM +0100, Federico Calboli wrote:
 
 On 12 May 2008, at 01:05, Andrew Robinson wrote:
 
 On Mon, May 12, 2008 at 10:34:40AM +1200, Rolf Turner wrote:
 
 On 12/05/2008, at 9:45 AM, Andrew Robinson wrote:
 
 On Sun, May 11, 2008 at 07:52:50PM +0100, Federico Calboli wrote:
 
 The main point of my question is, having a 3 way anova (or  
 ancova, if
 you prefer), with *no* nesting, 2 fixed effects and 1 random  
 effect,
 why is it so boneheaded difficult to specify a bog standard fully
 crossed model? I'm not talking about some rarified esoteric model
 here, we're talking about stuff tought in a first year Biology  
 Stats
 course here[1].
 
 That may be so, but I've never needed to use one.
 
 So what?  This is still a standard, common, garden-variety
 model that you will encounter in exercises in many (if not
 all!) textbooks on experimental design and anova.
 
 To reply in similar vein, so what?  Why should R-core or the R
 community feel it necessary to reproduce every textbook example?  How
 many times have *you* used such a model in real statistical work,
 Rolf?
 
 There is a very important reason why R (or any other stats package)  
 should *easily* face the challenge of bog standard models: because it  
 is a *tool* for an end (i.e. the analysis of data to figure out what  
 the heck they tell us) rather than a end in itself.

But a tool that mostly (entirely?) appears in textbooks.  
 
 Bog standard models are *likely* to be used over and over again  
 because they are *bog standard*, and they became such by being used  
 *lots*.

Well.  I have documentation relevant to nlme that goes back about 10
years.  I don't know when it was first added to S-plus, but I assume
that it was about then.  Now, do you think that if the thing that you
want to do was really bog standard, that noone would have raised a
fuss or solved it within 10 years?
 
 If someone with a relatively easy model cannot use R for his job s/he  
 will use something else, and the R community will *not* increase in  
 numbers. Since R is a *community driven project*, you do the math on  
 what that would mean in the long run.

Fewer pestering questions?  ;)

Andrew

-- 
Andrew Robinson  
Department of Mathematics and StatisticsTel: +61-3-8344-6410
University of Melbourne, VIC 3010 Australia Fax: +61-3-8344-4599
http://www.ms.unimelb.edu.au/~andrewpr
http://blogs.mbs.edu/fishing-in-the-bay/

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Quadratic Constraints

2008-05-12 Thread Shubha Vishwanath Karanth

Hi R,

 

A quick question How can I optimize the objective function
constrained to quadratic constraints? Which function of R is useful for
quadratic constraints?

 

Many Thanks,

Shubha

 

This e-mail may contain confidential and/or privileged i...{{dropped:13}}

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] lme nesting/interaction advice

2008-05-12 Thread Federico Calboli


On 12 May 2008, at 11:16, Andrew Robinson wrote:

Well.  I have documentation relevant to nlme that goes back about 10
years.  I don't know when it was first added to S-plus, but I assume
that it was about then.  Now, do you think that if the thing that you
want to do was really bog standard, that noone would have raised a
fuss or solved it within 10 years?


I'm pretty unpleasant, more so in person, so I'll tell you this. If  
people raised the issue and got the answer I got, I would not be  
surprised if they'd migrated to 'any other stats software' in droves.  
I have no doubt that, given the cryptic and sparse nature of the  
documentation of the issue, most people migrated well before asking -- 
on the grounds most people have a job to do, papers to publish,  
grants to write, kids to pick up from school and pretty little time  
for RTFM and all that sanctimonious attitude.


Once people stop nagging about 'whatever', the reason is because they  
finally got the message things ain't gonna improve, so cut your  
losses and look elsewhere.


Being unpleasant, thick skinned and cheap I will keep nagging and use  
R (the fact I do like it very much might be a factor). But given the  
selection users go through, it will be Vogon time sooner or later ;).


/F

--
Federico C. F. Calboli
Department of Epidemiology and Public Health
Imperial College, St. Mary's Campus
Norfolk Place, London W2 1PG

Tel +44 (0)20 75941602   Fax +44 (0)20 75943193

f.calboli [.a.t] imperial.ac.uk
f.calboli [.a.t] gmail.com

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Insert a recorde into a table using SQL

2008-05-12 Thread Gabor Grothendieck

Create an encoding function which replaces single quotes with
two single quotes:

# first string is a single character consisting of single quote
# second string is two characters consisting of two single quotes
enc - function(x) gsub(', '', x)
dbGetQuery(con,sprintf(insert into dd (txt) values ('%s'), enc(dd[2,1])))


On Mon, May 12, 2008 at 1:54 AM, ronggui [EMAIL PROTECTED] wrote:
 Dear list,

 I want to insert a recorde into a SQLite table by dbGetQuery(), but
 there is a problem when the value contains quotation mark.

  dd-data.frame(txt=c(having both ' and \ in character.,OK))
  library(RSQLite)
 Loading required package: DBI
  con-dbConnect(dbDriver(SQLite),:memory:)
  dbWriteTable(con,dd,dd,over=T)
 [1] TRUE
  dbGetQuery(con,sprintf(insert into dd (txt) values (\%s\),dd[2,1]))
 NULL
  dbGetQuery(con,sprintf(insert into dd (txt) values (\%s\),dd[1,1]))
 Error in sqliteExecStatement(con, statement, bind.data) :
  RS-DBI driver: (error in statement: unrecognized token: ))

 How can I insert a (key, value) pair into a table by dbGetQuery?  Thanks.

 --
 HUANG Ronggui, Wincent
 Bachelor of Social Work, Fudan University, China
 Master of sociology, Fudan University, China
 Ph.D. Candidate, CityU of HK.

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Quadratic Constraints

2008-05-12 Thread Shubha Vishwanath Karanth

Hello R,

By any chance, Rdonlp2 package of R defined in
http://arumat.net/Rdonlp2/tutorial.html#SECTION0002
serves the below purpose?

BR, Shubha
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
On Behalf Of Shubha Vishwanath Karanth
Sent: Monday, May 12, 2008 3:27 PM
To: [EMAIL PROTECTED]
Subject: [R] Quadratic Constraints

Hi R,

 

A quick question How can I optimize the objective function
constrained to quadratic constraints? Which function of R is useful for
quadratic constraints?

 

Many Thanks,

Shubha

 

This e-mail may contain confidential and/or privileged
i...{{dropped:13}}

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
This e-mail may contain confidential and/or privileged i...{{dropped:10}}

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Insert a recorde into a table using SQL

2008-05-12 Thread ronggui

It works. Thanks very much.

Best

On Mon, May 12, 2008 at 7:02 PM, Gabor Grothendieck
[EMAIL PROTECTED] wrote:
 Create an encoding function which replaces single quotes with
 two single quotes:

 # first string is a single character consisting of single quote
 # second string is two characters consisting of two single quotes
 enc - function(x) gsub(', '', x)
 dbGetQuery(con,sprintf(insert into dd (txt) values ('%s'), enc(dd[2,1])))


 On Mon, May 12, 2008 at 1:54 AM, ronggui [EMAIL PROTECTED] wrote:
 Dear list,

 I want to insert a recorde into a SQLite table by dbGetQuery(), but
 there is a problem when the value contains quotation mark.

  dd-data.frame(txt=c(having both ' and \ in character.,OK))
  library(RSQLite)
 Loading required package: DBI
  con-dbConnect(dbDriver(SQLite),:memory:)
  dbWriteTable(con,dd,dd,over=T)
 [1] TRUE
  dbGetQuery(con,sprintf(insert into dd (txt) values (\%s\),dd[2,1]))
 NULL
  dbGetQuery(con,sprintf(insert into dd (txt) values (\%s\),dd[1,1]))
 Error in sqliteExecStatement(con, statement, bind.data) :
  RS-DBI driver: (error in statement: unrecognized token: ))

 How can I insert a (key, value) pair into a table by dbGetQuery?  Thanks.

 --
 HUANG Ronggui, Wincent
 Bachelor of Social Work, Fudan University, China
 Master of sociology, Fudan University, China
 Ph.D. Candidate, CityU of HK.

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.





-- 
HUANG Ronggui, Wincent
Bachelor of Social Work, Fudan University, China
Master of sociology, Fudan University, China
Ph.D. Candidate, CityU of HK,
http://www.cityu.edu.hk/sa/psa_web2006/students/rdegree/huangronggui.html

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] [R-sig-ME] lme nesting/interaction advice

2008-05-12 Thread Nick Isaac

I *think* the syntax for the model Federico wants is this:

lmer(y~selection*males+ (selection|month) + (males|month))

My lme syntax is a bit rusty, so I'm not confident how to recode with nested
random effects, as in PB p24.


Two quick points:
1. I think Federico has caused some confusion on account of the way he used
the term 'crossing'2. Whilst interesting reading, some of the content of
this thread does not conform to the list's rules and regs.

Best wishes, Nick

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Optimization problem, to minimize the length(rle(B)$lengths) for all the rows and columns

2008-05-12 Thread Ng Stanley

Hi,

how can I order the rows and columns of a matrix A to generate B, in order
to minimize the length(rle(B)$lengths) for all the rows and columns ?

 set.seed(5)
 a - matrix(rnorm(200), nrow=20)
 a[a=0] - 0
 a[a0] - 1
 a
  [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] [,10]
 [1,]011010100 1
 [2,]110101101 0
 [3,]010011011 1
 [4,]111101011 1
 [5,]110010100 0
 [6,]001001100 0
 [7,]010010010 0
 [8,]010000100 0
 [9,]000000000 1
[10,]100001000 0
[11,]111010110 1
[12,]011001011 1
[13,]011111000 0
[14,]010111010 1
[15,]010110100 0
[16,]010011111 1
[17,]001100101 1
[18,]000101010 1
[19,]100101101 0
[20,]001000110 1

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] rpart with circular data?

2008-05-12 Thread Bálint Czúcz

Hi Rainer,

In a similar situation I used the two components of the normal vector
of the surface (northing  easting). I.e. for a horizontal plane
both are 0, for a vertical slope facing south northing=-1and
easting=0, etc. This descartian decomposition of the slope vector
avoids the problem of circularity present in the widespreadly used
polar (aspect, slope) decomposition, and thus seems to suit ecological
problems much better to me. However I have not looked into this much,
I am also very interested in the opinion of others.

Bálint


On Mon, May 12, 2008 at 11:35 AM, Rainer M Krug [EMAIL PROTECTED] wrote:
 Hi

  I am planning to use a CART analysis with rpart() to analyse the
  impact of, among others, slope, altitude and aspect on mortality
  rates.
  My question is:
  Is there a p[roblem with using aspect as a predictor as it is circular?
  And if it is a problem (which I suspect), is there a transformation I
  could use to transform aspect?

  Thanks

  Rainer

  --
  Rainer M. Krug, PhD (Conservation Ecology, SUN), MSc (Conservation
  Biology, UCT), Dipl. Phys. (Germany)

  Plant Conservation Unit
  Department of Botany
  University of Cape Town
  Rondebosch 7701
  South Africa

  __
  R-help@r-project.org mailing list
  https://stat.ethz.ch/mailman/listinfo/r-help
  PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
  and provide commented, minimal, self-contained, reproducible code.




-- 
Bálint Czúcz
Institute of Ecology and Botany of the Hungarian Academy of Sciences
H-2163 Vácrátót, Alkotmány u. 2-4. HUNGARY
Tel: +36 28 360122/157 +36 70 7034692

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Compact Indicator Matrices

2008-05-12 Thread Douglas Bates

On Sun, May 11, 2008 at 9:49 AM, amarkos [EMAIL PROTECTED] wrote:
 On May 11, 4:47 pm, Douglas Bates [EMAIL PROTECTED] wrote:

 Do you mean that you want to collapse similar rows into a single row
 and perhaps a count of the number of times that this row occurs?

 Let me rephrase the problem by providing an example.

 Input:

 A =
  [,1] [,2]
  [1,]11
  [2,]13
  [3,]21
  [4,]12
  [5,]21
  [6,]12
  [7,]11
  [8,]12
  [9,]13
 [10,]21

An important question here is do you start with two or more variables
like the columns of your matrix A?  If so, there is a more direct
method of getting the answers that you want.  The natural way to store
such variables in R is as factors.  I prefer to use letters instead of
numbers to represent the levels of a factor (that way I don't confuse
a factor with a numeric variable when I look at rows)  so I would
create a data frame with two factors instead of a matrix.

 V1 - factor(c(1,1,2,1,2,1,1,1,1,2), labels = LETTERS[1:2])
 V2 - factor(c(1,3,1,2,1,2,1,2,3,1), labels = letters[1:3])
 df - data.frame(f1 = V1, f2 = V2)
 df
   f1 f2
1   A  a
2   A  c
3   B  a
4   A  b
5   B  a
6   A  b
7   A  a
8   A  b
9   A  c
10  B  a

You could produce the indicator matrix and check for unique rows, etc.
- I will show that below - but all you need is the interaction of the
two factors

 df$f12 - with(df, f1:f2)[drop = TRUE]
 df
   f1 f2 f12
1   A  a A:a
2   A  c A:c
3   B  a B:a
4   A  b A:b
5   B  a B:a
6   A  b A:b
7   A  a A:a
8   A  b A:b
9   A  c A:c
10  B  a B:a
 str(df)
'data.frame':   10 obs. of  3 variables:
 $ f1 : Factor w/ 2 levels A,B: 1 1 2 1 2 1 1 1 1 2
 $ f2 : Factor w/ 3 levels a,b,c: 1 3 1 2 1 2 1 2 3 1
 $ f12: Factor w/ 4 levels A:a,A:b,A:c,..: 1 3 4 2 4 2 1 2 3 4
 table(df$f12)

A:a A:b A:c B:a
  2   3   2   3
 as.numeric(df$f12)
 [1] 1 3 4 2 4 2 1 2 3 4

Notice that this shows you that there are four distinct combinations
that occur 2, 3, 2 and 3 times respectively; the first combination
occurs in rows 1 and 7, it consists of the first level of f1 and the
first level of f2, etc.

If you really do want the indicator matrix you could generate it as

 (ind - cbind(model.matrix(~ 0 + f1, df), model.matrix(~ 0 + f2, df)))
   f1A f1B f2a f2b f2c
11   0   1   0   0
21   0   0   0   1
30   1   1   0   0
41   0   0   1   0
50   1   1   0   0
61   0   0   1   0
71   0   1   0   0
81   0   0   1   0
91   0   0   0   1
10   0   1   1   0   0
 unique(ind)
  f1A f1B f2a f2b f2c
1   1   0   1   0   0
2   1   0   0   0   1
3   0   1   1   0   0
4   1   0   0   1   0

but working with the factors is generally much simpler than working
with the indicators.

 # Indicator matrix
 A - data.frame(lapply(data.frame(obj), as.factor))

 nocases - dim(obj)[1]
 novars  - dim(obj)[2]

 # variable levels
 levels.n - sapply(obj, nlevels)
 n- cumsum(levels.n)

 # Indicator matrix calculations
 Z- matrix(0, nrow = nocases, ncol = n[length(n)])
 newdat   - lapply(obj, as.numeric)
 offset   - (c(0, n[-length(n)]))
 for (i in 1:novars)
  Z[1:nocases + (nocases * (offset[i] + newdat[[i]] - 1))] - 1

 ###

 Output:

 Z =

[,1] [,2] [,3] [,4] [,5]
  [1,]10100
  [2,]10001
  [3,]01100
  [4,]10010
  [5,]01100
  [6,]10010
  [7,]10100
  [8,]10010
  [9,]10001
 [10,]01100


 Z is an indicator matrix in the Multiple Correspondence Analysis
 framework.
 My problem is to collapse identical rows (e.g. 2 and 9) into a single
 row and
 store the row ids.

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] [R-sig-ME] lme nesting/interaction advice

2008-05-12 Thread Doran, Harold

   But that avoids the question as to *why* it isn't very well
   set up for crossed random effects?  What's the problem?
   What are the issues?  The model is indeed bog-standard.
   It would seem not unreasonable to expect that it could be
   fitted in a straightforward manner, and it is irritating to
   find that it cannot be.  If SAS and Minitab can do it at
   the touch of a button, why can't R do it?

I haven't followed this thread carefully, so apologies if I'm too off
base. But, in response to Rolf's questions/issues. First, SAS cannot
handle models with crossed random effects (at least well at all). SAS is
horribly incapable of handling even the simplest of models (especially
generalized linear mixed models). I can cite numerous (recent) examples
of SAS coming to a complete halt (proc nlmixed) for an analyses we were
recently working on. R (and Ubuntu) was the only solution to our
problem.

Now, lme is not optimized for crossed random effects, but lmer is. That
is why lmer is supported and lme is not really supported much. lmer is
optimized for models with nested random effects and crossed random
effects. 

When working with models with nested random effects, and software
optimized for those problems (e.g., HLM, SAS, mlWin) the
variance/covariance matrix forms a special, and simple structure that
can be easily worked with. This is not the case for models with crossed
random effects.

Software packages designed for nested random effects can be tricked into
handling models with crossed random effects, but this kludge is slow and
really inefficient.

If you want complete transparency into the why and how, here is a
citation for your review.

Best
Harold

@article{Doran:Bates:Bliese:Dowling:2007:JSSOBK:v20i02,
  author =  Harold  Doran and Douglas  Bates and Paul  Bliese and
Maritza   Dowling,
  title =   Estimating the Multilevel Rasch Model: With the lme4
Package,
  journal = Journal of Statistical Software,
  volume =  20,
  number =  2,
  pages =   1--18,
  day = 22,
  month =   2,
  year =2007,
  CODEN =   JSSOBK,
  ISSN =1548-7660,
  bibdate = 2007-02-22,
  URL = http://www.jstatsoft.org/v20/i02;,
  accepted =2007-02-22,
  acknowledgement = ,
  keywords =,
  submitted =   2006-10-01,
}

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Collection of lm()s

2008-05-12 Thread Chip Barnaby


Hello,

I would like to create a subscriptable collection (presumably a list) 
of lm() models.


I have a data frame DX containing 6 groups of data.  The general idea 
is (NOT RUN) ...


for (i in 1:6)
{   DXS = subset( DX, whatever);
LMX[ i] = lm( formula, data = DXS);
}

Now access model results by subscript ... e.g. coefficients( LMX[ 
2]).  Or would it be [[ 2]]?


I have experimented with various schemes, attempting to 
pre-allocate a list etc. without success.


Also, I assume that lm() does not make a copy of input data?  That 
is, if I want predict( LMX[ i]), I would have to retain a copy the 
associated DXS subsets?


TIA,

Chip Barnaby



-
Chip Barnaby   [EMAIL PROTECTED]
Vice President of Research
Wrightsoft Corp.   781-862-8719 x118 voice
131 Hartwell Ave   781-861-2058 fax
Lexington, MA 02421 www.wrightsoft.com

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] combining bar and column graphs?

2008-05-12 Thread Michael Kubovy

Perhaps
?mosaic
or
?mosaicplot
_
Professor Michael Kubovy
University of Virginia
Department of Psychology
USPS: P.O.Box 400400Charlottesville, VA 22904-4400
Parcels:Room 102Gilmer Hall
 McCormick RoadCharlottesville, VA 22903
Office:B011+1-434-982-4729
Lab:B019+1-434-982-4751
Fax:+1-434-982-4766
WWW:http://www.people.virginia.edu/~mk9y/




On May 11, 2008, at 11:56 AM, Me wrote:

 Hi, I'm hoping to find out whether R, or an R add-on, can generate a  
 particular type of graph. And, more basically, whether such a type  
 of graph even makes sense.

 I'm looking for something resembling both a column chart and a bar  
 chart, where the basic visual unit is a solid rectangle of color  
 that can be extended either horizontally, vertically, or both. The  
 data that needs to be graphed consists of the relative contributions  
 of a number (6 or 8) of companies (entities, whatever) to a common  
 fund, over the course of a number of years (say, 1990-2008).  I'm  
 picturing years on the X-axis, and dollar amounts on the Y-axis  
 (say, $0-$100,000).  From a temporal perspective, every year will  
 have at least one contributor, starting with dollar zero, but some  
 years will have multiple contributors.  From a company  
 perspective, some companies will contribute, e.g., dollars $1,001- 
 $5,000 for several years running, visually forming a horizontal  
 block riding on top of whatever happens to be below.

 So as a simple example, between the years 2000 and 2001, Company A  
 might inhabit a solid block extending from dollar zero to dollar  
 1000, two years wide. In year 2000, Company B might contribute  
 dollars $1,001-$2,000, while right next door in year 2001, a  
 different Company C might contribute dollars $1,001-$10,000.

 Is it possible to have this sort of horizontal/vertical chart  
 generated automatically, or is this impossible? Do I need to  
 generate a year-on-year column graph and manually elide the  
 boundaries between companies' contributions in successive years,  
 thus forming the horizontal blocks I have in mind manually?  Is  
 there perhaps another software tool that would be good for this?

 Thanks very much - this is a long question...

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.


[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Mathematical annotation in lattice strip: Is it possible?

2008-05-12 Thread Andrewjohnclose


I have tried without success to find a way including the square root symbol
in lattice strips as part of my conditioning labels. I have tried
supplementing by creating a list of vectors using the var.name function
coupled with the expression function used in xlab/ylab.

xyplot(adjusted_Rand_index~cluster|distance_measure, main=Level of
agreement between partitions: Wards Method, ylab=Coefficient value
(adjusted rand index), xlab=number of clusters, type=l, data=randA1,
strip=strip.custom(varnames=c(expression(sqrt(Bray-Curtis)))

Is there a way of generating the square root symbol inside the strip or am I
wasting my time. 

Thank you very much

Regards

Andrew

http://www.nabble.com/file/p17187888/randA1.csv randA1.csv 
-- 
View this message in context: 
http://www.nabble.com/Mathematical-annotation-in-lattice-strip%3A-Is-it-possible--tp17187888p17187888.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] how to not sort factors when plotting

2008-05-12 Thread Lydia N. Slobodian

Hello.  I'm trying find the ratios between each of the integers in a
vector.  I have:

for (n in x) {
   ratio - (x[n]/x[n-1])
   ratio.all - c(ratio.all, ratio)
}

Of course this doesn't work, nor does diff(n)/diff(n-1).  Is there a
way to specify a pair of integers in a vector?

Thank you,

Lydia

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Collection of lm()s

2008-05-12 Thread Prof Brian Ripley


On Mon, 12 May 2008, Chip Barnaby wrote:


Hello,

I would like to create a subscriptable collection (presumably a list) of lm() 
models.


I have a data frame DX containing 6 groups of data.  The general idea is (NOT 
RUN) ...


for (i in 1:6)
{   DXS = subset( DX, whatever);
   LMX[ i] = lm( formula, data = DXS);
}

Now access model results by subscript ... e.g. coefficients( LMX[ 2]).  Or 
would it be [[ 2]]?


I have experimented with various schemes, attempting to pre-allocate a list 
etc. without success.


You want

LMX - list(6)
...
LMX[[1]] - lm( formula, data = DX, subset=whatever)

(No trailing ';' needed, [[]] not [] for list elements.)



Also, I assume that lm() does not make a copy of input data?


It may.  This is controlled by the 'model' argument which defaults to TRUE 
(in recent versions of R: I think it once defaulted to FALSE).


That is, if I 
want predict( LMX[ i]), I would have to retain a copy the associated DXS 
subsets?


It does retain the fitted values:  see the 'Value' section of ?lm.



TIA,

Chip Barnaby



-
Chip Barnaby   [EMAIL PROTECTED]
Vice President of Research
Wrightsoft Corp.   781-862-8719 x118 voice
131 Hartwell Ave   781-861-2058 fax
Lexington, MA 02421 www.wrightsoft.com

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.



--
Brian D. Ripley,  [EMAIL PROTECTED]
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford, Tel:  +44 1865 272861 (self)
1 South Parks Road, +44 1865 272866 (PA)
Oxford OX1 3TG, UKFax:  +44 1865 272595

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Collection of lm()s

2008-05-12 Thread Dimitris Rizopoulos


try something like the following:

x - runif(1000, -3, 3)
y - 2 + 3 * x + rnorm(1000)
f - gl(10, 100)

lm.lis - vector(list, 10)
for (i in 1:10) {
lm.lis[[i]] - lm(y ~ x, subset = f == as.character(i))
}

sapply(lm.lis, coef)
sapply(lm.lis, fitted)


I hope it helps.

Best,
Dimitris


Dimitris Rizopoulos
Biostatistical Centre
School of Public Health
Catholic University of Leuven

Address: Kapucijnenvoer 35, Leuven, Belgium
Tel: +32/(0)16/336899
Fax: +32/(0)16/337015
Web: http://med.kuleuven.be/biostat/
 http://www.student.kuleuven.be/~m0390867/dimitris.htm


Quoting Chip Barnaby [EMAIL PROTECTED]:


Hello,

I would like to create a subscriptable collection (presumably a list)
of lm() models.

I have a data frame DX containing 6 groups of data.  The general idea
is (NOT RUN) ...

for (i in 1:6)
{   DXS = subset( DX, whatever);
LMX[ i] = lm( formula, data = DXS);
}

Now access model results by subscript ... e.g. coefficients( LMX[ 2]).
Or would it be [[ 2]]?

I have experimented with various schemes, attempting to pre-allocate
a list etc. without success.

Also, I assume that lm() does not make a copy of input data?  That is,
if I want predict( LMX[ i]), I would have to retain a copy the
associated DXS subsets?

TIA,

Chip Barnaby



-
Chip Barnaby   [EMAIL PROTECTED]
Vice President of Research
Wrightsoft Corp.   781-862-8719 x118 voice
131 Hartwell Ave   781-861-2058 fax
Lexington, MA 02421 www.wrightsoft.com

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.




Disclaimer: http://www.kuleuven.be/cwis/email_disclaimer.htm

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] ggplot2: font size mismatch for pdf output

2008-05-12 Thread Michael Friendly


Hi

In the following, the graph I see on the screen and the .png output
coincide.  However, in the .pdf file, the fonts seem to be scaled
fairly larger, resulting in the label for the top legend disappearing.
Is this an infelicity or bug, or is there something I've missed?

More generally, how do I control the size of fonts used in legends
and axis labels?

library(car)
library(ggplot2)

qp -qplot (education , income , shape=type , size=women , colour=prestige ,
 xlab=Education , ylab=Income, data=Prestige)
   + scale_y_continuous(limits=c(NA, 2))
qp + scale_size(to=c(1,8))

ggsave(file=prestige-ggplot.png, width=6, height=5)  # OK
ggsave(file=prestige-ggplot.pdf, width=6, height=5)  # fonts too large

-Michael

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] [R-sig-ME] lme nesting/interaction advice

2008-05-12 Thread Federico Calboli


On 12 May 2008, at 12:21, Nick Isaac wrote:



I *think* the syntax for the model Federico wants is this:

lmer(y~selection*males+ (selection|month) + (males|month))


I'll try and check against some back of the envelope calculations -- 
as I said, the model is, per se, nothing really new, and my data is  
fully balanced.


My lme syntax is a bit rusty, so I'm not confident how to recode  
with nested random effects, as in PB p24.



Two quick points:
1. I think Federico has caused some confusion on account of the way  
he used the term 'crossing'


Sorry about that, I'll try to avoid causing such confusion in the  
future.


Cheers,

/F

--
Federico C. F. Calboli
Department of Epidemiology and Public Health
Imperial College, St. Mary's Campus
Norfolk Place, London W2 1PG

Tel +44 (0)20 75941602   Fax +44 (0)20 75943193

f.calboli [.a.t] imperial.ac.uk
f.calboli [.a.t] gmail.com

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] [R-sig-ME] lme nesting/interaction advice

2008-05-12 Thread Federico Calboli


On 12 May 2008, at 14:37, Doran, Harold wrote:


I haven't followed this thread carefully, so apologies if I'm too off
base. But, in response to Rolf's questions/issues. First, SAS cannot
handle models with crossed random effects (at least well at all).  
SAS is

horribly incapable of handling even the simplest of models (especially
generalized linear mixed models). I can cite numerous (recent)  
examples
of SAS coming to a complete halt (proc nlmixed) for an analyses we  
were

recently working on. R (and Ubuntu) was the only solution to our
problem


First off, let's keep SAS out of this. I never used it, never wanted  
to use it and did not mention anywhere I wanted to get SAS-like  
results! Although, seeing how easily it creeps up, I can sympathise  
with those who have strog feelings about it! [for those with strong  
feelings about me, this is meant to be something joke-like]


Now, lme is not optimized for crossed random effects, but lmer is.  
That

is why lmer is supported and lme is not really supported much. lmer is
optimized for models with nested random effects and crossed random
effects.

When working with models with nested random effects, and software
optimized for those problems (e.g., HLM, SAS, mlWin) the
variance/covariance matrix forms a special, and simple structure that
can be easily worked with. This is not the case for models with  
crossed

random effects.

Software packages designed for nested random effects can be tricked  
into
handling models with crossed random effects, but this kludge is  
slow and

really inefficient.

If you want complete transparency into the why and how, here is a
citation for your review.


Thank you very much. I'll read the paper and hopefully get the  
answers I was looking for.


Best,

Federico




Best
Harold

@article{Doran:Bates:Bliese:Dowling:2007:JSSOBK:v20i02,
  author =  Harold  Doran and Douglas  Bates and Paul  Bliese and
Maritza   Dowling,
  title =   Estimating the Multilevel Rasch Model: With the lme4
Package,
  journal = Journal of Statistical Software,
  volume =  20,
  number =  2,
  pages =   1--18,
  day = 22,
  month =   2,
  year =2007,
  CODEN =   JSSOBK,
  ISSN =1548-7660,
  bibdate = 2007-02-22,
  URL = http://www.jstatsoft.org/v20/i02;,
  accepted =2007-02-22,
  acknowledgement = ,
  keywords =,
  submitted =   2006-10-01,
}

___
[EMAIL PROTECTED] mailing list
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models


--
Federico C. F. Calboli
Department of Epidemiology and Public Health
Imperial College, St. Mary's Campus
Norfolk Place, London W2 1PG

Tel +44 (0)20 75941602   Fax +44 (0)20 75943193

f.calboli [.a.t] imperial.ac.uk
f.calboli [.a.t] gmail.com

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] rpart with circular data?

2008-05-12 Thread Dylan Beaudette

On Mon, May 12, 2008 at 6:29 AM, Bálint Czúcz [EMAIL PROTECTED] wrote:
 Hi Rainer,

 In a similar situation I used the two components of the normal vector
 of the surface (northing  easting). I.e. for a horizontal plane
 both are 0, for a vertical slope facing south northing=-1and
 easting=0, etc. This descartian decomposition of the slope vector
 avoids the problem of circularity present in the widespreadly used
 polar (aspect, slope) decomposition, and thus seems to suit ecological
 problems much better to me. However I have not looked into this much,
 I am also very interested in the opinion of others.

 Bálint


 On Mon, May 12, 2008 at 11:35 AM, Rainer M Krug [EMAIL PROTECTED] wrote:
 Hi

  I am planning to use a CART analysis with rpart() to analyse the
  impact of, among others, slope, altitude and aspect on mortality
  rates.
  My question is:
  Is there a p[roblem with using aspect as a predictor as it is circular?
  And if it is a problem (which I suspect), is there a transformation I
  could use to transform aspect?

  Thanks

  Rainer

  --
  Rainer M. Krug, PhD (Conservation Ecology, SUN), MSc (Conservation
  Biology, UCT), Dipl. Phys. (Germany)

  Plant Conservation Unit
  Department of Botany
  University of Cape Town
  Rondebosch 7701
  South Africa

  __
  R-help@r-project.org mailing list
  https://stat.ethz.ch/mailman/listinfo/r-help
  PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
  and provide commented, minimal, self-contained, reproducible code.




I would recommend going straight to the source-- modeling solar
radiation. This gives a purely physical-based generalization of
terrain-induced variation in microclimate. Keep an eye out for this
paper, soon to be published in Soil. Sci, Soc, America:

D.E. Beaudette, and A.T. O'Geen. Quantifying the Aspect Effect. Soil
Sci. Soc. Am. J. March 3, 2008. (In Review)

This paper uses a GRASS-based solution to the problem.

Cheers

Dylan

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] RPM-style install (SLED 10.1)

2008-05-12 Thread Stas Kolenikov

I am trying to install R on a SLED 10.1 machine.
R-base-2.7.0-7.1-i586.rpm fails with

[EMAIL PROTECTED]:~/RPMs rpm -Uvh R-base-2.7.0-7.1.i586.rpm
warning: R-base-2.7.0-7.1.i586.rpm: Header V3 DSA signature: NOKEY,
key ID 14ec5930
error: Failed dependencies:
libgfortran.so.1 is needed by R-base-2.7.0-7.1.i586

I tried to trick it into believing there's the library by setting up the link:

[EMAIL PROTECTED]:~/RPMs ls -l /usr/lib/libgf*
lrwxrwxrwx 1 root root  29 2008-05-08 17:45
/usr/lib/libgfortran.so.1 - /usr/lib/libgfortran.so.3.0.0
lrwxrwxrwx 1 root root  20 2008-05-08 17:37
/usr/lib/libgfortran.so.3 - libgfortran.so.3.0.0
-rwxr-xr-x 1 root root 2251139 2008-05-01 09:43 /usr/lib/libgfortran.so.3.0.0

but that did not work, either. I went on and installed GCC's gfortran,
but it did not provide libgfortran.so.1 either:

[EMAIL PROTECTED]:~/RPMs ls -l /usr/irun/lib/libgf*
-rw-r--r-- 1 1005 1011 4330512 2008-03-02 02:29 /usr/irun/lib/libgfortran.a
-rwxr-xr-x 1 1005 10111009 2008-03-02 02:29 /usr/irun/lib/libgfortran.la
-rwxr-xr-x 1 1005 1011 2509309 2008-03-02 02:29 /usr/irun/lib/libgfortran.so
-rwxr-xr-x 1 1005 1011 2509309 2008-03-02 02:29 /usr/irun/lib/libgfortran.so.3
-rwxr-xr-x 1 1005 1011 2509309 2008-03-02 02:29
/usr/irun/lib/libgfortran.so.3.0.0

Any reason R wants that package instead of the newer one? Of course
rpm --nodeps was an option, but then it fails to load r-stats with the
same message about the missing library:

Error in dyn.load(file, DLLpath = DLLpath, ...) :
  unable to load shared library '/usr/lib/R/library/stats/libs/stats.so':
  /usr/lib/R/library/stats/libs/stats.so: undefined symbol: _gfortran_pow_r8_i4
During startup - Warning message:
package stats in options(defaultPackages) was not found

I am stuck... I am trying to compile R from 2.6.2 sources in the
meantime, but that's painful for a Linux newbie like me on a brand-new
computer that does not necessarily have all the packages. I almost
want to switch to Ubuntu as R installation back there was a single
click :))

-- 
Stas Kolenikov, also found at http://stas.kolenikov.name
Small print: Please do not reply to my Gmail address as I don't check
it regularly.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] help with calculating the differences between dates

2008-05-12 Thread Tom Cohen


Dear list, 
  How can I calculate the difference in days between the eventdate and basedate 
in the below dataset? 
  
 id basedate   outcome.3 eventdate daydiff
1  1001 1999-09-28 2 1999-10-013
2  1002 1999-09-22 1   
3  1003 2000-01-19 1   
4  1004 2004-01-25 2 2004-02-039
5  1005 2005-08-11 1   
6  1006 2000-07-04 1 2001-05-29
7  1007 2004-02-12 1 2004-11-18
8  1008 2006-01-18 2 2006-02-02
9  1009 2005-04-29 2 2005-06-14
10 1010 2006-03-17 2 2006-03-31
11 1011 2000-03-21 2 2000-03-28
12 1012 2004-07-12 1 2006-11-28
13 1013 2000-02-24 1   
14 1014 2003-04-17 1   
15 1015 2000-04-05 1 
  
Thanks for any help,
Tom

   
-
Går det långsamt? Skaffa dig en snabbare bredbandsuppkoppling.

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] inserting mathematical symbols in lattice strip

2008-05-12 Thread Andrewjohnclose


I have tried without success to find a way including the square root symbol
in lattice strips as part of my conditioning labels. I have tried
supplementing by creating a list of vectors using the var.name function
coupled with the expression function used in xlab/ylab. 

xyplot(adjusted_Rand_index~cluster|distance_measure, main=Level of
agreement between partitions: Wards Method, ylab=Coefficient value
(adjusted rand index), xlab=number of clusters, type=l, data=randA1, 
strip=strip.custom(varnames=c(expression(sqrt(Bray-Curtis))) 

Is there a way of generating the square root symbol inside the strip or am I
wasting my time. i.e. converting Bray-Curtis to ... [sqrt symbol]
Bray-Curtis.

Thank you very much 

Regards 

Andrew 
http://www.nabble.com/file/p17188291/randA1.csv randA1.csv 
-- 
View this message in context: 
http://www.nabble.com/inserting-mathematical-symbols-in-lattice-strip-tp17188291p17188291.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] ggplot2: font size mismatch for pdf output

2008-05-12 Thread Prof Brian Ripley

What version of R and OS is this?  Prior to R 2.7.0 there was little 
attempt to match output dimensions from various devices, and one of the 
png devices in 2.7.0 has an error in doing so, fixed in R-patched (see 
NEWS).


On Mon, 12 May 2008, Michael Friendly wrote:


Hi

In the following, the graph I see on the screen and the .png output
coincide.  However, in the .pdf file, the fonts seem to be scaled
fairly larger, resulting in the label for the top legend disappearing.
Is this an infelicity or bug, or is there something I've missed?

More generally, how do I control the size of fonts used in legends
and axis labels?

library(car)
library(ggplot2)

qp -qplot (education , income , shape=type , size=women , colour=prestige ,
 xlab=Education , ylab=Income, data=Prestige)
  + scale_y_continuous(limits=c(NA, 2))


Hmm, you can't break the line before '+'.


qp + scale_size(to=c(1,8))

ggsave(file=prestige-ggplot.png, width=6, height=5)  # OK
ggsave(file=prestige-ggplot.pdf, width=6, height=5)  # fonts too large


I would not expect you to be able to specify a smaller size without also 
reducing 'pointsize'.  E.g. dev.print() does so, but ggsave seems not to.



-Michael

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.



--
Brian D. Ripley,  [EMAIL PROTECTED]
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford, Tel:  +44 1865 272861 (self)
1 South Parks Road, +44 1865 272866 (PA)
Oxford OX1 3TG, UKFax:  +44 1865 272595

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Fonts in Quartz Devices

2008-05-12 Thread Arcadio Rubio García


Hi,

I'm new to R. I'm using a Mac OS X 10.5 and R 2.7.0. I'm trying to  
change the font family for a plot in a quartz device.


Simply passing the desired font by using the family argument works  
with other devices, but not with quartz. Am I missing anything? I've  
already checked the docs for quartz and quartzFonts, but didn't find a  
solution.


Any help much appreciated.


Regards,

A. Rubio

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] help with calculating the differences between dates

2008-05-12 Thread Uwe Ligges




Tom Cohen wrote:
Dear list, 
  How can I calculate the difference in days between the eventdate and basedate in the below dataset? 
  
 id basedate   outcome.3 eventdate daydiff

1  1001 1999-09-28 2 1999-10-013
2  1002 1999-09-22 1   
3  1003 2000-01-19 1   
4  1004 2004-01-25 2 2004-02-039
5  1005 2005-08-11 1   
6  1006 2000-07-04 1 2001-05-29
7  1007 2004-02-12 1 2004-11-18

8  1008 2006-01-18 2 2006-02-02
9  1009 2005-04-29 2 2005-06-14
10 1010 2006-03-17 2 2006-03-31
11 1011 2000-03-21 2 2000-03-28
12 1012 2004-07-12 1 2006-11-28
13 1013 2000-02-24 1   
14 1014 2003-04-17 1   
15 1015 2000-04-05 1 



Given they are in some appropriate time format (e.g. when reading 
specify colClasses = POSIXct for those columns or use strptime() on 
the colummns later on):


difftime(x$eventdate, x$basedate, days)

Uwe Ligges





Thanks for any help,
Tom

   
-

Går det långsamt? Skaffa dig en snabbare bredbandsuppkoppling.

[[alternative HTML version deleted]]





__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] ggplot2: font size mismatch for pdf output

2008-05-12 Thread Michael Friendly


Sorry for not providing this in my initial posting.
I'm using R version 2.7.0 (2008-04-22), on Win XP Pro.

As I said, the .png output matched what I saw on screen.
It was the .pdf output for which the font size was noticeably
larger, enough to make the legend run off the screen.

-Michael

Prof Brian Ripley wrote:
What version of R and OS is this?  Prior to R 2.7.0 there was little 
attempt to match output dimensions from various devices, and one of the 
png devices in 2.7.0 has an error in doing so, fixed in R-patched (see 
NEWS).


On Mon, 12 May 2008, Michael Friendly wrote:


Hi

In the following, the graph I see on the screen and the .png output
coincide.  However, in the .pdf file, the fonts seem to be scaled
fairly larger, resulting in the label for the top legend disappearing.
Is this an infelicity or bug, or is there something I've missed?

More generally, how do I control the size of fonts used in legends
and axis labels?

library(car)
library(ggplot2)

qp -qplot (education , income , shape=type , size=women , 
colour=prestige ,

 xlab=Education , ylab=Income, data=Prestige)
  + scale_y_continuous(limits=c(NA, 2))


Hmm, you can't break the line before '+'.


qp + scale_size(to=c(1,8))

ggsave(file=prestige-ggplot.png, width=6, height=5)  # OK
ggsave(file=prestige-ggplot.pdf, width=6, height=5)  # fonts too large


I would not expect you to be able to specify a smaller size without also 
reducing 'pointsize'.  E.g. dev.print() does so, but ggsave seems not to.



-Michael

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide 
http://www.R-project.org/posting-guide.html

and provide commented, minimal, self-contained, reproducible code.






--
Michael Friendly Email: friendly AT yorku DOT ca
Professor, Psychology Dept.
York University  Voice: 416 736-5115 x66249 Fax: 416 736-5814
4700 Keele Streethttp://www.math.yorku.ca/SCS/friendly.html
Toronto, ONT  M3J 1P3 CANADA

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] how to not sort factors when plotting

2008-05-12 Thread Jorge Ivan Velez

Hi Lydia,

Try this:

# Function
ratio=function(x){
temp=NULL
for (n in 1:length(x)) temp=c(temp,x[n]/x[n-1])
temp
}

# Example
x=c(1,2,3,2,1,2,3)

ratio(x)
[1] 2.000 1.500 0.667 0.500 2.000 1.500


HTH,

Jorge


On Mon, May 12, 2008 at 9:52 AM, Lydia N. Slobodian [EMAIL PROTECTED] wrote:

 Hello.  I'm trying find the ratios between each of the integers in a
 vector.  I have:

 for (n in x) {
   ratio - (x[n]/x[n-1])
   ratio.all - c(ratio.all, ratio)
 }

 Of course this doesn't work, nor does diff(n)/diff(n-1).  Is there a
 way to specify a pair of integers in a vector?

 Thank you,

 Lydia

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide
 http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.


[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Fonts in Quartz Devices

2008-05-12 Thread Arcadio Rubio García


In relation to my previous email, I've discovered that

quartz(family = Monaco)

works fine with 2.6.2. However, it doesn't with 2.7.0.

Moreover, with 2.6.2 I get lot's of warnings, that previously I  
didn't. Maybe an update of the OS has broken some code leading to a bug?


El 12/05/2008, a las 16:43, Arcadio Rubio García escribió:


Hi,

I'm new to R. I'm using a Mac OS X 10.5 and R 2.7.0. I'm trying to  
change the font family for a plot in a quartz device.


Simply passing the desired font by using the family argument works  
with other devices, but not with quartz. Am I missing anything? I've  
already checked the docs for quartz and quartzFonts, but didn't find  
a solution.


Any help much appreciated.


Regards,

A. Rubio


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] how to not sort factors when plotting

2008-05-12 Thread Dimitris Rizopoulos


this can be more efficiently coded using indexing, e.g.,

x - c(1,2,3,2,1,2,3)

n - length(x)
x[2:n] / x[1:(n-1)]


Best,
Dimitris


Dimitris Rizopoulos
Biostatistical Centre
School of Public Health
Catholic University of Leuven

Address: Kapucijnenvoer 35, Leuven, Belgium
Tel: +32/(0)16/336899
Fax: +32/(0)16/337015
Web: http://med.kuleuven.be/biostat/
 http://www.student.kuleuven.be/~m0390867/dimitris.htm


Quoting Jorge Ivan Velez [EMAIL PROTECTED]:


Hi Lydia,

Try this:

# Function
ratio=function(x){
temp=NULL
for (n in 1:length(x)) temp=c(temp,x[n]/x[n-1])
temp
}

# Example
x=c(1,2,3,2,1,2,3)

ratio(x)
[1] 2.000 1.500 0.667 0.500 2.000 1.500


HTH,

Jorge


On Mon, May 12, 2008 at 9:52 AM, Lydia N. Slobodian [EMAIL PROTECTED] wrote:


Hello.  I'm trying find the ratios between each of the integers in a
vector.  I have:

for (n in x) {
  ratio - (x[n]/x[n-1])
  ratio.all - c(ratio.all, ratio)
}

Of course this doesn't work, nor does diff(n)/diff(n-1).  Is there a
way to specify a pair of integers in a vector?

Thank you,

Lydia

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.



[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.






Disclaimer: http://www.kuleuven.be/cwis/email_disclaimer.htm

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] ggplot2: font size mismatch for pdf output

2008-05-12 Thread hadley wickham

 More generally, how do I control the size of fonts used in legends
 and axis labels?

There is no general way (yet) - it is on my customisation to do list,
which I hope to make progress on over summer.

Hadley

-- 
http://had.co.nz/

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] how to not sort factors when plotting

2008-05-12 Thread Jorge Ivan Velez

Hi Lydia,

I compared my ratio function with Dimitris and Phil's suggestions. Please do
NOT use my approach because it's painfully slow for a large vector (as Phil
told me). Here is why (using Win XP SP2, Intel Core- 2 Duo 2.4 GHz, R 2.7.0
Patched):


# Vector
x=rnorm(10,0,1)

# Suggestion
new.ratio=function(x) x[2:length(x)]/x[1:(length(x)-1)]

# My horrible function
my.ratio=function(x){
 temp=NULL
 for (n in 1:length(x)) temp=c(temp,x[n]/x[n-1])
 temp
 }

 # System time
t=system.time(my.ratio(x))
tnr=system.time(new.ratio(x))
 t
   user  system elapsed
  38.790.06   39.31
 tnr
   user  system elapsed
  0   0   0


Thanks to all,

Jorge



On Mon, May 12, 2008 at 11:15 AM, Phil Spector [EMAIL PROTECTED]
wrote:

 Another alternative would be to take advantage of R's vectorization:

  x=c(1,2,3,2,1,2,3)
  x[2:length(x)]/x[1:(length(x)-1)]
 
 [1] 2.000 1.500 0.667 0.500 2.000 1.500

 The solution using your ratio function will be painfully slow
 for a large vector.

   - Phil Spector
 Statistical Computing Facility
 Department of Statistics
 UC Berkeley
 [EMAIL PROTECTED]



[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] inserting mathematical symbols in lattice strip

2008-05-12 Thread Richard . Cotton

 I have tried without success to find a way including the square root 
symbol
 in lattice strips as part of my conditioning labels. I have tried
 supplementing by creating a list of vectors using the var.name function
 coupled with the expression function used in xlab/ylab. 
 
 xyplot(adjusted_Rand_index~cluster|distance_measure, main=Level of
 agreement between partitions: Wards Method, ylab=Coefficient value
 (adjusted rand index), xlab=number of clusters, type=l, 
data=randA1, 
 strip=strip.custom(varnames=c(expression(sqrt(Bray-Curtis))) 
 
 Is there a way of generating the square root symbol inside the strip or 
am I
 wasting my time. i.e. converting Bray-Curtis to ... [sqrt symbol]
 Bray-Curtis.

You want factor.level, not var.name in strip.custom.

Here's an example with the iris data.  Apologies for the second line; 
there's almost certainly an easier way to do it, but you (hopefully) get 
the idea.

flevels - levels(iris$Species)
foo - paste(c(, paste(expression(paste(, flevels, , sqrt(, 1:3, 
))), sep=, collapse=,), )) 
xyplot(Sepal.Length ~ Petal.Length | Species, 
data = iris, 
strip=strip.custom(factor.levels=eval(parse(text=foo

Regards,
Richie.

Mathematical Sciences Unit
HSL



ATTENTION:

This message contains privileged and confidential inform...{{dropped:20}}

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Random number generation

2008-05-12 Thread Greg Snow

 -Original Message-
 From: [EMAIL PROTECTED]
 [mailto:[EMAIL PROTECTED] On Behalf Of Esmail Bonakdarian
 Sent: Sunday, May 11, 2008 7:25 AM
 To: Prof Brian Ripley
 Cc: [EMAIL PROTECTED]
 Subject: Re: [R] Random number generation

[snip]

 What I read doesn't seem to be incorrect however (it may even
 have been an archived message here), the *language* itself
 does not seem to support block *comments*. Using conditional
 constructs, or an IDE/editor to achieve similar results is a
 work around - but not the same. I don't mean to nitpick, but
 as a computer scientist I see this as different :-)

I am not a computer scientist, so correct me if I am wrong, but from what I 
remember (and a quick glance at my copy of Kernighan Ritchie), the C *language* 
itself does not support block *comments*, rather the preproccessor replaces the 
comments with a single space and the compilor does not even see them.

Since R is optimized for interactive use rather than compilation, running 
everything through a preproccessor is not really an option.  However as an 
additional work around you could always run your R scripts through the C 
preproccessor and have it strip the block comments for you.

Given the complexity of implementing block commenting (even deciding on the 
syntax) and the ease of current work arounds, the cost benefit ratio probably 
puts this very near the bottom of the priority list.

--
Gregory (Greg) L. Snow Ph.D.
Statistical Data Center
Intermountain Healthcare
[EMAIL PROTECTED]
(801) 408-8111

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Left censored responses in mixed effects models

2008-05-12 Thread Bert Gunter

Dear R Fellow-Travellers:

What is your recommended way of dealing with a left-censored response
(non-detects) in (linear Gaussian) mixed effects models?

Specifics: Response is a numeric positive measurement (of volume, actually);
but when it falls below some unknown and slightly random value (depending on
how the sample is prepared and measured), it cannot be measured and is
recorded as 0. 

There is some statistical literature on this, but I was unable to find
anything that appeared to me to implement a strategy in any R package. If it
matters, I am less interested in inference than in removing possible bias in
estimation.

Feel free to respond off-list if you feel that this would not be of general
interest.

Cheers,

Bert Gunter
Genentech

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Fundamental formula and dataframe question.

2008-05-12 Thread Greg Snow


I would have thought that:

 lm( C1 ~ M^2, data=DF )

Would give the main effects and 2 way interaction(s) (but a quick test did not 
match my expectation).  Possibly a feature request is in order if people plan 
to use this a lot.

--
Gregory (Greg) L. Snow Ph.D.
Statistical Data Center
Intermountain Healthcare
[EMAIL PROTECTED]
(801) 408-8111



 -Original Message-
 From: [EMAIL PROTECTED]
 [mailto:[EMAIL PROTECTED] On Behalf Of Ted Harding
 Sent: Sunday, May 11, 2008 2:07 PM
 To: Myers, Brent
 Cc: r-help@r-project.org
 Subject: Re: [R] Fundamental formula and dataframe question.

 On 11-May-08 18:58:45, Myers, Brent wrote:
  There is a very useful and apparently fundamental feature
 of R (or of
  the package pls) which I don't understand.
 
  For datasets with many independent (X) variables such as
 chemometric
  datasets there is a convenient formula and dataframe
 construction that
  allows one to access the entire X matrix with a single term.
 
  Consider the gasoline dataset available in the pls package. For the
  model statement in the plsr function one can write: Octane ~ NIR
 
  NIR refers to a (wide) matrix which is a portion of a
 dataframe. The
  naming of the columns is of the form: 'NIR. nm'
 
  names(gasoline) returns...
 
  $names
  [1] octane NIR
 
  instead of...
 
  $names
  [1] octane NIR.1000 nm NIR.1001 nm ...
 
  How do I construct and manipulate such dataframes and the
 column names
  that go with?
 
  Does the use of these types of formulas and dataframes
 generalize to
  other modeling functions?
 
  Some specific clues on a help search might be enough, I've
 tried many.
 
  Regards,
  Brent

 I don't have the 'gasoline' dataset to hand, but I can
 produce something to which your descrption applies as follows:

   C1 - c(1.1,1.2,1.3,1.4)
   C2 - c(2.1,2.2,2.3,2.4)
M - cbind(M1=c(11.1,11.2,11.3,11.4),
   M2=c(12.1,12.2,12.3,12.4))
   DF - data.frame(C1=C1,C2=C2,M=M)
   DF
 #C1  C2 M.M1 M.M2
 # 1 1.1 2.1 11.1 12.1
 # 2 1.2 2.2 11.2 12.2
 # 3 1.3 2.3 11.3 12.3
 # 4 1.4 2.4 11.4 12.4

 so the two columns C1 and C2 have gone in as named, and the
 matrix M (with named columns M1 and M2) has gone in with
 columns M.M1, M.M2

 Now let's fuzz the numbers a bit, so that the lm() fit makes sense:

   C1 - C1 + round(0.1*runif(4),2)
   C1 - C1 + round(0.1*runif(4),2)
M - cbind(M1=c(11.1,11.2,11.3,11.4),
   M2=c(12.1,12.2,12.3,12.4)) +
 round(0.1*runif(8),2)
   DF - data.frame(C1=C1,C2=C2,M=M)
   DF
 # C1  C2  M.M1  M.M2
 # 1 1.21 2.1 11.19 12.13
 # 2 1.34 2.2 11.23 12.23
 # 3 1.38 2.3 11.36 12.30
 # 4 1.50 2.4 11.43 12.48

   summary(lm(C1 ~ M),data=DF)
 # Call:
 # lm(formula = C1 ~ M)
 # Residuals:
 #1234
 # -0.02422  0.02448  0.01309 -0.01335
 # Coefficients:
 # Estimate Std. Error t value Pr(|t|)
 # (Intercept) -8.284352.48952  -3.3280.186
 # MM1 -0.054110.66909  -0.0810.949
 # MM2  0.834630.50687   1.6470.347
 # Residual standard error: 0.03919 on 1 degrees of freedom
 # Multiple R-Squared: 0.9642, Adjusted R-squared: 0.8925
 # F-statistic: 13.46 on 2 and 1 DF,  p-value: 0.1893

 In other words, a perfectly standard LM fit, equivalent to

   summary(lm(C1 ~ M[,1]+M[,2]))

 (as you can check). So all that looks straightforward.

 One thing, however, is not clear to me in this scenario.
 Suppose, for example, that the columns M1 and M2 of M were
 factors (and that you had more rows than I've used above, so
 that the fit is non-trivial).

 Then, in the standard specification of an LM, you could write

   summary(lm(C1 ~ M[,1]*M[,2]))

 and get the main effects and interactions. But how would you
 do that in the other type of specification:

 Where you used
   summary(lm(C1 ~ M, data=DF))
 to get the equivalent of
   summary(lm(C1 ~ M[,1]+M[,2]))
 what would you use to get the equivalent of
   summary(lm(C1 ~ M[,1]*M[,2]))??

 Would you have to spell out the interaction term[s] in
 additional columns of M?

 Hmmm, interesting! I hadn't been aware of this aspect of
 formula and dataframe construction for modellinng, until you
 pointed it out!

 Best wishes,
 Ted.

 
 E-Mail: (Ted Harding) [EMAIL PROTECTED]
 Fax-to-email: +44 (0)870 094 0861
 Date: 11-May-08   Time: 21:06:49
 -- XFMail --

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide
 http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented,

Re: [R] [R-sig-ME] lme nesting/interaction advice

2008-05-12 Thread Douglas Bates

I'm entering this discussion late so I may be discussing issues that
have already been addressed.

As I understand it, Federico, you began by describing a model for data
in which two factors have a fixed set of levels and one factor has an
extensible, or random, set of levels and you wanted to fit a model
that you described as

y ~ effect1 * effect2 * effect3

The problem is that this specification is not complete.  An
interaction of factors with fixed levels and a factor with random
levels can mean, in the lmer specification,

lmer(y ~ effect1 * effect2 + (1| effect3) + (1|effect1:effect2:effect3), ...)

or

lmer(y ~ effect1 * effect2 + (effect1*effect2 | effect3), ...)

or other variations.  When you specify a random effect or an random
interaction term you must, either explicitly or implicitly, specify
the form of the variance-covariance matrix associated with those
random effects.

The advantage that other software may provide for you is that it
chooses the model for you but that, of course, means that you only
have the one choice.

If you can describe how many variance components you think should be
estimated in your model and what they would represent then I think it
will be easier to describe how to fit the model.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Mathematical annotation in lattice strip: Is it possible?

2008-05-12 Thread Deepayan Sarkar

On 5/12/08, Andrewjohnclose [EMAIL PROTECTED] wrote:

  I have tried without success to find a way including the square root symbol
  in lattice strips as part of my conditioning labels. I have tried
  supplementing by creating a list of vectors using the var.name function
  coupled with the expression function used in xlab/ylab.

  xyplot(adjusted_Rand_index~cluster|distance_measure, main=Level of
  agreement between partitions: Wards Method, ylab=Coefficient value
  (adjusted rand index), xlab=number of clusters, type=l, data=randA1,
  strip=strip.custom(varnames=c(expression(sqrt(Bray-Curtis)))

You say 'var.names', but use 'varnames'. ?strip.custom also says:

var.name: vector of character strings or expressions as long as the
  number of conditioning variables.  The contents are
  interpreted as names for the conditioning variables.  Whether
  they are shown on the strip depends on the values of
  'strip.names' and 'style' (see below).  By default, the names
  are shown for shingles, but not for factors.

This seems to work as expected:

xyplot(1 ~ 1 | gl(1, 1),
   strip = strip.custom(var.name = expression(sqrt(Bray-Curtis)),
strip.names = TRUE, strip.levels = FALSE))

-Deepayan

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Fonts in Quartz Devices

2008-05-12 Thread Prof Brian Ripley


On Mon, 12 May 2008, Arcadio Rubio García wrote:


Hi,

I'm new to R. I'm using a Mac OS X 10.5 and R 2.7.0. I'm trying to change the 
font family for a plot in a quartz device.


Simply passing the desired font by using the family argument works with other 
devices, but not with quartz. Am I missing anything? I've already checked the 
docs for quartz and quartzFonts, but didn't find a solution.


Can you tell us what you have tried?  Some families worked not long before 
release, at least.


Such MacOS-specific questions are more likely to get an informed answer on 
R-sig-mac.




Any help much appreciated.

Regards,

A. Rubio


--
Brian D. Ripley,  [EMAIL PROTECTED]
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford, Tel:  +44 1865 272861 (self)
1 South Parks Road, +44 1865 272866 (PA)
Oxford OX1 3TG, UKFax:  +44 1865 272595__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] [R-sig-ME] lme nesting/interaction advice

2008-05-12 Thread Federico Calboli


On 12 May 2008, at 17:09, Douglas Bates wrote:


I'm entering this discussion late so I may be discussing issues that
have already been addressed.

As I understand it, Federico, you began by describing a model for data
in which two factors have a fixed set of levels and one factor has an
extensible, or random, set of levels and you wanted to fit a model
that you described as

y ~ effect1 * effect2 * effect3

The problem is that this specification is not complete.


My apologies for that, I thought that the above formula was the  
shorthand for what I would call the 'full' model, i.e. the single  
factors and the 2 and 3 ways interactions.

An
interaction of factors with fixed levels and a factor with random
levels can mean, in the lmer specification,

lmer(y ~ effect1 * effect2 + (1| effect3) + (1| 
effect1:effect2:effect3), ...)


or

lmer(y ~ effect1 * effect2 + (effect1*effect2 | effect3), ...)

or other variations.  When you specify a random effect or an random
interaction term you must, either explicitly or implicitly, specify
the form of the variance-covariance matrix associated with those
random effects.


I'll play around with this and see what I can get.


The advantage that other software may provide for you is that it
chooses the model for you but that, of course, means that you only
have the one choice.


I'm more than happy to stick to R, and to put more legwork into my  
models


If you can describe how many variance components you think should be
estimated in your model and what they would represent then I think it
will be easier to describe how to fit the model.


I'll work on that. Incidentally, what/where is the most comprehensive  
and up to date documentation for lme4? the pdfs coming with the  
package? I suspect knowing which are the right docs will help a lot  
in keeping me within the boundaries of civility and prevent me from  
annoying anyone (which is not something I sent forth to do on purpose).


Best regards,

Federico

--
Federico C. F. Calboli
Department of Epidemiology and Public Health
Imperial College, St. Mary's Campus
Norfolk Place, London W2 1PG

Tel +44 (0)20 75941602   Fax +44 (0)20 75943193

f.calboli [.a.t] imperial.ac.uk
f.calboli [.a.t] gmail.com

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Fonts in Quartz Devices

2008-05-12 Thread Arcadio Rubio García



El 12/05/2008, a las 18:18, Prof Brian Ripley escribió:


On Mon, 12 May 2008, Arcadio Rubio García wrote:


Hi,

I'm new to R. I'm using a Mac OS X 10.5 and R 2.7.0. I'm trying to  
change the font family for a plot in a quartz device.


Simply passing the desired font by using the family argument works  
with other devices, but not with quartz. Am I missing anything?  
I've already checked the docs for quartz and quartzFonts, but  
didn't find a solution.


Can you tell us what you have tried?  Some families worked not long  
before release, at least.


I tried quartz(family=Monaco) and quartz(family=Courier) with no  
success.


I've just downgraded back to R 2.6.2 and both commands work fine.




Such MacOS-specific questions are more likely to get an informed  
answer on R-sig-mac.




Any help much appreciated.

Regards,

A. Rubio


--
Brian D. Ripley,  [EMAIL PROTECTED]
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford, Tel:  +44 1865 272861 (self)
1 South Parks Road, +44 1865 272866 (PA)
Oxford OX1 3TG, UKFax:  +44 1865 272595


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] how to not sort factors when plotting

2008-05-12 Thread Gavin Simpson

On Mon, 2008-05-12 at 11:41 -0400, Jorge Ivan Velez wrote:
 Hi Lydia,

[I'm struggling to see what this has to do with the subject line?]

 
 I compared my ratio function with Dimitris and Phil's suggestions. Please do
 NOT use my approach because it's painfully slow for a large vector (as Phil
 told me). Here is why (using Win XP SP2, Intel Core- 2 Duo 2.4 GHz, R 2.7.0
 Patched):

Jorge,

If you pre-allocate storage for temp rather than the concatenation
approach you use, this is reasonably speedy for the example you quote:

Note: you have 1:length(x) which throws and error, the loop condition
needs to be 2:length(x) (or more robustly: seq_along(x[-1]) + 1)

 x - rnorm(10,0,1)
 my.ratio - function(x){
+ temp - numeric(length(x))
+ for(n in 2:length(x))
+ temp[n] - x[n] / x[n-1]
+ temp
+ }
 system.time(my.ratio(x))
   user  system elapsed 
  0.757   0.000   0.758 
 new.ratio - function(x) x[2:length(x)]/x[1:(length(x)-1)]
 system.time(new.ratio(x))
   user  system elapsed 
  0.011   0.003   0.013 

OK, so it isn't faster than the vectorised approach, but it isn't bad.
For those more familiar with C-type programming than the R vector
approach, you can do reasonably well with a for loop as long as you do
proper allocation of the result vector/object first.

Note that I'm not advocating that people shouldn't bother to learn to
use R to its advantages and code for R rather than as you might have
learnt from other languages (I'm not!), but for some problems a loop is
just fine unless you are the sort of person who needs that extra 0.7 of
a second to do something else with... ;-)

HTH

G

 
 
 # Vector
 x=rnorm(10,0,1)
 
 # Suggestion
 new.ratio=function(x) x[2:length(x)]/x[1:(length(x)-1)]
 
 # My horrible function
 my.ratio=function(x){
  temp=NULL
  for (n in 1:length(x)) temp=c(temp,x[n]/x[n-1])
  temp
  }
 
  # System time
 t=system.time(my.ratio(x))
 tnr=system.time(new.ratio(x))
  t
user  system elapsed
   38.790.06   39.31
  tnr
user  system elapsed
   0   0   0
 
 
 Thanks to all,
 
 Jorge
 
 
 
 On Mon, May 12, 2008 at 11:15 AM, Phil Spector [EMAIL PROTECTED]
 wrote:
 
  Another alternative would be to take advantage of R's vectorization:
 
   x=c(1,2,3,2,1,2,3)
   x[2:length(x)]/x[1:(length(x)-1)]
  
  [1] 2.000 1.500 0.667 0.500 2.000 1.500
 
  The solution using your ratio function will be painfully slow
  for a large vector.
 
- Phil Spector
  Statistical Computing Facility
  Department of Statistics
  UC Berkeley
  [EMAIL PROTECTED]
 
 
 
   [[alternative HTML version deleted]]
 
 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.
-- 
%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%
 Dr. Gavin Simpson [t] +44 (0)20 7679 0522
 ECRC, UCL Geography,  [f] +44 (0)20 7679 0565
 Pearson Building, [e] gavin.simpsonATNOSPAMucl.ac.uk
 Gower Street, London  [w] http://www.ucl.ac.uk/~ucfagls/
 UK. WC1E 6BT. [w] http://www.freshwaters.org.uk
%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Compact Indicator Matrices

2008-05-12 Thread Douglas Bates

On Mon, May 12, 2008 at 11:27 AM, amarkos [EMAIL PROTECTED] wrote:
 Thanks, it works!

 Could you please provide the direct method you mentioned for the
 multivariate case?

I'm not sure what you mean.  I looked at what I wrote and I don't see
anything that would fit that description.

May I suggest that you continue to cc: the R-help list on the
discussion.  I can't always respond rapidly to requests and there are
many who read the list that can.

 On May 12, 4:30 pm, Douglas Bates [EMAIL PROTECTED] wrote:
 On Sun, May 11, 2008 at 9:49 AM, amarkos [EMAIL PROTECTED] wrote:
  On May 11, 4:47 pm, Douglas Bates [EMAIL PROTECTED] wrote:

  Do you mean that you want to collapse similar rows into a single row
  and perhaps a count of the number of times that this row occurs?

  Let me rephrase the problem by providing an example.

  Input:

  A =
   [,1] [,2]
   [1,]11
   [2,]13
   [3,]21
   [4,]12
   [5,]21
   [6,]12
   [7,]11
   [8,]12
   [9,]13
  [10,]21

 An important question here is do you start with two or more variables
 like the columns of your matrix A?  If so, there is a more direct
 method of getting the answers that you want.  The natural way to store
 such variables in R is as factors.  I prefer to use letters instead of
 numbers to represent the levels of a factor (that way I don't confuse
 a factor with a numeric variable when I look at rows)  so I would
 create a data frame with two factors instead of a matrix.

  V1 - factor(c(1,1,2,1,2,1,1,1,1,2), labels = LETTERS[1:2])
  V2 - factor(c(1,3,1,2,1,2,1,2,3,1), labels = letters[1:3])
  df - data.frame(f1 = V1, f2 = V2)
  df

f1 f2
 1   A  a
 2   A  c
 3   B  a
 4   A  b
 5   B  a
 6   A  b
 7   A  a
 8   A  b
 9   A  c
 10  B  a

 You could produce the indicator matrix and check for unique rows, etc.
 - I will show that below - but all you need is the interaction of the
 two factors

  df$f12 - with(df, f1:f2)[drop = TRUE]
  df

f1 f2 f12
 1   A  a A:a
 2   A  c A:c
 3   B  a B:a
 4   A  b A:b
 5   B  a B:a
 6   A  b A:b
 7   A  a A:a
 8   A  b A:b
 9   A  c A:c
 10  B  a B:a str(df)

 'data.frame':   10 obs. of  3 variables:
  $ f1 : Factor w/ 2 levels A,B: 1 1 2 1 2 1 1 1 1 2
  $ f2 : Factor w/ 3 levels a,b,c: 1 3 1 2 1 2 1 2 3 1
  $ f12: Factor w/ 4 levels A:a,A:b,A:c,..: 1 3 4 2 4 2 1 2 3 4

  table(df$f12)

 A:a A:b A:c B:a
   2   3   2   3 as.numeric(df$f12)

  [1] 1 3 4 2 4 2 1 2 3 4

 Notice that this shows you that there are four distinct combinations
 that occur 2, 3, 2 and 3 times respectively; the first combination
 occurs in rows 1 and 7, it consists of the first level of f1 and the
 first level of f2, etc.

 If you really do want the indicator matrix you could generate it as

  (ind - cbind(model.matrix(~ 0 + f1, df), model.matrix(~ 0 + f2, df)))

f1A f1B f2a f2b f2c
 11   0   1   0   0
 21   0   0   0   1
 30   1   1   0   0
 41   0   0   1   0
 50   1   1   0   0
 61   0   0   1   0
 71   0   1   0   0
 81   0   0   1   0
 91   0   0   0   1
 10   0   1   1   0   0 unique(ind)

   f1A f1B f2a f2b f2c
 1   1   0   1   0   0
 2   1   0   0   0   1
 3   0   1   1   0   0
 4   1   0   0   1   0

 but working with the factors is generally much simpler than working
 with the indicators.



  # Indicator matrix
  A - data.frame(lapply(data.frame(obj), as.factor))

  nocases - dim(obj)[1]
  novars  - dim(obj)[2]

  # variable levels
  levels.n - sapply(obj, nlevels)
  n- cumsum(levels.n)

  # Indicator matrix calculations
  Z- matrix(0, nrow = nocases, ncol = n[length(n)])
  newdat   - lapply(obj, as.numeric)
  offset   - (c(0, n[-length(n)]))
  for (i in 1:novars)
   Z[1:nocases + (nocases * (offset[i] + newdat[[i]] - 1))] - 1

  ###

  Output:

  Z =

 [,1] [,2] [,3] [,4] [,5]
   [1,]10100
   [2,]10001
   [3,]01100
   [4,]10010
   [5,]01100
   [6,]10010
   [7,]10100
   [8,]10010
   [9,]10001
  [10,]01100

  Z is an indicator matrix in the Multiple Correspondence Analysis
  framework.
  My problem is to collapse identical rows (e.g. 2 and 9) into a single
  row and
  store the row ids.

  __
  [EMAIL PROTECTED] mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
  PLEASE do read the posting guidehttp://www.R-project.org/posting-guide.html
  and provide commented, minimal, self-contained, reproducible code.

 __
 [EMAIL PROTECTED] mailing listhttps://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guidehttp://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.

 Angelos Markos
 Dr. of Applied Informatics,
 University of

[R] hessian in constrained optimization (constrOptim)

2008-05-12 Thread Carlo Fezzi

Dear helpers,

I am using the function constrOptim to estimate a model with ML with an
inequality constraint using the option method='Nelder-Mead'.

When I specify the option: hessian = TRUE I obtain the response:

Error in f(theta, ...) : unused argument(s) (hessian = TRUE)

I guess the function constrOptim does not allow this argument which, on
the other hand, is allowed in optim.

I would be extremely grateful if anybody could suggest a way I could use to
I obtain the values of the hessian matrix...

Many thanks,

Carlo


**
Carlo Fezzi
Senior Research Associate
Centre for Social Research on the Global Environment (CSERGE)
Department of Environmental Sciences
University of East Anglia
Norwich (UK) NR2 7TJ 
Telephone: +44(0)1603 591408
Fax: +44(0)1603 593739

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Monty Hall simulation

2008-05-12 Thread Farley, Robert

The common Monty Hall problem (where the MC helps out the contestant) is not 
random!  See:
http://en.wikipedia.org/wiki/Monty_Hall_problem




My edits were removed as I had no references.  :-(

Maybe you can verify my statements, included below. :-)


Another analysis considers three types of hosts and three prize levels. The 
Benevolent Host always shows the worst remaining prize after you choose, the 
Random Host randomly picks a remaining door to show, and the Malevolent Host 
always shows the best remaining prize. The prizes are bad, middle, best; e.g. 
Goat, Luggage, Car. The Player is unaware of which prize is which. He may 
expect to be choosing among Pigs, Goats, Blenders, Luggage, Cars, and Houses.

If you always switch, the results for each host are:

Results after switching, expanded behaviors 
Host V /Prize -Bad Middle  Good 
Benevolent Host 0%  33% 67% 
Random Host 33% 33% 33% 
Malevolent Host 67% 33% 0% 

If each host were equally likely, the total probability for each prize would be 
33%-the same as not switching. Without knowing the type of Host and the prize 
mix, you can make no meaningful statement about the success of a switching 
strategy.

The only way for a Player to improve the odds is if he or she can get some 
meaningful information from the prize shown. I.e., if you know what the three 
prizes are and what type of Host you have, then you can develop a winning 
strategy. If you were wrong about either the Host or the prize mix, that 
strategy may be harmful.






 
Robert Farley
Metro
www.Metro.net 
 
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Johannes Huesing
Sent: Sunday, May 11, 2008 01:26
To: r-help@r-project.org
Subject: Re: [R] Monty Hall simulation

cirrus74 [EMAIL PROTECTED] [Sun, May 11, 2008 at 03:44:46AM CEST]:
 
 Is it possible to simulate the Monty Hall problem using R? If so, could
 someone please show me how? Thanks for any help rendered.

The kind of simulation, as any thinking about this seeingly paradoxical
situation, depends on your mindset.

To my mind, 

niter - 999
prize - sample(c(car, car, goat), niter, replace=TRUE)

would be a perfect simulation.

-- 
Johannes Hüsing   There is something fascinating about science. 
  One gets such wholesale returns of conjecture 
mailto:[EMAIL PROTECTED]  from such a trifling investment of fact.  
  
http://derwisch.wikidot.com (Mark Twain, Life on the Mississippi)

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] [R-sig-ME] lme nesting/interaction advice

2008-05-12 Thread Douglas Bates

On Mon, May 12, 2008 at 11:22 AM, Federico Calboli
[EMAIL PROTECTED] wrote:
 On 12 May 2008, at 17:09, Douglas Bates wrote:

 I'm entering this discussion late so I may be discussing issues that
 have already been addressed.

 As I understand it, Federico, you began by describing a model for data
 in which two factors have a fixed set of levels and one factor has an
 extensible, or random, set of levels and you wanted to fit a model
 that you described as

 y ~ effect1 * effect2 * effect3

 The problem is that this specification is not complete.

 My apologies for that, I thought that the above formula was the shorthand
 for what I would call the 'full' model, i.e. the single factors and the 2
 and 3 ways interactions.

As I indicated, the trick is that the interaction of a fixed factor
and a random factor can be defined in more than one way.

It sounds as if what you want is

lmer(y ~ factor1 * factor2 + (1|factor3) + (1|factor1:factor3) +
(1|factor2:factor3) + (1|factor1:factor2:factor3), ...)

but I'm not sure.

 An interaction of factors with fixed levels and a factor with random
 levels can mean, in the lmer specification,

 lmer(y ~ effect1 * effect2 + (1| effect3) + (1|effect1:effect2:effect3),
 ...)

 or

 lmer(y ~ effect1 * effect2 + (effect1*effect2 | effect3), ...)

 or other variations.  When you specify a random effect or an random
 interaction term you must, either explicitly or implicitly, specify
 the form of the variance-covariance matrix associated with those
 random effects.

 I'll play around with this and see what I can get.

 The advantage that other software may provide for you is that it
 chooses the model for you but that, of course, means that you only
 have the one choice.

 I'm more than happy to stick to R, and to put more legwork into my models

 If you can describe how many variance components you think should be
 estimated in your model and what they would represent then I think it
 will be easier to describe how to fit the model.

 I'll work on that. Incidentally, what/where is the most comprehensive and up
 to date documentation for lme4? the pdfs coming with the package? I suspect
 knowing which are the right docs will help a lot in keeping me within the
 boundaries of civility and prevent me from annoying anyone (which is not
 something I sent forth to do on purpose).

Documentation for lme4 is pretty sketchy at present.  I hope to remedy
that during our summer break.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] how to check linearity in Cox regression

2008-05-12 Thread Terry Therneau

  Use pspline within a Cox model.  It includes a fairly general test for 
nonlinearity, that is similar to GAM models.
Terry Therneau


 coxph(Surv(time, status) ~ ph.ecog + pspline(age), lung)
Call:
coxph(formula = Surv(time, status) ~ ph.ecog + pspline(age), 
data = lung)

 coef   se(coef) se2 Chisq DF   p  
ph.ecog  0.4505 0.11766  0.11723 14.66 1.00 0.00013
pspline(age), linear 0.0112 0.00927  0.00927  1.45 1.00 0.23000
pspline(age), nonlin  2.96 3.08 0.41000

Iterations: 4 outer, 10 Newton-Raphson
 Theta= 0.797 
Degrees of freedom for terms= 1.0 4.1 
Likelihood ratio test=22.7  on 5.07 df, p=0.000412
  n=227 (1 observation deleted due to missingness)

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] predicting from coxph with pspline

2008-05-12 Thread Terry Therneau

 -== begin included message -
 Hello.

I get a bit confused by the output from the predict function when used
on an object from coxph in combination with p-spline, e.g. 

fit - coxph(Surv(time1, time2, status)~pspline(x), Data)
predict(fit, newdata=data.frame(x=1:2))

- end included --

  Yes, you should be confused.  Coxph still retains a well known problem with S 
models, namely that the prediction is incorrect when there are data-dependent 
transformations in the formula such as ns(), poly() or pspline().  That is, the 
set of basis functions chosen by pspline(x) depends on the range of x; for a 
new 
data prediction the basis functions are re-calculated, giving results that are 
wrong (unless the new x happens to have the exact same lower and upper limits).
  
  This is on my to-be-fixed list.  Once a few other things are cleared away 
(actually a long list of other things).   These fixes have been applied to lm 
and others for long enough now, coxph needs to catch up.
   
Terry Therneau

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] what kind of residuals are the ones calculated in coxph?

2008-05-12 Thread Chang Liu


Hi Gurus:
 
In the coxph() objects in Survival package, there is an attribute called 
residuals. Usually, there are several kinds for censored survival data. I can't 
seem to find in the documentation as to which one this is calculating. Anyone 
knows?
 
Karen
_


[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Limiting size of pairs plots

2008-05-12 Thread Sébastien


Dear R-users,

This is a follow-up on a quite old post of mine which dealt with margins 
in the pairs function. I thought my problem was solved, but it doesn't 
seem so (see the code below).


I use the pairs function to produce matrix plots, where distinct groups 
are represented by dots of different colors within each panel. My 
troubles start when I add a legend at the bottom of the plot. The steps are:

1- I build my dataframe
2- I plot the graph and define the margins within pairs with oma (I want 
to add 5 + my number of group lines at the bottom)

3- I use gridBase functions to add the legend (symbol + text)

The result is an overlay of the legend on the plot. It looks like the 
lines do not have the same size when using pairs or the gridBase 
functions. Could anyone explain this discrepancy to me?


Thank you in advance for your help.

Sebastien


### START
library(gridBase)

my.Group.number=12

my.df-data.frame(rnorm(my.Group.number*20, mean=5, 
sd=2),rnorm(my.Group.number*20, mean=4, 
sd=1.6),rnorm(my.Group.number*20, mean=8, 
sd=3),rnorm(my.Group.number*20, mean=6, 
sd=1),rep(letters[1:my.Group.number],each=20))

names(my.df)-c(Var1,Var2,Var2,Var2,Group)

pairs(my.df[1:4],
 main = My Title,
 pch = 21,
 col =rainbow(n=my.Group.number)[unclass(my.df$Group)],
 bg = rainbow(n=my.Group.number)[unclass(my.df$Group)],
 oma = c(5 + my.Group.number, 3, 5, 3)
 )

 mylegend = paste(Group ,letters[1:my.Group.number],sep=)
 mylegend.width = strwidth(mylegend[which.max(nchar(mylegend))], 
figure, mylegend.Cex(mylegend,myTarget = 0.90))


 vps - baseViewports()
 pushViewport(vps$inner)

 for (j in 1:my.Group.number) {
 grid.text(mylegend[j],
   x = unit((1-mylegend.width)/2,npc),
   y = unit(1 + my.Group.number - j,lines),
   gp = gpar(cex = 1),
   just = left,
   draw=TRUE)

 grid.points(x = 
unit((1-mylegend.width)/2,npc)-convertUnit(unit(0.5,lines),npc),

 y = unit(1 + my.Group.number - j,lines),
 size = unit(0.5, lines),
 default.units = lines,
 pch = 21,
 gp = gpar(col = rainbow(n=my.Group.number)[j],fill = 
rainbow(n=my.Group.number)[j]),

 draw=TRUE)
   }

 END 

Prof Brian Ripley a écrit :

On Tue, 28 Aug 2007, Sébastien wrote:


Thanks you very much.
I did not thought about calling oma inside the pairs function...

Do you know by any chance why 'pairs' behaves differently from 
'plot', with regards to the plot dimension ?


Well, it is an array of figures so why should it be the same?  See 
chapter 12 of 'An Introduction to R'.




Prof Brian Ripley a écrit :

From ?pairs


 The graphical parameter 'oma' will be set by 'pairs.default'
 unless supplied as an argument.

so try

pairs(iris[1:4], main = Anderson's Iris Data -- 3 species, pch = 21,
  bg = c(red, green3, blue)[unclass(iris$Species)],
  oma = c(8,3,5,3))



On Tue, 28 Aug 2007, Sébastien wrote:


Dear R-users,

I would like to add a legend at the bottom of pairs plots (it's my 
first

use of this function). With the plot function, I usually add some
additional space at the bottom when I define the size of the graphical
device (using mar); grid functions then allows me to draw my legend 
as I

want.
Unfortunatley, this technique does not seem to work with the pairs
function as the generated plots use all the available space on the
device (see below). I guess I am missing a key argument... my attempts
to modify the oma, mar, usr arguments were unsuccesfull, and I 
could not

find any helpful threads on the archives.

As usual, any advice would be greatly appreciated

Sebastien


pdf(file=C:/test.pdf, width=6, height= 6 + 0.2*6)

par(mar=c(5 + 6,4,4,2)+0.1)

pairs(iris[1:4], main = Anderson's Iris Data -- 3 species, pch = 21,
bg = c(red, green3, blue)[unclass(iris$Species)])

dev.off()

__
[EMAIL PROTECTED] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide 
http://www.R-project.org/posting-guide.html

and provide commented, minimal, self-contained, reproducible code.









__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Compact Indicator Matrices

2008-05-12 Thread amarkos

Thanks. It works!

I think I found another solution, working straight with the indicator
matrix.

 count - factor(table(apply(ind, 1, paste, collapse=)))

However, that way I can't store the indices of the collapsed rows.

-Angelos Markos

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] how to not sort factors when plotting

2008-05-12 Thread Lydia N. Slobodian

Thanks for all of the help! Everything's working beautifully now, and
I've accomplished in a few hours what it takes most of my colleagues
weeks to do, so I think I'll stick with R after all!

Lydia

On Mon, May 12, 2008 at 4:41 PM, Jorge Ivan Velez
[EMAIL PROTECTED] wrote:


 Hi Lydia,

 I compared my ratio function with Dimitris and Phil's suggestions. Please do
 NOT use my approach because it's painfully slow for a large vector (as Phil
 told me). Here is why (using Win XP SP2, Intel Core- 2 Duo 2.4 GHz, R 2.7.0
 Patched):


 # Vector
 x=rnorm(10,0,1)

 # Suggestion
 new.ratio=function(x) x[2:length(x)]/x[1:(length(x)-1)]

 # My horrible function
 my.ratio=function(x){

  temp=NULL
  for (n in 1:length(x)) temp=c(temp,x[n]/x[n-1])
   temp
  }

  # System time
 t=system.time(my.ratio(x))
 tnr=system.time(new.ratio(x))
  t
user  system elapsed
   38.790.06   39.31
  tnr
user  system elapsed
   0   0   0


 Thanks to all,

 Jorge



 On Mon, May 12, 2008 at 11:15 AM, Phil Spector [EMAIL PROTECTED]
 wrote:
  Another alternative would be to take advantage of R's vectorization:
 
 
  
  
   x=c(1,2,3,2,1,2,3)
   x[2:length(x)]/x[1:(length(x)-1)]
  
 
 
  [1] 2.000 1.500 0.667 0.500 2.000 1.500
 
  The solution using your ratio function will be painfully slow
  for a large vector.
 
- Phil Spector
  Statistical Computing Facility
  Department of Statistics
  UC Berkeley
  [EMAIL PROTECTED]
 
 


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Left censored responses in mixed effects models

2008-05-12 Thread Don MacQueen

I assume you've looked at the NADA package(?) While I don't believe 
it goes as far as dealing the mixed effects models, it might give you 
a starting point, and possibly some additional references.


-Don

At 9:08 AM -0700 5/12/08, Bert Gunter wrote:

Dear R Fellow-Travellers:

What is your recommended way of dealing with a left-censored response
(non-detects) in (linear Gaussian) mixed effects models?

Specifics: Response is a numeric positive measurement (of volume, actually);
but when it falls below some unknown and slightly random value (depending on
how the sample is prepared and measured), it cannot be measured and is
recorded as 0.

There is some statistical literature on this, but I was unable to find
anything that appeared to me to implement a strategy in any R package. If it
matters, I am less interested in inference than in removing possible bias in
estimation.

Feel free to respond off-list if you feel that this would not be of general
interest.

Cheers,

Bert Gunter
Genentech

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.



--
--
Don MacQueen
Environmental Protection Department
Lawrence Livermore National Laboratory
Livermore, CA, USA
925-423-1062

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] lexicographic comparison of two vectors

2008-05-12 Thread Gabriel Valiente

Is there any built-in way to lexicographically compare two vectors of  
the same length in R? The textbook algorithm could be coded as follows:


lex.cmp - function (vec1,vec2) {
  for (j in 1:length(vec1)) {
if (vec1[j]  vec2[j]) { return(-1) }
if (vec1[j]  vec2[j]) { return(1) }
  }
  return(0)
}

Thanks,

Gabriel

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] [OT] xemacs on windows vista

2008-05-12 Thread Wensui Liu

Hi, dear all,
I just switch to vista (ultimate) and have heard there is some problem
for the installation of xemacs on vista. Is there any insight or
experience that you could share? I really appreciate any input.
thank you so much!

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] lexicographic comparison of two vectors

2008-05-12 Thread Duncan Murdoch


On 5/12/2008 2:58 PM, Gabriel Valiente wrote:
Is there any built-in way to lexicographically compare two vectors of  
the same length in R? The textbook algorithm could be coded as follows:


lex.cmp - function (vec1,vec2) {
   for (j in 1:length(vec1)) {
 if (vec1[j]  vec2[j]) { return(-1) }
 if (vec1[j]  vec2[j]) { return(1) }
   }
   return(0)
}



I don't think there's any standard function for this.  You could write 
one as above, or slightly faster as


lex.cmp - function(vec1, vec2) {
  index - which.min(vec1 == vec2)  # find the first diff
  sign(vec1[index] - vec2[index])   # assumes numeric
}

If you don't want to assume numeric data, you may need to expand that 
last line to a series of comparisons like yours, but with index in place 
of j, e.g.


  if (vec1[index]  vec2[index]) { return(-1) }
  if (vec1[index]  vec2[index]) { return(1) }
  return(0)

(unless there's a compare function in some package or other.)

Duncan Murdoch

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Left censored responses in mixed effects models

2008-05-12 Thread Gregor Gorjanc

Bert Gunter gunter.berton at gene.com writes:
 Dear R Fellow-Travellers:
 
 What is your recommended way of dealing with a left-censored response
 (non-detects) in (linear Gaussian) mixed effects models?

Your description of the data calls for a tobit model

http://en.wikipedia.org/wiki/Tobit_model

I think you need to take a look in the survival package.

Regards, Gregor

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] [OT] xemacs on windows vista

2008-05-12 Thread Vincent Goulet


Le lun. 12 mai à 15:15, Wensui Liu a écrit :


Hi, dear all,
I just switch to vista (ultimate) and have heard there is some problem
for the installation of xemacs on vista. Is there any insight or
experience that you could share?


Yes: go with GNU Emacs. There doesn't seem to be any compelling reason  
to prefer XEmacs nowadays. And if you use my distribution


http://vgoulet.act.ulaval.ca/en/emacs

you get a wizard based installation with up-to-date AUCTeX and ESS  
built-in.


Otherwise, I have never heard of special issues on Vista.

HTH

Vincent



I really appreciate any input.
thank you so much!

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Several questions about MCMClogit

2008-05-12 Thread j t

Hello everybody,



I'm new to MCMClogit.  I'm trying to use MCMClogit to fit a logistic
regression model but I got some warnings I can't understand.



My input data X is 32(tissue sample)*20(genes) matrix, each element in this
matrix corresponds to the expression value of one particular gene in one of
32 samples. And the Y presents the corresponding classes (0-non cancer,
1-cancer) for those 32 samples. The formula is Y~. All other parameters are
default.



This is the output message:

@@

The Metropolis acceptance rate for beta was 0.0

@@

Warning messages:

1: In glm.fit(x = X, y = Y, weights = weights, start = start, etastart =
etastart,  :

  fitted probabilities numerically 0 or 1 occurred

2: In glm.fit(x = X, y = Y, weights = weights, start = start, etastart =
etastart,  :

  fitted probabilities numerically 0 or 1 occurred



Question 1:

What are the meanings of two warning messages?



Question 2:

How could I use the regression coefficients to predict for other data, in
other words, how could I extract those regression coefficients from the
result of MCMClogit.



I know maybe my questions are some basic but it already bothered me for
several days. I hope somebody can give me some hint about them.

BTW, is there any specific manual or examples for using MCMClogit, the
manual in package is not enough to figure out my problem.



Thanks and regards,



Chao

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] help with rpart

2008-05-12 Thread Linus An

Hi,

 I am using rpart as a part of my masters' project. I am trying to print out
the resulting model using plot() function along with text() function. I am
having difficulties with labels being cut-off. In text() function, I am
using use.n=T option to get the number of people in each nodes but the on
the lower and left part of the plot, the numbers get cut off. Thanks!

Linus

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] writing a table in the device (pdf in this case)

2008-05-12 Thread Greg Snow

You probably want to look at Sweave in the utils package or the odfWeave 
package.

Both let you set up a planned set of commands interspersed with text (notes, 
explanations, full report, etc.) and then you process the file and get the 
output (and commands) in either a LaTeX file or an OpenOffice writer document, 
either of which can then be processed to a pdf file.  Both give several options 
for what input/output to return, inclusion of tables and graphs, and much more.

Hope this helps,

--
Gregory (Greg) L. Snow Ph.D.
Statistical Data Center
Intermountain Healthcare
[EMAIL PROTECTED]
(801) 408-8111



 -Original Message-
 From: [EMAIL PROTECTED]
 [mailto:[EMAIL PROTECTED] On Behalf Of Julien Colomb
 Sent: Saturday, May 10, 2008 9:47 AM
 To: r-help@r-project.org
 Subject: [R] writing a table in the device (pdf in this case)

 hello all,
 I would like to introduce a summary table into the pdf along
 with the plots (in order to archive my data into single files
 automatically).
 Similarly, It would be great to have the result of the
 statistical analysis (for instance anova) in the same file.

 Is there a way to do that?


 example:
 pdf(example.pdf)
 layout (matrix(1:2,1,2))
 plot (groups, scores)

 Result - cbind (samplesize, mean_score, sem_score)
 #calculated before ???

 dev.off ()
 --
 ___

 Dr. Julien Colomb

 Genes et Dynamique des Systemes de Memoire CNRS UMR 7637
 ESPCI 10 rue Vauquelin
 75005 Paris
 France
 www.gdsm.espci.fr

 tel +33 (0)1 40 79 51 23
 fax +33 (0)1 40 79 52 29

 AIM (ichat) or skype: thrawny1
 [[alternative HTML version deleted]]

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide
 http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] RPM-style install (SLED 10.1)

2008-05-12 Thread Horace Tso

Stas, this doesn't solve your problem but may shed some light on what might 
have gone wrong. Just recently I installed R 2.7.0 under openSuse 10.3 in a 
brand new 64-bit Lenovo ThinkPad. The first install failed because of a 
dependency error just like yours. It complained it couldn't find BLAS and 
libgfortran. I then got on the varous repositories and installed a couple 
flavours (don't remember how many) of fortran and finally it went through. 
Right now, in my /usr/lib64, i have libgfortran.so.2.0.0.

HTH.

Horace




-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Stas Kolenikov
Sent: Monday, May 12, 2008 7:33 AM
To: r-help@r-project.org
Subject: [R] RPM-style install (SLED 10.1)

I am trying to install R on a SLED 10.1 machine.
R-base-2.7.0-7.1-i586.rpm fails with

[EMAIL PROTECTED]:~/RPMs rpm -Uvh R-base-2.7.0-7.1.i586.rpm
warning: R-base-2.7.0-7.1.i586.rpm: Header V3 DSA signature: NOKEY,
key ID 14ec5930
error: Failed dependencies:
libgfortran.so.1 is needed by R-base-2.7.0-7.1.i586

I tried to trick it into believing there's the library by setting up the link:

[EMAIL PROTECTED]:~/RPMs ls -l /usr/lib/libgf*
lrwxrwxrwx 1 root root  29 2008-05-08 17:45
/usr/lib/libgfortran.so.1 - /usr/lib/libgfortran.so.3.0.0
lrwxrwxrwx 1 root root  20 2008-05-08 17:37
/usr/lib/libgfortran.so.3 - libgfortran.so.3.0.0
-rwxr-xr-x 1 root root 2251139 2008-05-01 09:43 /usr/lib/libgfortran.so.3.0.0

but that did not work, either. I went on and installed GCC's gfortran,
but it did not provide libgfortran.so.1 either:

[EMAIL PROTECTED]:~/RPMs ls -l /usr/irun/lib/libgf*
-rw-r--r-- 1 1005 1011 4330512 2008-03-02 02:29 /usr/irun/lib/libgfortran.a
-rwxr-xr-x 1 1005 10111009 2008-03-02 02:29 /usr/irun/lib/libgfortran.la
-rwxr-xr-x 1 1005 1011 2509309 2008-03-02 02:29 /usr/irun/lib/libgfortran.so
-rwxr-xr-x 1 1005 1011 2509309 2008-03-02 02:29 /usr/irun/lib/libgfortran.so.3
-rwxr-xr-x 1 1005 1011 2509309 2008-03-02 02:29
/usr/irun/lib/libgfortran.so.3.0.0

Any reason R wants that package instead of the newer one? Of course
rpm --nodeps was an option, but then it fails to load r-stats with the
same message about the missing library:

Error in dyn.load(file, DLLpath = DLLpath, ...) :
  unable to load shared library '/usr/lib/R/library/stats/libs/stats.so':
  /usr/lib/R/library/stats/libs/stats.so: undefined symbol: _gfortran_pow_r8_i4
During startup - Warning message:
package stats in options(defaultPackages) was not found

I am stuck... I am trying to compile R from 2.6.2 sources in the
meantime, but that's painful for a Linux newbie like me on a brand-new
computer that does not necessarily have all the packages. I almost
want to switch to Ubuntu as R installation back there was a single
click :))

--
Stas Kolenikov, also found at http://stas.kolenikov.name
Small print: Please do not reply to my Gmail address as I don't check
it regularly.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Several questions about MCMClogit

2008-05-12 Thread j t

Hello everybody,

I'm new to MCMClogit.  I'm trying to use MCMClogit to fit a logistic
regression model but I got some warnings I can't understand.

My input data X is 32(tissue sample)*20(genes) matrix, each element in this
matrix corresponds to the expression value of one particular gene in one of
32 samples. And the Y presents the corresponding classes (0-non cancer,
1-cancer) for those 32 samples. The formula is Y~. All other parameters are
default.

This is the output message:
@@
The Metropolis acceptance rate for beta was 0.0
@@
Warning messages:
1: In glm.fit(x = X, y = Y, weights = weights, start = start, etastart =
etastart,  :
  fitted probabilities numerically 0 or 1 occurred
2: In glm.fit(x = X, y = Y, weights = weights, start = start, etastart =
etastart,  :
  fitted probabilities numerically 0 or 1 occurred

Question 1:
What are the meanings of two warning messages?

Question 2:
How could I use the regression coefficients to predict for other data, in
other words, how could I extract those regression coefficients from the
result of MCMClogit.

I know maybe my questions are some basic but it already bothered me for
several days. I hope somebody can give me some hint about them.
BTW, is there any specific manual or examples for using MCMClogit, the
manual in package is not enough to figure out my problem.

Thanks and regards,

Chao

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Converting qqplot2 qplot() to grammar?

2008-05-12 Thread B. Bogart

Hello all,

I've been using the following qplot command:

qplot(pixX,pixY, data=som, geom=tile, fill=rgb) +
scale_fill_identity() + opts(aspect.ratio = .75) + facet_grid(unitX ~ unitY)

Now I would like to convert it into the explicit ggplot grammar, so I
can remove the extras: axes, labels, background, borders, facet labels,
and extra white-space around the plot. (If anyone has suggestions on
removing these please let me know)

I've come up with the following but it is not behaving the same way as
the qplot above:

ggplot(data = somdf, mapping = aes(x = pixX, y = pixY)) +
layer(data = somdf, geom = tile, fill=rgb) +
scale_y_continuous(name= ,breaks= ) +
scale_x_continuous(name= ,breaks= ) +
scale_fill_identity() +
coord_cartesian() +
opts(aspect.ratio = .75) +
facet_grid(unitX ~ unitY,margins=FALSE)

The result is a plot where all tiles are filled with grey50, and not the
data values. I've also tried this variation with the same results:

ggplot(data = somdf, mapping = aes(x = pixX, y = pixY)) +
geom_tile(data = somdf, fill=rgb) +
scale_y_continuous(name= ,breaks= ) +
scale_x_continuous(name= ,breaks= ) +
scale_fill_identity() +
coord_cartesian() +
opts(aspect.ratio = .75) +
facet_grid(unitX ~ unitY,margins=FALSE)

The other issue I'm having is that the above seems to create multiple
plots, another, apparently identical, plot gets drawn after I call
dev.off(). I have no idea why this would be so.

Setting name and breaks to   seems like a hack to get rid of the axis
stuff, is there a better way?

Oh, and I can't find documentation for opts() on the ggplot2 website,
where is it available?

Thanks all, Hadley in particular,
B. Bogart

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Several questions about MCMClogit

2008-05-12 Thread j t

Hello everybody,

I'm new to MCMClogit.  I'm trying to use MCMClogit to fit a logistic
regression model but I got some warnings I can't understand.

My input data X is 32(tissue sample)*20(genes) matrix, each element in this
matrix corresponds to the expression value of one particular gene in one of
32 samples. And the Y presents the corresponding classes (0-non cancer,
1-cancer) for those 32 samples. The formula is Y~. All other parameters are
default.

This is the output message:
@@
The Metropolis acceptance rate for beta was 0.0
@@
Warning messages:
1: In glm.fit(x = X, y = Y, weights = weights, start = start, etastart =
etastart,  :
  fitted probabilities numerically 0 or 1 occurred
2: In glm.fit(x = X, y = Y, weights = weights, start = start, etastart =
etastart,  :
  fitted probabilities numerically 0 or 1 occurred

Question 1:
What are the meanings of two warning messages?

Question 2:
How could I use the regression coefficients to predict for other data, in
other words, how could I extract those regression coefficients from the
result of MCMClogit.

I know maybe my questions are some basic but it already bothered me for
several days. I hope somebody can give me some hint about them.
BTW, is there any specific manual or examples for using MCMClogit, the
manual in package is not enough to figure out my problem.

Thanks and regards,

Chao

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Several questions about MCMClogit

2008-05-12 Thread j t

Hello everybody,

I'm new to MCMClogit.  I'm trying to use MCMClogit to fit a logistic
regression model but I got some warnings I can't understand.

My input data X is 32(tissue sample)*20(genes) matrix, each element in
this matrix corresponds to the expression value of one particular gene
in one of 32 samples. And the Y presents the corresponding classes
(0-non cancer, 1-cancer) for those 32 samples. The formula is Y~. All
other parameters are default.

This is the output message:
@@
The Metropolis acceptance rate for beta was 0.0
@@
Warning messages:
1: In glm.fit(x = X, y = Y, weights = weights, start = start, etastart
= etastart,  :
  fitted probabilities numerically 0 or 1 occurred
2: In glm.fit(x = X, y = Y, weights = weights, start = start, etastart
= etastart,  :
  fitted probabilities numerically 0 or 1 occurred

Question 1:
What are the meanings of two warning messages?

Question 2:
How could I use the regression coefficients to predict for other data,
in other words, how could I extract those regression coefficients from
the result of MCMClogit.

I know maybe my questions are some basic but it already bothered me
for several days. I hope somebody can give me some hint about them.
BTW, is there any specific manual or examples for using MCMClogit, the
manual in package is not enough to figure out my problem.

Thanks and regards,

Chao

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] [OT] xemacs on windows vista

2008-05-12 Thread Esmail Bonakdarian


Wensui Liu wrote:

Hi, dear all,
I just switch to vista (ultimate) and have heard there is some problem
for the installation of xemacs on vista. Is there any insight or
experience that you could share? I really appreciate any input.
thank you so much!


Hi,

I don't know about XEmacs, but I am using a specially bundled
version of GNU Emacs by Vincet Goulet which is great. I was
using a plain-vanilla version of GNU Emacs before, but this one
has ESS  and spell check support built in :-)

I am using it under XP .. I believe this also works under Vista

Check out:

   http://vgoulet.act.ulaval.ca/en/ressources/emacs/

HTH,

Esmail

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Several questions about MCMClogit

2008-05-12 Thread j t

Hello everybody,

I'm new to MCMClogit.  I'm trying to use MCMClogit to fit a logistic
regression model but I got some warnings I can't understand.

My input data X is 32(tissue sample)*20(genes) matrix, each element in
this matrix corresponds to the expression value of one particular gene
in one of 32 samples. And the Y presents the corresponding classes
(0-non cancer, 1-cancer) for those 32 samples. The formula is Y~. All
other parameters are default.

This is the output message:
@@
The Metropolis acceptance rate for beta was 0.0
@@
Warning messages:
1: In glm.fit(x = X, y = Y, weights = weights, start = start, etastart
= etastart,  :
 fitted probabilities numerically 0 or 1 occurred
2: In glm.fit(x = X, y = Y, weights = weights, start = start, etastart
= etastart,  :
 fitted probabilities numerically 0 or 1 occurred

Question 1:
What are the meanings of two warning messages?

Question 2:
How could I use the regression coefficients to predict for other data,
in other words, how could I extract those regression coefficients from
the result of MCMClogit.

I know maybe my questions are some basic but it already bothered me
for several days. I hope somebody can give me some hint about them.
BTW, is there any specific manual or examples for using MCMClogit, the
manual in package is not enough to figure out my problem.

Thanks and regards,

Chao

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] test

2008-05-12 Thread j t

Sorry to bother your.
I am trying to post my question for more than 10 times, but I still
didn't see it.
It drives my crazy!!!

It is a test for posting some simple pure text.

Chao

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] help with rpart

2008-05-12 Thread Prof Brian Ripley


Some answers are on the help pages for plot.rpart and text.rpart.

On Mon, 12 May 2008, Linus An wrote:


Hi,

I am using rpart as a part of my masters' project. I am trying to print out
the resulting model using plot() function along with text() function. I am
having difficulties with labels being cut-off. In text() function, I am
using use.n=T option to get the number of people in each nodes but the on
the lower and left part of the plot, the numbers get cut off. Thanks!

Linus

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


PLEASE do.


--
Brian D. Ripley,  [EMAIL PROTECTED]
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford, Tel:  +44 1865 272861 (self)
1 South Parks Road, +44 1865 272866 (PA)
Oxford OX1 3TG, UKFax:  +44 1865 272595

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Format integer

2008-05-12 Thread Anh Tran

Hi,
What's one way to convert an integer to a string with preceding 0's?
such that
'13' becomes '013'
to be put into a string

I've tried formatC, but they removes all the zeros and replace it with
blanks

Thanks

-- 
Regards,
Anh Tran

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] is.category

2008-05-12 Thread Applejus


Hello,

Could someone tell me what the SPLUS is.category function do and what is
its equivalent in R?
Thank you, I couldn't find any help elsewhere...

 
-- 
View this message in context: 
http://www.nabble.com/is.category-tp1719p1719.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Format integer

2008-05-12 Thread Anh Tran

Thanks. formatC(flag) works.

But it's awefully slow. I try to do that for 65000 numbers (generating ID
for each item) and it seems like forever.
Is there any faster way?

Thank all.

Anh Tran

On Mon, May 12, 2008 at 2:36 PM, Uwe Ligges 
[EMAIL PROTECTED] wrote:



 Anh Tran wrote:

  Hi,
  What's one way to convert an integer to a string with preceding 0's?
  such that
  '13' becomes '013'
  to be put into a string
 
  I've tried formatC, but they removes all the zeros and replace it with
  blanks
 

 Not so for me:

 formatC(13, digits=10, flag=0)

 Uwe LIgges



  Thanks
 
 


-- 
Regards,
Anh Tran

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] help on plot title

2008-05-12 Thread Lisa

Hi, I am do some plotting and need to add title to each of the plots, Can
anyone help me on how to add the variables to the title?

here it is the program, the title i want should look like, j=1, j=2..,
but if i use title(j=,j), the j will go to the x axis. Can anyone help me
on this?
thanka s lot


for (j in 1:6) {

   y-mx.j(x,j)


   sx-mx.j(x,j)+sigma*ei
   plot(x,y, type=l,xlab=x, ylab=)

   title(j=,j)
}

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] test

2008-05-12 Thread Tony Plate

You probably should check this section in your R-help 
subscription options (via 
https://stat.ethz.ch/mailman/options/r-help/, I think):



Receive your own posts to the list?

Ordinarily, you will get a copy of every message you post to the list. If you don't want to receive this copy, set this option to No. 


I see 5 identical posts with the subject Several questions 
about MCMClogit on R-help recently.


-- Tony Plate

j t wrote:

Sorry to bother your.
I am trying to post my question for more than 10 times, but I still
didn't see it.
It drives my crazy!!!

It is a test for posting some simple pure text.

Chao

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.



__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Format integer

2008-05-12 Thread Charilaos Skiadas


On May 12, 2008, at 5:22 PM, Anh Tran wrote:


Hi,
What's one way to convert an integer to a string with preceding 0's?
such that
'13' becomes '013'
to be put into a string

I've tried formatC, but they removes all the zeros and replace it with
blanks


formatC(13, width=10, format=d, flag=0)


Thanks

--
Regards,
Anh Tran


Haris Skiadas
Department of Mathematics and Computer Science
Hanover College

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Format integer

2008-05-12 Thread Uwe Ligges




Anh Tran wrote:

Thanks. formatC(flag) works.

But it's awefully slow. I try to do that for 65000 numbers (generating ID
for each item) and it seems like forever.


On my not that recent laptop:

 system.time(formatC(1:65000, width=10, flag=0))
   user  system elapsed
   1.920.001.94


I think 2 seconds is less than forever.

Uwe Ligges






Is there any faster way?

Thank all.

Anh Tran

On Mon, May 12, 2008 at 2:36 PM, Uwe Ligges 
[EMAIL PROTECTED] wrote:



Anh Tran wrote:


Hi,
What's one way to convert an integer to a string with preceding 0's?
such that
'13' becomes '013'
to be put into a string

I've tried formatC, but they removes all the zeros and replace it with
blanks


Not so for me:

formatC(13, digits=10, flag=0)

Uwe LIgges




Thanks







__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Format integer

2008-05-12 Thread Uwe Ligges




Anh Tran wrote:

Hi,
What's one way to convert an integer to a string with preceding 0's?
such that
'13' becomes '013'
to be put into a string

I've tried formatC, but they removes all the zeros and replace it with
blanks


Not so for me:

formatC(13, digits=10, flag=0)

Uwe LIgges




Thanks



__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Format integer

2008-05-12 Thread Anh Tran

Yea, thanks all. I checked back and I got a few things mistyped.
The array is 650,000 and it took 25 seconds :p. It's acceptable. Just that I
had too many variable at the time I ran it.

Also, seems like sprintf is a little faster.

Thanks all.

Anh Tran


On Mon, May 12, 2008 at 2:55 PM, Uwe Ligges [EMAIL PROTECTED]
wrote:



 Anh Tran wrote:

  Thanks. formatC(flag) works.
 
  But it's awefully slow. I try to do that for 65000 numbers (generating
  ID
  for each item) and it seems like forever.
 

 On my not that recent laptop:

  system.time(formatC(1:65000, width=10, flag=0))
   user  system elapsed
   1.920.001.94


 I think 2 seconds is less than forever.

 Uwe Ligges






  Is there any faster way?
 
  Thank all.
 
  Anh Tran
 
  On Mon, May 12, 2008 at 2:36 PM, Uwe Ligges 
  [EMAIL PROTECTED] wrote:
 
 
   Anh Tran wrote:
  
Hi,
What's one way to convert an integer to a string with preceding 0's?
such that
'13' becomes '013'
to be put into a string
   
I've tried formatC, but they removes all the zeros and replace it
with
blanks
   
 Not so for me:
  
   formatC(13, digits=10, flag=0)
  
   Uwe LIgges
  
  
  
Thanks
   
   
   
 
 


-- 
Regards,
Anh Tran

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Format integer

2008-05-12 Thread Phil Spector


I guess little means different things to different people:


x = sample(1:100,65,replace=TRUE)
system.time(a-formatC(x,digits=10,flag='0'))

   user  system elapsed
 32.854   0.444  34.813

system.time(b-sprintf(%011d,x))

   user  system elapsed
  0.352   0.012   0.363

If you look at the definitions of the functions, you'll see
that formatC is written in R, and sprintf uses a single call
to an .Internal function.   I

   - Phil Spector
 Statistical Computing Facility
 Department of Statistics
 UC Berkeley
 [EMAIL PROTECTED]



On Mon, 12 May 2008, Anh Tran wrote:


Yea, thanks all. I checked back and I got a few things mistyped.
The array is 650,000 and it took 25 seconds :p. It's acceptable. Just that I
had too many variable at the time I ran it.

Also, seems like sprintf is a little faster.

Thanks all.

Anh Tran


On Mon, May 12, 2008 at 2:55 PM, Uwe Ligges [EMAIL PROTECTED]
wrote:




Anh Tran wrote:


Thanks. formatC(flag) works.

But it's awefully slow. I try to do that for 65000 numbers (generating
ID
for each item) and it seems like forever.



On my not that recent laptop:


system.time(formatC(1:65000, width=10, flag=0))

  user  system elapsed
  1.920.001.94


I think 2 seconds is less than forever.

Uwe Ligges






 Is there any faster way?


Thank all.

Anh Tran

On Mon, May 12, 2008 at 2:36 PM, Uwe Ligges 
[EMAIL PROTECTED] wrote:



Anh Tran wrote:

 Hi,

What's one way to convert an integer to a string with preceding 0's?
such that
'13' becomes '013'
to be put into a string

I've tried formatC, but they removes all the zeros and replace it
with
blanks

 Not so for me:


formatC(13, digits=10, flag=0)

Uwe LIgges



 Thanks










--
Regards,
Anh Tran

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.



__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] [R-sig-ME] lme nesting/interaction advice

2008-05-12 Thread Rolf Turner



On 13/05/2008, at 4:09 AM, Douglas Bates wrote:


I'm entering this discussion late so I may be discussing issues that
have already been addressed.

As I understand it, Federico, you began by describing a model for data
in which two factors have a fixed set of levels and one factor has an
extensible, or random, set of levels and you wanted to fit a model
that you described as

y ~ effect1 * effect2 * effect3

The problem is that this specification is not complete.


At *last* (as Owl said to Rabbit) we're getting somewhere!!!

I always knew that there was some basic fundamental point
about this business that I (and I believe many others) were
simply missing.  But I could not for the life of me get anyone
to explain to me what that point was.  Or to put it another
way, I was never able to frame a question that would illuminate
just what it was that I wasn't getting.

I now may be at a stage where I can start asking the right
questions.


An interaction of factors with fixed levels and a factor with random
levels can mean, in the lmer specification,

lmer(y ~ effect1 * effect2 + (1| effect3) + (1| 
effect1:effect2:effect3), ...)


or

lmer(y ~ effect1 * effect2 + (effect1*effect2 | effect3), ...)

or other variations.  When you specify a random effect or an random
interaction term you must, either explicitly or implicitly, specify
the form of the variance-covariance matrix associated with those
random effects.

The advantage that other software may provide for you is that it
chooses the model for you but that, of course, means that you only
have the one choice.


Now may I start asking what I hope are questions that will lift
the fog a bit?

Let us for specificity consider a three-way model with two
fixed effects and one random effect from the good old Rothamstead style
agricultural experiment context:  Suppose we have a number of
species/breeds of wheat (say) and a number of fertilizers.
These are fixed effects.  And we have a number of fields (blocks)
--- a random effect.  Each breed-fertilizer combination is
applied a number of times in each field.  We ***assume*** that
that the field or block effect is homogeneous throughout.  This
may or may not be a ``good'' assumption, but it's not completely
ridiculous and would often be made in practice.  And probably
*was* made at Rothamstead.  The response would be something like
yield in bushels per acre.

The way that I would write the ``full'' model for this setting,
in mathematical style is:

Y_ijkl = mu + alpha_i + beta_j + (alpha.beta)_ij + C_k + (alpha.C)_ik
+ (beta.C)_jk + (alpha.beta.C)_ijk + E_ijkl

	The alpha_i and beta_j are parameters corresponding to breed and  
fertilizer
	respectively; the C_k are random effects corresponding to fields or  
blocks.

Any effect ``involving'' C is also random.

	The assumptions made by the Package-Which-Must-Not-Be-Named are (I  
think)

that

C_k ~ N(0,sigma_C^2)
(alpha.C)_ik ~ N(0,sigma_aC^2)
(beta.C)jk ~ N(0,sigma_bC^2)
(alpha.beta.C)_ijk ~ N(0,sigma_abC^2)
E_ijkl ~ N(0,sigma^2)

and these random variables are *all independent*.

A ... perhaps I'm on the way to answering my own question.  Is
it this assumption of ``all independent'' which is questionable?  It
seemed innocent enough when I first learned about this stuff, lo these
many years ago.  But  maybe not!

To start with:  What would be the lmer syntax to fit the foregoing
(possibly naive) model?  I am sorry, but I really cannot get my head
around the syntax of lmer model specification, and I've tried.  I
really have.  Hard.  I know I must be starting from the wrong place,
but I haven't a clue as to what the *right* place to start from is.
And if I'm in that boat, I will wager Euros to pretzels that there
are others in it.  I know that I'm not the brightest bulb in the
chandelier, but I'm not the dullest either.

Having got there:  Presuming that I'm more-or-less on the right track
	in my foregoing conjecture that it's the over-simple dependence  
structure
	that is the problem with what's delivered by the Package-Which-Must- 
Not-Be-Named,
	how might one go about being less simple-minded?  I.e. what might be  
some
	more realistic dependence structures, and how would one specify  
these in lmer?

And how would one assess whether the assumed dependence structure gives
a reasonable fit to the data?


If you can describe how many variance components you think should be
estimated in your model and what they would represent then I think it
will be easier to describe how to fit the model.


How

Re: [R] RPM-style install (SLED 10.1)

2008-05-12 Thread Stas Kolenikov

That's what I ended up doing. I've had complaints about blas, too, but
I rpm-ed them out with --nodeps. I've compiled R from the sources (I
had to add some -devel- packages, too), and then copied over the
stats.*.so library from my fully successful 2.6.2 compilation to
semi-successful 2.7.0 installation. So I cannot actually tell if I
have 2.7.0 or 2.6.2 on this computer... and I don't know if any other
problems may appear later.

And yes I have a ThinkPad although it is 32 bit.

On 5/12/08, Horace Tso [EMAIL PROTECTED] wrote:
 Stas, this doesn't solve your problem but may shed some light on what might 
 have gone wrong. Just recently I installed R 2.7.0 under openSuse 10.3 in a 
 brand new 64-bit Lenovo ThinkPad. The first install failed because of a 
 dependency error just like yours. It complained it couldn't find BLAS and 
 libgfortran. I then got on the varous repositories and installed a couple 
 flavours (don't remember how many) of fortran and finally it went through. 
 Right now, in my /usr/lib64, i have libgfortran.so.2.0.0.


  -Original Message-
  From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Stas Kolenikov
  Sent: Monday, May 12, 2008 7:33 AM
  To: r-help@r-project.org
  Subject: [R] RPM-style install (SLED 10.1)

  I am trying to install R on a SLED 10.1 machine.
  R-base-2.7.0-7.1-i586.rpm fails with

  [EMAIL PROTECTED]:~/RPMs rpm -Uvh R-base-2.7.0-7.1.i586.rpm
  warning: R-base-2.7.0-7.1.i586.rpm: Header V3 DSA signature: NOKEY,
  key ID 14ec5930
  error: Failed dependencies:
 libgfortran.so.1 is needed by R-base-2.7.0-7.1.i586

  I tried to trick it into believing there's the library by setting up the 
 link:

  [EMAIL PROTECTED]:~/RPMs ls -l /usr/lib/libgf*
  lrwxrwxrwx 1 root root  29 2008-05-08 17:45
  /usr/lib/libgfortran.so.1 - /usr/lib/libgfortran.so.3.0.0
  lrwxrwxrwx 1 root root  20 2008-05-08 17:37
  /usr/lib/libgfortran.so.3 - libgfortran.so.3.0.0
  -rwxr-xr-x 1 root root 2251139 2008-05-01 09:43 /usr/lib/libgfortran.so.3.0.0

  but that did not work, either. I went on and installed GCC's gfortran,
  but it did not provide libgfortran.so.1 either:

  [EMAIL PROTECTED]:~/RPMs ls -l /usr/irun/lib/libgf*
  -rw-r--r-- 1 1005 1011 4330512 2008-03-02 02:29 /usr/irun/lib/libgfortran.a
  -rwxr-xr-x 1 1005 10111009 2008-03-02 02:29 /usr/irun/lib/libgfortran.la
  -rwxr-xr-x 1 1005 1011 2509309 2008-03-02 02:29 /usr/irun/lib/libgfortran.so
  -rwxr-xr-x 1 1005 1011 2509309 2008-03-02 02:29 
 /usr/irun/lib/libgfortran.so.3
  -rwxr-xr-x 1 1005 1011 2509309 2008-03-02 02:29
  /usr/irun/lib/libgfortran.so.3.0.0

  Any reason R wants that package instead of the newer one? Of course
  rpm --nodeps was an option, but then it fails to load r-stats with the
  same message about the missing library:

  Error in dyn.load(file, DLLpath = DLLpath, ...) :
   unable to load shared library '/usr/lib/R/library/stats/libs/stats.so':
   /usr/lib/R/library/stats/libs/stats.so: undefined symbol: 
 _gfortran_pow_r8_i4
  During startup - Warning message:
  package stats in options(defaultPackages) was not found

  I am stuck... I am trying to compile R from 2.6.2 sources in the
  meantime, but that's painful for a Linux newbie like me on a brand-new
  computer that does not necessarily have all the packages. I almost
  want to switch to Ubuntu as R installation back there was a single
  click :))

  --
  Stas Kolenikov, also found at http://stas.kolenikov.name
  Small print: Please do not reply to my Gmail address as I don't check
  it regularly.


 __
  R-help@r-project.org mailing list
  https://stat.ethz.ch/mailman/listinfo/r-help
  PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
  and provide commented, minimal, self-contained, reproducible code.



-- 
Stas Kolenikov, also found at http://stas.kolenikov.name

Small print: Please do not reply to my Gmail address as I don't check
it regularly.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] combining bar and column graphs?

2008-05-12 Thread Me


Michael Kubovy wrote:

Perhaps
?mosaic
or
?mosaicplot


Thanks for these helpful suggestions - I will try them both out -

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] [OT] xemacs on windows vista

2008-05-12 Thread David Scott



Just in case you think Vincent is pushing his distribution as opposed to 
some other approach, let me endorse his suggestion as someone who has 
recently made the change to GNU Emacs.


Vincent's distribution is truly very nice. I have installed it on Vista 
without any problems and for anyone who uses TeX and R it is an excellent 
option.


I haven't checked John Fox's advice concerning XEmacs lately but in the 
past it has been the best source of information about XEmacs and R if you 
do still want to stick with XEmacs.


David Scott


On Mon, 12 May 2008, Vincent Goulet wrote:


Le lun. 12 mai à 15:15, Wensui Liu a écrit :


Hi, dear all,
I just switch to vista (ultimate) and have heard there is some problem
for the installation of xemacs on vista. Is there any insight or
experience that you could share?


Yes: go with GNU Emacs. There doesn't seem to be any compelling reason to 
prefer XEmacs nowadays. And if you use my distribution


http://vgoulet.act.ulaval.ca/en/emacs

you get a wizard based installation with up-to-date AUCTeX and ESS built-in.

Otherwise, I have never heard of special issues on Vista.

HTH

Vincent



I really appreciate any input.
thank you so much!

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide 
http://www.R-project.org/posting-guide.html

and provide commented, minimal, self-contained, reproducible code.


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


_
David Scott Department of Statistics, Tamaki Campus
The University of Auckland, PB 92019
Auckland 1142,NEW ZEALAND
Phone: +64 9 373 7599 ext 86830 Fax: +64 9 373 7000
Email:  [EMAIL PROTECTED]

Graduate Officer, Department of Statistics
Director of Consulting, Department of Statistics
__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] help on plot title

2008-05-12 Thread Bert Gunter

Please read the docs (starting with An Introduction to R) . It is
unreasonable to expect to use command line based software without first
reading the how to manuals.

Also, and always, read the man pages: e.g. ?plot, ?plot.default and
appropriate links.

Finally, as a matter of etiquette, signed emails are more likely to get
helpful responses. BTW, is this a homework problem?

Cheers,
Bert

-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On
Behalf Of Lisa
Sent: Monday, May 12, 2008 2:45 PM
To: r-help@r-project.org
Subject: [R] help on plot title

Hi, I am do some plotting and need to add title to each of the plots, Can
anyone help me on how to add the variables to the title?

here it is the program, the title i want should look like, j=1, j=2..,
but if i use title(j=,j), the j will go to the x axis. Can anyone help me
on this?
thanka s lot


for (j in 1:6) {

   y-mx.j(x,j)


   sx-mx.j(x,j)+sigma*ei
   plot(x,y, type=l,xlab=x, ylab=)

   title(j=,j)
}

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

1 2 >

1 - 100 of 103 matches

Mail list logo