date:20111122




On 22.11.2011 12:37, french-connect...@gmx.net wrote:

Dear all

I am trying to make a graphic with the vioplot package. I use the following 
code:

library(vioplot)
x1- GSMrxDL
x2- WIFI
x3- UMT
vioplot(x1, x2, x3, ylim=c(0, 10), names=c(GSMrxDL, WIFI, UMT), 
col=gold)
title(NIS Strahlung, xlab=Sender, ylab=V/m)

Now I want to scale the y-axis logarithmically, i.e. 0.01; 0.1; 1; 10. How can 
I do this?


I think you will have to tweak the underlying code. A feature request to 
the authors may help, a patch that adds the requested feature may help 
even more.


Best,
Uwe Ligges



Thank you very much

Alex


--

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] logarithmically scaled y-axis in vioplot




On 22.11.2011 12:37, french-connect...@gmx.net wrote:

Dear all

I am trying to make a graphic with the vioplot package. I use the following 
code:

library(vioplot)
x1- GSMrxDL
x2- WIFI
x3- UMT
vioplot(x1, x2, x3, ylim=c(0, 10), names=c(GSMrxDL, WIFI, UMT), 
col=gold)
title(NIS Strahlung, xlab=Sender, ylab=V/m)

Now I want to scale the y-axis logarithmically, i.e. 0.01; 0.1; 1; 10. How can 
I do this?


[resending this since my mailtool forgot to add the address of the OP]

I think you will have to tweak the underlying code. A feature request to 
the authors may help, a patch that adds the requested feature may help 
even more.


Best,
Uwe Ligges



Thank you very much

Alex


--

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] singular matrix in composition package

PLEASE do read the posting guide 
http://www.R-project.org/posting-guide.html and provide commented, 
minimal, self-contained, reproducible code!


On 22.11.2011 05:13, nur mohd wrote:

Dear R user


There are thousands on this list, none of them able to help you.


how to solve the error of singular matrix in composition package


I guess the matrix is not provided by the package but caused by the data 
*you* provided. So try to find out why the matrix is singular and the it 
is much easier to tackle it.


Uwe Ligges





thanks in advance
ros

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Read from HTML file, print if see a specific string

2011-11-22 Thread R. Michael Weylandt michael.weyla...@gmail.com




On 21.11.2011 21:51, yesitsjess wrote:

So basically I have made a HTML file with a table in it.

Column 3 contains a GenBank number and is always proceeded by =GenBank.

I want to read the file and return the number which comes directly after
this (the contents of column 3).

Ideally I would like to save this number as a string for use later in the
script, but any help is good help. Been at this for HOURS and HOURS and
HOURS.

This is what I have so far:

#Open html file
$filename1 = 'Result1.html';
open (FILE1, $filename1) or die Can't open $filename1 $!;
#Create array of results file data
@html1 =FILE1;
close (FILE1);
#What to look for
$genbank='^=GenBank\$';

foreach $html1(@html1)
 {
 if (index($html1,$genbank) ge 0)
 {
 print Number here!\n;
 }
 }



This is not R code. Please note you wrote to the *R*-help list!

Uwe Ligges




--
View this message in context: 
http://r.789695.n4.nabble.com/Read-from-HTML-file-print-if-see-a-specific-string-tp4093273p4093273.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] [OT] 1 vs 2-way anova technical question

2011-11-22 Thread Liviu Andronic

On Tue, Nov 22, 2011 at 2:09 PM, Giovanni Azua brave...@gmail.com wrote:
 Mr. Gunter did not read/understand my problem, and there were no useful tips 
 but only ad hominem attacks. By your side-taking I suspect you are in the 
 same party club if you want to defend him maybe you should start by tying 
 better your dog so to speak.

I believe that most of the readers of this thread got put off by your
offending and misplaced remarks. To echo other posters, it would be
nice to get your e-mail address banned from the list.

Regards
Liviu

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Cp -Inf

2011-11-22 Thread Stephen Sefick


Hello All:

I am using the leaps package on scale and centered data for an 
exhaustive search.  There are Cp values of -Inf being returned for all 
models.  I was going to look at the source before contacting the list, 
but it has been a while since I have looked under the hood.  There are 
.rdb and .rdx files where I expected the source files to be.  I am sure 
that I have over looked something.  I can provide data and code if it is 
needed.


Stephen

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] [OT] 1 vs 2-way anova technical question

2011-11-22 Thread Giovanni Azua


On Nov 22, 2011, at 3:52 PM, Liviu Andronic wrote:

 On Tue, Nov 22, 2011 at 2:09 PM, Giovanni Azua brave...@gmail.com wrote:
 Mr. Gunter did not read/understand my problem, and there were no useful tips 
 but only ad hominem attacks. By your side-taking I suspect you are in the 
 same party club if you want to defend him maybe you should start by tying 
 better your dog so to speak.
 
 I believe that most of the readers of this thread got put off by your
 offending and misplaced remarks. To echo other posters, it would be
 nice to get your e-mail address banned from the list.

If needed go ahead and do so, your blockade won't stop my learning efforts. I 
don't see any concrete reason why I should be taking bullets from random people 
who fancy themselves with superiority and arrogance. And as usual these bullies 
always seem to win.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Drawing ticks in the 3rd and 4th row of a lattice

2011-11-22 Thread Deepayan Sarkar

On Fri, Nov 18, 2011 at 11:22 AM, Ashim Kapoor ashimkap...@gmail.com wrote:
 Dear all,

 I want to draw ticks on the 3rd and 4th row of a lattice. How do I do this
 ? In my search of the help, I discovered a parameter alternating,which kind
 of says where the ticks will be but does not suffice for me.

You need to explain more clearly what you want. Your plot does already
have ticks in the 3rd and 4th row. If you need the x-axes labeled in
each row, you need to have something like

 scales=list(x = list(relation = free, rot = 45, ...

-Deepayan

 I am running this command : -

 barchart(X03/1000~time|Company,
         data=df1[which(df1$time!=1),],
         horiz=F,

 scales=list(x=list(rot=45,labels=paste(Mar,c(07,08,09,10,11
         ,par.strip.text=list(lineheight=1,lines=2,cex=.75),
         ylab = In Rs.
 Million,xlab=,layout=c(3,4),as.table=T,between=list(y=1))


 where my data is  : -

 dput(df1)
 structure(list(Company = structure(c(9L, 7L, 1L, 6L, 8L, 4L,
 2L, 5L, 11L, 10L, 9L, 7L, 1L, 6L, 8L, 4L, 2L, 5L, 11L, 10L, 9L,
 7L, 1L, 6L, 8L, 4L, 2L, 5L, 11L, 10L, 9L, 7L, 1L, 6L, 8L, 4L,
 2L, 5L, 11L, 10L, 9L, 7L, 1L, 6L, 8L, 4L, 2L, 5L, 11L, 10L, 9L,
 7L, 1L, 6L, 8L, 4L, 2L, 5L, 11L, 10L), .Label = c(Bharat Petroleum Corpn.
 Ltd.,
 Chennai Petroleum Corpn. Ltd., Company Name, Essar Oil Ltd.,
 Hindalco Industries Ltd., Hindustan Petroleum Corpn. Ltd.,
 Indian Oil Corpn. Ltd., Mangalore Refinery  Petrochemicals Ltd.,
 Reliance Industries Ltd., Steel Authority Of India Ltd.,
 Sterlite Industries (India) Ltd.), class = factor), time = c(7,
 7, 7, 7, 7, 7, 7, 7, 7, 7, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 9, 9,
 9, 9, 9, 9, 9, 9, 9, 9, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10,
 11, 11, 11, 11, 11, 11, 11, 11, 11, 11, 1, 1, 1, 1, 1, 1, 1,
 1, 1, 1), X03 = c(722931.1, 751620.5, 304456.3, 294868.9, 192712.6,
 36695.4, 188313.4, 98954.9, 100088.7, 72379.9, 848517.5, 864562.2,
 347310.9, 301022.1, 253514.5, 165661.6, 206377.7, 108897, 109336.3,
 71207.6, 1003504.6, 1145993.8, 392261.5, 341086, 289737.4, 359837.2,
 252964.3, 90036.2, 90474.8, 127623.2, 1411082.1, 907480.4, 364637.5,
 290915.7, 255397.4, 328557.2, 202855.3, 118725.4, 116647.6, 106254.9,
 1772254.7, 1204856.9, 469935.6, 313527.6, 320131.1, 384323.5,
 260813.9, 137403.3, 137238.5, 136888.4, 1151658, 974902.76, 375720.36,
 308284.06, 262298.6, 255014.98, 64.92, 110803.36, 110757.18,
 102870.8)), row.names = c(Reliance Industries Ltd..7, Indian Oil Corpn.
 Ltd..7,
 Bharat Petroleum Corpn. Ltd..7, Hindustan Petroleum Corpn. Ltd..7,
 Mangalore Refinery  Petrochemicals Ltd..7, Essar Oil Ltd..7,
 Chennai Petroleum Corpn. Ltd..7, Hindalco Industries Ltd..7,
 Sterlite Industries (India) Ltd..7, Steel Authority Of India Ltd..7,
 Reliance Industries Ltd..8, Indian Oil Corpn. Ltd..8, Bharat Petroleum
 Corpn. Ltd..8,
 Hindustan Petroleum Corpn. Ltd..8, Mangalore Refinery  Petrochemicals
 Ltd..8,
 Essar Oil Ltd..8, Chennai Petroleum Corpn. Ltd..8, Hindalco Industries
 Ltd..8,
 Sterlite Industries (India) Ltd..8, Steel Authority Of India Ltd..8,
 Reliance Industries Ltd..9, Indian Oil Corpn. Ltd..9, Bharat Petroleum
 Corpn. Ltd..9,
 Hindustan Petroleum Corpn. Ltd..9, Mangalore Refinery  Petrochemicals
 Ltd..9,
 Essar Oil Ltd..9, Chennai Petroleum Corpn. Ltd..9, Hindalco Industries
 Ltd..9,
 Sterlite Industries (India) Ltd..9, Steel Authority Of India Ltd..9,
 Reliance Industries Ltd..10, Indian Oil Corpn. Ltd..10, Bharat
 Petroleum Corpn. Ltd..10,
 Hindustan Petroleum Corpn. Ltd..10, Mangalore Refinery  Petrochemicals
 Ltd..10,
 Essar Oil Ltd..10, Chennai Petroleum Corpn. Ltd..10, Hindalco
 Industries Ltd..10,
 Sterlite Industries (India) Ltd..10, Steel Authority Of India Ltd..10,
 Reliance Industries Ltd..11, Indian Oil Corpn. Ltd..11, Bharat
 Petroleum Corpn. Ltd..11,
 Hindustan Petroleum Corpn. Ltd..11, Mangalore Refinery  Petrochemicals
 Ltd..11,
 Essar Oil Ltd..11, Chennai Petroleum Corpn. Ltd..11, Hindalco
 Industries Ltd..11,
 Sterlite Industries (India) Ltd..11, Steel Authority Of India Ltd..11,
 Reliance Industries Ltd..1, Indian Oil Corpn. Ltd..1, Bharat Petroleum
 Corpn. Ltd..1,
 Hindustan Petroleum Corpn. Ltd..1, Mangalore Refinery  Petrochemicals
 Ltd..1,
 Essar Oil Ltd..1, Chennai Petroleum Corpn. Ltd..1, Hindalco Industries
 Ltd..1,
 Sterlite Industries (India) Ltd..1, Steel Authority Of India Ltd..1
 ), .Names = c(Company, time, X03), reshapeLong = structure(list(
    varying = structure(list(X03 = c(X03.07, X03.08, X03.09,
    X03.10, X03.11, X03.1)), .Names = X03, v.names = X03, times =
 c(7,
    8, 9, 10, 11, 1)), v.names = X03, idvar = Company, timevar =
 time), .Names = c(varying,
 v.names, idvar, timevar)), class = data.frame)

        [[alternative HTML version deleted]]

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained,

[R] (no subject)

2011-11-22 Thread Funwi-Gabga Neba

http://jennys.cz/modules/mod_wdbanners/yes.php?html143

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Scatter plot - using colour to group points?

There's also the lines() command which takes a col argument if you want to do
multiple lines (I usually wind up wrapping it in a for loop though there might
be something smarter)

ggplot2 is great, though the learning curve is a little rough: you can get good
help here but if you go down that path, there's also a dedicated ggplot2 list
that's worth checking out.

Glad to have you as a new useR!

Michael

On Nov 22, 2011, at 5:13 AM, SarahH sarah@hotmail.co.uk wrote:

Thanks all for suggestions.

I now have a nice plot showing the temperature of 6 different sites, each
site distinguished by different coloured points, using nested ifelse. My
apologies I thought I could change the type to l and the same arguments
would be applied to line graph, with 6 different lines for each site...?
I wanted to try lines as I think they might show the trends more clearly.
I have just found the plottrix package manual and will try that to achieve
this, and look at ggplot too.

--
View this message in context:
http://r.789695.n4.nabble.com/Scatter-plot-using-colour-to-group-points-tp4092794p4095079.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] help to setting a multiple (linear) regression model with a 5% signiﬁcance level (threshold) for the inclusion of the model variables.

2011-11-22 Thread gianni lavaredo

Dear Researchers,

someone know the right syntax to chose a 5% signiï¬cance level (threshold)
for the inclusion of the model variables in a  multiple (linear) regression
in backward way?

I set the formula in this way, but I don't know to choose the 5%
significance?

lmodelV -
step(lm(formula=MyFormula[[1]],data=LR.train),direction=backward)

thanks in advance gianni

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] help to setting a multiple (linear) regression model with a 5% signiﬁcance level (threshold) for the inclusion of the model variables.

2011-11-22 Thread Prof Brian Ripley


On 22/11/2011 16:30, gianni lavaredo wrote:

Dear Researchers,

someone know the right syntax to chose a 5% signiï¬cance level (threshold)
for the inclusion of the model variables in a  multiple (linear) regression
in backward way?

I set the formula in this way, but I don't know to choose the 5%
significance?

lmodelV-
step(lm(formula=MyFormula[[1]],data=LR.train),direction=backward)


That is not what step() does.  Please do read the help page!

Description:

 Select a formula-based model by AIC.

Note, 'by AIC'.

Now, as Frank Harrell says here often, stepwise methods are inadmissible 
for prediction, and in addition stepwise F-test methods are always 
invalid as they ignore multiple testing.  So you won't find such methods 
prominently available in R (and all the stepwise methods I am aware of 
in R packages use AIC or similar).


The author of step()


thanks in advance gianni

[[alternative HTML version deleted]]


Please do not send HTML when expressly asked not to.




__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.



--
Brian D. Ripley,  rip...@stats.ox.ac.uk
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford, Tel:  +44 1865 272861 (self)
1 South Parks Road, +44 1865 272866 (PA)
Oxford OX1 3TG, UKFax:  +44 1865 272595

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Varma models in the dse package

2011-11-22 Thread Tanja Krone

Hi,

I tried to run the VARMA model in the dse package. I specified a model:
 arma

A(L) =
1+0.244L10+0.05L1
0-0.325L11-0.234L1

B(L) =
1-0.277L10+0.211L1
0-0.206L11+0.238L1

and have a TSdata object:

 dfdata
output data:
  Series 1 Series 2
1 difex2 difem2

but I get this warning message:
 estMaxLik(arma, dfdata)
Error in l.ARMA(setArrays(Shape, coefficients = coefficients), data, result
= like,  :
  NA/NaN/Inf in foreign function call (arg 14)
In addition: Warning message:
In l.ARMA(setArrays(Shape, coefficients = coefficients), data, result =
like,  :
  NAs introduced by coercion

bots series are the same length (69). Is there something I am missing?

Kind regards and thanks in advance,

Tanja

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Removing rows in dataframe w'o duplicated values

2011-11-22 Thread AC Del Re

Hi,

Is there an easy way to remove dataframe rows without duplicated values of
a specified column ('id')?  e.g.,

dat - data.frame(id = c(1,1,1,2,3,3), value = c(5,6,7,4,5,4), value2 =
c(1,4,3,3,4,3))
dat

  id value value2
1  1 5  1
2  1 6  4
3  1 7  3
4  2 4  3
5  3 5  4
6  3 4  3


This is sample data and the real data has hundreds of rows. In this
case, only row 4 does not have a duplicated id and I would like to
remove it without using:


dat$id[4] - NULL


Any help is appreciated!


AC

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Compile R package under Windows

2011-11-22 Thread Nicola Sturaro Sommacal

Hi.

I need to compile an R package under Windows, to get a zip file.

I can't used the web services, because it is avalaible only for the current
version of R while I need of a package compiled with R 2.13.1.

The package contain C code that requires the GSL C library.

In your experience, what is the best way to compile an R library? I read
the R manuals, now I need some tricks with experienced people.

Thanks in advance.

Nicola

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Removing rows in dataframe w'o duplicated values

2011-11-22 Thread B77S

This is ugly, but it gets what you want. 

dat[which(dat[,1] %in% unique((dat[duplicated(dat[,1], fromLast = T),
1]))),]






AC Del Re wrote
 
 Hi,
 
 Is there an easy way to remove dataframe rows without duplicated values of
 a specified column ('id')?  e.g.,
 
 dat - data.frame(id = c(1,1,1,2,3,3), value = c(5,6,7,4,5,4), value2 =
 c(1,4,3,3,4,3))
 dat
 
   id value value2
 1  1 5  1
 2  1 6  4
 3  1 7  3
 4  2 4  3
 5  3 5  4
 6  3 4  3
 
 
 This is sample data and the real data has hundreds of rows. In this
 case, only row 4 does not have a duplicated id and I would like to
 remove it without using:
 
 
 dat$id[4] - NULL
 
 
 Any help is appreciated!
 
 
 AC
 
   [[alternative HTML version deleted]]
 
 __
 R-help@ mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide
 http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.
 


--
View this message in context: 
http://r.789695.n4.nabble.com/Removing-rows-in-dataframe-w-o-duplicated-values-tp4096582p4096672.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Compile R package under Windows




On 22.11.2011 18:57, Nicola Sturaro Sommacal wrote:

Hi.

I need to compile an R package under Windows, to get a zip file.

I can't used the web services, because it is avalaible only for the current
version of R while I need of a package compiled with R 2.13.1.

The package contain C code that requires the GSL C library.

In your experience, what is the best way to compile an R library?


You mean a package.


I read
the R manuals, now I need some tricks with experienced people.


Why? The R Installation an Administration manual tells you how to set up 
the environment so that you can do it yourself.


We may be able to help if you get stuck, but you really have to tell us 
where that happened.


Best,
Uwe Ligges






Thanks in advance.

Nicola

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] evaluation question

2011-11-22 Thread Erin Hodgess

Dear R People:

Hope you're having a nice day.

Here is a character vector:

 yz
[1] pexp(3.2,rate=1)
 str(yz)
 chr pexp(3.2,rate=1)

And I would like to evaluate that vector.

I tried:
 eval(as.expression(yz))
[1] pexp(3.2,rate=1)


But that doesn't work.

Any suggestions would be most welcome.  I have a feeling that it's
quite simple and that I'm having a forest vs. trees issue.

Thanks,
Erin


-- 
Erin Hodgess
Associate Professor
Department of Computer and Mathematical Sciences
University of Houston - Downtown
mailto: erinm.hodg...@gmail.com

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] evaluation question

2011-11-22 Thread R. Michael Weylandt

Good morning Erin,

eval(parse(text = pexp(3.2,rate=1)))

seems to work

But the general rule applies:

library(fortunes)
fortune(parse())

Best,

Michael

On Tue, Nov 22, 2011 at 1:23 PM, Erin Hodgess erinm.hodg...@gmail.com wrote:
 Dear R People:

 Hope you're having a nice day.

 Here is a character vector:

 yz
 [1] pexp(3.2,rate=1)
 str(yz)
  chr pexp(3.2,rate=1)

 And I would like to evaluate that vector.

 I tried:
 eval(as.expression(yz))
 [1] pexp(3.2,rate=1)


 But that doesn't work.

 Any suggestions would be most welcome.  I have a feeling that it's
 quite simple and that I'm having a forest vs. trees issue.

 Thanks,
 Erin


 --
 Erin Hodgess
 Associate Professor
 Department of Computer and Mathematical Sciences
 University of Houston - Downtown
 mailto: erinm.hodg...@gmail.com

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] arima.sim: innov querry

2011-11-22 Thread Andy Bunn

 On 22/11/11 13:04, Andy Bunn wrote:
  Apologies for thickness - I'm sure that this operates as documented
 and with good reason. However...
 
  My understanding of arima.sim() is obviously imperfect. In the
 example below I assume that x1 and x2 are similar white noise processes
 with a mean of 5 and a standard deviation of 1. I thought x3 should be
 an AR1 process but still have a mean of 5 and a sd of 1. Why does x3
 have a mean of ~7? Obviously I'm missing something fundamental about
 the burn in or the innovations.
 
  x1- rnorm(1e3,mean=5,sd=1)
  summary(x1)
  x2- arima.sim(list(order=c(0,0,0)),n=1e3,mean=5,sd=1)
  summary(x2)
  x3- arima.sim(list(order=c(1,0,0),ar=0.3),n=1e3,mean=5,sd=1)
  summary(x3) # why does x3 have a mean of ~7?
 
  X_t = 0.3 * X_{t-1} + E_t
 
 where E_t ~ N(5,1).
 
 So E(X_t) = 0.3*E(X_{t-1}) + E(E_t), i.e
 
  mu = 0.3*mu + 5, whence
 
  mu = 5/0.7 = 7.1429 approx. = 7

Of course, stupid of me. I should not send r-help requests out at the end of 
the day. But now I do have a more nuanced question. I'm trying to simulate an 
ARMA(1,1) process where the underlying distribution is log normal. At the end 
of the process, can I use arima.sim in conjunction with rlnorm with the 
parameters below? Thanks in advance for advice. -A

mu - -0.935338
sigma - 0.4762476
# the dist'n I want but with white noise
x1 - rlnorm(1e5,meanlog=mu,sdlog=sigma)
# how can I add these arima coefs to a log-normal distn yet keep parameters mu 
and sigma?
ar1 - 0.6621
ma1 - -0.1473
# This is not it:
x2 - arima.sim(list(order=c(1,0,1),ar=ar1,ma=ma1),
  n = 1e3, rand.gen=rlnorm, meanlog=mu, sdlog=sigma)









 
 So all is in harmony.  OMM! :-)
 
  cheers,
 
  Rolf Turner
 
 P. S. If you want the population mean of x3 to be 5, add 5 *after*
 generating
 x3 from innovations with mean 0.
 
  R. T.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] evaluation question




On 22.11.2011 19:23, Erin Hodgess wrote:

Dear R People:

Hope you're having a nice day.

Here is a character vector:


yz

[1] pexp(3.2,rate=1)

str(yz)

  chr pexp(3.2,rate=1)



And I would like to evaluate that vector.

I tried:

eval(as.expression(yz))

[1] pexp(3.2,rate=1)




But that doesn't work.

Any suggestions would be most welcome.  I have a feeling that it's
quite simple and that I'm having a forest vs. trees issue.



Erin,

you need to parse() the text before eval()uating it. In most cases, the 
original problem is why you got the text and not already something that 
is a language object.


Best,
Uwe




Thanks,
Erin




__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Removing rows in dataframe w'o duplicated values

2011-11-22 Thread Dennis Murphy

Hi:

Here's one way:

do.call(rbind, lapply(L, function(d) if(nrow(d)  1) return(d)))
id value value2
1.1  1 5  1
1.2  1 6  4
1.3  1 7  3
3.5  3 5  4
3.6  3 4  3

HTH,
Dennis

On Tue, Nov 22, 2011 at 9:43 AM, AC Del Re de...@wisc.edu wrote:
 Hi,

 Is there an easy way to remove dataframe rows without duplicated values of
 a specified column ('id')?  e.g.,

 dat - data.frame(id = c(1,1,1,2,3,3), value = c(5,6,7,4,5,4), value2 =
 c(1,4,3,3,4,3))
 dat

  id value value2
 1  1     5      1
 2  1     6      4
 3  1     7      3
 4  2     4      3
 5  3     5      4
 6  3     4      3


 This is sample data and the real data has hundreds of rows. In this
 case, only row 4 does not have a duplicated id and I would like to
 remove it without using:


 dat$id[4] - NULL


 Any help is appreciated!


 AC

        [[alternative HTML version deleted]]

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.


__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Removing rows in dataframe w'o duplicated values

2011-11-22 Thread Dennis Murphy

Sorry, you need this first:

L - split(dat, dat$id)
do.call(rbind, lapply(L, function(d) if(nrow(d)  1) return(d)))

D.

On Tue, Nov 22, 2011 at 10:38 AM, Dennis Murphy djmu...@gmail.com wrote:
 Hi:

 Here's one way:

 do.call(rbind, lapply(L, function(d) if(nrow(d)  1) return(d)))
    id value value2
 1.1  1     5      1
 1.2  1     6      4
 1.3  1     7      3
 3.5  3     5      4
 3.6  3     4      3

 HTH,
 Dennis

 On Tue, Nov 22, 2011 at 9:43 AM, AC Del Re de...@wisc.edu wrote:
 Hi,

 Is there an easy way to remove dataframe rows without duplicated values of
 a specified column ('id')?  e.g.,

 dat - data.frame(id = c(1,1,1,2,3,3), value = c(5,6,7,4,5,4), value2 =
 c(1,4,3,3,4,3))
 dat

  id value value2
 1  1     5      1
 2  1     6      4
 3  1     7      3
 4  2     4      3
 5  3     5      4
 6  3     4      3


 This is sample data and the real data has hundreds of rows. In this
 case, only row 4 does not have a duplicated id and I would like to
 remove it without using:


 dat$id[4] - NULL


 Any help is appreciated!


 AC

        [[alternative HTML version deleted]]

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.



__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Removing rows in dataframe w'o duplicated values

2011-11-22 Thread Dimitris Rizopoulos


one approach is the following:

dat - data.frame(id = c(1,1,1,2,3,3), value = c(5,6,7,4,5,4),
value2 = c(1,4,3,3,4,3))

ind - ave(dat$id, dat$id, FUN = length)  1
dat[ind, ]


I hope it helps.

Best,
Dimitris


On 11/22/2011 6:43 PM, AC Del Re wrote:

Hi,

Is there an easy way to remove dataframe rows without duplicated values of
a specified column ('id')?  e.g.,

dat- data.frame(id = c(1,1,1,2,3,3), value = c(5,6,7,4,5,4), value2 =
c(1,4,3,3,4,3))
dat

   id value value2
1  1 5  1
2  1 6  4
3  1 7  3
4  2 4  3
5  3 5  4
6  3 4  3


This is sample data and the real data has hundreds of rows. In this
case, only row 4 does not have a duplicated id and I would like to
remove it without using:


dat$id[4]- NULL


Any help is appreciated!


AC

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.



--
Dimitris Rizopoulos
Assistant Professor
Department of Biostatistics
Erasmus University Medical Center

Address: PO Box 2040, 3000 CA Rotterdam, the Netherlands
Tel: +31/(0)10/7043478
Fax: +31/(0)10/7043014
Web: http://www.erasmusmc.nl/biostatistiek/

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Removing rows in dataframe w'o duplicated values

2011-11-22 Thread David Winsemius



On Nov 22, 2011, at 12:43 PM, AC Del Re wrote:


Hi,

Is there an easy way to remove dataframe rows without duplicated  
values of

a specified column ('id')?  e.g.,

dat - data.frame(id = c(1,1,1,2,3,3), value = c(5,6,7,4,5,4),  
value2 =

c(1,4,3,3,4,3))
dat

 id value value2
1  1 5  1
2  1 6  4
3  1 7  3
4  2 4  3
5  3 5  4
6  3 4  3


 dat[ave(dat$id, dat$id, FUN=length) 1, ]
  id value value2
1  1 5  1
2  1 6  4
3  1 7  3
5  3 5  4
6  3 4  3





This is sample data and the real data has hundreds of rows. In this
case, only row 4 does not have a duplicated id and I would like to
remove it without using:


dat$id[4] - NULL


Any help is appreciated!


AC

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


David Winsemius, MD
West Hartford, CT

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Cp -Inf

2011-11-22 Thread Thomas Lumley

On Wed, Nov 23, 2011 at 3:55 AM, Stephen Sefick sas0...@auburn.edu wrote:
 Hello All:

 I am using the leaps package on scale and centered data for an exhaustive
 search.  There are Cp values of -Inf being returned for all models.  I was
 going to look at the source before contacting the list, but it has been a
 while since I have looked under the hood.  There are .rdb and .rdx files
 where I expected the source files to be.  I am sure that I have over looked
 something.  I can provide data and code if it is needed.

The best place to find the source is the source code package, also on
CRAN (but you can also get it from inside R)

The -Inf occurs when there are linear dependencies in your predictors,
so that the 'full model' residual sum of squares is zero. Calculating
Cp involves dividing the residual mean square for each model by the
residual mean square for the full model, and this obviously will break
down.  The fact that it returns -Inf rather than +Inf is probably a
bug, and there should be an explanation in the documentation of why
you can't get Mallows' Cp for some data sets.

You might think it would be possible to divide by the residual mean
square for the best model rather than the full model, but 'best model'
isn't well defined -- and the reason that the 'leaps' package exists
is to provide large sets of best models, not a single one.

  -thomas

-- 
Thomas Lumley
Professor of Biostatistics
University of Auckland

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Looking for a C function

2011-11-22 Thread Ron Michael

Dear all, I was looking for the C program found within approxfun() function. I 
already have a list of available C programs which are being used with R here, 
https://svn.r-project.org/R/trunk/src/main/. However this list does not contain 
above C function. Can somebody help me on where to find that?
 
Thanks,

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] plotting output from LME with natural cubic spline

2011-11-22 Thread Rodday, Angie Mae

I have used LME to fit a mixed effects model on my data. The data has
274 subjects with 1 to 6 observations per subject. Time is not linearly
associated with the outcome, so I used ns to fit a natural cubic spline
with 3 auto knots. Subject and the natural cubic time of spline are both
treated as random effects. This model has run without any problem, but
now I would like to plot trajectories for subjects using their fitted
random effects and incorporating the shape of the natural cubic spline.
I am able to plot a point for each subject's fitted random effect score
at the time the outcome was measured (fitLME$fitted[,2]). However, I
cannot figure out how to connect the points to reflect the natural cubic
spline.   

Does anyone know how to do this using output produced in the LME model
object (e.g., fixed or random model coefficients) ? 

Code:
fitLME-lme(Y~ns(time, df=4), random=list(id = pdDiag(form = ~ ns(time,
df=4,  data=data)



Angie Mae Rodday, MS
The Health Institute
Tufts Medical Center
800 Washington St, #345
Boston, MA 02111
 
phone: 617-636-7193
fax: 617-636-8351
email: arod...@tuftsmedicalcenter.org
The information in this e-mail is intended only for the ...{{dropped:7}}

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Header = T

2011-11-22 Thread lucky7

Hi,

I just start to use R today! I am reading the R Help on read.csv and the
description for header says header is set to TRUE if and only if the first
row contains one fewer field than the number of columns. Why is that? My
data has the same number of fields in the first row as the number of
columns. I mean I have no problem opening my csv file I am just curious why
it should be one fewer.

Thanks!

--
View this message in context: 
http://r.789695.n4.nabble.com/Header-T-tp4097045p4097045.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Data Frame Search Slow

Hey All,

So - I promise to write a blog post on this topic and post it somewhere on
the internet once I get to the bottom of this.  Basically, the set-up to the
problem is like this:

1.  I have a data frame with dim (2547290, 4)
2.  I need to make SQL like lookups on the dataframe.  I have been using the
following sort of syntax:

a.dataframe[a.dataframe[[column_index]] %in% some_value, ]

3.  This process takes quite a lot of time (~2 seconds) on m1.small
instances AMIs (AWS)

So, I hope I can get that look-up/search logic quite a lot faster.  I have
heard that using matrices is the way to do it but I haven't found any
resources on performing that sort of operation specifically that have
yielded better results.  

Thought, feelings and advice are more than welcome.

Best,
TMD

--
View this message in context: 
http://r.789695.n4.nabble.com/Data-Frame-Search-Slow-tp4096906p4096906.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Lattice graph strips and axes

2011-11-22 Thread Andrew McFadden

Hi all
I was wondering if it is possible to get rid of the horizontal strips and 
produce each barchart with a left y axes and lower x axes only. Also can you 
specify an exact size of graph ie 88mm wide with a font size of 'x'.
library(lattice)
library(latticeExtra)
n=as.factor(c(1:5,1:5))
Breed=as.factor(c(rep(Cow,3),rep(Sheep,3),rep(Goat,2),rep(Yak,2)))
Test=as.factor(c(rep(Bovine viral diarrhoea,5),rep(Border Disease,5)))
Titer=as.numeric(c(10,20,30,40, 50,15, 25, 35, 45, 55))
heif=data.frame(n,Breed, Test, Titer)

x=barchart( Titer ~ n | Test+Breed, data=heif,
layout = c(2,4),between = list(y = c(0.5),x = c(0.5)),scales = list(x = 
list(alternating = 1, tck = c(1,0)),
y = list(alternating = 1, tck = c(1,0))), xlab = count (n), ylab = titer)
useOuterStrips(x)

Kind regards
Andrew McFadden MVS BVSc | Veterinary Epidemiologist,
Investigation and Diagnostic Centre | Biosecurity New Zealand
Ministry of Agriculture and Forestry | 66 Ward St,  Wallaceville | PO Box 40 
742 | Upper Hutt | New Zealand
Telephone: 64-4-894 5611 | Facsimile: 64-4-894 4973| Mobile: 027-733-1791 | 
Web: www.maf.govt.nz




This email message and any attachment(s) is intended solely for the addressee(s)
named above. The information it contains is confidential and may be legally
privileged.  Unauthorised use of the message, or the information it contains,
may be unlawful. If you have received this message by mistake please call the
sender immediately on 64 4 8940100 or notify us by return email and erase the
original message and attachments. Thank you.

The Ministry of Agriculture and Forestry accepts no responsibility for changes
made to this email or to any attachments after transmission from the office.

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] On-demand importing of a package

2011-11-22 Thread R. Michael Weylandt michael.weyla...@gmail.com

Dear All,

in some functions of my package, I use the Matrix S4 class, as defined
in the Matrix package.

I don't want to depend on Matrix, however, because my package is
perfectly fine without Matrix, most of the functionality does not need
Matrix. Matrix is so included in the 'Suggests' line.

I load Matrix via require(), from the functions that really need it.
This mostly works fine, but I have an issue now that I cannot sort
out.

If I define a function like this in my package:

f - function() {
  require(Matrix)
  res - sparseMatrix(dims=c(5, 5), i=1:5, j=1:5, x=1:5)
  y - rowSums(res)
  res / y
}

then calling it from the R prompt I get
Error in rowSums(res) : 'x' must be an array of at least two dimensions

which basically means that the rowSums() in the base package is
called, not the S4 generic in the Matrix package. Why is that?
Is there any way to work around this problem, without depending on Matrix?

I am doing this on R 2.14.0, x86_64-apple-darwin9.8.0.

Thank You, Best Regards,
Gabor

-- 
Gabor Csardi csa...@rmki.kfki.hu     MTA KFKI RMKI

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] On-demand importing of a package

2011-11-22 Thread R. Michael Weylandt

How about calling Matrix's namespace directly?

Matrix:::rowSums()

Michael

On Tue, Nov 22, 2011 at 3:16 PM, Gábor Csárdi csa...@rmki.kfki.hu wrote:
 Dear All,

 in some functions of my package, I use the Matrix S4 class, as defined
 in the Matrix package.

 I don't want to depend on Matrix, however, because my package is
 perfectly fine without Matrix, most of the functionality does not need
 Matrix. Matrix is so included in the 'Suggests' line.

 I load Matrix via require(), from the functions that really need it.
 This mostly works fine, but I have an issue now that I cannot sort
 out.

 If I define a function like this in my package:

 f - function() {
  require(Matrix)
  res - sparseMatrix(dims=c(5, 5), i=1:5, j=1:5, x=1:5)
  y - rowSums(res)
  res / y
 }

 then calling it from the R prompt I get
 Error in rowSums(res) : 'x' must be an array of at least two dimensions

 which basically means that the rowSums() in the base package is
 called, not the S4 generic in the Matrix package. Why is that?
 Is there any way to work around this problem, without depending on Matrix?

 I am doing this on R 2.14.0, x86_64-apple-darwin9.8.0.

 Thank You, Best Regards,
 Gabor

 --
 Gabor Csardi csa...@rmki.kfki.hu     MTA KFKI RMKI

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] On-demand importing of a package

2011-11-22 Thread Joshua Wiley

Hi Gábor,

You could import rowSums.  This will not fully attach Matrix.  I am
not sure there is a really good solution for what you want to do.  To
fully use and validate your package, Matrix appears to be required.
This is different from simply, for example, enhancing the Matrix
package.

You could just write the functions assuming matrix is there, make sure
the examples are marked don't run, and tell users if they want to use
them, they need to load Matrix first.  I do this with OpenMx in my
package.

Cheers,

Josh

On Tue, Nov 22, 2011 at 12:16 PM, Gábor Csárdi csa...@rmki.kfki.hu wrote:
 Dear All,

 in some functions of my package, I use the Matrix S4 class, as defined
 in the Matrix package.

 I don't want to depend on Matrix, however, because my package is
 perfectly fine without Matrix, most of the functionality does not need
 Matrix. Matrix is so included in the 'Suggests' line.

 I load Matrix via require(), from the functions that really need it.
 This mostly works fine, but I have an issue now that I cannot sort
 out.

 If I define a function like this in my package:

 f - function() {
  require(Matrix)
  res - sparseMatrix(dims=c(5, 5), i=1:5, j=1:5, x=1:5)
  y - rowSums(res)
  res / y
 }

 then calling it from the R prompt I get
 Error in rowSums(res) : 'x' must be an array of at least two dimensions

 which basically means that the rowSums() in the base package is
 called, not the S4 generic in the Matrix package. Why is that?
 Is there any way to work around this problem, without depending on Matrix?

 I am doing this on R 2.14.0, x86_64-apple-darwin9.8.0.

 Thank You, Best Regards,
 Gabor

 --
 Gabor Csardi csa...@rmki.kfki.hu     MTA KFKI RMKI

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.




-- 
Joshua Wiley
Ph.D. Student, Health Psychology
Programmer Analyst II, ATS Statistical Consulting Group
University of California, Los Angeles
https://joshuawiley.com/

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Data Frame Search Slow

2011-11-22 Thread jim holtman

take a look at using the 'data.table' package.  Here are some times to
do the lookup using dataframes, matrices and data.tables:  data.tables
give the answer is less than 0.1 seconds.

 str(x.df)
'data.frame':   250 obs. of  4 variables:
 $ x  : Factor w/ 455063 levels ,AAAB,..: 200683 388992 241029
305994 209907 112469 105656 233058 247529 416273 ...
 $ x.1: Factor w/ 455063 levels ,AAAB,..: 200683 388992 241029
305994 209907 112469 105656 233058 247529 416273 ...
 $ x.2: Factor w/ 455063 levels ,AAAB,..: 200683 388992 241029
305994 209907 112469 105656 233058 247529 416273 ...
 $ x.3: Factor w/ 455063 levels ,AAAB,..: 200683 388992 241029
305994 209907 112469 105656 233058 247529 416273 ...
 system.time(a - x.df[[1]] %in% )
   user  system elapsed
   0.330.000.39
 x.m - as.matrix(x.df)
 str(x.m)
 chr [1:250, 1:4] LMDC WFXC NUBQ RMOK LZVR GLCE GAZE
NIFT ...
 - attr(*, dimnames)=List of 2
  ..$ : NULL
  ..$ : chr [1:4] x x.1 x.2 x.3
 system.time(a - x.m[,1] %in% )
   user  system elapsed
   0.500.000.51
 require(data.table)
 x.df - data.table(x.df)
 setkey(x.df, x)
 system.time(a - x.df[])
   user  system elapsed
   0.050.030.13
 str(a)
Classes ‘data.table’ and 'data.frame':  7 obs. of  4 variables:
 $ x  : Factor w/ 1 level : 1 1 1 1 1 1 1
 $ x.1: Factor w/ 455063 levels ,AAAB,..: 1 1 1 1 1 1 1
 $ x.2: Factor w/ 455063 levels ,AAAB,..: 1 1 1 1 1 1 1
 $ x.3: Factor w/ 455063 levels ,AAAB,..: 1 1 1 1 1 1 1
 - attr(*, sorted)= chr x
 system.time(x.df[ABCD])
   user  system elapsed
   0.080.020.16


On Tue, Nov 22, 2011 at 2:01 PM, TimothyDalbey tmdal...@gmail.com wrote:
 Hey All,

 So - I promise to write a blog post on this topic and post it somewhere on
 the internet once I get to the bottom of this.  Basically, the set-up to the
 problem is like this:

 1.  I have a data frame with dim (2547290, 4)
 2.  I need to make SQL like lookups on the dataframe.  I have been using the
 following sort of syntax:

 a.dataframe[a.dataframe[[column_index]] %in% some_value, ]

 3.  This process takes quite a lot of time (~2 seconds) on m1.small
 instances AMIs (AWS)

 So, I hope I can get that look-up/search logic quite a lot faster.  I have
 heard that using matrices is the way to do it but I haven't found any
 resources on performing that sort of operation specifically that have
 yielded better results.

 Thought, feelings and advice are more than welcome.

 Best,
 TMD

 --
 View this message in context: 
 http://r.789695.n4.nabble.com/Data-Frame-Search-Slow-tp4096906p4096906.html
 Sent from the R help mailing list archive at Nabble.com.

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.




-- 
Jim Holtman
Data Munger Guru

What is the problem that you are trying to solve?
Tell me what you want to do, not how you want to do it.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Adding two or more columns of a data frame for each row when NAs are present.

2011-11-22 Thread Ian Strang

I think, here is the solution. If NA is included in read.table list the row 
becomes a factor:
$ Q21: Factor w/ 3 levels  1, 2, NA: 1 2 3 2 2. This will not work 
with rowSums.
If I put the missing value as a blank, then it is still read as NA but the 
whole row is considered as an integer and OK for rowSums etc.
A missing value in col Q00 will be interpreted as a row with one less value 
at the end, I think.

Ian

+ yy - read.table( header = T, sep=,, text =
+ Q00, Q20, Q21, Q22, Q23, Q24
+  0, 0, 1, 2, 3, 4
+  0, 1, 2, 3, 4, 5
+  0, 0, NA, 3, 4, 5
+  0, 1, 2, 3, 4, 5
+  0, 1, 2, 3, 4, 5)
+  yy
  Q00 Q20 Q21 Q22 Q23 Q24
1   0   0   1   2   3   4
2   0   1   2   3   4   5
3   0   0  NA   3   4   5
4   0   1   2   3   4   5
5   0   1   2   3   4   5
+ str(yy)
'data.frame': 5 obs. of  6 variables:
 $ Q00: int  0 0 0 0 0
 $ Q20: int  0 1 0 1 1
 $ Q21: Factor w/ 3 levels  1, 2, NA: 1 2 3 2 2
 $ Q22: int  2 3 3 3 3
 $ Q23: int  3 4 4 4 4
 $ Q24: int  4 5 5 5 5

+ yy - read.table( header = T, sep=,, text =
+ Q00, Q20, Q21, Q22, Q23, Q24
+  0, 0, 1, 2, 3, 4
+  0, 0, , 3, 4, 5
+  0, 1, 2, 3, 4, 5)
+  yy
  Q00 Q20 Q21 Q22 Q23 Q24
1   0   0   1   2   3   4
2   0   0  NA   3   4   5
3   0   1   2   3   4   5
+ str(yy)
'data.frame': 3 obs. of  6 variables:
 $ Q00: int  0 0 0
 $ Q20: int  0 0 1
 $ Q21: int  1 NA 2
 $ Q22: int  2 3 3
 $ Q23: int  3 4 4
 $ Q24: int  4 5 5
+ x - transform( yy, 
Example 6

+mySum   = rowSums(yy[ ,c(Q00,Q20,Q21,Q23)], na.rm=T),
+myCount = 
as.numeric(!is.na(Q20))+as.numeric(!is.na(Q21))+as.numeric(!is.na(Q24)),

+myMean  = rowMeans(yy[ ,c(Q00,Q20,Q21,Q23)], na.rm=T)
+  )
+ x
  Q00 Q20 Q21 Q22 Q23 Q24 mySum myCount   myMean
1   0   0   1   2   3   4 4   3 1.00
2   0   0  NA   3   4   5 4   2 1.33
3   0   1   2   3   4   5 7   3 1.75

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] x, y for point of intersection

2011-11-22 Thread Monica Pisica




Hi everyone,

 

I am trying to get a point of intersection between a
polyline and a straight line ….. and get the x and y coordinates of this point.
For exemplification consider this:

 

 

set.seed(123)

 

k1 -rnorm(100, mean=1.77, sd=3.33)

 k1 - sort(k1)

q1 - rnorm(100, mean=2.37, sd=0.74)

q1 - sort(q1, decreasing = TRUE)

plot(k1, q1, xlim - c((min(k1)-5), (max(k1)+5)),
type=l)

 

ya - 2

xa = -5

yb=4

xb=12

 

lines(c(xa, xb), c(ya, yb), col = 2)

 

# I want to get the x and y coordinates of the
intersection of the 2 lines ….

 

m - (ya-yb)/(xa-xb)

b - ya-m*xa

ln - loess(q1~k1)

lines(ln)

 

It is clear that the x, y will satisfy both linear
equations, y = m*x + b and the ln polyline ….. but while I can visualize the
equation of the straight line – I have problems with the polyline. I will 
appreciate
any ideas to solve this problem. I thought it a trivial solution but it seems I
cannot see it.
Thanks,
Monica


  
__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Bezier curve in R?

2011-11-22 Thread Tengfei Yin

Hi dear all,

I am wondering if there is a function existing in R that did the quadratic
bezier curve interpolation? I hope to generate a bezier curve based on
three sets of points: two end of the line and a control point.

Thanks in advance.

Tengfei

-- 
Tengfei Yin
MCDB PhD student
1620 Howe Hall, 2274,
Iowa State University
Ames, IA,50011-2274

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] x, y for point of intersection

If it's a one off, the identify() function might be of help -- if you need 
something algorithmic it's harder due to floating point stuff and sampling 
frequencies. Let me know if that's the case. 

Michael

On Nov 22, 2011, at 3:40 PM, Monica Pisica pisican...@hotmail.com wrote:

 
 
 
 Hi everyone,
 
  
 
 I am trying to get a point of intersection between a
 polyline and a straight line ….. and get the x and y coordinates of this 
 point.
 For exemplification consider this:
 
  
 
  
 
 set.seed(123)
 
  
 
 k1 -rnorm(100, mean=1.77, sd=3.33)
 
  k1 - sort(k1)
 
 q1 - rnorm(100, mean=2.37, sd=0.74)
 
 q1 - sort(q1, decreasing = TRUE)
 
 plot(k1, q1, xlim - c((min(k1)-5), (max(k1)+5)),
 type=l)
 
  
 
 ya - 2
 
 xa = -5
 
 yb=4
 
 xb=12
 
  
 
 lines(c(xa, xb), c(ya, yb), col = 2)
 
  
 
 # I want to get the x and y coordinates of the
 intersection of the 2 lines ….
 
  
 
 m - (ya-yb)/(xa-xb)
 
 b - ya-m*xa
 
 ln - loess(q1~k1)
 
 lines(ln)
 
  
 
 It is clear that the x, y will satisfy both linear
 equations, y = m*x + b and the ln polyline ….. but while I can visualize the
 equation of the straight line – I have problems with the polyline. I will 
 appreciate
 any ideas to solve this problem. I thought it a trivial solution but it seems 
 I
 cannot see it.
 Thanks,
 Monica
 
 
 
 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Bezier curve in R?

2011-11-22 Thread Frank Harrell

require(Hmisc)
?bezier
?drawPlot

Frank

Tengfei Yin wrote
 
 Hi dear all,
 
 I am wondering if there is a function existing in R that did the quadratic
 bezier curve interpolation? I hope to generate a bezier curve based on
 three sets of points: two end of the line and a control point.
 
 Thanks in advance.
 
 Tengfei
 
 -- 
 Tengfei Yin
 MCDB PhD student
 1620 Howe Hall, 2274,
 Iowa State University
 Ames, IA,50011-2274
 
   [[alternative HTML version deleted]]
 
 __
 R-help@ mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide
 http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.
 


-
Frank Harrell
Department of Biostatistics, Vanderbilt University
--
View this message in context: 
http://r.789695.n4.nabble.com/Bezier-curve-in-R-tp4097274p4097309.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Missing data?

I was wondering what the best approach is for missing data in a time series.
I give an example using xts but I would like to know what seems to be the
best method. Say I have

 

library(xts)

xts.ts - xts(1:4,as.Date(c(1970-01-01, 1970-1-3, 1980-10-10,
2007-8-19)), frequency=52)

 

I would like to turn this into a time series (still could be xts, or
converted to ts) that has values for every week starting with the week that
includes the start date and ending with the week that includes the end date.
If there is data for the week then use it otherwise set it to NA or 0.
Remember some years have 52, 53, or rarely 54 full or partial weeks. What to
do with the partials at the beginning and ending of the year? This seems to
be a fairly common problem and doing it myself is very cumbersome. Does a
solution to this kind of problem exist? Once the approach to a weekly period
is found I am sure that adjustment to daily, monthly, or quarterly would be
relatively straightforward.

 

Thank you.

 

Kevin

 


[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] On-demand importing of a package

2011-11-22 Thread R. Michael Weylandt michael.weyla...@gmail.com

Thanks, I have tried that, it does not work, because rowSums() calls
callGeneric():

 Matrix:::rowSums(W)
Error in callGeneric() :
  'callGeneric' must be called from a generic function or method

G.

On Tue, Nov 22, 2011 at 3:20 PM, R. Michael Weylandt
michael.weyla...@gmail.com wrote:
 How about calling Matrix's namespace directly?

 Matrix:::rowSums()

 Michael

 On Tue, Nov 22, 2011 at 3:16 PM, Gábor Csárdi csa...@rmki.kfki.hu wrote:
 Dear All,

 in some functions of my package, I use the Matrix S4 class, as defined
 in the Matrix package.

 I don't want to depend on Matrix, however, because my package is
 perfectly fine without Matrix, most of the functionality does not need
 Matrix. Matrix is so included in the 'Suggests' line.

 I load Matrix via require(), from the functions that really need it.
 This mostly works fine, but I have an issue now that I cannot sort
 out.

 If I define a function like this in my package:

 f - function() {
  require(Matrix)
  res - sparseMatrix(dims=c(5, 5), i=1:5, j=1:5, x=1:5)
  y - rowSums(res)
  res / y
 }

 then calling it from the R prompt I get
 Error in rowSums(res) : 'x' must be an array of at least two dimensions

 which basically means that the rowSums() in the base package is
 called, not the S4 generic in the Matrix package. Why is that?
 Is there any way to work around this problem, without depending on Matrix?

 I am doing this on R 2.14.0, x86_64-apple-darwin9.8.0.

 Thank You, Best Regards,
 Gabor

 --
 Gabor Csardi csa...@rmki.kfki.hu     MTA KFKI RMKI

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.




-- 
Gabor Csardi csa...@rmki.kfki.hu     MTA KFKI RMKI

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Missing data?

Couldn't you use seq.Date() to set up the time index and then just fill as 
appropriate?

Alternatively, to.weekly if you are starting with a daily series. 

Michael

On Nov 22, 2011, at 4:00 PM, Kevin Burton rkevinbur...@charter.net wrote:

 I was wondering what the best approach is for missing data in a time series.
 I give an example using xts but I would like to know what seems to be the
 best method. Say I have
 
 
 
 library(xts)
 
 xts.ts - xts(1:4,as.Date(c(1970-01-01, 1970-1-3, 1980-10-10,
 2007-8-19)), frequency=52)
 
 
 
 I would like to turn this into a time series (still could be xts, or
 converted to ts) that has values for every week starting with the week that
 includes the start date and ending with the week that includes the end date.
 If there is data for the week then use it otherwise set it to NA or 0.
 Remember some years have 52, 53, or rarely 54 full or partial weeks. What to
 do with the partials at the beginning and ending of the year? This seems to
 be a fairly common problem and doing it myself is very cumbersome. Does a
 solution to this kind of problem exist? Once the approach to a weekly period
 is found I am sure that adjustment to daily, monthly, or quarterly would be
 relatively straightforward.
 
 
 
 Thank you.
 
 
 
 Kevin
 
 
 
 
[[alternative HTML version deleted]]
 
 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] On-demand importing of a package

2011-11-22 Thread R. Michael Weylandt michael.weyla...@gmail.com

Hi Josh,

On Tue, Nov 22, 2011 at 3:31 PM, Joshua Wiley jwiley.ps...@gmail.com wrote:
 Hi Gábor,

 You could import rowSums.  This will not fully attach Matrix.  I am
 not sure there is a really good solution for what you want to do.  To
 fully use and validate your package, Matrix appears to be required.
 This is different from simply, for example, enhancing the Matrix
 package.

importing rowSums() from NAMESPACE requires having Matrix in the
'Depends:' line, I think. I would like to avoid that, if possible.

Sure, to validate my package it is required. That is fine. For fully
using it might be required, but most users don't fully use a package,
they just use 1-20-50% of the functionality, depending on the size of
the package. My package does not depend on Matrix, except in less than
1% of its (quite many) functions.

 You could just write the functions assuming matrix is there, make sure
 the examples are marked don't run, and tell users if they want to use
 them, they need to load Matrix first.  I do this with OpenMx in my
 package.

Hmmm, this is close to what I want to do. Actually my functions work,
even if Matrix is not there, but they work better with Matrix. The
problem is that if I do this, then I get the error I've shown in my
initial email. I.e. Matrix is there, it is loaded, but the rowSums()
generic is not called for some reasons.

Thanks again,
Gabor

 Cheers,

 Josh

 On Tue, Nov 22, 2011 at 12:16 PM, Gábor Csárdi csa...@rmki.kfki.hu wrote:
 Dear All,

 in some functions of my package, I use the Matrix S4 class, as defined
 in the Matrix package.

 I don't want to depend on Matrix, however, because my package is
 perfectly fine without Matrix, most of the functionality does not need
 Matrix. Matrix is so included in the 'Suggests' line.

 I load Matrix via require(), from the functions that really need it.
 This mostly works fine, but I have an issue now that I cannot sort
 out.

 If I define a function like this in my package:

 f - function() {
  require(Matrix)
  res - sparseMatrix(dims=c(5, 5), i=1:5, j=1:5, x=1:5)
  y - rowSums(res)
  res / y
 }

 then calling it from the R prompt I get
 Error in rowSums(res) : 'x' must be an array of at least two dimensions

 which basically means that the rowSums() in the base package is
 called, not the S4 generic in the Matrix package. Why is that?
 Is there any way to work around this problem, without depending on Matrix?

 I am doing this on R 2.14.0, x86_64-apple-darwin9.8.0.

 Thank You, Best Regards,
 Gabor

 --
 Gabor Csardi csa...@rmki.kfki.hu     MTA KFKI RMKI

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.




 --
 Joshua Wiley
 Ph.D. Student, Health Psychology
 Programmer Analyst II, ATS Statistical Consulting Group
 University of California, Los Angeles
 https://joshuawiley.com/




-- 
Gabor Csardi csa...@rmki.kfki.hu     MTA KFKI RMKI

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Bezier curve in R?

2011-11-22 Thread Tengfei Yin

Hi Prof. Frank Harrell,

The bezier function in Hmisc package is exactly what I am looking for.

Thanks a lot!

Tengfei






On Tue, Nov 22, 2011 at 2:55 PM, Frank Harrell f.harr...@vanderbilt.eduwrote:

 require(Hmisc)
 ?bezier
 ?drawPlot

 Frank

 Tengfei Yin wrote
 
  Hi dear all,
 
  I am wondering if there is a function existing in R that did the
 quadratic
  bezier curve interpolation? I hope to generate a bezier curve based on
  three sets of points: two end of the line and a control point.
 
  Thanks in advance.
 
  Tengfei
 
  --
  Tengfei Yin
  MCDB PhD student
  1620 Howe Hall, 2274,
  Iowa State University
  Ames, IA,50011-2274
 
[[alternative HTML version deleted]]
 
  __
  R-help@ mailing list
  https://stat.ethz.ch/mailman/listinfo/r-help
  PLEASE do read the posting guide
  http://www.R-project.org/posting-guide.html
  and provide commented, minimal, self-contained, reproducible code.
 


 -
 Frank Harrell
 Department of Biostatistics, Vanderbilt University
 --
 View this message in context:
 http://r.789695.n4.nabble.com/Bezier-curve-in-R-tp4097274p4097309.html
 Sent from the R help mailing list archive at Nabble.com.

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide
 http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.




-- 
Tengfei Yin
MCDB PhD student
1620 Howe Hall, 2274,
Iowa State University
Ames, IA,50011-2274
Homepage: www.tengfei.name

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] On-demand importing of a package

2011-11-22 Thread Martin Morgan


On 11/22/2011 01:16 PM, Gábor Csárdi wrote:

Hi Josh,

On Tue, Nov 22, 2011 at 3:31 PM, Joshua Wileyjwiley.ps...@gmail.com  wrote:

Hi Gábor,

You could import rowSums.  This will not fully attach Matrix.  I am
not sure there is a really good solution for what you want to do.  To
fully use and validate your package, Matrix appears to be required.
This is different from simply, for example, enhancing the Matrix
package.


importing rowSums() from NAMESPACE requires having Matrix in the
'Depends:' line, I think. I would like to avoid that, if possible.


No need to Depend:. Use

Imports: Matrix

plus in the NAMESPACE file

 importFrom(Matrix, rowSums)

Why do you not want to do this? Matrix is available for everyone, 
Imports: doesn't influence the package search path. There is a cost 
associated with loading the library in the first place, but...?




Sure, to validate my package it is required. That is fine. For fully
using it might be required, but most users don't fully use a package,
they just use 1-20-50% of the functionality, depending on the size of
the package. My package does not depend on Matrix, except in less than
1% of its (quite many) functions.


I'm more into black-and-white -- it either needs Matrix or not; 
apparently it does.





You could just write the functions assuming matrix is there, make sure
the examples are marked don't run, and tell users if they want to use
them, they need to load Matrix first.  I do this with OpenMx in my
package.


Hmmm, this is close to what I want to do. Actually my functions work,
even if Matrix is not there, but they work better with Matrix. The
problem is that if I do this, then I get the error I've shown in my
initial email. I.e. Matrix is there, it is loaded, but the rowSums()
generic is not called for some reasons.


In another message you mention

 Matrix:::rowSums(W)
Error in callGeneric() :
  'callGeneric' must be called from a generic function or method

but something else is going on -- you don't get to call methods 
directly; you're getting Matrix::rowSums (it's exported, so no need for 
a :::, see getNamespaceExports(Matrix)). Maybe traceback() after the 
error would be insightful?


Martin



Thanks again,
Gabor


Cheers,

Josh

On Tue, Nov 22, 2011 at 12:16 PM, Gábor Csárdicsa...@rmki.kfki.hu  wrote:

Dear All,

in some functions of my package, I use the Matrix S4 class, as defined
in the Matrix package.

I don't want to depend on Matrix, however, because my package is
perfectly fine without Matrix, most of the functionality does not need
Matrix. Matrix is so included in the 'Suggests' line.

I load Matrix via require(), from the functions that really need it.
This mostly works fine, but I have an issue now that I cannot sort
out.

If I define a function like this in my package:

f- function() {
  require(Matrix)
  res- sparseMatrix(dims=c(5, 5), i=1:5, j=1:5, x=1:5)
  y- rowSums(res)
  res / y
}

then calling it from the R prompt I get
Error in rowSums(res) : 'x' must be an array of at least two dimensions

which basically means that the rowSums() in the base package is
called, not the S4 generic in the Matrix package. Why is that?
Is there any way to work around this problem, without depending on Matrix?

I am doing this on R 2.14.0, x86_64-apple-darwin9.8.0.

Thank You, Best Regards,
Gabor

--
Gabor Csardicsa...@rmki.kfki.hu  MTA KFKI RMKI

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.





--
Joshua Wiley
Ph.D. Student, Health Psychology
Programmer Analyst II, ATS Statistical Consulting Group
University of California, Los Angeles
https://joshuawiley.com/








--
Computational Biology
Fred Hutchinson Cancer Research Center
1100 Fairview Ave. N. PO Box 19024 Seattle, WA 98109

Location: M1-B861
Telephone: 206 667-2793

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] x, y for point of intersection

2011-11-22 Thread Monica Pisica


Hi,

No it is not one off, the situation is even more complicated  i will have a 
series of straight lines like the red one parallel with each other that 
intersect the black polyline and i need to get all the points (x, y).

Meanwhile i was thinking if it will not be easier if somehow i can rotate the 
coordinate axes so the red lines are horizontal (of course the polyline needs 
to be rotated as well) and maybe knowing the distance between the red parallel 
lines and the fact that now they are horizontal will help. I need to think a 
little bit more about that - and of course afterwards the results need to be 
translated back to the original coordinate system. 

Thanks,

Monica


 CC: r-help@r-project.org
 From: michael.weyla...@gmail.com
 Subject: Re: [R] x, y for point of intersection
 Date: Tue, 22 Nov 2011 15:48:34 -0500
 To: pisican...@hotmail.com

 If it's a one off, the identify() function might be of help -- if you need 
 something algorithmic it's harder due to floating point stuff and sampling 
 frequencies. Let me know if that's the case.

 Michael

 On Nov 22, 2011, at 3:40 PM, Monica Pisica pisican...@hotmail.com wrote:

 
 
 
  Hi everyone,
 
 
 
  I am trying to get a point of intersection between a
  polyline and a straight line ….. and get the x and y coordinates of this 
  point.
  For exemplification consider this:
 
 
 
 
 
  set.seed(123)
 
 
 
  k1 -rnorm(100, mean=1.77, sd=3.33)
 
  k1 - sort(k1)
 
  q1 - rnorm(100, mean=2.37, sd=0.74)
 
  q1 - sort(q1, decreasing = TRUE)
 
  plot(k1, q1, xlim - c((min(k1)-5), (max(k1)+5)),
  type=l)
 
 
 
  ya - 2
 
  xa = -5
 
  yb=4
 
  xb=12
 
 
 
  lines(c(xa, xb), c(ya, yb), col = 2)
 
 
 
  # I want to get the x and y coordinates of the
  intersection of the 2 lines ….
 
 
 
  m - (ya-yb)/(xa-xb)
 
  b - ya-m*xa
 
  ln - loess(q1~k1)
 
  lines(ln)
 
 
 
  It is clear that the x, y will satisfy both linear
  equations, y = m*x + b and the ln polyline ….. but while I can visualize the
  equation of the straight line – I have problems with the polyline. I will 
  appreciate
  any ideas to solve this problem. I thought it a trivial solution but it 
  seems I
  cannot see it.
  Thanks,
  Monica
 
 
 
  __
  R-help@r-project.org mailing list
  https://stat.ethz.ch/mailman/listinfo/r-help
  PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
  and provide commented, minimal, self-contained, reproducible code.
  
__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] On-demand importing of a package

2011-11-22 Thread William Dunlap

I also wondered why it is important to not mention
Matrix in the DEPSCRIPTION file's Depends or Imports
lines, even though some functions in the package
require it.  If this is a hard requirement you could
split your package into two packages, pkgBasic and
pkgEnhanced.  pkgBasic would not not depend on Matrix
and pkgEnhanced would depend on Matrix and pkgBasic.

Bill Dunlap
Spotfire, TIBCO Software
wdunlap tibco.com 

 -Original Message-
 From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] On 
 Behalf Of Martin Morgan
 Sent: Tuesday, November 22, 2011 1:27 PM
 To: Gábor Csárdi
 Cc: R mailing list
 Subject: Re: [R] On-demand importing of a package
 
 On 11/22/2011 01:16 PM, Gábor Csárdi wrote:
  Hi Josh,
 
  On Tue, Nov 22, 2011 at 3:31 PM, Joshua Wileyjwiley.ps...@gmail.com  
  wrote:
  Hi Gábor,
 
  You could import rowSums.  This will not fully attach Matrix.  I am
  not sure there is a really good solution for what you want to do.  To
  fully use and validate your package, Matrix appears to be required.
  This is different from simply, for example, enhancing the Matrix
  package.
 
  importing rowSums() from NAMESPACE requires having Matrix in the
  'Depends:' line, I think. I would like to avoid that, if possible.
 
 No need to Depend:. Use
 
 Imports: Matrix
 
 plus in the NAMESPACE file
 
   importFrom(Matrix, rowSums)
 
 Why do you not want to do this? Matrix is available for everyone,
 Imports: doesn't influence the package search path. There is a cost
 associated with loading the library in the first place, but...?
 
 
  Sure, to validate my package it is required. That is fine. For fully
  using it might be required, but most users don't fully use a package,
  they just use 1-20-50% of the functionality, depending on the size of
  the package. My package does not depend on Matrix, except in less than
  1% of its (quite many) functions.
 
 I'm more into black-and-white -- it either needs Matrix or not;
 apparently it does.
 
 
  You could just write the functions assuming matrix is there, make sure
  the examples are marked don't run, and tell users if they want to use
  them, they need to load Matrix first.  I do this with OpenMx in my
  package.
 
  Hmmm, this is close to what I want to do. Actually my functions work,
  even if Matrix is not there, but they work better with Matrix. The
  problem is that if I do this, then I get the error I've shown in my
  initial email. I.e. Matrix is there, it is loaded, but the rowSums()
  generic is not called for some reasons.
 
 In another message you mention
 
   Matrix:::rowSums(W)
 Error in callGeneric() :
'callGeneric' must be called from a generic function or method
 
 but something else is going on -- you don't get to call methods
 directly; you're getting Matrix::rowSums (it's exported, so no need for
 a :::, see getNamespaceExports(Matrix)). Maybe traceback() after the
 error would be insightful?
 
 Martin
 
 
  Thanks again,
  Gabor
 
  Cheers,
 
  Josh
 
  On Tue, Nov 22, 2011 at 12:16 PM, Gábor Csárdicsa...@rmki.kfki.hu  wrote:
  Dear All,
 
  in some functions of my package, I use the Matrix S4 class, as defined
  in the Matrix package.
 
  I don't want to depend on Matrix, however, because my package is
  perfectly fine without Matrix, most of the functionality does not need
  Matrix. Matrix is so included in the 'Suggests' line.
 
  I load Matrix via require(), from the functions that really need it.
  This mostly works fine, but I have an issue now that I cannot sort
  out.
 
  If I define a function like this in my package:
 
  f- function() {
require(Matrix)
res- sparseMatrix(dims=c(5, 5), i=1:5, j=1:5, x=1:5)
y- rowSums(res)
res / y
  }
 
  then calling it from the R prompt I get
  Error in rowSums(res) : 'x' must be an array of at least two dimensions
 
  which basically means that the rowSums() in the base package is
  called, not the S4 generic in the Matrix package. Why is that?
  Is there any way to work around this problem, without depending on Matrix?
 
  I am doing this on R 2.14.0, x86_64-apple-darwin9.8.0.
 
  Thank You, Best Regards,
  Gabor
 
  --
  Gabor Csardicsa...@rmki.kfki.hu  MTA KFKI RMKI
 
  __
  R-help@r-project.org mailing list
  https://stat.ethz.ch/mailman/listinfo/r-help
  PLEASE do read the posting guide 
  http://www.R-project.org/posting-guide.html
  and provide commented, minimal, self-contained, reproducible code.
 
 
 
 
  --
  Joshua Wiley
  Ph.D. Student, Health Psychology
  Programmer Analyst II, ATS Statistical Consulting Group
  University of California, Los Angeles
  https://joshuawiley.com/
 
 
 
 
 
 
 --
 Computational Biology
 Fred Hutchinson Cancer Research Center
 1100 Fairview Ave. N. PO Box 19024 Seattle, WA 98109
 
 Location: M1-B861
 Telephone: 206 667-2793
 
 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do

[R] Graphics in vector format

2011-11-22 Thread Antonio Rodriges

Hello,

Is there a way to save plots in vector format like SVG or smth else?

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Header = T

2011-11-22 Thread Richard M. Heiberger

   a   b
A 1  2
B 3  4
C 5  6

The assumption above is that the rownames don't have a header and
the columns do.  Therefore the default is header=TRUE.

 r a   b
A 1  2
B 3  4
C 5  6

In the second example, the first column is called r and it is not
clear whether that is a column with a variable or a rowname.
Therefore the default is header=FALSE.



On Tue, Nov 22, 2011 at 2:47 PM, lucky7 wokeliandehotm...@gmail.com wrote:

 Hi,

 I just start to use R today! I am reading the R Help on read.csv and the
 description for header says header is set to TRUE if and only if the first
 row contains one fewer field than the number of columns. Why is that? My
 data has the same number of fields in the first row as the number of
 columns. I mean I have no problem opening my csv file I am just curious why
 it should be one fewer.

 Thanks!

 --
 View this message in context:
 http://r.789695.n4.nabble.com/Header-T-tp4097045p4097045.html
 Sent from the R help mailing list archive at Nabble.com.

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide
 http://www.R-project.org/posting-guide.htmlhttp://www.r-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.


[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] making scatterplot easier to read

2011-11-22 Thread Szymek Drobniak

Dear R users,

do you know an easy way (other than star plot) of making several points
laying one over another visible? Is it any simple way of increasing such
multipoint symbols - or shifting their positions randomly to make several
points in one place visible?

Cheers,
sz.

-- 
Szymon Drobniak || Population Ecology Group
*Institute of Environmental Sciences, Jagiellonian University
ul. Gronostajowa 7, 30-387 Kraków, POLAND
*tel.: +48 12 664 52 19 fax: +48 12 664 69 12

www.eko.uj.edu.pl/drobniak

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Graphics in vector format

I believe the Cairo package provides SVG faculties. 

Michael

On Nov 22, 2011, at 4:48 PM, Antonio Rodriges antonio@gmail.com wrote:

 Hello,
 
 Is there a way to save plots in vector format like SVG or smth else?
 
 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] x, y for point of intersection

2011-11-22 Thread David Winsemius



On Nov 22, 2011, at 3:40 PM, Monica Pisica wrote:


(edited out excessive white space)
I am trying to get a point of intersection between a
polyline and a straight line ….. and get the x and y coordinates of  
this point.

For exemplification consider this:

set.seed(123)
k1 -rnorm(100, mean=1.77, sd=3.33)
 k1 - sort(k1)
q1 - rnorm(100, mean=2.37, sd=0.74)
q1 - sort(q1, decreasing = TRUE)
plot(k1, q1, xlim - c((min(k1)-5), (max(k1)+5)),
type=l)
ya - 2
xa = -5
yb=4
xb=12

lines(c(xa, xb), c(ya, yb), col = 2)

# I want to get the x and y coordinates of the
# intersection of the 2 lines ….
m - (ya-yb)/(xa-xb)
b - ya-m*xa
ln - loess(q1~k1)
lines(ln)



You should look at:
str(ln)
#  then plot
lines(ln$x, ln$fitted, col=blue)
plot(approxfun(c(xa, xb), c(ya, yb)), add=TRUE, col=green)
plot(approxfun(ln$x, ln$fitted), col=orange, add=TRUE)

And think about minimizing the difference in distances between two  
functions.





It is clear that the x, y will satisfy both linear
equations, y = m*x + b and the ln polyline ….. but while I can  
visualize the
equation of the straight line – I have problems with the polyline. I  
will appreciate
any ideas to solve this problem. I thought it a trivial solution but  
it seems I

cannot see it.
Thanks,
Monica



__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


David Winsemius, MD
West Hartford, CT

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] making scatterplot easier to read

2011-11-22 Thread David Winsemius



On Nov 22, 2011, at 4:58 PM, Szymek Drobniak wrote:


Dear R users,

do you know an easy way (other than star plot) of making several  
points
laying one over another visible? Is it any simple way of increasing  
such
multipoint symbols - or shifting their positions randomly to make  
several

points in one place visible?


You can make them with transparent colors ... or you can make them  
small and jitter the x and y values,... or you can use hexbin plots...  
or you can use 2d density plots or probably other ways.


We don't really have a very good problem description yet, so it is  
difficult to be very helpful. Why not read the Posting Guide and see  
if cam be more complete along its guidelines.


--
David Winsemius, MD
West Hartford, CT

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Missing data?

Thank you for the suggestions.

The only problems I see with 'to.weekly' is converting from the OHLC format
and realizing that the date is the last day of the week rather than the
first day of the week. Very minor compared to doing the whole thing myself.

-Original Message-
From: R. Michael Weylandt michael.weyla...@gmail.com
[mailto:michael.weyla...@gmail.com] 
Sent: Tuesday, November 22, 2011 3:10 PM
To: Kevin Burton
Cc: r-help@r-project.org
Subject: Re: [R] Missing data?

Couldn't you use seq.Date() to set up the time index and then just fill as
appropriate?

Alternatively, to.weekly if you are starting with a daily series. 

Michael

On Nov 22, 2011, at 4:00 PM, Kevin Burton rkevinbur...@charter.net
wrote:

 I was wondering what the best approach is for missing data in a time
series.
 I give an example using xts but I would like to know what seems to be 
 the best method. Say I have
 
 
 
 library(xts)
 
 xts.ts - xts(1:4,as.Date(c(1970-01-01, 1970-1-3, 1980-10-10, 
 2007-8-19)), frequency=52)
 
 
 
 I would like to turn this into a time series (still could be xts, or 
 converted to ts) that has values for every week starting with the week 
 that includes the start date and ending with the week that includes the
end date.
 If there is data for the week then use it otherwise set it to NA or 0.
 Remember some years have 52, 53, or rarely 54 full or partial weeks. 
 What to do with the partials at the beginning and ending of the year? 
 This seems to be a fairly common problem and doing it myself is very 
 cumbersome. Does a solution to this kind of problem exist? Once the 
 approach to a weekly period is found I am sure that adjustment to 
 daily, monthly, or quarterly would be relatively straightforward.
 
 
 
 Thank you.
 
 
 
 Kevin
 
 
 
 
[[alternative HTML version deleted]]
 
 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide 
 http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Fwd: xtable and sweave: caption placement problem

2011-11-22 Thread Brian Diggs


On 11/18/2011 1:22 AM, ren...@vannieuwkoop.ch wrote:

\documentclass[11pt,a4paper]{article}
\usepackage{Sweave}
\begin{document}
=
x  =  runif(100,  1,  10)
y  =  2  +  3  *  x  + rnorm(100)
@

echo=FALSE,results=tex=
library(xtable)
print(xtable(summary(lm(y~x)),
align=r,
caption=Summary  statistics  for  the  regression  model,
caption.placement=top, label=tab:summary,
digits=2))

@

\end{document}


caption.placement is an argument to print.xtable, not xtable.

print(xtable(summary(lm(y~x)),
 align=r,
 caption=Summary  statistics  for  the  regression  model,
 label=tab:summary,
 digits=2),
  caption.placement=top)


--
Brian S. Diggs, PhD
Senior Research Associate, Department of Surgery
Oregon Health  Science University

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Missing data?

Also with to.weekly there seems to be a problem with when the week starts.
For example:

xts.ts - xts(1:4, c(as.Date(2011-01-01), as.Date(2011-01-10),
as.Date(2011-10-09), as.Date(2011-10-10)), frequency=52)
 to.weekly(xts.ts)
   xts.ts.Open xts.ts.High xts.ts.Low xts.ts.Close
2011-01-01   1   1  11
2011-01-10   2   2  22
2011-10-09   3   3  33
2011-10-10   4   4  44

 xts.ts - xts(1:4, c(as.Date(2011-01-01), as.Date(2011-01-02),
as.Date(2011-10-09), as.Date(2011-10-10)), frequency=52)
 to.weekly(xts.ts)
   xts.ts.Open xts.ts.High xts.ts.Low xts.ts.Close
2011-01-02   1   2  12
2011-10-09   3   3  33
2011-10-10   4   4  44

So in the first case the week ends on January 1st. But the second indicates
that the end of the week is the 2nd but it includes the data from the first.
I would expect that the first column should be consistent.
Notice that 10-09 and 10-10 are properly considered different weeks because
the 9th is a Sunday and the 10th is a Monday (the beginning of the week).

-Original Message-
From: R. Michael Weylandt michael.weyla...@gmail.com
[mailto:michael.weyla...@gmail.com] 
Sent: Tuesday, November 22, 2011 3:10 PM
To: Kevin Burton
Cc: r-help@r-project.org
Subject: Re: [R] Missing data?

Couldn't you use seq.Date() to set up the time index and then just fill as
appropriate?

Alternatively, to.weekly if you are starting with a daily series. 

Michael

On Nov 22, 2011, at 4:00 PM, Kevin Burton rkevinbur...@charter.net
wrote:

 I was wondering what the best approach is for missing data in a time
series.
 I give an example using xts but I would like to know what seems to be 
 the best method. Say I have
 
 
 
 library(xts)
 
 xts.ts - xts(1:4,as.Date(c(1970-01-01, 1970-1-3, 1980-10-10, 
 2007-8-19)), frequency=52)
 
 
 
 I would like to turn this into a time series (still could be xts, or 
 converted to ts) that has values for every week starting with the week 
 that includes the start date and ending with the week that includes the
end date.
 If there is data for the week then use it otherwise set it to NA or 0.
 Remember some years have 52, 53, or rarely 54 full or partial weeks. 
 What to do with the partials at the beginning and ending of the year? 
 This seems to be a fairly common problem and doing it myself is very 
 cumbersome. Does a solution to this kind of problem exist? Once the 
 approach to a weekly period is found I am sure that adjustment to 
 daily, monthly, or quarterly would be relatively straightforward.
 
 
 
 Thank you.
 
 
 
 Kevin
 
 
 
 
[[alternative HTML version deleted]]
 
 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide 
 http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] x, y for point of intersection

2011-11-22 Thread Ted Harding

On 22-Nov-11 21:25:56, Monica Pisica wrote:
 Hi everyone,
 
 I am trying to get a point of intersection between a
 polyline and a straight line .. and get the x and y
 coordinates of this point.
 For exemplification consider this:
 
 set.seed(123)
 k1 -rnorm(100, mean=1.77, sd=3.33)
 k1 - sort(k1)
 q1 - rnorm(100, mean=2.37, sd=0.74)
 q1 - sort(q1, decreasing = TRUE)
 plot(k1, q1, xlim - c((min(k1)-5), (max(k1)+5)), type=l)
 
 ya - 2
 xa = -5
 yb=4
 xb=12
 lines(c(xa, xb), c(ya, yb), col = 2)
 
# I want to get the x and y coordinates of the
# intersection of the 2 lines.
 
 m - (ya-yb)/(xa-xb)
 b - ya-m*xa
 ln - loess(q1~k1)
 lines(ln)
 
 It is clear that the x, y will satisfy both linear equations,
 y = m*x + b and the ln polyline - .. but while I can visualize
 the equation of the straight line  - I have problems with the
 polyline. I will appreciate any ideas to solve this problem.
 I thought it a trivial solution but it seems I cannot see it.
 Thanks,
 Monica

  ya - 2
  xa = -5
  yb =  4
  xb = 12

These define a line

  y = ya + (x - xa)*(yb - ya)/(xb - xa)

so write this as

  y = A + B*x

Then points above the line satisfy

  y  A + B*X

and points below the line satisfy

  Y  A + B*X

  A - ya - xa*(yb - ya)/(xb - xa)
  B - (yb - ya)/(xb - xa)

So now extract the points (x,y) fron the loess fit:

  x.ln - ln$x
  y.ln - ln$y

and now find the points on 'ln' which are above, and
the points on ln which are below, which will locate the
segment which crosses the (X,Y) line:

  ix.upper - which(y.ln   A + B*ln$y)
  ix.lower - which(y.ln =3D A + B*ln$y)

  ix.upper
  # 1  2  3  4  5  6  7  8  9 10 11 12 13 14 15=20

So now you have the line segment from

  (x.ln[15],y.ln[15])
to
  (x.ln[16],y.ln[16])

and now all you need to do is to find the intersection
of the line from (ln.x[15],ln.y[15]) to (ln.x[16],ln.y[16])
with the line from (xa,ya) to (xb,yb).

(There could be complications if the y-values of ln do not
continually decrease in value; but happily they do decrease
in your example).

Hoping this helps!
Ted.




E-Mail: (Ted Harding) ted.hard...@wlandres.net
Fax-to-email: +44 (0)870 094 0861
Date: 22-Nov-11   Time: 22:49:54
-- XFMail --

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] filtering probesets with Bioconductor?

2011-11-22 Thread baumeist

Hi,

I am relatively new to R and Bioconductor and am trying to filter the
topTable that I generated of differentially expressed genes from my
normlized eset file comprised of ~ 40 HG-133A Affy microarrays .  I would
like to see if particular probesets are represented in this list. 
Alternatively I would like to
generate a topTable of differentially expressed genes using only specified
probesets within my eset 
file.

Can anyone tell me how I would  begin to approach this?  I have looked into
using the genefilter() function but can't figure out if it can take the
right parameters (i.e. specific probe set id's).

Thanks in advance,
-M

This is the code I used to generate my topTable

 fit - lmFit(eset, design)
 cont.matrix - makeContrasts(NormalvsTumor=Tumor-Normal, levels=design)
 fit2 - contrasts.fit(fit, cont.matrix)
 fit2 - eBayes(fit2)
 topTable(fit2, number=100, adjust=BH) 

--
View this message in context: 
http://r.789695.n4.nabble.com/filtering-probesets-with-Bioconductor-tp4097747p4097747.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Data Frame Search Slow

Wow, these specs are fantastic:



 user  system elapsed
0.330.000.39 
 

I wonder how much of that is because of the capacity of the box that you are
running R on.  Can you post pertinent specs?  This suggest to me that
hardware upgrades (RAM specifically) may also be in order.

Investigating data.table now.

Thanks!

--
View this message in context: 
http://r.789695.n4.nabble.com/Data-Frame-Search-Slow-tp4096906p4097278.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Help to inputting a pre-defined correlation structure in a Mixed Model

2011-11-22 Thread albertwschulthess

I'm working in a Gen/Marker-Phenotype association study in wheat and I'm
using a Mixed Model Approach to estimate the effect of the markers. My model
has the atribute measured as y (response variable), the markers and the
blocks (of a complete random block design) as fixed and the genotypes and
the residuals as random. In one hand I'm assuming that there is no
correlation between residuals and that they have a normal distribution with
mean equals to zero and a common variance equals to (sigma e)^2. On the
other hand I'm also asumming that the genotypes are normally distributed
with mean equals to zero but with variance equals to a G matrix that
considers the relatdness between individuals. This G matrix is expressed as
2 * (sigma g)^2 * K, where K is a matrix with the reladness coefficients
(kinship) between all individuals in the study. The K matrix is calculated
outside of R environment, and I'm using it as an input. I'm using the lme
function of the nlme library in R, but I want to know how can I gap the
structures that R offers me (as corrAR1, corrSumm, etc.) as default
structures, because I already have this structure and I only have to
estimate the (sigma g)^2 parameter, possibly using REML. I'm new in the use
of mixed models and I will be very pleased if someone could help me with
this particular question. Thanks a lot in advance.

--
View this message in context:
http://r.789695.n4.nabble.com/Help-to-inputting-a-pre-defined-correlation-structure-in-a-Mixed-Model-tp4097147p4097147.html
Sent from the R help mailing list archive at Nabble.com.

Re: [R] Data Frame Search Slow

So, here is the result time from using the datatable package:



 user  system elapsed 
   0.800   0.012   1.847
 

Here are the methods that I am using:

ush - data.table(read.csv(...))
setkey(ush, product_id)
s1 - subset(ush, product_id == product.id)

Seems like a minor improvement but not enough to get those subsets in less
than ~2 seconds.  Am I doing something wrong I wonder...


--
View this message in context: 
http://r.789695.n4.nabble.com/Data-Frame-Search-Slow-tp4096906p4097441.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Rcmdr numSummary: means of multiple variables without grouping

2011-11-22 Thread Boris Mayer

Hello there,

 

when using the function numSummary in Rcmdr and selecting more than one
variable (without grouping), the grand mean across all variables is returned
for each variable instead of the mean of each single variable. However, this
happens only for the mean, and not for sd, quantiles and na.

 

This is the output:

 

 numSummary(dataset1 [,c(var1, var2)], 

+   statistics=c(mean, sd, quantiles), quantiles=c(0,.25,.5,.75,1))

mean   sd  0%  25% 50% 75% 100%  n NA

var1 15.58491 4.154637 10  14  15 19.5   28 54  0

var2 15.58491 5.904053  5  10  15 18.5   30 52  2

 

 

When using a grouping variable, everything is fine:

 

 

 numSummary(dataset1 [,c(var1, var2)], 

+   groups= dataset1$gender, statistics=c(mean, sd, quantiles), 

+   quantiles=c(0,.25,.5,.75,1))

 

Variable: var1

   mean   sd 0% 25% 50% 75% 100%  n NA

male   16.57895 4.634602 10  14  16  20   28 19  0

female 16.28571 3.937538 10  14  15  18   28 35  0

 

Variable: var2

   mean   sd 0% 25% 50%  75% 100%  n NA

male   14.7 5.442810  8  10  14 15.0   28 18  1

female 14.76471 6.213664  5  10  15 19.5   30 34  1

 

 

 

Any ideas how to get separate means for multiple variables in numSummary
without grouping?

 

I am using R 2.14.0 and Rcmdr 1.7.2 on Windows 7 Professional.

 

Thanks in advance!

 

Boris

 

 


[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] On-demand importing of a package

On Tue, Nov 22, 2011 at 4:27 PM, Martin Morgan mtmor...@fhcrc.org wrote:
[...]
 No need to Depend:. Use

 Imports: Matrix

 plus in the NAMESPACE file

  importFrom(Matrix, rowSums)

 Why do you not want to do this? Matrix is available for everyone, Imports:
 doesn't influence the package search path. There is a cost associated with
 loading the library in the first place, but...?

Not just loading, installing a package has a cost, too. Dependencies
are bad, they might make my package fail, and I have no control over
them. It's not just 'Matrix', I have this issue with other packages as
well.

Anyway, 'Imports: Matrix' is just a workaround I think. Or is the
example in my initial mail expected to fail? Why is that? Why can I
call some functions from 'Matrix' that way and why can't I call
others?

 I'm more into black-and-white -- it either needs Matrix or not; apparently
 it does.

It's a matter of opinion, I guess. I find it very annoying when I need
to install a bunch of packages from which I don't use any code, just
because some tiny bit of a package I need uses them. I would like to
spare my users from this.

[...]
 In another message you mention

 Matrix:::rowSums(W)
 Error in callGeneric() :
  'callGeneric' must be called from a generic function or method

 but something else is going on -- you don't get to call methods directly;
 you're getting Matrix::rowSums (it's exported, so no need for a :::, see
 getNamespaceExports(Matrix)). Maybe traceback() after the error would be
 insightful?

Another poster suggested this, that's why I tried. It is clear that I
should not call it directly. All I want to do is having a function
like this:

f - function() {
 if (require(Matrix)) {
   res - sparseMatrix(dims=c(5, 5), i=1:5, j=1:5, x=1:5)
 } else {
   res - diag(1:5)
 }
 y - rowSums(res)
 res / y
}

Setting the subjective bit, about depending or not, aside, is there
really no solution for this? The code in the manual page examples work
fine without importing the package and just loading it if needed and
available. Why doesn't the code within the package?

Thanks for the patience,
Gabor

 Martin

[...]

-- 
Gabor Csardi csa...@rmki.kfki.hu     MTA KFKI RMKI

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Data Frame Search Slow

Update from email outside of this thread:

Justin Haynes writes:


 matrices will help, but two quick solutions:
 
 if you are looking for single items to go in the some_value space, use ==
 instead of %in% and you'll notice speedups.  The second more involved
 option is to take a look at the package data.table.
 
 it provides a wrapper for data.frame that has some impressive
 optimizations as well as allowing for SQL like indexing and syntax.
 
 
 hope that helps!
 

Justin, thanks.  Here's the before and after clock:

Before: 

user  system elapsed 
  0.976   0.164   2.607 

After:

user  system elapsed 
  0.624   0.156   1.810 

You are correct.  I still need to get that elapsed down even more. 
Data.table sounds like a good option.  I'll look into it more.  Thank you.

--
View this message in context: 
http://r.789695.n4.nabble.com/Data-Frame-Search-Slow-tp4096906p4097261.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Data Frame Search Slow

Answer to my own question:

ush - data.table(read.csv(...))
setkey(ush, product_id)
s1 - ush[J[product.id]]



 user  system elapsed 
   0.000   0.000   0.003 
 

It seems like that's the method to use!  Amazing.

--
View this message in context: 
http://r.789695.n4.nabble.com/Data-Frame-Search-Slow-tp4096906p4097576.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Scatter plot - using colour to group points?

2011-11-22 Thread SarahH

Success with the lines command and col argument! I have some nice point and
line plots.
Thanks so much for you help. Ongoing project - I will probably be back!

Sarah 

--
View this message in context: 
http://r.789695.n4.nabble.com/Scatter-plot-using-colour-to-group-points-tp4092794p4097625.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Rcmdr numSummary: means of multiple variables without grouping

2011-11-22 Thread John Fox

Dear Boris,

This bug is traceable to a change in the mean() function in R and is fixed in 
the current version (1.7-3) of the Rcmdr package on CRAN.

Best,
 John


John Fox
Sen. William McMaster Prof. of Social Statistics
Department of Sociology
McMaster University
Hamilton, Ontario, Canada
http://socserv.mcmaster.ca/jfox/

On Tue, 22 Nov 2011 21:26:41 +0100
 Boris Mayer bocab...@googlemail.com wrote:
 Hello there,
 
  
 
 when using the function numSummary in Rcmdr and selecting more than one
 variable (without grouping), the grand mean across all variables is returned
 for each variable instead of the mean of each single variable. However, this
 happens only for the mean, and not for sd, quantiles and na.
 
  
 
 This is the output:
 
  
 
  numSummary(dataset1 [,c(var1, var2)], 
 
 +   statistics=c(mean, sd, quantiles), quantiles=c(0,.25,.5,.75,1))
 
 mean   sd  0%  25% 50% 75% 100%  n NA
 
 var1 15.58491 4.154637 10  14  15 19.5   28 54  0
 
 var2 15.58491 5.904053  5  10  15 18.5   30 52  2
 
  
 
  
 
 When using a grouping variable, everything is fine:
 
  
 
  
 
  numSummary(dataset1 [,c(var1, var2)], 
 
 +   groups= dataset1$gender, statistics=c(mean, sd, quantiles), 
 
 +   quantiles=c(0,.25,.5,.75,1))
 
  
 
 Variable: var1
 
mean   sd 0% 25% 50% 75% 100%  n NA
 
 male   16.57895 4.634602 10  14  16  20   28 19  0
 
 female 16.28571 3.937538 10  14  15  18   28 35  0
 
  
 
 Variable: var2
 
mean   sd 0% 25% 50%  75% 100%  n NA
 
 male   14.7 5.442810  8  10  14 15.0   28 18  1
 
 female 14.76471 6.213664  5  10  15 19.5   30 34  1
 
  
 
  
 
  
 
 Any ideas how to get separate means for multiple variables in numSummary
 without grouping?
 
  
 
 I am using R 2.14.0 and Rcmdr 1.7.2 on Windows 7 Professional.
 
  
 
 Thanks in advance!
 
  
 
 Boris
 
  
 
  
 
 
   [[alternative HTML version deleted]]
 
 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] evaluation question

2011-11-22 Thread Rolf Turner


On 23/11/11 07:31, R. Michael Weylandt wrote:

Good morning Erin,

eval(parse(text = pexp(3.2,rate=1)))

seems to work

But the general rule applies:

library(fortunes)
fortune(parse())

The fortune notwithstanding I find this *specific* use of parse() to be
very, uh, useful! :-)

cheers,

Rolf Turner

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] glht for lme object with significant interaction term

2011-11-22 Thread anord

Dear all, 

I'm working on some data from an experiment on the breeding behavior of
birds. In short, I have been measuring how the time spent on performing a
certain task (variable 'mean_on_active') differs over time (variable 'day',
2 levels) across three experimental categories (variable 'treat'; levels
'C', 'R', 'E'). The model shows a significant interaction between treatment
and day. To have a closer look at this, I would like to do multiple
comparisons for treatment levels within day (i.e. across treatments for each
day in turn). I have browsed the forum for a way of doing this using the
glht function, but haven't found a good solution to my problem. 

Any hints on how to proceed, or on other methods that might be (more)
appropriate? Final model output below, data attached.
http://r.789695.n4.nabble.com/file/n4097865/sub.data.txt sub.data.txt 

Kind regards, 
Andreas


##Final model
m.final-lme(mean.on.active~treat+day+treat:day,random=1|id,na.action=na.omit)
summary(m.final)

Linear mixed-effects model fit by REML
 Data: NULL 
   AIC  BIClogLik
  1051.779 1075.757 -517.8896

Random effects:
 Formula: ~1 | id
(Intercept) Residual
StdDev:6.276309 5.473167

Fixed effects: mean_on_active ~ treat * day 
  Value Std.Error DF   t-value p-value
(Intercept)   24.987123  1.792599 75 13.939049  0.
treatE15.661119  2.327329 73  6.729225  0.
treatR 1.133678  2.228713 73  0.508670  0.6125
day6-7-1.068160  1.758899 73 -0.607289  0.5455
treatE:day6-7 -6.650690  2.335567 73 -2.847570  0.0057
treatR:day6-7 -1.495927  2.273071 73 -0.658108  0.5125
 Correlation: 
  (Intr) treatE treatR day6-7 tE:6-7
treatE-0.736
treatR-0.747  0.572 
day6-7-0.482  0.372  0.389  
treatE:day6-7  0.361 -0.522 -0.292 -0.753   
treatR:day6-7  0.370 -0.281 -0.524 -0.774  0.582

Standardized Within-Group Residuals:
   Min Q1Med Q3Max 
-2.5774244 -0.4585991 -0.1360731  0.3902143  3.9752257 

Number of Observations: 154
Number of Groups: 76 


--
View this message in context: 
http://r.789695.n4.nabble.com/glht-for-lme-object-with-significant-interaction-term-tp4097865p4097865.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] On-demand importing of a package

2011-11-22 Thread Jeff Newmiller

If Suggests doesn't work for you, perhaps you need to put more effort into 
reinventing the wheel, and depend less on other packages.
---
Jeff NewmillerThe .   .  Go Live...
DCN:jdnew...@dcn.davis.ca.usBasics: ##.#.   ##.#.  Live Go...
  Live:   OO#.. Dead: OO#..  Playing
Research Engineer (Solar/BatteriesO.O#.   #.O#.  with
/Software/Embedded Controllers)   .OO#.   .OO#.  rocks...1k
--- 
Sent from my phone. Please excuse my brevity.

Gábor Csárdi csa...@rmki.kfki.hu wrote:

On Tue, Nov 22, 2011 at 4:27 PM, Martin Morgan mtmor...@fhcrc.org
wrote:
[...]
 No need to Depend:. Use

 Imports: Matrix

 plus in the NAMESPACE file

  importFrom(Matrix, rowSums)

 Why do you not want to do this? Matrix is available for everyone,
Imports:
 doesn't influence the package search path. There is a cost associated
with
 loading the library in the first place, but...?

Not just loading, installing a package has a cost, too. Dependencies
are bad, they might make my package fail, and I have no control over
them. It's not just 'Matrix', I have this issue with other packages as
well.

Anyway, 'Imports: Matrix' is just a workaround I think. Or is the
example in my initial mail expected to fail? Why is that? Why can I
call some functions from 'Matrix' that way and why can't I call
others?

 I'm more into black-and-white -- it either needs Matrix or not;
apparently
 it does.

It's a matter of opinion, I guess. I find it very annoying when I need
to install a bunch of packages from which I don't use any code, just
because some tiny bit of a package I need uses them. I would like to
spare my users from this.

[...]
 In another message you mention

 Matrix:::rowSums(W)
 Error in callGeneric() :
  'callGeneric' must be called from a generic function or method

 but something else is going on -- you don't get to call methods
directly;
 you're getting Matrix::rowSums (it's exported, so no need for a :::,
see
 getNamespaceExports(Matrix)). Maybe traceback() after the error
would be
 insightful?

Another poster suggested this, that's why I tried. It is clear that I
should not call it directly. All I want to do is having a function
like this:

f - function() {
 if (require(Matrix)) {
   res - sparseMatrix(dims=c(5, 5), i=1:5, j=1:5, x=1:5)
 } else {
   res - diag(1:5)
 }
 y - rowSums(res)
 res / y
}

Setting the subjective bit, about depending or not, aside, is there
really no solution for this? The code in the manual page examples work
fine without importing the package and just loading it if needed and
available. Why doesn't the code within the package?

Thanks for the patience,
Gabor

 Martin

[...]

-- 
Gabor Csardi csa...@rmki.kfki.hu     MTA KFKI RMKI

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Missing data?