Re: [R] filtering a dataframe with a vector of rownames

2011-01-14 Thread ANJAN PURKAYASTHA
subset(data, row.names(data) %in% name.to.keep
HTH,
Anjan

On Fri, Jan 14, 2011 at 1:25 PM, Jonathan Hughes 
jonathan.hughes...@live.com wrote:



 Hello,

 Here's my problem. I have a large data frame and a vector with some of its
 row names. I'd like to have a new data frame only with those rows that match
 this vector of row names.

 I tried this:

 data-cbind(c(1,2,3,4,5,6),c(2,3,4,5,6,7))
 rownames(data)-c(a, b, c,d,e,f)
 names.to.keep-c(a, c, d)
 rows.to.keep-which(rownames(data)==names.to.keep)

 But it didn't work. Any suggestions?

 thanks a lot.

 Jonathan.

[[alternative HTML version deleted]]

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide
 http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.




-- 
===
anjan purkayastha, phd.
research associate
fas center for systems biology,
harvard university
52 oxford street
cambridge ma 02138
phone-703.740.6939
===

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Plotting colour-coded points

2011-01-05 Thread ANJAN PURKAYASTHA
Hi,
I have a file of the following type:

idab
1   0.5   5
2   0.7  15
3   1.6   7
40.5 25


I would like to plot the data in column a on the y-axis and the
corresponding data in column id on the x-axis, so plot(a~id).  However I
would like to colour these points according to the data in column b.
column b data may be colour coded into the following bins: 0-9; 10-19;
20-29.
Any idea on how to accomplish this?
TIA,
Anjan

-- 
===
anjan purkayastha, phd.
research associate
fas center for systems biology,
harvard university
52 oxford street
cambridge ma 02138
phone-703.740.6939
===

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] subset question

2010-12-29 Thread ANJAN PURKAYASTHA
Hi,
I'm having a problem with a step that should be pretty simple.
I have a dataframe, d,  with column names : gene s1 s2 s3. The column gene
stores an Id; the rest of the columns store intensity data.
I would like to extract the rows for gene Ids i1, i2, i3 ( I know a priori
that those rows exist).
So I do this:
subset(d, gene %in% c(i1, i2, i3)).
This does not give me the required data.
Any ideas where I am going wrong?
TIA,
Anjan

-- 
===
anjan purkayastha, phd.
research associate
fas center for systems biology,
harvard university
52 oxford street
cambridge ma 02138
phone-703.740.6939
===

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] subset question

2010-12-29 Thread ANJAN PURKAYASTHA
nope, that did not work.
thanks though.
Anjan

On Wed, Dec 29, 2010 at 5:02 PM, Jonathan Flowers 
jonathanmflow...@gmail.com wrote:

 Try subd - d[, gene == c(i1,i2,i3)]

 On Wed, Dec 29, 2010 at 4:55 PM, ANJAN PURKAYASTHA 
 anjan.purkayas...@gmail.com wrote:

 Hi,
 I'm having a problem with a step that should be pretty simple.
 I have a dataframe, d,  with column names : gene s1 s2 s3. The column
 gene
 stores an Id; the rest of the columns store intensity data.
 I would like to extract the rows for gene Ids i1, i2, i3 ( I know a priori
 that those rows exist).
 So I do this:
 subset(d, gene %in% c(i1, i2, i3)).
 This does not give me the required data.
 Any ideas where I am going wrong?
 TIA,
 Anjan

 --
 ===
 anjan purkayastha, phd.
 research associate
 fas center for systems biology,
 harvard university
 52 oxford street
 cambridge ma 02138
 phone-703.740.6939
 ===

[[alternative HTML version deleted]]

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide
 http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.





-- 
===
anjan purkayastha, phd.
research associate
fas center for systems biology,
harvard university
52 oxford street
cambridge ma 02138
phone-703.740.6939
===

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Error message when creating a dataframe

2010-11-01 Thread ANJAN PURKAYASTHA
Hi,
I'm creating a data frame of 24 columns and 45101 rows.
Here is the error I get:
R(147,0xa04d8720) malloc: *** mmap(size=184320) failed (error code=12)
*** error: can't allocate region
*** set a breakpoint in malloc_error_break to debug
Any idea of what I can do to trouble-shoot?
Attached is the session  info:
R version 2.11.1 (2010-05-31)
i386-apple-darwin9.8.0

locale:
[1] en_US.UTF-8/en_US.UTF-8/C/C/en_US.UTF-8/en_US.UTF-8

attached base packages:
[1] stats graphics  grDevices utils datasets  methods   base



Thanks in advance,
Anjan


-- 
===
anjan purkayastha, phd.
research associate
fas center for systems biology,
harvard university
52 oxford street
cambridge ma 02138
phone-703.740.6939
===

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] One-class SVM

2010-10-29 Thread ANJAN PURKAYASTHA
Does any R package support one-class SVM?
I'm trying to develop an application to detect anomalies in genome
sequencing.
Thanks in advance.
Anjan

-- 
===
anjan purkayastha, phd.
research associate
fas center for systems biology,
harvard university
52 oxford street
cambridge ma 02138
phone-703.740.6939
===

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Subsetting a dataframe

2010-10-24 Thread ANJAN PURKAYASTHA
Hi,
I have a dataframe with 43 columns and a 1000 rows.  Each entry in the
dataframe can be either P or A.
here is a small chunk:
 c1c2 ...c43
r100  P A   ...  P
r101  A A   ...  A
r102  P P   ...  P

How does one subset this data frame to select those rows that have only P's
in them?

Thanks in advance.
Anjan




-- 
===
anjan purkayastha, phd.
research associate
fas center for systems biology,
harvard university
52 oxford street
cambridge ma 02138
phone-703.740.6939
===

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] Subsetting a dataframe

2010-10-24 Thread ANJAN PURKAYASTHA
Thanks all for your help.
Anjan


On Sun, Oct 24, 2010 at 1:38 PM, Jorge Ivan Velez
jorgeivanve...@gmail.comwrote:

 Hi Anjan,

 Please consider the following example:

  x - c(2, rep(1, 10))
  all(x == 1)
 [1] FALSE
  d - replicate(10, sample(x, replace = TRUE))
  d
   [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] [,10]
  [1,]111111211 1
  [2,]111212121 1
  [3,]111111111 1
  [4,]111111112 1
  [5,]111121111 1
  [6,]211111111 1
  [7,]111111112 1
  [8,]112111111 1
  [9,]122121112 1
 [10,]111111111 1
 [11,]111111111 1
  d[apply(d, 1, function(v) all(v==1)), ]
  [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] [,10]
 [1,]111111111 1
 [2,]111111111 1
 [3,]111111111 1


 HTH,
 Jorge


 On Sun, Oct 24, 2010 at 1:32 PM, ANJAN PURKAYASTHA  wrote:

 Hi,
 I have a dataframe with 43 columns and a 1000 rows.  Each entry in the
 dataframe can be either P or A.
 here is a small chunk:
 c1c2 ...c43
 r100  P A   ...  P
 r101  A A   ...  A
 r102  P P   ...  P

 How does one subset this data frame to select those rows that have only
 P's
 in them?

 Thanks in advance.
 Anjan




 --
 ===
 anjan purkayastha, phd.
 research associate
 fas center for systems biology,
 harvard university
 52 oxford street
 cambridge ma 02138
 phone-703.740.6939
 ===

[[alternative HTML version deleted]]

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide
 http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.





-- 
===
anjan purkayastha, phd.
research associate
fas center for systems biology,
harvard university
52 oxford street
cambridge ma 02138
phone-703.740.6939
===

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] How do I tell whether two vectors are identical?

2010-10-12 Thread ANJAN PURKAYASTHA
Hi,
I have two vectors, each of length 45000.
How do I compare the vectors to ascertain if they are identical. Secondly if
they are NOT identical, how do I determine the indices of positions at which
the vectors differ?
Thanks,
Anjan


-- 
===
anjan purkayastha, phd.
research associate
fas center for systems biology,
harvard university
52 oxford street
cambridge ma 02138
phone-703.740.6939
===

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] How do I tell whether two vectors are identical?

2010-10-12 Thread ANJAN PURKAYASTHA
Thanks all for you help.
Anjan

2010/10/12 £ukasz Rêc³awowicz lukasz.reclawow...@gmail.com

 Or just:

 id-seq(1,45e3,1)
 compare-data.frame(id,a,b)
 differences-compare$id[a!=b]

 --
 Mi³ego dnia

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide
 http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.




-- 
===
anjan purkayastha, phd.
research associate
fas center for systems biology,
harvard university
52 oxford street
cambridge ma 02138
phone-703.740.6939
===

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] problem with colors

2010-10-09 Thread ANJAN PURKAYASTHA
Hi Phil and Thomas,
Thanks for your helpful feedback.  I must admit my solution to creating the
vector of colors lacked your elegance.
In brief, I saved the output of colors() into a text file, saved all but 47
colours in that file and read it back as a data frame and used the first
column of the dataframe as a vector of 47 colours. This roundabout method
may have caused the  problem because when I chose colours according to the
commands sent by both of you things seemed to work just fine.
Thank you very much for your feedback.
Anjan

On Thu, Oct 7, 2010 at 3:25 PM, Thomas Stewart tgstew...@gmail.com wrote:

 I would be helpful if you provided a more complete, reproducible example.
  Consider the following code.  It colors the boxes according to the first 47
 colors listed in the color() vector.

 -tgs

 data-as.data.frame(matrix(rnorm(47*23),ncol=47))
 boxplot(data,col=colors()[1:47])




 On Thu, Oct 7, 2010 at 2:22 PM, ANJAN PURKAYASTHA 
 anjan.purkayas...@gmail.com wrote:

 Hi,
 I have a data set of 47 columns. I would like to create a boxplot for each
 column, each boxplot of a different colour.
 So I created a vector col1. This vector has a subset of the colors
 returned by color()- red, cyan, green etc.
 Now I use the command: boxplot(dataset, col= col1) expecting to see 47
 boxplots, each of a different colour.
 Here is the problem: the boxplots are drawn correctly but it seems that
 only
 the first few colours in col1 are being used in a repeated pattern.
 Anybody has any ideas on how to tackle this?
 Thanks in advance,
 Anjan

 --
 ===
 anjan purkayastha, phd.
 research associate
 fas center for systems biology,
 harvard university
 52 oxford street
 cambridge ma 02138
 phone-703.740.6939
 ===

[[alternative HTML version deleted]]

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide
 http://www.R-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.





-- 
===
anjan purkayastha, phd.
research associate
fas center for systems biology,
harvard university
52 oxford street
cambridge ma 02138
phone-703.740.6939
===

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] problem with colors

2010-10-07 Thread ANJAN PURKAYASTHA
Hi,
I have a data set of 47 columns. I would like to create a boxplot for each
column, each boxplot of a different colour.
So I created a vector col1. This vector has a subset of the colors
returned by color()- red, cyan, green etc.
Now I use the command: boxplot(dataset, col= col1) expecting to see 47
boxplots, each of a different colour.
Here is the problem: the boxplots are drawn correctly but it seems that only
the first few colours in col1 are being used in a repeated pattern.
Anybody has any ideas on how to tackle this?
Thanks in advance,
Anjan

-- 
===
anjan purkayastha, phd.
research associate
fas center for systems biology,
harvard university
52 oxford street
cambridge ma 02138
phone-703.740.6939
===

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Plotting x-axis labels perpendicular to the axis

2010-10-05 Thread ANJAN PURKAYASTHA
Hi,
The following command: axis(1, at= 1:50, labels= sampleNames(data)) produces
an X-axis with 50 ticks and labels drawn from the sampleNames namespace.
Problem is the labels at each tick are printed parallel to the x-axis and
thus run into each other. How do I specify for the labels to be printed
perpendicular to the x-axis?
Thanks in advance.
Anjan

-- 
===
anjan purkayastha, phd.
research associate
fas center for systems biology,
harvard university
52 oxford street
cambridge ma 02138
phone-703.740.6939
===

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] subsetting

2010-05-30 Thread ANJAN PURKAYASTHA
Hi,
I have a data-frame,  r (column names below), that needs subsetting:
date, time, strain, gene, deltact

When I try to subset r by applying selection criteria on two columns I get
an empty data frame. For example I would like to extract all rows that have
time == 0h and strain == ROC.
So, t - subset(r, (r$time == 0h  r$strain == ROC), select= c(time,
strain, gene, deltact)) returns an empty data-frame.
Is it not possible to subset based on two criteria?
TIA
Anjan

-- 
===
anjan purkayastha, phd.
research associate
fas center for systems biology,
harvard university
52 oxford street
cambridge ma
phone-703.740.6939
===

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] modifying only certain rows in a data-frame

2010-05-29 Thread ANJAN PURKAYASTHA
Hi,
I have a data-frame r (shown below is a subset) with the following column
names:
datetimesamplevctnormhctdeltact

$time has 8 levels: 0h, 0.5h, .24h. $deltact holds real numbers.
I want to subset this dataframe by the levels of $time.  For all the rows
which have r$time == 0h I would like to convert the present value of
r$deltact to 0.
Any pointers to a simple method to accomplish would be greatly appreciated.

TIA,
Anjan


-- 
===
anjan purkayastha, phd.
research associate
fas center for systems biology,
harvard university
52 oxford street
cambridge ma
phone-703.740.6939
===

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] plotting the following data

2010-05-21 Thread ANJAN PURKAYASTHA
Hi,
I need to plot $time on the x-axis and $f on the y-axis for the following
data:
timef
0h0.00
0.5h0.54
1h1.15
2h2.33
4h1.57
6h2.19
18h1.45
24h1.79
I want the order of the data-points to be retained and the x-coordinates of
each of the 8 data-points to be equally spaced.
however, using plot($time, $f) completely scrambles the ordering of the
points.
Any pointers to this newbie on how to plot these data points accurately will
be much appreciated.
Thanks,
Anjan

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] plotting the following data

2010-05-21 Thread ANJAN PURKAYASTHA
Thanks all for your suggestions.
plot(1:8, f, axes=FALSE)
axis(1, at=1:8, labels=time)
axis(2)
That gave me exactly what I wanted.
Anjan

On Fri, May 21, 2010 at 12:27 PM, jim holtman jholt...@gmail.com wrote:

 Not exactly sure what you mean by equally spaced;  here is one way:

  x - read.table(textConnection(timef
 + 0h0.00
 + 0.5h0.54
 + 1h1.15
 + 2h2.33
 + 4h1.57
 + 6h2.19
 + 18h1.45
 + 24h1.79), header=TRUE, as.is=TRUE)
 
  plot(x$f, xaxt='n')
  axis(1, at=1:8, labels=x$time)


 On Fri, May 21, 2010 at 12:15 PM, ANJAN PURKAYASTHA
 anjan.purkayas...@gmail.com wrote:
  Hi,
  I need to plot $time on the x-axis and $f on the y-axis for the following
  data:
  timef
  0h0.00
  0.5h0.54
  1h1.15
  2h2.33
  4h1.57
  6h2.19
  18h1.45
  24h1.79
  I want the order of the data-points to be retained and the x-coordinates
 of
  each of the 8 data-points to be equally spaced.
  however, using plot($time, $f) completely scrambles the ordering of the
  points.
  Any pointers to this newbie on how to plot these data points accurately
 will
  be much appreciated.
  Thanks,
  Anjan
 
 [[alternative HTML version deleted]]
 
  __
  R-help@r-project.org mailing list
  https://stat.ethz.ch/mailman/listinfo/r-help
  PLEASE do read the posting guide
 http://www.R-project.org/posting-guide.html
  and provide commented, minimal, self-contained, reproducible code.
 



 --
 Jim Holtman
 Cincinnati, OH
 +1 513 646 9390

 What is the problem that you are trying to solve?


[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] row-wise means

2009-11-18 Thread ANJAN PURKAYASTHA
I have a dataframe with 3 columns. The first column stores an index. I would
like to calculate the mean of the numbers stored in each of the rest of the
columns.
So,
here is my data matrix:
col1 col2 col3
1 23 34
2 45 56
3 23 56
4 34 68

For each row I would like to calculate the means of the numbers stored in
col2 and col3.
How can this be done in R?
TIA,
Anjan

-- 
=
anjan purkayastha, phd
bioinformatics analyst
whitehead institute for biomedical research
nine cambridge center
cambridge, ma 02142

purkayas [at] wi [dot] mit [dot] edu
703.740.6939

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] Filtering on a dataframe- newbie question

2009-10-19 Thread ANJAN PURKAYASTHA
Thank you all who replied to my post. That cleared things up very well
Anjan

On Sun, Oct 18, 2009 at 11:10 PM, ANJAN PURKAYASTHA 
anjan.purkayas...@gmail.com wrote:

 Hi,
 newbie question. I have a data-frame with 3 named columns: Name, Obs1,
 Obs2.
 The Name column members are made of alphanumeric characters: T1, T2, T3
 etc.
 I would like to acess only that subset of the data-frame with Name == T44.
 X - dataframe[dataframe$Name=='T44'] does not work.
 Any ideas on how to do this?
 I'm sure I'm missing a simple concept here.

 Thanks,
 Anjan


 --
 =
 anjan purkayastha, phd
 bioinformatics analyst
 whitehead institute for biomedical research
 nine cambridge center
 cambridge, ma 02142

 purkayas [at] wi [dot] mit [dot] edu
 703.740.6939




-- 
=
anjan purkayastha, phd
bioinformatics analyst
whitehead institute for biomedical research
nine cambridge center
cambridge, ma 02142

purkayas [at] wi [dot] mit [dot] edu
703.740.6939

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Filtering on a dataframe- newbie question

2009-10-18 Thread ANJAN PURKAYASTHA
Hi,
newbie question. I have a data-frame with 3 named columns: Name, Obs1, Obs2.
The Name column members are made of alphanumeric characters: T1, T2, T3 etc.
I would like to acess only that subset of the data-frame with Name == T44.
X - dataframe[dataframe$Name=='T44'] does not work.
Any ideas on how to do this?
I'm sure I'm missing a simple concept here.

Thanks,
Anjan


-- 
=
anjan purkayastha, phd
bioinformatics analyst
whitehead institute for biomedical research
nine cambridge center
cambridge, ma 02142

purkayas [at] wi [dot] mit [dot] edu
703.740.6939

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Calculating the average after adding 3 matrices

2009-10-03 Thread ANJAN PURKAYASTHA
Hi all,
Here is my problem:
 I have 3 matrices , A, B, C.
Each is an nXn matrix.
I need to create matrix D such that : D[i,j]= (A[i,j]+B[i,j]+C[i,j])/3.
Being a newbie this is proving to be a challenge.
Any ideas on how best to accomplish this?
Thanks!
Anjan


-- 
=
anjan purkayastha, phd
bioinformatics analyst
whitehead institute for biomedical research
nine cambridge center
cambridge, ma 02142

purkayas [at] wi [dot] mit [dot] edu
703.740.6939

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Book on R programming

2009-08-31 Thread ANJAN PURKAYASTHA
Most books on R I come across describe running statistical procedures in R.
Any suggestions on a good book that teaches *programming* in R?
Thanks,
Anjan

-- 
=
anjan purkayastha, phd
bioinformatics analyst
whitehead institute for biomedical research
nine cambridge center
cambridge, ma 02142

purkayas [at] wi [dot] mit [dot] edu
703.740.6939

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] Book on R programming

2009-08-31 Thread ANJAN PURKAYASTHA
To All,
Your prompt and helpful replies are most appreciated. I'll be sure to check
out your suggestions.
Cheers,
Anjan

On Mon, Aug 31, 2009 at 8:24 AM, Ottorino-Luca Pantani 
ottorino-luca.pant...@unifi.it wrote:

 ANJAN PURKAYASTHA ha scritto:

 Most books on R I come across describe running statistical procedures in
 R.
 Any suggestions on a good book that teaches *programming* in R?
 Thanks,
 Anjan



 Here there are a few

 http://www.r-project.org/doc/bib/R-books.html
 [42] and [79] may be good starting points

 --
 Ottorino




-- 
=
anjan purkayastha, phd
bioinformatics analyst
whitehead institute for biomedical research
nine cambridge center
cambridge, ma 02142

purkayas [at] wi [dot] mit [dot] edu
703.740.6939

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Plotting question

2009-07-18 Thread ANJAN PURKAYASTHA
Hi I have a data set that needs to be plotted (see below)
When I plot it using the simple plot(read_count ~ Cn), I get box plots for
the read_count numbers plotted according to Cn. The Cn's on the x-axis are
ordered: 1, 100, 1000, 40, 400, 4000...
How do I plot so that Cn is plotted on the x-axis in an ascending order: 40,
60, 80, ...1?

Thanks for your help.
Anjan


ID Cn read_count
1   MJ-2000-79 10,000   6876
2   MJ-2000-80 10,000  23440
3   MJ-2000-87 10,000  18787
4  MJ-2000-100   8000   4775
5   MJ-2000-81   8000   1542
6   MJ-2000-82   8000   1550
7  MJ-2000-101   6000  15322
8   MJ-2000-83   6000   7023
9   MJ-2000-84   6000834
10 MJ-2000-102   4000   4216
11  MJ-2000-85   4000   1174
12  MJ-2000-86   4000404
13 MJ-4000-131   1000   1368
14 MJ-4000-132   1000   2219
15 MJ-4000-125800413
16 MJ-4000-133800   2594
17 MJ-4000-127600   3059
18 MJ-4000-134600   1561
19 MJ-4000-142600848
20 MJ-4000-129400182
21 MJ-4000-130400   1410
22 MJ-4000-135400   2713
23 MJ-7000-182100943
24 MJ-8000-184100318
25 MJ-8000-185 80 92
26 MJ-8000-186 80161
27 MJ-8000-188 60158
28 MJ-8000-191 60103
29 MJ-8000-192 40468
30 MJ-8000-193 40432

-- 
=
anjan purkayastha, phd
bioinformatics analyst
whitehead institute for biomedical research
nine cambridge center
cambridge, ma 02142

purkayas [at] wi [dot] mit [dot] edu
703.740.6939

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] drawing a plot with SEM

2009-04-22 Thread ANJAN PURKAYASTHA
I have a data matrix:
repeat1repeat2
13034
21223
3 50   13
4 5698
.
.
.

I would like to plot for the mean and standard error of mean of repeat 1 and
repeat2 for each row.
Any pointers on how to do this in R?

TIA,
Anjan
-- 
=
anjan purkayastha, phd
bioinformatics analyst
whitehead institute for biomedical research
nine cambridge center
cambridge, ma 02142

purkayas [at] wi [dot] mit [dot] edu
703.740.6939

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Uninstall question

2009-02-17 Thread ANJAN PURKAYASTHA
I need to uninstall R 2.7.1 from my Mac. What is the best way to uninstall
it? Simply delete the R icon in the Applications folder?
Or is it more involved?
TIA,
Anjan

-- 
=
anjan purkayastha, phd
bioinformatics analyst
whitehead institute for biomedical research
nine cambridge center
cambridge, ma 02142

purkayas [at] wi [dot] mit [dot] edu
703.740.6939

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] question on statistical test

2009-01-31 Thread ANJAN PURKAYASTHA
hi,
this is more of a statistical methodology question than an R question.
however, since there are quite a few expert statisticians in this forum i
think i can expect to get some useful feedback.
here is my problem:
i am studying a set of genes.
for each gene i have a set of probes (there are 2-5 probes per gene).
for each probe i have a set of 3 measurements (under conditions a, b and c).

let us say that for a given gene i have 3 probes X, Y and Z. and for each
probe there are 3 measurements, thus:

   a b c
X   Xa   Xb   Xc
Y   Ya   YbYc
ZZa   ZbZc

for each gene i need to test the correlation among the measurements under
each condition. so, are Xa, Ya, Za correlated AND Xb, Yb, Zb correlated AND
Xc, Yc, Zc correlated?
based on the degree of correlation among these probe measurements i will
choose the genes for further study.
of course i have to do this for each gene in my set of 100.

what statistical tests do i need to perform to accomplish this?

all advice will be appreciated.
tia,
anjan

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] quantile question

2009-01-22 Thread ANJAN PURKAYASTHA
Hi,
A simple quantile question:
I need to calculate the 95% and 5% quantiles (aka percentiles) for the
following data:
67.12
64.51
62.06
55.45
51.41
43.78
10.74
10.14

if I use the formula: 95% quantile point= 95 (8+1)/100, I get the 8.55th
point as the 95% quantile. Which does not make too much sense as I have only
8 data points.
The other option is to use (95*8)/100 = 7.6th data point (which can be found
by interpolation between the 7th and 8th data points).
Reportedly, the second formula is not too accurate. However in my case the
first formula does not make much sense.

Any advice?

TIA,
Anjan

-- 
=
anjan purkayastha, phd
bioinformatics analyst
whitehead institute for biomedical research
nine cambridge center
cambridge, ma 02142

purkayas [at] wi [dot] mit [dot] edu
703.740.6939

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] quantile question

2009-01-22 Thread ANJAN PURKAYASTHA
Thanks all for your prompt and helpful replies!
Anjan

On Thu, Jan 22, 2009 at 11:52 AM, ANJAN PURKAYASTHA 
anjan.purkayas...@gmail.com wrote:

 Hi,
 A simple quantile question:
 I need to calculate the 95% and 5% quantiles (aka percentiles) for the
 following data:
 67.12
 64.51
 62.06
 55.45
 51.41
 43.78
 10.74
 10.14

 if I use the formula: 95% quantile point= 95 (8+1)/100, I get the 8.55th
 point as the 95% quantile. Which does not make too much sense as I have only
 8 data points.
 The other option is to use (95*8)/100 = 7.6th data point (which can be
 found by interpolation between the 7th and 8th data points).
 Reportedly, the second formula is not too accurate. However in my case the
 first formula does not make much sense.

 Any advice?

 TIA,
 Anjan

 --
 =
 anjan purkayastha, phd
 bioinformatics analyst
 whitehead institute for biomedical research
 nine cambridge center
 cambridge, ma 02142

 purkayas [at] wi [dot] mit [dot] edu
 703.740.6939




-- 
=
anjan purkayastha, phd
bioinformatics analyst
whitehead institute for biomedical research
nine cambridge center
cambridge, ma 02142

purkayas [at] wi [dot] mit [dot] edu
703.740.6939

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Perl-R bridge

2009-01-18 Thread ANJAN PURKAYASTHA
Hi,
I'm planning to access R from my perl scripts.
The only noteworthy bridge seems to be
Statistics-R-0.03http://search.cpan.org/%7Ectbrown/Statistics-R/lib/Statistics/R.pm.
Would anyone like to share their experience with this Perl-R bridge?
I'd like to install it in a Mac OS X.
Suggestions on alternate solutions will be appreciated.
Thanks in advance,
Anjan

-- 
=
anjan purkayastha, phd
bioinformatics analyst
whitehead institute for biomedical research
nine cambridge center
cambridge, ma 02142

purkayas [at] wi [dot] mit [dot] edu
703.740.6939

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Statistics forums

2008-11-15 Thread ANJAN PURKAYASTHA
Hi,
Given that the R forum is mostly a site to discuss issues related to the
language, are there any forums that discuss statistical issues? I am
specifically interested in the statistical issues in experimental design.
TIA,
Anjan

-- 
=
anjan purkayastha, phd
bioinformatics analyst
whitehead institute for biomedical research
nine cambridge center
cambridge, ma 02142

purkayas [at] wi [dot] mit [dot] edu
703.740.6939

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] package question

2008-10-06 Thread ANJAN PURKAYASTHA
hi,
am new to R. what command do i use to check of certain packages have been
loaded?
Also, it seems that most packages come with vignette- is this a document
that describes the usage of the package?
thanks for your help,
anjan

-- 
=
anjan purkayastha, phd
bioinformatics analyst
whitehead institute for biomedical research
nine cambridge center
cambridge, ma 02142

purkayas [at] wi [dot] mit [dot] edu
703.740.6939

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] printing all rows

2008-09-09 Thread ANJAN PURKAYASTHA
Hi,
my data table has 38939 rows. R prints  the first 1 columns and then
prints an error message:[ reached getOption(max.print) -- omitted 27821
rows ]].
is it possible to set the maxprint parameter so that R prints all the rows?

tia,
anjan

-- 
=
anjan purkayastha, phd
bioinformatics analyst
whitehead institute for biomedical research
nine cambridge center
cambridge, ma 02142

purkayas [at] wi [dot] mit [dot] edu
703.740.6939

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.