[R] Size of R user base

2004-04-19 Thread Kevin S. Van Horn
I have been trying to determine the size of the R user base, and was 
asked to share my findings with this mailing list.  Although I still 
don't have any definite estimate of this number, I do have some 
interesting and indicative information:

1. It appears that there are about 100,000 S-PLUS users.

Rationale: According to Insightful's 2002 Annual Report, over 100,000 
people use Insightful software; since license revenues from S-PLUS and 
add-on modules accounted for nearly all of their license revenues in 
2002, and their other products are much more costly than S-PLUS, it 
seems that the great majority of users of Insightful software are S-PLUS 
users.

Conclusion: S-PLUS costs $3500 (Windows) or $4500 (Linux/Unix) for an 
individual copy; R is free.  This suggests that there may be more R 
users than S-PLUS users, which suggests  100,000 R users.

Does anyone has any other information that would give some notion as to 
the RELATIVE numbers of R and S-PLUS users?

2. At least one R book has achieved sales of just over 5,000 copies.  (I 
could not find sales figures for other R books, as it appears that 
publishers are closed-mouthed about such figures.  And no, I can't 
reveal which particular book this was, so don't ask.)

Conclusion:  Very few books sell to more than 12% of the population of 
potential buyers, and most books have a far lower penetration -- 1% or 
less is not uncommon.  A 12% penetration for the book in question 
implies 42,000 R users; a more reasonable 5% penetration implies 100,000 
users.  A low 1% penetration implies 500,000 users.

3. There are a total of 3225 unique subscribers to the three R mailing 
lists.

__
[EMAIL PROTECTED] mailing list
https://www.stat.math.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html


Re: [R] Size of R user base

2004-04-19 Thread Kevin S. Van Horn
(Ted Harding) wrote:

1. It appears that there are about 100,000 S-PLUS users.
   

[...]

Does anyone has any other information that would give some notion as to
the RELATIVE numbers of R and S-PLUS users?
   

There is one major factor in here. The number of Windows users
in the world is much higher than the number of Unix/Linux users,
especially in the corporate sector. Organisations whose work
needs R/S-PLUS and whose IT is Windows based will (I believe)
mostly go for S-PLUS (I could expand in my reasons for believing
this).
But R is available for Windows, too.  I've downloaded and installed both 
the Linux and Windows versions; neither task was difficult, and the 
Windows version had a rather nicer interface.

__
[EMAIL PROTECTED] mailing list
https://www.stat.math.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html


Re: [R] Size of R user base

2004-04-19 Thread Kevin S. Van Horn
Prof Brian Ripley wrote:

Conclusion:  Very few books sell to more than 12% of the population of 
potential buyers, and most books have a far lower penetration -- 1% or 
   

Where did you get that 12% from?

A booklet on assessing financial feasibility in nonfiction book 
publishing.  That's a general figure, so perhaps it doesn't apply if the 
book in question is a must-have, definitive reference for the group in 
question... like the book you mention (if it's the one I think it is).

I have big problems with the definition. What is an `R user'? Someone  
who has ever used R, even for a one-hour practical class? Someone who 
has used R in the last 3 months?


Good question.  I guess I'd lean more towards your second definition, 
with the added caveat of and expects to use it again in the next 3 months.

__
[EMAIL PROTECTED] mailing list
https://www.stat.math.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html


Re: [R] Computing a CDF or many quantiles

2003-09-10 Thread Kevin S. Van Horn
Your method looks like a naive reimplementation of integration, and 
won't work so well for distributions that have the great majority of the 
probability mass concentrated in a small fraction of the sample space. 
I was hoping for something that would retain the adaptability of 
integrate().

(Ted Harding) wrote:

If that's all you want to do, then a very straightfoward approach should
be OK. I illustrate with a truncated normal distribution on [-1,1]:
 x - (-1)+(0.001*(0:2000));pdf-dnorm(x); pdf-pdf/(sum(pdf)*0.001)
 CDF-cumsum(pdf)*0.001
 plot(x,pdf,ylim=c(0,1),type=l);lines(x,CDF)
Quantiles:
 N=10;e-CDF[1];
 for(i in (0:10)){
 j-max(which(CDF=i/N+e));print(c(x[j],CDF[j]))
 }
 

__
[EMAIL PROTECTED] mailing list
https://www.stat.math.ethz.ch/mailman/listinfo/r-help


[R] Computing a CDF or many quantiles

2003-09-09 Thread Kevin S. Van Horn
Given f, a pdf over a finite interval, is there any existing R function that 
can efficiently tabulate the cumulative distribution function for f, or 
produce all N+1 quantiles of the form i/N?  Efficiently here means better 
than doing repeated integrations for each point.

__
[EMAIL PROTECTED] mailing list
https://www.stat.math.ethz.ch/mailman/listinfo/r-help