[R] Size of R user base
I have been trying to determine the size of the R user base, and was asked to share my findings with this mailing list. Although I still don't have any definite estimate of this number, I do have some interesting and indicative information: 1. It appears that there are about 100,000 S-PLUS users. Rationale: According to Insightful's 2002 Annual Report, over 100,000 people use Insightful software; since license revenues from S-PLUS and add-on modules accounted for nearly all of their license revenues in 2002, and their other products are much more costly than S-PLUS, it seems that the great majority of users of Insightful software are S-PLUS users. Conclusion: S-PLUS costs $3500 (Windows) or $4500 (Linux/Unix) for an individual copy; R is free. This suggests that there may be more R users than S-PLUS users, which suggests 100,000 R users. Does anyone has any other information that would give some notion as to the RELATIVE numbers of R and S-PLUS users? 2. At least one R book has achieved sales of just over 5,000 copies. (I could not find sales figures for other R books, as it appears that publishers are closed-mouthed about such figures. And no, I can't reveal which particular book this was, so don't ask.) Conclusion: Very few books sell to more than 12% of the population of potential buyers, and most books have a far lower penetration -- 1% or less is not uncommon. A 12% penetration for the book in question implies 42,000 R users; a more reasonable 5% penetration implies 100,000 users. A low 1% penetration implies 500,000 users. 3. There are a total of 3225 unique subscribers to the three R mailing lists. __ [EMAIL PROTECTED] mailing list https://www.stat.math.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html
Re: [R] Size of R user base
(Ted Harding) wrote: 1. It appears that there are about 100,000 S-PLUS users. [...] Does anyone has any other information that would give some notion as to the RELATIVE numbers of R and S-PLUS users? There is one major factor in here. The number of Windows users in the world is much higher than the number of Unix/Linux users, especially in the corporate sector. Organisations whose work needs R/S-PLUS and whose IT is Windows based will (I believe) mostly go for S-PLUS (I could expand in my reasons for believing this). But R is available for Windows, too. I've downloaded and installed both the Linux and Windows versions; neither task was difficult, and the Windows version had a rather nicer interface. __ [EMAIL PROTECTED] mailing list https://www.stat.math.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html
Re: [R] Size of R user base
Prof Brian Ripley wrote: Conclusion: Very few books sell to more than 12% of the population of potential buyers, and most books have a far lower penetration -- 1% or Where did you get that 12% from? A booklet on assessing financial feasibility in nonfiction book publishing. That's a general figure, so perhaps it doesn't apply if the book in question is a must-have, definitive reference for the group in question... like the book you mention (if it's the one I think it is). I have big problems with the definition. What is an `R user'? Someone who has ever used R, even for a one-hour practical class? Someone who has used R in the last 3 months? Good question. I guess I'd lean more towards your second definition, with the added caveat of and expects to use it again in the next 3 months. __ [EMAIL PROTECTED] mailing list https://www.stat.math.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html
Re: [R] Computing a CDF or many quantiles
Your method looks like a naive reimplementation of integration, and won't work so well for distributions that have the great majority of the probability mass concentrated in a small fraction of the sample space. I was hoping for something that would retain the adaptability of integrate(). (Ted Harding) wrote: If that's all you want to do, then a very straightfoward approach should be OK. I illustrate with a truncated normal distribution on [-1,1]: x - (-1)+(0.001*(0:2000));pdf-dnorm(x); pdf-pdf/(sum(pdf)*0.001) CDF-cumsum(pdf)*0.001 plot(x,pdf,ylim=c(0,1),type=l);lines(x,CDF) Quantiles: N=10;e-CDF[1]; for(i in (0:10)){ j-max(which(CDF=i/N+e));print(c(x[j],CDF[j])) } __ [EMAIL PROTECTED] mailing list https://www.stat.math.ethz.ch/mailman/listinfo/r-help
[R] Computing a CDF or many quantiles
Given f, a pdf over a finite interval, is there any existing R function that can efficiently tabulate the cumulative distribution function for f, or produce all N+1 quantiles of the form i/N? Efficiently here means better than doing repeated integrations for each point. __ [EMAIL PROTECTED] mailing list https://www.stat.math.ethz.ch/mailman/listinfo/r-help