Re: [R] popular R packages

2009-03-11 Thread Jim Lemon
Christos Hatzis wrote: Bioconductor already provides download stats for all packages... http://bioconductor.org/packages/stats/bioc/affy.html Maybe if we asked the Bioconductor people _really_ nicely Jim __ R-help@r-project.org mailing list

Re: [R] popular R packages

2009-03-10 Thread Jim Lemon
Gabor Grothendieck wrote: R-Forge already has this but I don't think its used much. R-Forge does allow authors to opt out which seems sensible lest it deter potential authors from submitting packages. I think objective quality metrics are better than ratings, e.g. does package have a vignette,

Re: [R] popular R packages

2009-03-10 Thread Gabor Grothendieck
On Tue, Mar 10, 2009 at 6:14 AM, Jim Lemon j...@bitwrit.com.au wrote: Gabor Grothendieck wrote: R-Forge already has this but I don't think its used much.  R-Forge does allow authors to opt out which seems sensible lest it deter potential authors from submitting packages. I think objective

Re: [R] popular R packages

2009-03-10 Thread Frank E Harrell Jr
Gabor Grothendieck wrote: On Tue, Mar 10, 2009 at 6:14 AM, Jim Lemon j...@bitwrit.com.au wrote: Gabor Grothendieck wrote: R-Forge already has this but I don't think its used much. R-Forge does allow authors to opt out which seems sensible lest it deter potential authors from submitting

Re: [R] popular R packages

2009-03-10 Thread Max Kuhn
If is easy to get the download numbers, we should do it and deal with the interpretation issues. I'd like to know the numbers so I can understand which (of my) packages have the most usage. One other compication about # downloads: I suspect that a package being on teh depends/suggests/imports

Re: [R] popular R packages

2009-03-10 Thread Dylan Beaudette
On Tuesday 10 March 2009, Frank E Harrell Jr wrote: Gabor Grothendieck wrote: On Tue, Mar 10, 2009 at 6:14 AM, Jim Lemon j...@bitwrit.com.au wrote: Gabor Grothendieck wrote: R-Forge already has this but I don't think its used much. R-Forge does allow authors to opt out which seems

Re: [R] popular R packages

2009-03-10 Thread Ajay ohri
Pricing each download at 99 cents ( the same as a song from I Tunes) can measure users more accurately. Thats my 2 cents anyways. On Tue, Mar 10, 2009 at 9:54 PM, Max Kuhn mxk...@gmail.com wrote: If is easy to get the download numbers, we should do it and deal with the interpretation issues.

Re: [R] popular R packages

2009-03-10 Thread Christos Hatzis
To: r-help@r-project.org Subject: Re: [R] popular R packages If is easy to get the download numbers, we should do it and deal with the interpretation issues. I'd like to know the numbers so I can understand which (of my) packages have the most usage. One other compication about

Re: [R] popular R packages

2009-03-10 Thread Tom Backer Johnsen
2100 Seattle, WA 98101 Phone: 206-686-1578 Email: s...@xlsolutions-corp.com web: www.xlsolutions-corp.com --- On Sat, 3/7/09, Spencer Graves spencer.gra...@prodsyse.com wrote: From: Spencer Graves spencer.gra...@prodsyse.com Subject: Re: [R] popular R packages To: Wacek Kusnierczyk

Re: [R] popular R packages

2009-03-09 Thread David Duffy
Given we are talking about statistical software, one bibliometric measure of relative package popularity is scientific citations. Web of Science is not too useful where the citation has been to a website or computer package, but Google Scholar for lme4: Linear mixed-effects models using S4

Re: [R] popular R packages

2009-03-09 Thread Ted Harding
On 10-Mar-09 01:07:54, David Duffy wrote: Given we are talking about statistical software, one bibliometric measure of relative package popularity is scientific citations. Web of Science is not too useful where the citation has been to a website or computer package, but Google Scholar for

Re: [R] popular R packages

2009-03-09 Thread Matthew Keller
Hi all, Put me in the camp that says more information is better than less information - even if imperfect. Interpretation can be left to those using the data. Also, popular can mean many things. An alternative to number of times a package is downloaded would be a ratings system, where R users

Re: [R] popular R packages

2009-03-09 Thread hadley wickham
There was a discussion on this a while back in which Bill Venables said: To me a much more urgent initiative [than rating responders on R listserves] is some kind of user online review system for packages, even something as simple as that used by Amazon.com has for customer review of books. I

Re: [R] popular R packages

2009-03-09 Thread Gabor Grothendieck
R-Forge already has this but I don't think its used much. R-Forge does allow authors to opt out which seems sensible lest it deter potential authors from submitting packages. I think objective quality metrics are better than ratings, e.g. does package have a vignette, has package had a release

Re: [R] popular R packages

2009-03-08 Thread Emmanuel Charpentier
On Sat, 07 Mar 2009 18:04:24 -0500, David Winsemius wrote : [ Snip ... ] Nonetheless, I do think the relative numbers of package downloads might be interpretable, or at the very least, the basis for discussions over beer. *Anything* might be the basis for discussions over beer (obvious

Re: [R] popular R packages

2009-03-08 Thread hadley wickham
More seriously : I don't think relative numbers of package downloads can be interpreted in any reasonable way, because reasons for package download have a very wide range from curiosity (what's this ?), fun (think fortunes...), to vital need tthink lme4 if/when a consensus on denominator DFs

Re: [R] popular R packages

2009-03-08 Thread Gabor Grothendieck
On Sun, Mar 8, 2009 at 10:49 AM, hadley wickham h.wick...@gmail.com wrote: More seriously : I don't think relative numbers of package downloads can be interpreted in any reasonable way, because reasons for package download have a very wide range from curiosity (what's this ?), fun (think

Re: [R] popular R packages

2009-03-08 Thread Duncan Murdoch
On 08/03/2009 10:49 AM, hadley wickham wrote: More seriously : I don't think relative numbers of package downloads can be interpreted in any reasonable way, because reasons for package download have a very wide range from curiosity (what's this ?), fun (think fortunes...), to vital need tthink

Re: [R] popular R packages

2009-03-08 Thread Barry Rowlingson
I think the situation is worse than messy.  If a client comes in with data that doesn't address the question they're interested in, I think they are better served to be told that, than to be given an answer that is not actually valid.  They should also be told how to design a study that

Re: [R] popular R packages

2009-03-08 Thread Ted Harding
On 08-Mar-09 15:14:03, Duncan Murdoch wrote: On 08/03/2009 10:49 AM, hadley wickham wrote: More seriously : I don't think relative numbers of package downloads can be interpreted in any reasonable way, because reasons for package download have a very wide range from curiosity (what's this ?),

Re: [R] popular R packages

2009-03-08 Thread Tal Galili
Hi Ted, Coming to think about your direction - another idea came to mind: The next time a major release is made (there is one scheduled quite soon actually), the core team could add a survey on the downloading page of the R base package asking for just one question please click here if this is

Re: [R] popular R packages

2009-03-08 Thread Spencer Graves
Is this another discussion of what data might be collected and analyzed, and what could and could not be said if we only had such data? Has anyone but me produced any actual data? If so, I missed it. Hadly mentioned the 'fortunes' package. My earlier methodology,

Re: [R] popular R packages

2009-03-08 Thread Duncan Murdoch
On 08/03/2009 12:08 PM, Barry Rowlingson wrote: I think the situation is worse than messy. If a client comes in with data that doesn't address the question they're interested in, I think they are better served to be told that, than to be given an answer that is not actually valid. They should

Re: [R] popular R packages

2009-03-08 Thread Emmanuel Charpentier
Dear Barry, As far as I understand, you're telling us that having a bit of data mining does not harm whatever the data. Your example of pop music charts might support your point (although my ears disagree ...) but I think it is bad policy to indulge in white-noise analysis without a well-reasoned

Re: [R] popular R packages

2009-03-08 Thread Dirk Eddelbuettel
On 8 March 2009 at 13:27, Duncan Murdoch wrote: | But we don't even have that data, since CRAN is distributed across lots | of mirrors. On 8 March 2009 at 19:01, Emmanuel Charpentier wrote: | As far as I can see (but I might be nearsighted), I see no model linking | package download to package

Re: [R] popular R packages

2009-03-08 Thread Jeffrey Horner
Dirk Eddelbuettel wrote: On 8 March 2009 at 13:27, Duncan Murdoch wrote: | But we don't even have that data, since CRAN is distributed across lots | of mirrors. On 8 March 2009 at 19:01, Emmanuel Charpentier wrote: | As far as I can see (but I might be nearsighted), I see no model linking |

Re: [R] popular R packages

2009-03-08 Thread Rolf Turner
On 9/03/2009, at 4:14 AM, Duncan Murdoch wrote: ... analyzing bad data will just give bad conclusions. Fortune? cheers, Rolf Turner ## Attention:\ This e-mail message is privileged and

Re: [R] popular R packages

2009-03-08 Thread Wacek Kusnierczyk
Rolf Turner wrote: On 9/03/2009, at 4:14 AM, Duncan Murdoch wrote: ... analyzing bad data will just give bad conclusions. Fortune? looking for fortunes? got one for you: A key reason that R is a good thing is because it is a language who/where is left as an (easy) exercise. vQ

Re: [R] popular R packages

2009-03-08 Thread Rolf Turner
On 9/03/2009, at 10:23 AM, John Fox wrote: Dear Rolf, Tukey put it nicely: The combination of some data and an aching desire for an answer does not ensure that a reasonable answer can be extracted from a given body of data. Inasmuch as there are no current fortunes from Tukey, I

Re: [R] popular R packages

2009-03-08 Thread Emmanuel Charpentier
Le dimanche 08 mars 2009 à 13:22 -0500, Dirk Eddelbuettel a écrit : On 8 March 2009 at 13:27, Duncan Murdoch wrote: | But we don't even have that data, since CRAN is distributed across lots | of mirrors. On 8 March 2009 at 19:01, Emmanuel Charpentier wrote: | As far as I can see (but I

Re: [R] popular R packages

2009-03-08 Thread Dirk Eddelbuettel
On 8 March 2009 at 23:45, Emmanuel Charpentier wrote: | Le dimanche 08 mars 2009 13:22 -0500, Dirk Eddelbuettel a crit : | Once you have data, you have an option of using or discarding it. But if you | have no data, you have no option. How is that better? | | I question 1) the usefulness

Re: [R] popular R packages

2009-03-08 Thread Barry Rowlingson
2009/3/8 Emmanuel Charpentier charp...@bacbuc.dyndns.org: I question 1) the usefulness of the effort necessary to get the data ; and 2) the very concept of data mining, which seems to be the rationale for this proposed effort. Furthermore (but this is seriously off-topic), I seriously

Re: [R] popular R packages

2009-03-08 Thread hadley wickham
I question 1) the usefulness of the effort necessary to get the data ; and 2) the very concept of data mining, which seems to be the rationale for this proposed effort. Furthermore (but this is seriously off-topic), I seriously despise the very idea of popularity in scientific debates...

[R] popular R packages

2009-03-07 Thread Jeroen Ooms
I would like to get some idea of which R-packages are popular, and what R is used for in general. Are there any statistics available on which R packages are downloaded often, or is there something like a package-survey? Something similar to http://popcon.debian.org/ maybe? Any tips are welcome!

Re: [R] popular R packages

2009-03-07 Thread Gabor Grothendieck
This function will show which other packages depend on a particular package: dep - function(pkg, AP = available.packages()) { +pkg - paste(\\b, pkg, \\b, sep = ) +cat(Depends:, rownames(AP)[grep(pkg, AP[, Depends])], \n) +cat(Suggests:, rownames(AP)[grep(pkg, AP[, Suggests])], \n) +

Re: [R] popular R packages

2009-03-07 Thread David Winsemius
When the question arises How many R-users there are?, the consensus seems to be that there is no valid method to address the question. The thread R-business case from 2004 can be found here: https://stat.ethz.ch/pipermail/r-help/2004-March/047606.html I did not see any material revision to

Re: [R] popular R packages

2009-03-07 Thread Thomas Adams
I don't think At least one of the participants in the 2004 thread suggested that it would be a good thing to track the numbers of downloads by package. is reasonable because I download R packages for 2 home computers (laptop desktop) and 2 at work (1 Linux 1 Mac). There must be many such

Re: [R] popular R packages

2009-03-07 Thread Tal Galili
I agree with Thomas, over the years I have installed R on at least 5 computers. BTW: does any one knows how the website statistics of r-project are being analyzed? Since I can't see any google analytics or other tracking code in the main website, I am guessing someone might be running some

Re: [R] popular R packages

2009-03-07 Thread David Winsemius
Quite so. It certainly is the case that Dirk Eddelbuettel suggested would be very desirable and I think Dirk's track record speaks for itself. I never said (and I am sure Dirk never intended) that one could take the raw numbers as a basis for blandly asserting that copies of ttt

Re: [R] popular R packages

2009-03-07 Thread Jeroen Ooms
I agree with Thomas, over the years I have installed R on at least 5 computers. I don't see why per-marchine statistics would not be useful. When you installed a package on five machines, you probably use it a lot, and it is more important to you than packages that you only installed once.

Re: [R] popular R packages

2009-03-07 Thread Wacek Kusnierczyk
i have kept r installed on more than ten computers during the past few years, some of them running win + more than one linux distro, all of them having r, most often installed from a separate download. i know of many cases where students download r for the purpose of a course in statistics --

Re: [R] popular R packages

2009-03-07 Thread Spencer Graves
I just did RSiteSearch(library(xxx)) with xxx = the names of 6 packages familiar to me, with the following numbers of hits: hits package 169 lme4 165 nlme 6 fda 4 maps 2 FinTS 2 DierckxSpline Software could be written to (1) extract the names of current packages from

Re: [R] popular R packages

2009-03-07 Thread s...@xlsolutions-corp.com
...@prodsyse.com Subject: Re: [R] popular R packages To: Wacek Kusnierczyk waclaw.marcin.kusnierc...@idi.ntnu.no Cc: r-help@r-project.org, Jeroen Ooms j.c.l.o...@uu.nl, Thomas Adams thomas.ad...@noaa.gov Date: Saturday, March 7, 2009, 5:22 PM I just did RSiteSearch(library(xxx)) with xxx = the names of 6

Re: [R] popular R packages

2009-03-07 Thread Jim Lemon
Hi all, I'm kind of amazed at the answers suggested for the relatively simple question, How many times has each R package been downloaded?. Some have veered off in another direction, like working out how many packages a package depends upon, or whether someone downloads more than one copy.