Re: [Analytics] Do most of the articles really receive little to no edits?

2016-09-16 Thread Tilman Bayer
On Fri, Sep 16, 2016 at 6:08 AM, aaron shaw wrote: ... > > > Also relevant to Reem's original concern and the subsequent discussion here: > page views accrue to redirects even when the *content* that is viewed exists > on the page that is the target of the redirect.

Re: [Analytics] Do most of the articles really receive little to no edits?

2016-09-15 Thread Tilman Bayer
To Andrew's point about excluding redirects, see also this paper by Benjamin Mako Hill and Aaron Shaw (CCed): https://mako.cc/ copyrighteous/consider-the-redirect (don't know if they have data for Arabic Wikipedia too) In short, the distribution of edits is very different for redirects and

Re: [Analytics] Do most of the articles really receive little to no edits?

2016-09-15 Thread Dan Andreescu
Good point, updated to *exclude redirects* and rerun: total_namespace_0_revisions: 457,574,404 total_namespace_0_pages: 5,236,104 per namespace 0 non-redirect article: standard deviation of edits: *324.45* *average* edits: *87.54* standard deviation of days between first and last edit:

Re: [Analytics] Do most of the articles really receive little to no edits?

2016-09-14 Thread Andrew Gray
Hi Dan, Thanks for running these! I'm struck by the figure of 12.8m pages in ns0 - it looks like this includes redirects (there are ~7.6m ns0 redirects on enwiki, and ~5.2m articles). This will probably skew things a lot, as the majority of those will probably be edited once and never touched

Re: [Analytics] Do most of the articles really receive little to no edits?

2016-09-14 Thread Dan Andreescu
Quick follow up 'cause I was curious. I calculated the average and standard deviation for edits per namespace 0 article on enwiki. I tried to do it on the research db replicas but it took forever so I did it on the hadoop cluster. Including archived pages isn't useful, doesn't change the

[Analytics] Do most of the articles really receive little to no edits?

2016-09-07 Thread Andrew Gray
Hi Reem, Here's some rough estimates. English - https://stats.wikimedia.org/EN/TablesWikipediaEN.htm English has ~5.2 million articles, with an average of ~92 edits per article, not counting deleted edits (or deleted articles). Note that 80% of those articles are more than three years old, so

Re: [Analytics] Do most of the articles really receive little to no edits?

2016-09-07 Thread Federico Leva (Nemo)
Reem Al-Kashif, 07/09/2016 15:52: I always hear people saying that most of the articles usually receive little to no edits Do you mean that many articles * have not been edited in a long time (6+ months?), * have few revisions (that is?), or * have only a human editor or two? (and that is

[Analytics] Do most of the articles really receive little to no edits?

2016-09-07 Thread Reem Al-Kashif
Hi, I always hear people saying that most of the articles usually receive little to no edits (and that is used to encourage participants to make sure their articles are good enough). I would like to know if there are statistics that support this for the English and Arabic Wikipedia. Best, Reem