On Fri, Sep 16, 2016 at 6:08 AM, aaron shaw wrote:
...
>
>
> Also relevant to Reem's original concern and the subsequent discussion here:
> page views accrue to redirects even when the *content* that is viewed exists
> on the page that is the target of the redirect.
To Andrew's point about excluding redirects, see also this paper by
Benjamin Mako Hill and Aaron Shaw (CCed): https://mako.cc/
copyrighteous/consider-the-redirect (don't know if they have data for
Arabic Wikipedia too)
In short, the distribution of edits is very different for redirects and
Good point, updated to *exclude redirects* and rerun:
total_namespace_0_revisions: 457,574,404
total_namespace_0_pages: 5,236,104
per namespace 0 non-redirect article:
standard deviation of edits: *324.45*
*average* edits: *87.54*
standard deviation of days between first and last edit:
Hi Dan,
Thanks for running these!
I'm struck by the figure of 12.8m pages in ns0 - it looks like this
includes redirects (there are ~7.6m ns0 redirects on enwiki, and ~5.2m
articles). This will probably skew things a lot, as the majority of
those will probably be edited once and never touched
Quick follow up 'cause I was curious. I calculated the average and
standard deviation for edits per namespace 0 article on enwiki. I tried to
do it on the research db replicas but it took forever so I did it on the
hadoop cluster. Including archived pages isn't useful, doesn't change the
Hi Reem,
Here's some rough estimates.
English - https://stats.wikimedia.org/EN/TablesWikipediaEN.htm
English has ~5.2 million articles, with an average of ~92 edits per
article, not counting deleted edits (or deleted articles). Note that 80% of
those articles are more than three years old, so
Reem Al-Kashif, 07/09/2016 15:52:
I always hear people saying that most of the articles usually receive
little to no edits
Do you mean that many articles
* have not been edited in a long time (6+ months?),
* have few revisions (that is?), or
* have only a human editor or two?
(and that is
Hi,
I always hear people saying that most of the articles usually receive
little to no edits (and that is used to encourage participants to make sure
their articles are good enough). I would like to know if there are
statistics that support this for the English and Arabic Wikipedia.
Best,
Reem