https://bugzilla.wikimedia.org/show_bug.cgi?id=33253

--- Comment #19 from Donald Lancon <[email protected]> 2012-05-17 00:56:34 UTC ---
I should point out that I'm really only _assuming_ this is true (about the
'content=""' string), since it seems to match (more or less) what I've been
told about what namespaces count as content on what projects.

Note, however, that there is quite a bit of variation in this. For example,
when you, Erik, told me at [[m:Talk:Wikimedia News#Using Wikipedia Statistics
to fill in gaps]] that "102 = Author, 104 = Page, 106 = Index" count as content
on Wikisource, that's true about the English Wikisource, but not necessarily
the others. Not all Wikisources even use the same namespace numbers for the
same purposes: in the Estonian Wikisource, for example, 102 = Page, 104 =
Index, and 106 = Author (and these are all marked as "content" in the API query
results; and in the Turkish Wikisource, 100 = Author, and that's the only
namespace other than main (ns0) marked as "content".

So does this mean not even Wikistats is counting the articles correctly?? [g]

As part of my investigation into the large shifts in "on-wiki" article counts
alluded to above, I've started to fill in a large table at [[m:Talk:Wikimedia
News#May 10 article count updates]] with some relevant info, including what
namespaces are marked as "content" in the API results, how many non-redirect
pages are (or appear to be, approximately) in each, and an estimate of what
percentage of these should count as "articles" by the "at least one link"
standard (plus a lot of other stuff -- note, BTW, that the table only contains
wikis that passed or dropped below article-count "milestones").

I'm also in the process of downloading all the relevant database dumps that
should allow me to calculate "exactly" many of the numbers in that table that
are currently only estimates (in essense duplicating what I assume your
script[s] do, Erik, but for dumps made just before and just after May 10th, not
only at the end of the month).

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to