https://bugzilla.wikimedia.org/show_bug.cgi?id=33253
--- Comment #19 from Donald Lancon <[email protected]> 2012-05-17 00:56:34 UTC --- I should point out that I'm really only _assuming_ this is true (about the 'content=""' string), since it seems to match (more or less) what I've been told about what namespaces count as content on what projects. Note, however, that there is quite a bit of variation in this. For example, when you, Erik, told me at [[m:Talk:Wikimedia News#Using Wikipedia Statistics to fill in gaps]] that "102 = Author, 104 = Page, 106 = Index" count as content on Wikisource, that's true about the English Wikisource, but not necessarily the others. Not all Wikisources even use the same namespace numbers for the same purposes: in the Estonian Wikisource, for example, 102 = Page, 104 = Index, and 106 = Author (and these are all marked as "content" in the API query results; and in the Turkish Wikisource, 100 = Author, and that's the only namespace other than main (ns0) marked as "content". So does this mean not even Wikistats is counting the articles correctly?? [g] As part of my investigation into the large shifts in "on-wiki" article counts alluded to above, I've started to fill in a large table at [[m:Talk:Wikimedia News#May 10 article count updates]] with some relevant info, including what namespaces are marked as "content" in the API results, how many non-redirect pages are (or appear to be, approximately) in each, and an estimate of what percentage of these should count as "articles" by the "at least one link" standard (plus a lot of other stuff -- note, BTW, that the table only contains wikis that passed or dropped below article-count "milestones"). I'm also in the process of downloading all the relevant database dumps that should allow me to calculate "exactly" many of the numbers in that table that are currently only estimates (in essense duplicating what I assume your script[s] do, Erik, but for dumps made just before and just after May 10th, not only at the end of the month). -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug. You are on the CC list for the bug. _______________________________________________ Wikibugs-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
