https://bugzilla.wikimedia.org/show_bug.cgi?id=70721

--- Comment #12 from Bartosz DziewoƄski <[email protected]> ---
(In reply to Erik Zachte from comment #11)
> I find 1587 lines with index.html, of which only 34 without curid.
> 
> Most lines are like https://en.wikipedia.org/wiki/index.html?curid=32681660

So these don't actually visit index.html, it's just the stats that are wrong.

Using 34/1587 as the percentage of real visits, we arrive at about 1000 hits
per day. This is comparable with other articles on these subjects, like "Web
server" or "HTTP". This thousand includes both humans and bots, right?


> Out of 1587 only 68 had a user agent that did not contain crawl,spider,bot
> or http (http is by unofficial convention only user for bots) 

I'm curious how many of the non-curid URLs are non-bots.


Either way, this seems to be just a stats issue and we do not actually have
millions of humans every month accidentally learning everything about webserver
directory indices. I suggest re-closing this bug as WONTFIX. Steve?

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to