https://bugzilla.wikimedia.org/show_bug.cgi?id=69371

--- Comment #6 from nuria <[email protected]> ---
>Parsing all zero.tsv* files i noticed a large number 
>of other strange items - highly broken URLs that still return miss/200 result.

I think we are missing issues here. I will address this comment but let's try
to focus on 'orghttp' problem first.

The fact that "highly broken" urls return miss/200 doesn't seem that strange if
we are caching negative image lookups on the edge-caches and returning 200 for
those (negative lookup meaning an image request we do not have in origin). So,
without knowing more about our caching infrastructure, I would not flag this as
a problem. Now, let's get back to this once we have taken a deeper look at
"orghttp" issue. 


>Various IP addresses and random hosts (e.g. 0.facebook.com) keep appearing, 
>and get resolved just fine by the backend, even though they clearly shouldn't.

There are requests with "0.facebook.com" on the User_Agent (facebook-bot) and
requests with "facebook" on the referrer. I see nothing obviously wrong with
either.

After there are (very few) requests like :
http://zero.wikipedia.org/uc6.ucweb.com:80/?0.facebook.com/?zout=1&ucid=6883926810-f3e4e4a2
that seem to come from the same kind of phone.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to