[Bug 61063] Data loss in mobile webrequests.

2014-02-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=61063

--- Comment #3 from christ...@quelltextlich.at ---
(In reply to comment #0)
 Today, though, I encountered requests without a MIME type.

Requests without a MIME type are fine in many settings.
We're seeing many of them.

Since you seem to be able to reproduce, could you provide a short snippet
that allows to exhibit such a log line?

(I am not asking for the log line itself, but for some chain of actions that
allows us to see a log line that you are concerned about)

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 61063] Data loss in mobile webrequests.

2014-02-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=61063

--- Comment #4 from Oliver Keyes oke...@wikimedia.org ---
Alright, you want to hunt for:

*hits to uri_path /wiki/File:Thailand_Surin_locator_map.svg
*Between 2014-01-20T10:14:00 and 2014-01-20T10:15:00

(hopefully that's anonymised enough)

From that particular example, it looks like the (intact) request was a MISS
from the varnish cache's point of view, which explains the immediate repeat of
the request. Whether it's also responsible for the lack of referrer data is too
network engineer-y for me to know - but it is a potential limiter if we want to
use MIME type filtering for say, pageviews. The good news is that, assuming my
data sample is representative (and it's probably off, since it's 128k mobile
views from a specific date), this only happens about 0.03 percent of the time.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 61063] Data loss in mobile webrequests.

2014-02-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=61063

--- Comment #5 from Oliver Keyes oke...@wikimedia.org ---
*Blinks* actually, looking at that example, the MIME type is intact, it's the
referrer that's vanished. My brain is...clearly not on today.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 61063] Data loss in mobile webrequests.

2014-02-07 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=61063

--- Comment #1 from Bingle bingle-ad...@wikimedia.org ---
Prioritization and scheduling of this bug is tracked on Mingle card
https://wikimedia.mingle.thoughtworks.com/projects/analytics/cards/cards/1439

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 61063] Data loss in mobile webrequests.

2014-02-07 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=61063

--- Comment #2 from Oliver Keyes oke...@wikimedia.org ---
Further investigation:

*I went through some of the requestlogs manually and found duplicate requests,
about 9-10ms apart, the latter of which had the MIME type and referer stripped.
This could be the source of both the MIME type data loss and the referer data
loss we've seen with Special:BannerRandom hits. Matt Walker theorises that the
problem may be us consuming requestlog data from multiple layers of varnish
machines, and thus getting the same requests multiple times. I'm going to yank
out the hostnames for the weird hinky hits I've noticed to see.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l