https://bugzilla.wikimedia.org/show_bug.cgi?id=60184
Web browser: --- Bug ID: 60184 Summary: Analytics: Can we start quoting our logging fields? Product: Analytics Version: unspecified Hardware: All OS: All Status: NEW Severity: normal Priority: Unprioritized Component: General/Unknown Assignee: wikibugs-l@lists.wikimedia.org Reporter: oke...@wikimedia.org CC: christ...@quelltextlich.at, dvanli...@gmail.com Classification: Unclassified Mobile Platform: --- I'm sat here looking at a 6MB user agent field. It's not /actually/ a 6MB user agent field, it's a user agent field where some browser designer decided "let's put tabs in our UA, that won't cause anyone any problems!" and so, of course, the tab-separated files we store our logs in happily escaped it, meaning that when the TSV was read in, the field overflowed. In the absence of hunting down the people who made that decision at the browser end and forcing them to use the internet through an early and experimental IE version for all of time, could we start quoting the fields in the request logs? I'm not sure how Erik Z reads his files in, but if it's tab-sensitive we're potentially looking at a data loss issue with wikistats. If it's not, we're looking at a data loss issue with my work. Either is to be avoided ;p. Obviously VK will solve for this once it's dealing with the whole firehose. -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. _______________________________________________ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l