https://bugzilla.wikimedia.org/show_bug.cgi?id=60184

       Web browser: ---
            Bug ID: 60184
           Summary: Analytics: Can we start quoting our logging fields?
           Product: Analytics
           Version: unspecified
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: Unprioritized
         Component: General/Unknown
          Assignee: wikibugs-l@lists.wikimedia.org
          Reporter: oke...@wikimedia.org
                CC: christ...@quelltextlich.at, dvanli...@gmail.com
    Classification: Unclassified
   Mobile Platform: ---

I'm sat here looking at a 6MB user agent field. It's not /actually/ a 6MB user
agent field, it's a user agent field where some browser designer decided "let's
put tabs in our UA, that won't cause anyone any problems!" and so, of course,
the tab-separated files we store our logs in happily escaped it, meaning that
when the TSV was read in, the field overflowed.

In the absence of hunting down the people who made that decision at the browser
end and forcing them to use the internet through an early and experimental IE
version for all of time, could we start quoting the fields in the request logs?
I'm not sure how Erik Z reads his files in, but if it's tab-sensitive we're
potentially looking at a data loss issue with wikistats. If it's not, we're
looking at a data loss issue with my work. Either is to be avoided ;p.

Obviously VK will solve for this once it's dealing with the whole firehose.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to