> These > zero.tsv.log* > files to which I refer seem to be, basically Varnish log lines that > correspond to Wikipedia Zero-targeted traffic. Yup! Correct. zero.tsv.log* files are captured unsampled and based on the presence of a "zero=" tag in the X-Analytics header:
http://git.wikimedia.org/blob/operations%2Fpuppet.git/37ffb0ccc1cd7d3f5612df8779e9a3bdb69066b2/templates%2Fudp2log%2Ffilters.oxygen.erb#L10 > Do I understand correctly that field as Content-Type? Yup again! The varnishncsa format string that is currently being beamed at udp2log is here: http://git.wikimedia.org/blob/operations%2Fpuppet.git/37ffb0ccc1cd7d3f5612df8779e9a3bdb69066b2/modules%2Fvarnish%2Ffiles%2Fvarnishncsa.default On Sep 10, 2013, at 4:25 PM, Adam Baso <ab...@wikimedia.org> wrote: > Somewhere in between, I think. > > Wikipedia Zero's main extension, ZeroRatedMobileAccess, relies upon the > mobile web's main extension, MobileFrontend. Wikipedia Zero access is > served across [lang.].zero.wikipedia.org and [lang.].m.wikipedia.org. > > As I understand, the general Varnish logs capture both the Wikipedia > Zero-based and the non-Wikipedia Zero-based mobile web access. These > zero.tsv.log* > files to which I refer seem to be, basically Varnish log lines that > correspond to Wikipedia Zero-targeted traffic. > > Wikipedia Zero for the mobile web will in all likelihood have a higher rate > of WAP device usage and WAP content served when compared to the general > Wikipedia for the mobile web stats. It's likely that, to at least some > extent, that higher WAP usage in participating Wikipedia Zero markets, > would be washed out by the relatively higher adoption of smartphones in > wealthier markets. > > Please do let me know in case of a need for further clarification! > > -Adam > > > On Tue, Sep 10, 2013 at 4:04 AM, Gerard Meijssen > <gerard.meijs...@gmail.com>wrote: > >> Hoi, >> Is the Wikipedia-Zero traffic information part of the mobile statistics or >> is it something completely separate thing? >> Thanks, >> GerardM >> >> >> On 10 September 2013 03:26, Adam Baso <ab...@wikimedia.org> wrote: >> >>> Wikipedia Zero traffic (IP address and MCC/MNC matching as expected) >> shows >>> in one day of requests (zero.tsv.log-20130907) roughly 7-9% of page >>> responses having a Content-Type response of "text/vnd.wap.wml", presuming >>> field #11 (or index 10 if you're indexing from 0) in zero.tsv.log-<date> >> is >>> the Content-Type. Do I understand correctly that field as Content-Type? >>> >>> Thanks. >>> -Adam >>> >>> >>> On Thu, Sep 5, 2013 at 9:27 AM, Arthur Richards <aricha...@wikimedia.org >>>> wrote: >>> >>>> Would adding the accept header to the x-analytics header be worthwhile >>> for >>>> this? >>>> On Sep 5, 2013 4:16 AM, "Erik Zachte" <ezac...@wikimedia.org> wrote: >>>> >>>>> For a breakdown per country, the higher the sampling rate the better, >> as >>>>> the data will become reliable even for smaller countries with a not so >>>>> great adoption rate of Wikipedia. >>>>> >>>>> -----Original Message----- >>>>> From: analytics-boun...@lists.wikimedia.org [mailto: >>>>> analytics-boun...@lists.wikimedia.org] On Behalf Of Max Semenik >>>>> Sent: Thursday, September 05, 2013 12:28 PM >>>>> To: Diederik van Liere >>>>> Cc: A mailing list for the Analytics Team at WMF and everybody who has >>> an >>>>> interest in Wikipedia and analytics.; mobile-l; Wikimedia developers >>>>> Subject: Re: [Analytics] [WikimediaMobile] Mobile stats >>>>> >>>>> On 05.09.2013, 4:04 Diederik wrote: >>>>> >>>>>> Heya, >>>>>> I would suggest to at least run it for a 7 day period so you capture >>>>>> at least the weekly time-trends, increasing the sample size should >>>>>> also be recommendable. We can help setup a udp-filter for this >> purpose >>>>>> as long as the data can be extracted from the user-agent string. >>>>> >>>>> Unfortunately, accept is no less important here. >>>>> So, to enumerate our requirements as a result of this thread: >>>>> * Sampling rate the same as wikistats (1/1000). >>>>> * No less than a week worth of data. >>>>> * User-agent: >>>>> * Accept: >>>>> * Country from GeoIP to determine the share of developing countries. >>>>> * Wiki to determine if some wikis are more dependant on WAP than other >>>>> ones. >>>>> >>>>> Anything else? >>>>> >>>>> -- >>>>> Best regards, >>>>> Max Semenik ([[User:MaxSem]]) >>>>> >>>>> >>>>> _______________________________________________ >>>>> Analytics mailing list >>>>> analyt...@lists.wikimedia.org >>>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>>> >>>>> >>>>> _______________________________________________ >>>>> Mobile-l mailing list >>>>> mobil...@lists.wikimedia.org >>>>> https://lists.wikimedia.org/mailman/listinfo/mobile-l >>>>> >>>> >>>> _______________________________________________ >>>> Mobile-l mailing list >>>> mobil...@lists.wikimedia.org >>>> https://lists.wikimedia.org/mailman/listinfo/mobile-l >>>> >>>> >>> _______________________________________________ >>> Wikitech-l mailing list >>> Wikitech-l@lists.wikimedia.org >>> https://lists.wikimedia.org/mailman/listinfo/wikitech-l >>> >> _______________________________________________ >> Wikitech-l mailing list >> Wikitech-l@lists.wikimedia.org >> https://lists.wikimedia.org/mailman/listinfo/wikitech-l >> > _______________________________________________ > Wikitech-l mailing list > Wikitech-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikitech-l _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l