> These
> zero.tsv.log*
> files to which I refer seem to be, basically Varnish log lines that
> correspond to Wikipedia Zero-targeted traffic.
Yup!  Correct.  zero.tsv.log* files are captured unsampled and based on the 
presence of a "zero=" tag in the X-Analytics header:  

http://git.wikimedia.org/blob/operations%2Fpuppet.git/37ffb0ccc1cd7d3f5612df8779e9a3bdb69066b2/templates%2Fudp2log%2Ffilters.oxygen.erb#L10

> Do I understand correctly that field as Content-Type?
Yup again!  The varnishncsa format string that is currently being beamed at 
udp2log is here:

http://git.wikimedia.org/blob/operations%2Fpuppet.git/37ffb0ccc1cd7d3f5612df8779e9a3bdb69066b2/modules%2Fvarnish%2Ffiles%2Fvarnishncsa.default





On Sep 10, 2013, at 4:25 PM, Adam Baso <ab...@wikimedia.org> wrote:

> Somewhere in between, I think.
> 
> Wikipedia Zero's main extension, ZeroRatedMobileAccess, relies upon the
> mobile web's main extension, MobileFrontend. Wikipedia Zero access is
> served across [lang.].zero.wikipedia.org and [lang.].m.wikipedia.org.
> 
> As I understand, the general Varnish logs capture both the Wikipedia
> Zero-based and the non-Wikipedia Zero-based mobile web access. These
> zero.tsv.log*
> files to which I refer seem to be, basically Varnish log lines that
> correspond to Wikipedia Zero-targeted traffic.
> 
> Wikipedia Zero for the mobile web will in all likelihood have a higher rate
> of WAP device usage and WAP content served when compared to the general
> Wikipedia for the mobile web stats. It's likely that, to at least some
> extent, that higher WAP usage in participating Wikipedia Zero markets,
> would be washed out by the relatively higher adoption of smartphones in
> wealthier markets.
> 
> Please do let me know in case of a need for further clarification!
> 
> -Adam
> 
> 
> On Tue, Sep 10, 2013 at 4:04 AM, Gerard Meijssen
> <gerard.meijs...@gmail.com>wrote:
> 
>> Hoi,
>> Is the Wikipedia-Zero traffic information part of the mobile statistics or
>> is it something completely separate thing?
>> Thanks,
>>     GerardM
>> 
>> 
>> On 10 September 2013 03:26, Adam Baso <ab...@wikimedia.org> wrote:
>> 
>>> Wikipedia Zero traffic (IP address and MCC/MNC matching as expected)
>> shows
>>> in one day of requests (zero.tsv.log-20130907) roughly 7-9% of page
>>> responses having a Content-Type response of "text/vnd.wap.wml", presuming
>>> field #11 (or index 10 if you're indexing from 0) in zero.tsv.log-<date>
>> is
>>> the Content-Type. Do I understand correctly that field as Content-Type?
>>> 
>>> Thanks.
>>> -Adam
>>> 
>>> 
>>> On Thu, Sep 5, 2013 at 9:27 AM, Arthur Richards <aricha...@wikimedia.org
>>>> wrote:
>>> 
>>>> Would adding the accept header to the x-analytics header be worthwhile
>>> for
>>>> this?
>>>> On Sep 5, 2013 4:16 AM, "Erik Zachte" <ezac...@wikimedia.org> wrote:
>>>> 
>>>>> For a breakdown per country, the higher the sampling rate the better,
>> as
>>>>> the data will become reliable even for smaller countries with a not so
>>>>> great adoption rate of Wikipedia.
>>>>> 
>>>>> -----Original Message-----
>>>>> From: analytics-boun...@lists.wikimedia.org [mailto:
>>>>> analytics-boun...@lists.wikimedia.org] On Behalf Of Max Semenik
>>>>> Sent: Thursday, September 05, 2013 12:28 PM
>>>>> To: Diederik van Liere
>>>>> Cc: A mailing list for the Analytics Team at WMF and everybody who has
>>> an
>>>>> interest in Wikipedia and analytics.; mobile-l; Wikimedia developers
>>>>> Subject: Re: [Analytics] [WikimediaMobile] Mobile stats
>>>>> 
>>>>> On 05.09.2013, 4:04 Diederik wrote:
>>>>> 
>>>>>> Heya,
>>>>>> I would suggest to at least run it for a 7 day period so you capture
>>>>>> at least the weekly time-trends, increasing the sample size should
>>>>>> also be recommendable. We can help setup a udp-filter for this
>> purpose
>>>>>> as long as the data can be extracted from the user-agent string.
>>>>> 
>>>>> Unfortunately, accept is no less important here.
>>>>> So, to enumerate our requirements as a result of this thread:
>>>>> * Sampling rate the same as wikistats (1/1000).
>>>>> * No less than a week worth of data.
>>>>> * User-agent:
>>>>> * Accept:
>>>>> * Country from GeoIP to determine the share of developing countries.
>>>>> * Wiki to determine if some wikis are more dependant on WAP than other
>>>>>  ones.
>>>>> 
>>>>> Anything else?
>>>>> 
>>>>> --
>>>>> Best regards,
>>>>>  Max Semenik ([[User:MaxSem]])
>>>>> 
>>>>> 
>>>>> _______________________________________________
>>>>> Analytics mailing list
>>>>> analyt...@lists.wikimedia.org
>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>>> 
>>>>> 
>>>>> _______________________________________________
>>>>> Mobile-l mailing list
>>>>> mobil...@lists.wikimedia.org
>>>>> https://lists.wikimedia.org/mailman/listinfo/mobile-l
>>>>> 
>>>> 
>>>> _______________________________________________
>>>> Mobile-l mailing list
>>>> mobil...@lists.wikimedia.org
>>>> https://lists.wikimedia.org/mailman/listinfo/mobile-l
>>>> 
>>>> 
>>> _______________________________________________
>>> Wikitech-l mailing list
>>> Wikitech-l@lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>>> 
>> _______________________________________________
>> Wikitech-l mailing list
>> Wikitech-l@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>> 
> _______________________________________________
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l


_______________________________________________
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to