Thanks. 7-9% of responses on Wikipedia Zero being WAP is pretty substantial.
On Tue, Sep 10, 2013 at 2:01 PM, Andrew Otto <[email protected]> wrote: > > These > > zero.tsv.log* > > files to which I refer seem to be, basically Varnish log lines that > > correspond to Wikipedia Zero-targeted traffic. > Yup! Correct. zero.tsv.log* files are captured unsampled and based on > the presence of a "zero=" tag in the X-Analytics header: > > > http://git.wikimedia.org/blob/operations%2Fpuppet.git/37ffb0ccc1cd7d3f5612df8779e9a3bdb69066b2/templates%2Fudp2log%2Ffilters.oxygen.erb#L10 > > > Do I understand correctly that field as Content-Type? > Yup again! The varnishncsa format string that is currently being beamed > at udp2log is here: > > > http://git.wikimedia.org/blob/operations%2Fpuppet.git/37ffb0ccc1cd7d3f5612df8779e9a3bdb69066b2/modules%2Fvarnish%2Ffiles%2Fvarnishncsa.default > > > > > > On Sep 10, 2013, at 4:25 PM, Adam Baso <[email protected]> wrote: > > > Somewhere in between, I think. > > > > Wikipedia Zero's main extension, ZeroRatedMobileAccess, relies upon the > > mobile web's main extension, MobileFrontend. Wikipedia Zero access is > > served across [lang.].zero.wikipedia.org and [lang.].m.wikipedia.org. > > > > As I understand, the general Varnish logs capture both the Wikipedia > > Zero-based and the non-Wikipedia Zero-based mobile web access. These > > zero.tsv.log* > > files to which I refer seem to be, basically Varnish log lines that > > correspond to Wikipedia Zero-targeted traffic. > > > > Wikipedia Zero for the mobile web will in all likelihood have a higher > rate > > of WAP device usage and WAP content served when compared to the general > > Wikipedia for the mobile web stats. It's likely that, to at least some > > extent, that higher WAP usage in participating Wikipedia Zero markets, > > would be washed out by the relatively higher adoption of smartphones in > > wealthier markets. > > > > Please do let me know in case of a need for further clarification! > > > > -Adam > > > > > > On Tue, Sep 10, 2013 at 4:04 AM, Gerard Meijssen > > <[email protected]>wrote: > > > >> Hoi, > >> Is the Wikipedia-Zero traffic information part of the mobile statistics > or > >> is it something completely separate thing? > >> Thanks, > >> GerardM > >> > >> > >> On 10 September 2013 03:26, Adam Baso <[email protected]> wrote: > >> > >>> Wikipedia Zero traffic (IP address and MCC/MNC matching as expected) > >> shows > >>> in one day of requests (zero.tsv.log-20130907) roughly 7-9% of page > >>> responses having a Content-Type response of "text/vnd.wap.wml", > presuming > >>> field #11 (or index 10 if you're indexing from 0) in > zero.tsv.log-<date> > >> is > >>> the Content-Type. Do I understand correctly that field as Content-Type? > >>> > >>> Thanks. > >>> -Adam > >>> > >>> > >>> On Thu, Sep 5, 2013 at 9:27 AM, Arthur Richards < > [email protected] > >>>> wrote: > >>> > >>>> Would adding the accept header to the x-analytics header be worthwhile > >>> for > >>>> this? > >>>> On Sep 5, 2013 4:16 AM, "Erik Zachte" <[email protected]> wrote: > >>>> > >>>>> For a breakdown per country, the higher the sampling rate the better, > >> as > >>>>> the data will become reliable even for smaller countries with a not > so > >>>>> great adoption rate of Wikipedia. > >>>>> > >>>>> -----Original Message----- > >>>>> From: [email protected] [mailto: > >>>>> [email protected]] On Behalf Of Max Semenik > >>>>> Sent: Thursday, September 05, 2013 12:28 PM > >>>>> To: Diederik van Liere > >>>>> Cc: A mailing list for the Analytics Team at WMF and everybody who > has > >>> an > >>>>> interest in Wikipedia and analytics.; mobile-l; Wikimedia developers > >>>>> Subject: Re: [Analytics] [WikimediaMobile] Mobile stats > >>>>> > >>>>> On 05.09.2013, 4:04 Diederik wrote: > >>>>> > >>>>>> Heya, > >>>>>> I would suggest to at least run it for a 7 day period so you capture > >>>>>> at least the weekly time-trends, increasing the sample size should > >>>>>> also be recommendable. We can help setup a udp-filter for this > >> purpose > >>>>>> as long as the data can be extracted from the user-agent string. > >>>>> > >>>>> Unfortunately, accept is no less important here. > >>>>> So, to enumerate our requirements as a result of this thread: > >>>>> * Sampling rate the same as wikistats (1/1000). > >>>>> * No less than a week worth of data. > >>>>> * User-agent: > >>>>> * Accept: > >>>>> * Country from GeoIP to determine the share of developing countries. > >>>>> * Wiki to determine if some wikis are more dependant on WAP than > other > >>>>> ones. > >>>>> > >>>>> Anything else? > >>>>> > >>>>> -- > >>>>> Best regards, > >>>>> Max Semenik ([[User:MaxSem]]) > >>>>> > >>>>> > >>>>> _______________________________________________ > >>>>> Analytics mailing list > >>>>> [email protected] > >>>>> https://lists.wikimedia.org/mailman/listinfo/analytics > >>>>> > >>>>> > >>>>> _______________________________________________ > >>>>> Mobile-l mailing list > >>>>> [email protected] > >>>>> https://lists.wikimedia.org/mailman/listinfo/mobile-l > >>>>> > >>>> > >>>> _______________________________________________ > >>>> Mobile-l mailing list > >>>> [email protected] > >>>> https://lists.wikimedia.org/mailman/listinfo/mobile-l > >>>> > >>>> > >>> _______________________________________________ > >>> Wikitech-l mailing list > >>> [email protected] > >>> https://lists.wikimedia.org/mailman/listinfo/wikitech-l > >>> > >> _______________________________________________ > >> Wikitech-l mailing list > >> [email protected] > >> https://lists.wikimedia.org/mailman/listinfo/wikitech-l > >> > > _______________________________________________ > > Wikitech-l mailing list > > [email protected] > > https://lists.wikimedia.org/mailman/listinfo/wikitech-l > > > _______________________________________________ > Wikitech-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wikitech-l > _______________________________________________ Wikitech-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikitech-l
