Heya,
I would suggest to at least run it for a 7 day period so you capture at
least the weekly time-trends, increasing the sample size should also be
recommendable. We can help setup a udp-filter for this purpose as long as
the data can be extracted from the user-agent string.

D
On Wed, Sep 4, 2013 at 1:50 PM, Arthur Richards <[email protected]>wrote:

> Thanks Max for digging into this :)
>
> I'm no analytics guy, but I am a little concerned about the sample size
> and duration of the internal logging that we've done - sampling 1/10000 for
> only a few days for data about something we generally know usage to already
> be low seems to me like it might be difficult to get accurate numbers. Can
> someone from the analytics team chime in and let us know if the approach is
> sound and if we should trust the data Max has come up with? This has big
> implications as it will play role in determining whether or not we continue
> supporting WAP devices and providing WAP access to the sites.
>
> Thanks everyone!
>
>
> On Tue, Sep 3, 2013 at 10:40 AM, Erik Zachte <[email protected]>wrote:
>
>> Sadly you need to take squid log based reports with a grain of salt.
>> Several incomplete maintenance jobs have taken their toll.
>>
>> Each report starts with a long list of unsolved bugs.
>> Among those https://bugzilla.wikimedia.org/show_bug.cgi?id=46273
>>
>> So yeah better trust your own data.
>>
>> Erik
>>
>>
>> -----Original Message-----
>> From: [email protected] [mailto:
>> [email protected]] On Behalf Of Max Semenik
>> Sent: Tuesday, September 03, 2013 5:33 PM
>> To: [email protected]; Wikimedia developers; mobile-l
>> Subject: [Analytics] Mobile stats
>>
>> Hi, I have a few questions regarding mobile stats.
>>
>> I need to determine a real percentage of WAP browsers. At first glance,
>> [1] looks interesting: ratio of text/html to text/vnd.wap.wml is 92M /
>> 3987M = 2.3% on m.wikipedia.org. However, this contradicts the stats at
>> [2] which have different numbers and a different ratio.
>>
>> I did my own research: because during browser detection in Varnish
>> WAPness is detected mostly by looking at accept header and because our
>> current analytics infrastructure doesn't log it, I quickly whipped up a
>> code that recorded user-agent and accept of every 10,000th request for
>> mobile page views hitting apaches.
>>
>> According to several days worth of data, out of 14917 logged requests
>> 1445 contained vnd.wap.wml in Accept: headers in any form. That's more
>> than what is logged for frontend responses, however it is expected as WAP
>> should have worse cache hit rate and thus should hit apaches more often.
>>
>> Next, our WAP detection code is very simple: user-agent is checked
>> against a few major browser IDs (all of them are HTML-capable and this
>> check is not actually needed anymore and will go away soon) and if still
>> not known, we consider every device that sends Accept:
>> header "vnd.wap.wml" (but not "application/vnd.wap.xhtml+xml"), to be
>> WAP-only. If we apply these rules, we get only 68 entries that qualify as
>> WAP which is 0.05% of all mobile requests.
>>
>> The question is, what's wrong: my research or stats.wikimedia.org?
>>
>> And if it's indeed just 0.05%, we should probably^W definitely kill WAP
>> support on our mobile site as it's virtually unmaintained.
>>
>> -----
>> [1] http://stats.wikimedia.org/wikimedia/squids/SquidReportRequests.htm
>> [2] http://stats.wikimedia.org/wikimedia/squids/SquidReportClients.htm
>>
>>
>>
>> --
>> Best regards,
>>   Max Semenik ([[User:MaxSem]])
>>
>>
>> _______________________________________________
>> Analytics mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>
>>
>> _______________________________________________
>> Mobile-l mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/mobile-l
>>
>
>
>
> --
> Arthur Richards
> Software Engineer, Mobile
> [[User:Awjrichards]]
> IRC: awjr
> +1-415-839-6885 x6687
>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
_______________________________________________
Wikitech-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to