Some things to have in mind:

1) Bots
   AND user_agent_map['device_family'] <> "Spider"
Doesn't remove all bots, only very prominent ones, so stats still include
traffic from say, wmf robots, for example.


2) Sampling:
>Strangely, the event-logs for specific actions showed much higher traffic
for
>beta but this is likely due to different user groups?
It is worth checking whether you are gathering data in equal sample rates
for all 3 sites. For tyraffic you are comparing days for eventlogging data
you are comparing an unknown interval of time.


3) Daily versus weekly
It is also worth checking weekly figures as they might differ.

4) MobileWebUIClickTracking
I thought this schema had been split in several and was no longer in use,
again, there is no  time interval in your query.


Thanks,

Nuria

On Wed, Apr 15, 2015 at 12:14 PM, Jon Katz <[email protected]> wrote:

> Hi,
> Just following up to thank you guys.
>
> Kevin and Ottomoata guided me through getting set up on hive and I made
> the first queries I needed to!  Here were the results (if you're interested)
> Best,
>
> J
>
>
>
> 4/12 full day
>
>    *mobile_site* *Pageviews* *% of stable*  a  8,578 0.004%  b  46,705
> 0.019%  stable  244,816,000
> Strangely, the event-logs for specific actions showed much higher traffic
> for beta but this is likely due to different user groups?
> (multiple days)
>    *mobile_site* * search events* *% of stable*  alpha  18,768 0.01%  beta  
> 153,617
> 0.09%  stable  178,118,332
>
>
> queries:
>
> for alpha beta
> SELECT
>   x_analytics_map['mf-m'] AS mobile_site, count(*)
> FROM
>   wmf.webrequest
> WHERE
>   access_method = "mobile web"
>   AND YEAR = 2015
>   AND MONTH = 4
>   AND DAY = 12
>   AND user_agent_map['device_family'] <> "Spider"
>   AND is_pageview = TRUE
>   AND x_analytics_map['mf-m'] IS NOT NULL
> GROUP BY
>   x_analytics_map['mf-m']
> LIMIT 50;
>
> For stable:
> SELECT
>   count(*)
> FROM
>   wmf.webrequest
>   tablesample(BUCKET 1 OUT OF 1000 ON rand())
> WHERE
>   access_method = "mobile web"
>   AND YEAR = 2015
>   AND MONTH = 4
>   AND DAY = 12
>   AND is_pageview = TRUE
>   AND user_agent_map['device_family'] <> "Spider"
>   AND x_analytics_map['mf-m'] IS NULL
> LIMIT 50;
>
>
> For events:
>   SELECT count(uuid), event_name, event_mobileMode    FROM
> MobileWebUIClickTracking_10742159  GROUP BY event_name, event_mobileMode;
>
_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to