Hey Gilles,

 

Thanks much for getting this done. 
We hope to announce the media file requests dump any moment now.

The initial release will be without this new data, but I hope we can 
incorporate it asap.

 

Cheers,

Erik



From: Nuria Ruiz [mailto:[email protected]] 
Sent: Thursday, March 19, 2015 17:47
To: A mailing list for the Analytics Team at WMF and everybody who has an 
interest in Wikipedia and analytics.
Cc: Erik Zachte
Subject: Re: [Analytics] Virtual file view hack for Media Viewer views

 

>What happens after that hasn't been configured and is starting to be way 
>beyond the scope of what the Multimedia team should be involved with.

True, but we need to know on our end whether some special treatment needs to be 
applied to this data.  

 

For example, if  these have to be counted as pageviews we should know. By Erik 
Z's reply early on we assume these are not to be counted as pageviews but 
rather "media files views" so they will "sit" on the refined tables with 
is_Pageview=0 and some code will harvest those to count them as "media file 
request counts".

 

Thanks, 

 

Nuria

 

 

 

 

On Thu, Mar 19, 2015 at 6:35 AM, Andrew Otto <[email protected]> wrote:

Any webrequest is picked up by varnishkafka and goes into the webrequest logs 
in the cluster.  If you want special treatment of your request, say different 
formatting or different logs, we’ll have to do something else :)

 

 

 

On Mar 19, 2015, at 04:45, Gilles Dubuc <[email protected]> wrote:

 

Hi Nuria,

As far as I'm aware, it only goes to the varnish logs at the moment. What 
happens after that hasn't been configured and is starting to be way beyond the 
scope of what the Multimedia team should be involved with.

CCing Ori who knows whether the hits to the beacon URI are picked up by 
varnishkafka or not at the moment, since he set up the endpoint.

 

On Wed, Mar 18, 2015 at 3:42 PM, Nuria Ruiz <[email protected]> wrote:

Gilles:

 

And we know this data is coming via varnishkafka into the cluster, right? Did 
we checked that?

 

Thanks, 

 

Nuria

 

 

 

On Wed, Mar 18, 2015 at 2:02 AM, Gilles Dubuc <[email protected]> wrote:

Our solution for this is now live.

Here's an example of a media beacon hit:

http://bits.wikimedia.org/beacon/media?duration=3709 
<http://bits.wikimedia.org/beacon/media?duration=3709&uri=http%3A%2F%2Fupload.beta.wmflabs.org%2Fwikipedia%2Fen%2Fthumb%2Fb%2Fb0%2FSunrise_over_fishing_boats_in_Kerala.jpg%2F640px-Sunrise_over_fishing_boats_in_Kerala.jpg>
 
&uri=http%3A%2F%2Fupload.beta.wmflabs.org%2Fwikipedia%2Fen%2Fthumb%2Fb%2Fb0%2FSunrise_over_fishing_boats_in_Kerala.jpg%2F640px-Sunrise_over_fishing_boats_in_Kerala.jpg

Beta is currently hitting that endpoint and production wikis will start doing 
the same once they start running 1.25wmf22

All views coming from Media Viewer will be hitting that endpoint. Note that 
there might be some loss of hits on browsers that don't support sendBeacon, 
since our fallback is a simple async AJAX request (we haven't tried to go 
beyond that with local storage and replaying the event, etc.) and this event 
might be fired in situations of tab/browser close as well as navigating away 
from the page. Thus keep in mind that a steady small increase of those hits 
over a long period of time might simply be the natural process of people 
upgrading their browsers to more modern ones.

 

On Fri, Feb 6, 2015 at 4:46 PM, Nuria Ruiz <[email protected]> wrote:

>A dummy image request seems rather reasonable. (I assume varnish can handle 
>such load of "atypical" requests.)

Right, the filtering for beacons is already in place in vcl and responses are 
sent right away so as far as I know there is no better place than varnish for 
this code. See example:

https://github.com/wikimedia/operations-puppet/blob/production/templates/varnish/bits.inc.vcl.erb#L24

 

 

On Thu, Feb 5, 2015 at 10:38 PM, Federico Leva (Nemo) <[email protected]> 
wrote:

Gergo Tisza, 04/02/2015 21:00:

Do you see any fundamental problem with this?


A dummy image request seems rather reasonable. (I assume varnish can handle 
such load of "atypical" requests.)
Making additional requests is ugly, but until we get SPDY our articles 
typically make dozens or hundreds requests, so the effect looks negligible.

Nemo



_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

 


_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

 


_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

 


_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

 

_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

 


_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

 

_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to