Hi Gergo -- I like this idea.  As far as capacity, any EL-Hadoop based
solution would be basically doing the same thing as you propose.

Can you please run it past ops (especially the 404 v 204) part?

Oliver -- the issue is that we'd like to figure out a way to provide
accurate views of the media files; because of client side caching, we can't
use the current requests. But your point is a good one -- we'll need to add
this to the PV definition.

-Toby

On Thu, Feb 5, 2015 at 5:18 AM, Oliver Keyes <[email protected]> wrote:

> A nice theory, but if they appear in the webrequest table (presumably
> they would, and we're not creating an entirely new set of varnishes
> for the transmission of dummy images?) they have to be factored in.
> Again, however, the new definition automatically filters them by
> checking the webrequest source and MIME type, so this is not a
> problem, as I originally stated.
>
> On 5 February 2015 at 08:10, Erik Zachte <[email protected]> wrote:
> > Oliver, this is not about pageviews, but about media file views.
> >
> >
> >
> > These will be collected and dumped separately, as per
> >
> https://www.mediawiki.org/wiki/Requests_for_comment/Media_file_request_counts
> > .
> >
> >
> >
> > Erik
> >
> >
> >
> >
> >
> > From: [email protected]
> > [mailto:[email protected]] On Behalf Of Nuria Ruiz
> > Sent: Wednesday, February 04, 2015 22:28
> > To: A mailing list for the Analytics Team at WMF and everybody who has an
> > interest in Wikipedia and analytics.
> > Subject: Re: [Analytics] Virtual file view hack for Media Viewer views
> >
> >
> >
> >>We would add a rule to Vagrant to make sure it does not try to look up
> such
> >> requests in Swift but returns a 404 immediately.
> >
> > I bet ops would like it a lot better if this is a 204 and it kind of
> makes
> > sense as it is the code used for beacons and such. Otherwise they might
> get
> > alarms on 404s increasing.
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> > On Wed, Feb 4, 2015 at 12:38 PM, Oliver Keyes <[email protected]>
> wrote:
> >
> > Not really; the new pageviews definition wouldn't include those files
> > anyway. It seems silly, thought, be deliberately generating a large
> > amount of automated noise and client requests for this :/.
> >
> >
> > On 4 February 2015 at 15:00, Gergo Tisza <[email protected]> wrote:
> >> Hi all,
> >>
> >> Erik Zachte is working on file view stats and is looking for a way to
> >> track
> >> Media Viewer image views (for which there is no 1:1 relation between
> >> server
> >> hits and actual image views); after some back and forth in
> >> https://phabricator.wikimedia.org/T86914 I proposed the following hack:
> >>
> >> whenever the javascript code in MediaViewer determines that an image
> view
> >> happened (e.g. an image has been displayed for a certain amount of
> time),
> >> it
> >> makes a request to a certain fake image, say
> >> upload.wikimedia.org/wikipedia/commons/thumb/0/00/Virtual-imageview-
> <real
> >> image name>/<size>px-thumbnail.<ext> . These hits can than be easily
> >> filtered from the varnish request logs and added to the normal requests.
> >> We
> >> would add a rule to Vagrant to make sure it does not try to look up such
> >> requests in Swift but returns a 404 immediately.
> >>
> >> This would be a temporary workaround until there is a proper way to log
> >> virtual image views, such as EventLogging with a non-SQL backend.
> >>
> >> Do you see any fundamental problem with this?
> >>
> >
> >> _______________________________________________
> >> Analytics mailing list
> >> [email protected]
> >> https://lists.wikimedia.org/mailman/listinfo/analytics
> >>
> >
> >
> >
> > --
> > Oliver Keyes
> > Research Analyst
> > Wikimedia Foundation
> >
> > _______________________________________________
> > Analytics mailing list
> > [email protected]
> > https://lists.wikimedia.org/mailman/listinfo/analytics
> >
> >
> >
> >
> > _______________________________________________
> > Analytics mailing list
> > [email protected]
> > https://lists.wikimedia.org/mailman/listinfo/analytics
> >
>
>
>
> --
> Oliver Keyes
> Research Analyst
> Wikimedia Foundation
>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to