Eventlogging data currently does go to files, as well as to the DB. Check it out on stat1003 at /srv/eventlogging/archive.
If you need something with higher throughput then eventlogging itself supports…then let’s talk :D -Ao > On Jan 6, 2015, at 13:28, Erik Zachte <[email protected]> wrote: > > You mean attach an X-analytics parameter, for extra images beyond the one the > user initially requested. > > But then we would undercount, basically missing all image views from clicking > right arrow in image viewer. > I'm not sure how much we would miss then. > iirc Gilles said this browsing feature was used quite a long, but I'm not > sure. > > From: [email protected] > [mailto:[email protected]] On Behalf Of Toby Negrin > Sent: Tuesday, January 06, 2015 19:16 > To: A mailing list for the Analytics Team at WMF and everybody who has an > interest in Wikipedia and analytics. > Subject: Re: [Analytics] Making EventLogging output to a log file instead of > the DB > > Right -- couldn't we just tag the URL? > > On Tue, Jan 6, 2015 at 10:10 AM, Erik Zachte <[email protected] > <mailto:[email protected]>> wrote: > Just to clarify, this is about prefetched images which have not been shown to > the public. > They were sent to the browser ahead of a possible request to speed things up > but in many cases never actually requested. > https://www.mediawiki.org/wiki/Requests_for_comment/Media_file_request_counts#Prefetched_images > > <https://www.mediawiki.org/wiki/Requests_for_comment/Media_file_request_counts#Prefetched_images> > - Erik > > From: [email protected] > <mailto:[email protected]> > [mailto:[email protected] > <mailto:[email protected]>] On Behalf Of Toby Negrin > Sent: Tuesday, January 06, 2015 18:49 > To: A mailing list for the Analytics Team at WMF and everybody who has an > interest in Wikipedia and analytics. > Subject: Re: [Analytics] Making EventLogging output to a log file instead of > the DB > > Hi Gilles -- why won't the page view logs work by themselves for this > purpose? EL can be configured to write into Hadoop which is probably the best > way to get the throughput you need but it seems overcomplicated. > > -Toby > > On Tue, Jan 6, 2015 at 9:41 AM, Gilles Dubuc <[email protected] > <mailto:[email protected]>> wrote: > This depends on [1] so we're not going to need that immediately, but in order > to help Erik Zachte with his RfC [2] to track unique media views in Media > Viewer, I'm going to need to use something almost exactly like EventLogging. > The main difference being that it should skip writing to the database and > write to a log file instead. > > That's because we'll be recording around 20-25M image views per day, which > would needlessly overload EventLogging for little purpose since the data will > be used for offline stats generation and doesn't need to be made available in > a relational database. Of course if storage space and EventLogging capacity > were no object, we could just use EL and keep the ever-growing table forever, > but I have the impression that we want to be reasonable here and only write > to a log, since that's what Erik needs. > > So here's the question: for a specific schema, can EventLogging work the way > it does but only record hits to a log file (maybe it already does that before > hitting the DB?) and not write to the DB? If not, how difficult would it be > to make EL capable of doing that? > > [1] https://phabricator.wikimedia.org/T44815 > <https://phabricator.wikimedia.org/T44815> > [2] > https://www.mediawiki.org/wiki/Requests_for_comment/Media_file_request_counts > <https://www.mediawiki.org/wiki/Requests_for_comment/Media_file_request_counts> > > _______________________________________________ > Analytics mailing list > [email protected] <mailto:[email protected]> > https://lists.wikimedia.org/mailman/listinfo/analytics > <https://lists.wikimedia.org/mailman/listinfo/analytics> > > > _______________________________________________ > Analytics mailing list > [email protected] <mailto:[email protected]> > https://lists.wikimedia.org/mailman/listinfo/analytics > <https://lists.wikimedia.org/mailman/listinfo/analytics> > > _______________________________________________ > Analytics mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/analytics
_______________________________________________ Analytics mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/analytics
