Right -- couldn't we just tag the URL? On Tue, Jan 6, 2015 at 10:10 AM, Erik Zachte <[email protected]> wrote:
> Just to clarify, this is about prefetched images which have not been shown > to the public. > > They were sent to the browser ahead of a possible request to speed things > up but in many cases never actually requested. > > > https://www.mediawiki.org/wiki/Requests_for_comment/Media_file_request_counts#Prefetched_images > > - Erik > > > > *From:* [email protected] [mailto: > [email protected]] *On Behalf Of *Toby Negrin > *Sent:* Tuesday, January 06, 2015 18:49 > *To:* A mailing list for the Analytics Team at WMF and everybody who has > an interest in Wikipedia and analytics. > *Subject:* Re: [Analytics] Making EventLogging output to a log file > instead of the DB > > > > Hi Gilles -- why won't the page view logs work by themselves for this > purpose? EL can be configured to write into Hadoop which is probably the > best way to get the throughput you need but it seems overcomplicated. > > > > -Toby > > > > On Tue, Jan 6, 2015 at 9:41 AM, Gilles Dubuc <[email protected]> wrote: > > This depends on [1] so we're not going to need that immediately, but in > order to help Erik Zachte with his RfC [2] to track unique media views in > Media Viewer, I'm going to need to use something almost exactly like > EventLogging. The main difference being that it should skip writing to the > database and write to a log file instead. > > That's because we'll be recording around 20-25M image views per day, which > would needlessly overload EventLogging for little purpose since the data > will be used for offline stats generation and doesn't need to be made > available in a relational database. Of course if storage space and > EventLogging capacity were no object, we could just use EL and keep the > ever-growing table forever, but I have the impression that we want to be > reasonable here and only write to a log, since that's what Erik needs. > > So here's the question: for a specific schema, can EventLogging work the > way it does but only record hits to a log file (maybe it already does that > before hitting the DB?) and not write to the DB? If not, how difficult > would it be to make EL capable of doing that? > > > [1] https://phabricator.wikimedia.org/T44815 > [2] > https://www.mediawiki.org/wiki/Requests_for_comment/Media_file_request_counts > > > _______________________________________________ > Analytics mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/analytics > > > > _______________________________________________ > Analytics mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/analytics > >
_______________________________________________ Analytics mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/analytics
