Is there any way you can look at traffic for particular pages? If so, you could look at traffic to something like Special:Watchlist or Special:UserLogin on a representative sample of wikis - anyone using these two pages is very likely to represent a logged-in user, and traffic numbers to them are high enough you might get useful data even with the sampling limits.
Andrew. On 15 October 2014 21:50, Oliver Keyes <[email protected]> wrote: > Darnit. Ah well! Okay; finished building the code to retrieve this data. > Takes ~400 seconds to handle a day of logs, so take into account > parallelisation and I should (should!) have something to show in a couple of > hours for the first 3 Qs. The fourth, it seems, is beyond our ken. > > On 15 October 2014 15:54, Max Semenik <[email protected]> wrote: >> >> There's no data for IE6 in EventLogging because IE6 gets no JS these days. >> Maybe, if there's old enough data... >> >> On Wed, Oct 15, 2014 at 12:46 PM, Oliver Keyes <[email protected]> >> wrote: >>> >>> Update: Yuvi's pointed me towards a login attempts schema. All 4 are >>> doable. Data tomorrow morning EST at the latest. >>> >>> On 15 October 2014 15:19, Oliver Keyes <[email protected]> wrote: >>>> >>>> (With "jiffy" read "a day"; even with sampling, big logs are big, and I >>>> imagine we probably want ~30 days of data.) >>>> >>>> On 15 October 2014 15:18, Oliver Keyes <[email protected]> wrote: >>>>> >>>>> First three are pretty trivial; last one is a bit of a pain, but doable >>>>> if someone wants to poke me on IRC (/query Ironholds) and chat about what >>>>> an >>>>> unambiguous successful login action would look like in terms of requests. >>>>> But I can do the first three in a jiffy. >>>>> >>>>> On 15 October 2014 13:32, Brandon Black <[email protected]> wrote: >>>>>> >>>>>> >>>>>> >>>>>> On Wed, Oct 15, 2014 at 3:43 PM, Oliver Keyes <[email protected]> >>>>>> wrote: >>>>>>> >>>>>>> You invoked my name! >>>>>>> >>>>>>> Emphasis is "logged-in". If you guys want more solid overall numbers, >>>>>>> I can get those in short order; this seems like a pretty critical >>>>>>> question >>>>>>> to have data on, fast. Lemme know. >>>>>> >>>>>> >>>>>> If you can source some good reliable numbers, probably what we care >>>>>> about (all of which have been estimated to some degree in this thread >>>>>> already, I think?) is: >>>>>> >>>>>> % of all requests from IE6 >>>>>> % of all https requests from IE6 >>>>>> % of all text/html https requests from IE6 (not so important IMHO, if >>>>>> it's difficult) >>>>>> % of all logged-in https requests (or alternatively, % of all >>>>>> successful https login attempts) from IE6. >>>>>> >>>>> >>>>> >>>>> >>>>> -- >>>>> Oliver Keyes >>>>> Research Analyst >>>>> Wikimedia Foundation >>>> >>>> >>>> >>>> >>>> -- >>>> Oliver Keyes >>>> Research Analyst >>>> Wikimedia Foundation >>> >>> >>> >>> >>> -- >>> Oliver Keyes >>> Research Analyst >>> Wikimedia Foundation >>> >>> _______________________________________________ >>> Analytics mailing list >>> [email protected] >>> https://lists.wikimedia.org/mailman/listinfo/analytics >>> >> >> >> >> -- >> Best regards, >> Max Semenik ([[User:MaxSem]]) >> >> _______________________________________________ >> Analytics mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/analytics >> > > > > -- > Oliver Keyes > Research Analyst > Wikimedia Foundation > > _______________________________________________ > Analytics mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/analytics > -- - Andrew Gray [email protected] _______________________________________________ Analytics mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/analytics
