That should be fine, please give us a heads up when you deploy the instrumenting.
On Thu, Jan 8, 2015 at 2:52 PM, Ryan Kaldari <[email protected]> wrote: > After further discussion, we've decided to just show WikiGrok to a > fraction of users during the test. I currently have it set to show WikiGrok > to 10 out of every 62 users or ~16% (the userToken is a base 62 number). > That should give us an estimated 27 hits per second. Does that work for > everyone? > > Kaldari > > On Thu, Jan 8, 2015 at 2:06 PM, Nuria Ruiz <[email protected]> wrote: > >> We cannot guarantee that with 60 events a sec things will still work well >> (as I said we should schedule some perf testing on this regard so I filed >> an item for this purpose: https://phabricator.wikimedia.org/T86244) >> >> See that now we go beyond 300 events per sec here and there: >> http://ibin.co/1nTsNYc1bekd >> >> I recommend sampling those events 1:10. >> >> Thanks, >> >> >> Nuria >> >> On Thu, Jan 8, 2015 at 12:06 PM, Ryan Kaldari <[email protected]> >> wrote: >> >>> After talking with Dario and Leila we decided that we will sample the >>> page-impression event at 1:1000. We would, however, like to retain the >>> widget-impression event unsampled if possible. That event happens >>> approximately 50% as often as page-impression. So we're probably talking >>> about somewhere around 60 events per second in that case. Would that be >>> acceptable or should we sample the widget-impression event as well? >>> >>> Kaldari >>> >>> On Wed, Jan 7, 2015 at 5:33 PM, Leila Zia <[email protected]> wrote: >>> >>>> Thanks, Nuria! >>>> >>>> On Wed, Jan 7, 2015 at 5:30 PM, Ryan Kaldari <[email protected]> >>>> wrote: >>>> >>>>> Thanks everyone for the research on this! I'll go ahead and create a >>>>> card for implementing sampling on the high-throughput WikiGrok events. >>>>> >>>>> Kaldari >>>>> >>>>> On Wed, Jan 7, 2015 at 5:20 PM, Nuria Ruiz <[email protected]> >>>>> wrote: >>>>> >>>>>> Sorry, I send it too soon, trying again: >>>>>> >>>>>> >We're talking about a total of ~170 events per sec for these pages. >>>>>> This is to high to log in 1:1 rate, we would need to do 1:10. At this >>>>>> time most events on EL logging log at a much lower rate, events over 1 >>>>>> per >>>>>> sec are the following, as you can see mobile & media viewer are the >>>>>> majority of the throughput. >>>>>> >>>>>> My preference would be to be less than 400 events per sec until we >>>>>> have done some perf testing to make sure we can handle it (we might be >>>>>> able >>>>>> to as we have done many improvements since we set these thresholds) >>>>>> >>>>>> MobileWebClickTracking 41.35% (114.15/sec) >>>>>> MediaViewer 21.66% (59.78/sec) >>>>>> MobileWikiAppToCInteraction 12.44% (34.35/sec) >>>>>> PageContentSaveComplete 3.39% (9.35/sec) >>>>>> EchoInteraction 2.69% (7.42/sec) >>>>>> NavigationTiming 2.51% (6.93/sec) >>>>>> MultimediaViewerNetworkPerformance 1.84% (5.07/sec) >>>>>> SaveTiming 1.58% (4.37/sec) >>>>>> Edit 1.39% (3.83/sec) >>>>>> PersonalBar 1.24% (3.43/sec) >>>>>> TimingData 0.83% (2.28/sec) >>>>>> MobileWebUIClickTracking 0.73% (2.02/sec) >>>>>> Popups 0.68% (1.87/sec) >>>>>> MobileWikiAppOnboarding 0.62% (1.70/sec) >>>>>> MultimediaViewerDimensions 0.61% (1.68/sec) >>>>>> UniversalLanguageSelector 0.50% (1.37/sec) >>>>>> PageCreation 0.50% (1.37/sec) >>>>>> MultimediaViewerDuration 0.47% (1.30/sec) >>>>>> MobileWebEditing 0.45% (1.25/sec) >>>>>> MobileWikiAppSearch 0.41% (1.13/sec) >>>>>> CentralAuth 0.40% (1.12/sec) >>>>>> >>>>>> On Wed, Jan 7, 2015 at 5:12 PM, Nuria Ruiz <[email protected]> >>>>>> wrote: >>>>>> >>>>>>> >We're talking about a total of ~170 events per sec for these pages. >>>>>>> This is to high to log in 1:1 rate, we would need to do 1:10. >>>>>>> >>>>>>> On Wed, Jan 7, 2015 at 4:10 PM, Leila Zia <[email protected]> >>>>>>> wrote: >>>>>>> >>>>>>>> Thanks everyone for chiming in. Your comments were very helpful. :-) >>>>>>>> >>>>>>>> Nuria, I checked the per second pageview count for the pages >>>>>>>> wikigrok will be live on for 3 hours in 2015-01-07 (as a sample). We're >>>>>>>> talking about a total of ~170 events per sec for these pages. Of course >>>>>>>> major events can affect this number. This number added to the current >>>>>>>> 270 >>>>>>>> events per sec you mentioned will send us over the 350 events per sec >>>>>>>> limit >>>>>>>> (if it's a hard limit). What do you think? >>>>>>>> >>>>>>>> Leila >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> On Wed, Jan 7, 2015 at 10:13 AM, Nuria Ruiz <[email protected]> >>>>>>>> wrote: >>>>>>>> >>>>>>>>> >Given that information, do you have any idea if we are in danger >>>>>>>>> of overloading EventLogging? >>>>>>>>> Logging broad events (such a page load) 1 to 1 might incur into >>>>>>>>> problems as our traffic is high enough that events logged1/1000 happen >>>>>>>>> still in very large amounts. >>>>>>>>> >>>>>>>>> Some numbers (oversimplyfying and rounding) >>>>>>>>> >>>>>>>>> We have about 200 million visits per day for the enwiki mobile >>>>>>>>> site . This means about 2300 pageviews per sec, if we are sending 1 >>>>>>>>> load >>>>>>>>> event per pageview EL will (sadly) die, most likely. >>>>>>>>> >>>>>>>>> If we assume EL handles up to 350 events per second (and now we >>>>>>>>> are at 270 events per sec) I would think that sending 10 events per >>>>>>>>> sec on >>>>>>>>> your case would be pretty safe. That would be sampling about 1/200 >>>>>>>>> for a >>>>>>>>> load event per every pageview. This seems like a good upper bound. >>>>>>>>> >>>>>>>>> Now, since there are no constrains as to how long you keep your >>>>>>>>> experiment running you can try a lower sampling ratio, say, 1/1000 >>>>>>>>> and keep >>>>>>>>> the experiment running for longer. >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> On Tue, Jan 6, 2015 at 5:50 PM, Ryan Kaldari < >>>>>>>>> [email protected]> wrote: >>>>>>>>> >>>>>>>>>> The highest volume events we are going to log will be: >>>>>>>>>> 1. For each of the 166,000 articles, one event when the page loads >>>>>>>>>> 2. For each of the 166,000 articles, one event when the WikiGrok >>>>>>>>>> widget enters the viewport (about half as often as #1) >>>>>>>>>> >>>>>>>>>> These will be active for all mobile users, logged in and logged >>>>>>>>>> out, including many high pageview articles. >>>>>>>>>> >>>>>>>>>> Given that information, do you have any idea if we are in danger >>>>>>>>>> of overloading EventLogging? If so, do you have recommendations on >>>>>>>>>> sampling? So far, everyone has said not to worry about it, but it >>>>>>>>>> would be >>>>>>>>>> good to get a sanity check for this test specifically. >>>>>>>>>> >>>>>>>>>> Kaldari >>>>>>>>>> >>>>>>>>>> On Tue, Jan 6, 2015 at 4:57 PM, Nuria Ruiz <[email protected]> >>>>>>>>>> wrote: >>>>>>>>>> >>>>>>>>>>> (cc-ing mobile-tech) >>>>>>>>>>> >>>>>>>>>>> Since we do not the details of how wikigrok is used and its >>>>>>>>>>> throughput of requests we can not "estimate" sampling ourselves. I >>>>>>>>>>> imagine >>>>>>>>>>> wikigrok is been deployed to a number of users and it is with that >>>>>>>>>>> usage >>>>>>>>>>> the mobile team could estimate the total throughput expected, with >>>>>>>>>>> this >>>>>>>>>>> throughput we can recommend sampling ratios. >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> Thanks for asking about this without before deploying! >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> On Tue, Jan 6, 2015 at 4:55 PM, Ryan Kaldari < >>>>>>>>>>> [email protected]> wrote: >>>>>>>>>>> >>>>>>>>>>>> I can elaborate on this after I finished the SWAT >>>>>>>>>>>> deployment.... Gimme 30 minutes or so. >>>>>>>>>>>> >>>>>>>>>>>> On Tue, Jan 6, 2015 at 4:51 PM, Leila Zia <[email protected]> >>>>>>>>>>>> wrote: >>>>>>>>>>>> >>>>>>>>>>>>> Hi, >>>>>>>>>>>>> >>>>>>>>>>>>> The mobile team is planning to switch WikiGrok on for >>>>>>>>>>>>> non-logged in users next week (2014-01-12). The widget will be on >>>>>>>>>>>>> on >>>>>>>>>>>>> 166,029 article pages in enwiki. There are two EventLogging >>>>>>>>>>>>> schema that may >>>>>>>>>>>>> collect data heavily and we want to make sure EL can handle the >>>>>>>>>>>>> influx of >>>>>>>>>>>>> data. >>>>>>>>>>>>> >>>>>>>>>>>>> The two schema collecting data are: >>>>>>>>>>>>> https://meta.wikimedia.org/wiki/Schema:MobileWebWikiGrok >>>>>>>>>>>>> https://meta.wikimedia.org/wiki/Schema:MobileWebWikiGrokError >>>>>>>>>>>>> and the list of pages affected is in: >>>>>>>>>>>>> wgq_page in enwiki.wikigrok_questions. >>>>>>>>>>>>> >>>>>>>>>>>>> It would be great if someone from the dev side let us know >>>>>>>>>>>>> whether we will need sampling. >>>>>>>>>>>>> >>>>>>>>>>>>> Thanks, >>>>>>>>>>>>> Leila >>>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> _______________________________________________ >>>>>>>>>>>> Analytics mailing list >>>>>>>>>>>> [email protected] >>>>>>>>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> _______________________________________________ >>>>>>>>>>> Analytics mailing list >>>>>>>>>>> [email protected] >>>>>>>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>> >>>>>>>>>> _______________________________________________ >>>>>>>>>> Analytics mailing list >>>>>>>>>> [email protected] >>>>>>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>>>>>>>> >>>>>>>>>> >>>>>>>>> >>>>>>>>> _______________________________________________ >>>>>>>>> Analytics mailing list >>>>>>>>> [email protected] >>>>>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>>>>>>> >>>>>>>>> >>>>>>>> >>>>>>>> _______________________________________________ >>>>>>>> Analytics mailing list >>>>>>>> [email protected] >>>>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>>> _______________________________________________ >>>>>> Analytics mailing list >>>>>> [email protected] >>>>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>>>> >>>>>> >>>>> >>>>> _______________________________________________ >>>>> Analytics mailing list >>>>> [email protected] >>>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>>> >>>>> >>>> >>>> _______________________________________________ >>>> Analytics mailing list >>>> [email protected] >>>> https://lists.wikimedia.org/mailman/listinfo/analytics >>>> >>>> >>> >>> _______________________________________________ >>> Analytics mailing list >>> [email protected] >>> https://lists.wikimedia.org/mailman/listinfo/analytics >>> >>> >> >> _______________________________________________ >> Analytics mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/analytics >> >> > > _______________________________________________ > Analytics mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/analytics > >
_______________________________________________ Analytics mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/analytics
