https://bugzilla.wikimedia.org/show_bug.cgi?id=68840

            Bug ID: 68840
           Summary: Wikimetrics can't run a lot of recurrent reports at
                    the same time
           Product: Analytics
           Version: unspecified
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: Unprioritized
         Component: Wikimetrics
          Assignee: [email protected]
          Reporter: [email protected]
                CC: [email protected],
                    [email protected], [email protected],
                    [email protected], [email protected],
                    [email protected], [email protected]
       Web browser: ---
   Mobile Platform: ---

When adding reports for all the wiki cohorts and manually kicking off the
scheduler, not all reports could successfully complete.  Of the 900 or so
reports, usually only about 80-160 would run before the queue would simply stop
processing reports.  No error messages, no failures, nothing, it would
basically do this:

* process each recurrent report and create a run for each one (I could see in
the database 900+ pending reports, all recurrent runs of the wiki cohort
reports)
* presumably create the group / chain celery constructs and execute delay() on
the top level chain.  No errors were reported here, and we can be sure of this
as this section is inside a try block.
* execute some of the reports on the queue.  Monitoring the queue log shows
this, and I could see errors for things such as mysql being unresponsive,
labsdb databases not existing for obscure wikis, etc.  But none of those
occurred in great number, and the queue seemed to process just fine from there
on out.

To me, this means that the error is happening somewhere in "celery land", maybe
something to do with the new group / chain addition we made...

Either way, this is not an optimization/bugfix that can wait.  Without it,
wikimetrics simply won't be able to run the recurrent reports for each project
as we hoped.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to