https://bugzilla.wikimedia.org/show_bug.cgi?id=27914

           Summary: Job queue refreshLinks2 duplicate removal
           Product: MediaWiki
           Version: 1.18-svn
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: enhancement
          Priority: Normal
         Component: General/Unknown
        AssignedTo: wikibugs-l@lists.wikimedia.org
        ReportedBy: tstarl...@wikimedia.org


The introduction of refreshLinks2 in r40741, while certainly useful, broke the
duplicate removal code in Job::pop(). As long as the page ID partitions are
precisely the same, it works. But for templates which are very heavily used,
it's normal for pages to start or stop using the given template in between
closely-spaced template edits. This means that the partitioning changes, so
job_params is not the same, so duplicate removal is not done. 

This has been observed to cause a lot of extra work for the Wikimedia job
runners. Although English Wikipedia administrators are aware of the performance
effects of editing heavily-used templates, they often make mistakes and end up
doing several edits in a row.

Duplicate removal of this kind should be a design requirement for a rewritten
job queue system. While we are waiting for that, a maintenance script which
traverses the job table and removes duplicate jobs would be a useful stopgap
measure.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to