https://bugzilla.wikimedia.org/show_bug.cgi?id=16603

           Summary: Write and run script to purge old rows from
                    categorylinks table on WMF wikis
           Product: Wikimedia
           Version: unspecified
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: enhancement
          Priority: Normal
         Component: Site requests
        AssignedTo: [email protected]
        ReportedBy: [EMAIL PROTECTED]


Looking at queries run on the Toolserver's replica of the en.wiki database:

mysql> SELECT * FROM categorylinks WHERE cl_to = "Africa-related_stubs";
+---------+----------------------+----------------+---------------------+
| cl_from | cl_to                | cl_sortkey     | cl_timestamp        |
+---------+----------------------+----------------+---------------------+
|  891746 | Africa-related_stubs | Hamani Diori   | 2005-05-31 04:11:48 | 
| 1547553 | Africa-related_stubs | Patty Schnyder | 2005-02-26 17:30:45 | 
+---------+----------------------+----------------+---------------------+
2 rows in set (0.04 sec)

mysql> SELECT * FROM page WHERE page_id = 891746;
Empty set (1.22 sec)


It's clear that there are old rows in the categorylinks table that for some
reason were not purged when the page was deleted. The rows reference deleted
pages. This causes inaccuracies when things like COUNT(cl_to) are run in
database queries.

A maintenance script should be written and run on the categorylinks tables of
WMF wikis to purge these old rows.


-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are watching all bug changes.

_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to