https://bugzilla.wikimedia.org/show_bug.cgi?id=51254

Robert Rohde <[email protected]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|RESOLVED                    |REOPENED
                 CC|                            |[email protected]
         Resolution|FIXED                       |---

--- Comment #8 from Robert Rohde <[email protected]> ---
Sorry to add to what I'm sure was a bit of a hectic day for someone, but I'm
still seeing lingering bits of corruption.  Perhaps some sort of edge case that
wasn't handled correctly by the rebuild?  99.9% of tags may be okay at this
point, but here are some example that still seem to be errors.

A API query of 200 revisions tags as flagged as "blanking":

http://en.wikipedia.org/w/api.php?action=query&list=recentchanges&rctag=blanking&rclimit=200&rcprop=user%7Ccomment%7Ctitle%7Ctags%7Ctimestamp|ids&rccontinue=2013-07-12T22:20:40Z|589061595

While this query returns 200 entries, we find that only 188 of them report as
actually having the "blanking" tag.

The remainder are things like 
  rcid="590123889" timestamp="2013-07-12T14:30:16Z"
  <tag>visualeditor</tag>

  rcid="590032703" timestamp="2013-07-12T00:33:31Z" 
  <tag>mobile edit</tag>

Where some other tag is reported but the expected "blanking" tag is not
reported.

For another example of this issue see the API query for the
"visualeditor-needcheck" tag:

http://en.wikipedia.org/w/api.php?action=query&list=recentchanges&rctag=visualeditor-needcheck&rclimit=200&rcprop=user%7Ccomment%7Ctitle%7Ctags%7Ctimestamp|ids

This tag should only be applied if the "visualeditor" tag is also present, but
we observe that most of the results have either "visualeditor" or
"visualeditor-needcheck" but not both.  A few entries even have other tags
entirely.


What appears to have happened is that rebuild didn't correctly handle cases
where a single revision was subject to multiple tags.  Instead it looks as
though the rebuilt table applies at most one tag to each of the historical
revisions.  Most of the time that's okay since few revisions actually have
multiple tags, but it still leaves a bit of corruption and missing data on the
rare cases when a revision is expected to have multiple tags.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to