https://bugzilla.wikimedia.org/show_bug.cgi?id=38919
Web browser: ---
Bug #: 38919
Summary: "page_is_redirect" wrong on Indonesian (and probably
other non-English) wiki dumps. Doesn't recognize
localization of "#REDIRECT"
Product: Datasets
Version: unspecified
Platform: All
OS/Version: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: General/Unknown
AssignedTo: [email protected]
ReportedBy: [email protected]
Classification: Unclassified
Mobile Platform: ---
Probably duplicate of Bug #12507 and possibly related to Bug #10931 and #30513
I'm downloading idwikibooks and idwiki dumps (i.e.
idwiki-pages-articles.xml.bz2). And after I install it locally and go through
random pages several times I've stumbled into many 'broken' redirects (as if
with parameter "redirect=no"). They didn't redirect me to the proper pages.
>From what I found out, the problem was because the site does not recognize the
pages with "#ALIH [[title]]" tag (Indonesian for "#REDIRECT [[title]]") as a
redirect, because they're marked as "page_is_redirect = 0" on the "page" table.
Only redirects with #REDIRECT tag are recognized (and thus redirected
properly). I've checked the special pages online and they seem to list all the
redirects properly, so it seems this problem is only for the dumps. (Note: I've
already set $wgLanguageCode = "id", so that wasn't the problem)
I could create new redirects using "#ALIH [[title]]" just fine, but I couldn't
fix the existing redirects. I've tried to purge the page and run
"refreshLinks.php --redirects-only" to no avail. Null edit gave me worse
result, it displays "1. ALIH [[title]]", as if it's an ordered list. It was
strange, because I can make a new redirects using "#ALIH"s, but get an error
doing a null edit on existing "#ALIH"s.
I've also downloaded idwiki-latest-redirect.sql.gz and put them in "redirect"
table, but as soon as I run rebuildall.php, they were overwritten by the data
from "page" table.
Right now my only option left is run a bot locally, and change all "#ALIH"s to
"#REDIRECT"s. Hope this gets fixed on the next version.
--
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l