https://bugzilla.wikimedia.org/show_bug.cgi?id=16880
Summary: Interwiki links should not be handled like local/static
pages
Product: MediaWiki extensions
Version: any
Platform: All
OS/Version: All
Status: NEW
Severity: enhancement
Priority: Normal
Component: DumpHTML
AssignedTo: [email protected]
ReportedBy: [email protected]
Interwiki links are handled like local links and not like external links.
Consequently, the HTML static pages belong dead local URLs.
What I do:
I run dumpHTML.php on the "Wikipedia" article
(http://en.wikipedia.org/wiki/Wikipedia).
What I get:
The interwikis issue from the "Template:Wikipedia"
(http://en.wikipedia.org/wiki/Template:Wikipedias), included in the "Wikipedia"
article, are represented rewritten local URLs :
-------
(...)
<tr style="">
<td class="navbox-group" style="">750,000+</td>
<td style="border-left: 2px solid rgb(253, 253, 253); padding: 0px; text-align:
left; width: 100%;" class="navbox-list navbox-even">
<div style="padding: 0em 0.25em;"><span style="white-space: nowrap;"><a
href="../../../../articles/g/e/r/German_Wikipedia_58de.html" title="German
Wikipedia">German</a> <a href="../../../../../de/index.html" class="extiw"
title="de:">de:</a></span></div>
</td>
</tr>
(...)
-------
What I want:
-------
(...)
</tr>
<tr style="">
<td class="navbox-group" style="">750,000+</td>
<td style="border-left: 2px solid rgb(253, 253, 253); padding: 0px; text-align:
left; width: 100%;" class="navbox-list navbox-even">
<div style="padding: 0em 0.25em;"><span style="white-space: nowrap;"><a
href="../../../../articles/g/e/r/German_Wikipedia_58de.html" title="German
Wikipedia">German</a> <a href="http://de.wikipedia.org" class="extiw"
title="de:">de:</a></span></div>
</td>
</tr>
(...)
-------
The difference:
* In the first case : <a href="../../../../../de/index.html" class="extiw"
title="de:">de:</a>
* In the second case : <a href="http://de.wikipedia.org" class="extiw"
title="de:">de:</a>
Remark:
The issue comes IMO from the "GetFullURL" hook in dumpHTML.inc, removing it
(and the useless onGetFullURL()) seems to resolve the issue.
--
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l