https://bugzilla.wikimedia.org/show_bug.cgi?id=16880

           Summary: Interwiki links should not be handled like local/static
                    pages
           Product: MediaWiki extensions
           Version: any
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: enhancement
          Priority: Normal
         Component: DumpHTML
        AssignedTo: [email protected]
        ReportedBy: [email protected]


Interwiki links are handled like local links and not like external links.
Consequently, the HTML static pages belong dead local URLs.

What I do:
I run dumpHTML.php on the "Wikipedia" article
(http://en.wikipedia.org/wiki/Wikipedia).

What I get:
The interwikis issue from the "Template:Wikipedia"
(http://en.wikipedia.org/wiki/Template:Wikipedias), included in the "Wikipedia"
article, are represented rewritten local URLs :

-------
(...)
<tr style="">
<td class="navbox-group" style="">750,000+</td>
<td style="border-left: 2px solid rgb(253, 253, 253); padding: 0px; text-align:
left; width: 100%;" class="navbox-list navbox-even">
<div style="padding: 0em 0.25em;"><span style="white-space: nowrap;"><a
href="../../../../articles/g/e/r/German_Wikipedia_58de.html" title="German
Wikipedia">German</a> <a href="../../../../../de/index.html" class="extiw"
title="de:">de:</a></span></div>
</td>
</tr>
(...)
-------

What I want:

-------
(...)
</tr>
<tr style="">
<td class="navbox-group" style="">750,000+</td>
<td style="border-left: 2px solid rgb(253, 253, 253); padding: 0px; text-align:
left; width: 100%;" class="navbox-list navbox-even">
<div style="padding: 0em 0.25em;"><span style="white-space: nowrap;"><a
href="../../../../articles/g/e/r/German_Wikipedia_58de.html" title="German
Wikipedia">German</a> <a href="http://de.wikipedia.org"; class="extiw"
title="de:">de:</a></span></div>
</td>
</tr>
(...)
-------

The difference:
* In the first case : <a href="../../../../../de/index.html" class="extiw"
title="de:">de:</a>
* In the second case : <a href="http://de.wikipedia.org"; class="extiw"
title="de:">de:</a>

Remark:
The issue comes IMO from the "GetFullURL" hook in dumpHTML.inc, removing it
(and the useless onGetFullURL()) seems to resolve the issue.


-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to