https://bugzilla.wikimedia.org/show_bug.cgi?id=20411

           Summary: Corrupt content in Description in opensearch results
           Product: MediaWiki
           Version: unspecified
          Platform: All
               URL: http://en.wikipedia.org/w/api.php?action=opensearch&limi
                    t=1&format=xmlfm&search=moscow
        OS/Version: All
            Status: NEW
          Severity: enhancement
          Priority: Normal
         Component: API
        AssignedTo: [email protected]
        ReportedBy: [email protected]
                CC: [email protected], [email protected],
                    [email protected]


opensearch results in XML format contain Description node in every Item, which
most times is the first sentence in page described by Item.

And the text of Description many times corrupted because of stripping
templates, like in the URL for bug.

>Moscow (Russian: Москва, romanised: Moskva, IPA: ru-Moskva.ogg [mɐˈskva] 
>(help·info); see also other names) is the capital and the largest city of 
>Russia.
Became:
>Moscow (, romanised: Moskva, IPA: ; see also other names) is the capital and 
>the largest city of Russia.

Some times this results are wrong at all since there is a character (like: + or
-) between templates written in this page and it's became the Description in
results.
See:
http://en.wikipedia.org/w/api.php?action=opensearch&limit=1&format=xmlfm&search=Roger%20Federer

Maybe it's possible to change the way opensearch return this results to include
rendered/unrendered wiki text or at least to resolve some of templates and
detect non alphanumeric characters in the start of the sentence?..


-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to