https://bugzilla.wikimedia.org/show_bug.cgi?id=68146

            Bug ID: 68146
           Summary: XML output can contain invalid comment syntax copied
                    from wikitext
           Product: Parsoid
           Version: unspecified
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: Unprioritized
         Component: serializer
          Assignee: [email protected]
          Reporter: [email protected]
                CC: [email protected], [email protected]
       Web browser: ---
   Mobile Platform: ---

Some articles contain de-facto HTML-style comments like <!--- foo ----> (with
more than two dashes). Parsoid passes these unchanged into the output XML,
which is then invalid.

Failure example:

  $ wget -q 'http://parsoid.wmflabs.org/enwiki/Bratislava?oldid=617085374' -O -
| xmlwf
  STDIN:6:3: not well-formed (invalid token)"

Non-failure example:

  $ wget -q 'http://parsoid.wmflabs.org/enwiki/Bratislava?oldid=617286772' -O -
| xmlwf

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to