DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=7214>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=7214

HTMLSerializer : accented char. wrongly treated in anchors' attributes.

           Summary: HTMLSerializer : accented char. wrongly treated in
                    anchors' attributes.
           Product: Cocoon 2
           Version: 2.0.1
          Platform: PC
        OS/Version: Windows 9x
            Status: NEW
          Severity: Normal
          Priority: Other
         Component: sitemap components
        AssignedTo: [EMAIL PROTECTED]
        ReportedBy: [EMAIL PROTECTED]


I'm working with the 1.2 revision of HTMLSerializer (date: 2002/01/10).
All my files and components use the iso-8859-1 encoding.


Overview :

Accented characters are not correctly replaced by the HTML serializer when they 
are in the attributes of an anchor.
Example : "é" is generally replaced by "eacute;", but in an anchor's attribute 
it is replaced by "%C3%A9".


Steps :

1. A simple pipeline :
    <map:match pattern="test-html">
      <map:generate src="test-html.xml"/>
      <map:serialize type="html"/>  <!-- using HTMLSerializer -->
    </map:match>

2. The content of test-html.xml :
<?xml version="1.0" encoding="iso-8859-1"?>
<html>
  <head>
  </head>
  <body>
    <a href="#réel">réel</a>
    <a name="réel"></a>
    <table>
      <tr>
        <td>
          réel
        </td>
      </tr>
    </table>
  <form>
    <input type="button" value="réel"></input>
  </form>
  </body>
</html>


Actual result :

<html>
  <head>
    <META http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
  </head>
  <body>
    <a href="#r%C3%A9el">r&eacute;el</a>
    <a name="r%C3%A9el"></a>
    <table>
      <tr>
        <td>
          r&eacute;el
        </td>
      </tr>
    </table>
    <form>
      <input type="button" value="r&eacute;el">
    </form>
  </body>
</html>


Expected result :

<html>
  <head>
    <META http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
  </head>
  <body>
    <a href="#r&eacute;el">r&eacute;el</a>
    <a name="r&eacute;el"></a>
    <table>
      <tr>
        <td>
          r&eacute;el
        </td>
      </tr>
    </table>
    <form>
      <input type="button" value="r&eacute;el">
    </form>
  </body>
</html>


Thanks.

Matthieu

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]

Reply via email to