Package: unhtml
Version: 2.3.9
Severity: normal

unhtml injected these octal 240's.
$ cat w
<a 
href="http://www.google.com.tw/search?q=%22PnPBIOS:+Missing+SMALL_TAG_ENDDEP+tag%22&amp;hl=zh-TW&amp;lr=&amp;ie=UTF-8&amp;safe=off";>http://www.google.com.tw/search?q="PnPBIOS:&nbsp;Missing&nbsp;SMALL_TAG_ENDDEP&nbsp;tag"&amp;hl=zh-TW&amp;lr=&amp;ie=UTF-8&amp;safe=off</a>
$ unhtml w|od -c
0000000   h   t   t   p   :   /   /   w   w   w   .   g   o   o   g   l
0000020   e   .   c   o   m   .   t   w   /   s   e   a   r   c   h   ?
0000040   q   =   "   P   n   P   B   I   O   S   : 240   M   i   s   s
0000060   i   n   g 240   S   M   A   L   L   _   T   A   G   _   E   N
0000100   D   D   E   P 240   t   a   g   "   &   h   l   =   z   h   -
0000120   T   W   &   l   r   =   &   i   e   =   U   T   F   -   8   &
0000140   s   a   f   e   =   o   f   f  \n

And it did that even though my locale is an utf-8 one and can't deal
with it.

Also it mangles Chinese utf-8.

Worst of all, non of those actions are documented.


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to