Your message dated Sun, 16 Apr 2006 20:36:58 +0200 with message-id <[EMAIL PROTECTED]> and subject line parseString doesn't handle unicode strings has caused the attached Bug report to be marked as done.
This means that you claim that the problem has been dealt with. If this is not the case it is now your responsibility to reopen the Bug report if necessary, and/or fix the problem forthwith. (NB: If you are a system administrator and have no idea what I am talking about this indicates a serious mail system misconfiguration somewhere. Please contact me immediately.) Debian bug tracking system administrator (administrator, Debian Bugs database)
--- Begin Message ---Package: python-utidylib Version: 0.2-2 The parseString function does not handle unicode strings; it seems to treat them the same as an empty string: In [4]: print tidy.parseString("<p>hello") <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 3.2//EN"> <html> <head> <meta name="generator" content= "HTML Tidy for Linux/x86 (vers 12 April 2005), see www.w3.org"> <title></title> </head> <body> <p>hello</p> </body> </html> In [5]: print tidy.parseString(u"<p>hello") <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 3.2//EN"> <html> <head> <meta name="generator" content= "HTML Tidy for Linux/x86 (vers 12 April 2005), see www.w3.org"> <title></title> </head> <body> </body> </html> - Josh Triplett
signature.asc
Description: OpenPGP digital signature
--- End Message ---
--- Begin Message ---Hi, utidilib actually does handle unicode, you just have to encode your unicode objects appropriately first and pass the character_encoding option to parseString: >>> import tidy >>> print tidy.parseString(u"<p>hello".encode("utf8"), char_encoding="utf8") <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 3.2//EN"> <html> <head> <meta name="generator" content= "HTML Tidy for Linux/x86 (vers 1 September 2005), see www.w3.org"> <title></title> </head> <body> <p>hello</p> </body> </html> I'm closing this bug :) Cheers, Igor P.S.: For a full list of possible options see http://tidy.sourceforge.net/docs/quickref.html
--- End Message ---

