Re: [Tutor] Error with incorrect encoding

Kent Johnson Tue, 15 Apr 2008 09:44:55 -0700

Oleg Oltar wrote:
> I am trying to parse an html page. Have following error while doing that
> 
> 
>  src = sel.get_html_source()
>         links = re.findall(r'<a class="al4"[^<]*</a>', src)
>         for link in links:
>             print link


Presumably get_html_source() is returning unicode? So link is a unicode 
string. To print, unicode must be encoded somehow. By default Python 
will try to encode as ascii, which causes the failure you are seeing.

Try
   print link.encode('xxx')
where 'xxx' is the value of sys.stdout.encoding, most likely either 
'utf-8' or 'windows-1252' depending on your platform.

Kent

_______________________________________________
Tutor maillist  -  [email protected]
http://mail.python.org/mailman/listinfo/tutor

Re: [Tutor] Error with incorrect encoding

Reply via email to