Re: Using lxml to screen scrap a site, problem with charset

2009-02-04 Thread Stefan Behnel
Tim Arnold wrote: ?? ??? gdam...@gmail.com wrote in message news:ciqh56-ses@archaeopteryx.softver.org.mk... So, I'm using lxml to screen scrap a site that uses the cyrillic alphabet (windows-1251 encoding). The sites HTML doesn't have the META ..content-type.. charset=..

Re: Using lxml to screen scrap a site, problem with charset

2009-02-02 Thread Tim Arnold
?? ??? gdam...@gmail.com wrote in message news:ciqh56-ses@archaeopteryx.softver.org.mk... So, I'm using lxml to screen scrap a site that uses the cyrillic alphabet (windows-1251 encoding). The sites HTML doesn't have the META ..content-type.. charset=.. header, but does have a