[webpy] Re: utf-8

[EMAIL PROTECTED] Thu, 20 Nov 2008 17:55:28 -0800

... I mean, I tried all of this except escaping characters as you said
above.
I'd rather look for a simpler solution before trying this...


On 20 nov, 22:52, "[EMAIL PROTECTED]" <[EMAIL PROTECTED]> wrote:
> Thanks TheBoff,
>
> I tried all of this but unfortunately the problem persists.
> Error message:
>
>   File "C:\Python25\lib\encodings\utf_8.py", line 16, in decode
>     return codecs.utf_8_decode(input, errors, True)
> UnicodeDecodeError: 'utf8' codec can't decode bytes in position 10-12:
> invalid d
> ata
>
> On 20 nov, 18:57, TheBoff <[EMAIL PROTECTED]> wrote:
>
> > Or, if the template is xhtml you could try this:
>
> > <?xml version="1.0" encoding="utf-8"?>
>
> > At the very top of the file.
>
> > On Nov 20, 9:54 pm, TheBoff <[EMAIL PROTECTED]> wrote:
>
> > > I had a problem a bit like this once.
>
> > > One way to solve it might be to put a meta tag like this into the head
> > > section of your template
> > > <meta http-equiv="Content-Type" content="text/html; charset=UTF-8"/>
> > > although you do already have that inn your header, which should take
> > > precendence.
>
> > > Another thing to try is the html language attribute
> > > <html lang="es">.
>
> > > If none of these work, you could try doing a find and replace on the
> > > document for special language characters, and replacing them with
> > > their (x)html 
> > > escapes:http://en.wikipedia.org/wiki/List_of_XML_and_HTML_character_entity_re...
>
> > > Hope this helps,
>
> > > Colin
>
> > > On Nov 20, 1:00 pm, "[EMAIL PROTECTED]" <[EMAIL PROTECTED]> wrote:
>
> > > > I wrote a simple script to scrap news from yahoo and display them in
> > > > my site.
> > > > The problem is that the news are in spanish, and the text I get
> > > > doesn't show properly the spanish accents.
> > > > I guess it has something to do with encoding formats but I'm
> > > > completely ignorant of this matter.
> > > > In webpy docs I found the utf8 function, but it doesn't seem to
> > > > work...
>
> > > > I'd like to know if this problem is webpy related or not. This is the
> > > > code (webpy 0.3):
>
> > > > class news(app.page):
> > > >     def GET(self):
> > > >         web.header('Content-Type', 'text/html; charset=UTF-8')
> > > >         from google.appengine.api import urlfetch
> > > >         from BeautifulSoup import BeautifulSoup
> > > >         from google.appengine.api import mail
>
> > > >         page = urlfetch.fetch("http://ar.news.yahoo.com";, headers=
> > > > {'Content-Type': 'text/html; charset=UTF-8'})
> > > >         soup = BeautifulSoup(page.content)
> > > >         s=[]
> > > >         block = soup('ul',{"class":"headlines"})
> > > >         for i in block:
> > > >             s.append(str(i))
> > > >         res = ''.join(s)
> > > >         #res = utils.utf8(res)
> > > >         res = res.replace('"/', '"http://ar.news.yahoo.com/')
> > > >         return utils.utf8(res)
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"web.py" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to [EMAIL PROTECTED]
For more options, visit this group at http://groups.google.com/group/webpy?hl=en
-~----------~----~----~----~------~----~------~--~---

[webpy] Re: utf-8

Reply via email to