Hello, I have this code to read html files. The code was working on a
different system with an old installation of web2py (not sure if gluon.html
could have changed).
import gluon.html
def myfunc(path, filename):
with open(path+filename, 'r') as content_file:
html = content_file.read()
try:
parsed_html = gluon.html.TAG(html)
except ValueError:
print ('Unknown error')
return
myfunc(path, filename)
Now I get this error:
File "x.py", line 50, in myfunc
parsed_html = gluon.html.TAG(html)
File "/usr/lib/pymodules/python2.7/gluon/html.py", line 1092, in __call__
return web2pyHTMLParser(decoder.decoder(html)).tree
File "/usr/lib/pymodules/python2.7/gluon/html.py", line 2233, in __init__
self.feed(text)
File "/usr/lib/python2.7/HTMLParser.py", line 117, in feed
self.goahead(0)
File "/usr/lib/python2.7/HTMLParser.py", line 191, in goahead
self.handle_charref(name)
File "/usr/lib/pymodules/python2.7/gluon/html.py", line 2258, in
handle_charref
self.parent.append(unichr(int(name[1:], 10)).encode('utf8'))
ValueError: invalid literal for int() with base 10: '32D'
Not sure if the html might be malformed
Any idea what is wrong?
--
Resources:
- http://web2py.com
- http://web2py.com/book (Documentation)
- http://github.com/web2py/web2py (Source code)
- https://code.google.com/p/web2py/issues/list (Report Issues)
---
You received this message because you are subscribed to the Google Groups
"web2py-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
For more options, visit https://groups.google.com/d/optout.