On Mon, 24 Aug 2009 09:08:07 +0200, Stefan Behnel wrote:
> But note that sgmllib is a particularly cumbersome way to deal with HTML.
Mostly because it only provides a tokeniser, not a parser. Whoever wrote
it doesn't appear to understand the difference.
--
http://mail.python.org/mailman/listinf
Dave Angel schrieb:
> elsa wrote:
>> python sgmllib.py "path/to/my/file.html" example (1)
>
> The path in the error message simply refers to the full path string to
> your Python interpreter, and reflects %0 in your shell. So I'd assume
> you've
elsa wrote:
Hi all,
I'm new to both this forum and Python, and I've got a bit stuck trying
to learn how to parse HTML here is my problem
I'm using a textbook that uses sgmllib.py for all its examples. I'm
aware that sgmllib is not in the current release, however I wa
30/python-html-parser-performance/
http://blog.ianbicking.org/2008/12/10/lxml-an-underappreciated-web-scraping-library/
> python sgmllib.py "path/to/my/file.html" example (1)
>
> this doesn't work for me. I think I have figured out the problem -
> the error says
&
Hi all,
I'm new to both this forum and Python, and I've got a bit stuck trying
to learn how to parse HTML here is my problem
I'm using a textbook that uses sgmllib.py for all its examples. I'm
aware that sgmllib is not in the current release, however I want to
get i
thank you Ben. not only did i learn something about my question, i
learnt the 'truth' :-)
--
http://mail.python.org/mailman/listinfo/python-list
Richard Hsu wrote:
> code:-
>
># Internal -- finish processing of end tag
> def finish_endtag(self, tag):
> if not tag: # < i am confused about this
> found = len(self.stack) - 1
> if found < 0:
> self.unknown_endtag(tag) # < and thi
code:-
# Internal -- finish processing of end tag
def finish_endtag(self, tag):
if not tag: # < i am confused about this
found = len(self.stack) - 1
if found < 0:
self.unknown_endtag(tag) # < and this
return
I am a l