Ah, cool! I came across lxml before, and lost track of what it was called, so I tried to find it again yesterday, to no avail.
Thanks for finding it! On Nov 11, 10:07 pm, David <[email protected]> wrote: > That definitely turned me on to lxml as well. I've been running with > BeautifulSoup but sometimes it chokes and lxml seems to play better. > > I was also reading about a BeautifulSoup wrapper for lxml but have not > gotten into it much as I was working on the first part of this with > twill tonight. > > Perhaps tomorrow I can have it parsing out the data I need. :0) > > On Nov 11, 7:47 pm, Richard <[email protected]> wrote: > > > that blog post got me interested in lxml, which is a very efficient > > library with a robust HTML parser. > > > On Nov 12, 10:51 am, David <[email protected]> wrote: > > > > what DOM parser do you recommend or do you know is available? > > > > <snip> > > > > In my search for answers I happened across this link. > > > >http://blog.ianbicking.org/2008/03/30/python-html-parser-performance/ > > > > - David > > > > On Nov 11, 4:52 pm, Wes James <[email protected]> wrote: > > > > > On Wed, Nov 11, 2009 at 3:48 PM, Dmitri Zagidulin > > > > <[email protected]> wrote: > > > > > <snip> > > > > > > # Go to a url > > > > > go('http://en.wikipedia.org/wiki/Web2py') > > > > > > # Use formvalue() and submit() functions to log in > > > > > > xhtml = show() # Capture the contents of the html page in a variable > > > > > > # Send the variable to a DOM parser, or use regexps, or whatever you > > > > > like > > > > > what DOM parser do you recommend or do you know is available? > > > > > <snip> > > > > > -wes > > --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "web2py-users" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/web2py?hl=en -~----------~----~----~----~------~----~------~--~---

