Hi, You may want to look into the 'Beautiful Soup' library. It's designed for parsing poorly-formatted HTML.
-Nick Johnson On Tue, Nov 10, 2009 at 5:25 AM, deostroll <[email protected]> wrote: > > Hi, > > I am interested in some data a remote web site provides. But they > don't hve any rss or stuff like that...so I have to scrape it all. I > am only interested in a portion of the whole html I get...so what are > lib functions I can do to process/read through the stuff after > fetching it. > > --deostroll > > > -- Nick Johnson, Developer Programs Engineer, App Engine Google Ireland Ltd. :: Registered in Dublin, Ireland, Registration Number: 368047 --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Google App Engine" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en -~----------~----~----~----~------~----~------~--~---
