Robert Koberg: > > I am thinking of trying to target a website and crawl through > the pages, > transform it into XML (as much as possible...) and deposit it > somewhere. > If you just want a collection of HTML pages to start with, you can get a bunch from google. They have announced a programming contest and are providing a repository of HTML pages as dataset. Look at http://www.google.com/programming-contest/ for details.
/Pankaj. --------------------------------------------------------------------- In case of troubles, e-mail: [EMAIL PROTECTED] To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
