Robert Koberg:
> 
> I am thinking of trying to target a website and crawl through 
> the pages,
> transform it into XML (as much as possible...) and deposit it 
> somewhere.
> 
If you just want a collection of HTML pages to start with, you can get a
bunch from google. They have announced a programming contest and are
providing a repository of HTML pages as dataset. Look at
http://www.google.com/programming-contest/ for details.

/Pankaj.

---------------------------------------------------------------------
In case of troubles, e-mail:     [EMAIL PROTECTED]
To unsubscribe, e-mail:          [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to