I found nutch!  I have installed nutch on one of my linux boxes and did the 
example crawl in the tutorial.html file.  Worked great.  Whats the next step I 
need to take?  Here is a brief description of what I would like to do.

I am wanting to start up a link directory like dmoz but only for a small niche 
topic so I would like to crawl the pages in the link directory and nothing 
outside of their submitted domain.  So when a user does a search it outputs 
from the crawled pages.

So how do I get output of crawled pages?  Tomcat?  

Any suggestions would be appreciated.

Thanks,

Chris Edwards




--- 
Chris Edwards

Reply via email to