I found nutch! I have installed nutch on one of my linux boxes and did the example crawl in the tutorial.html file. Worked great. Whats the next step I need to take? Here is a brief description of what I would like to do.
I am wanting to start up a link directory like dmoz but only for a small niche topic so I would like to crawl the pages in the link directory and nothing outside of their submitted domain. So when a user does a search it outputs from the crawled pages. So how do I get output of crawled pages? Tomcat? Any suggestions would be appreciated. Thanks, Chris Edwards --- Chris Edwards
