Hi, Is it possible to configure nutch to crawl a url like http://www.butterflycluster.com/index.php?searchword=java&option=com_search&Itemid
I dont want to crawl the _whole_ website. I want my crawl to start on the results returned from this query. I have injected this url but it doesnt seem to be fetched at all. If i inject the url http://www.butterflycluster.com it is crawled but I dont want this. In essence I want to crawl the search results of this website. And i have a lot more I want to crawl like this. Any suggestions will greatly appreciated;. Thanks
