Hello Frens, I want nutch to crawl two hosts www.oracle.com and www.ibm.com . I think my url-crawl filter is not set up correctly, because i see the message "No URLs to fetch - check your seed list and URL filters."
here is my url seed file is setup as follows http://www.oracle.com http://www.ibm.com and crawl-filter file is set up as follows # .... +^http://www.oracle.com/* +^http://www.ibm.com/* # skip everything else -. Do you see anything wrong in these files ?
